Citation: Herbert F. Jelinek, Jemal H. Abawajy, David J. Cornforth, Adam Kowalczyk, Michael Negnevitsky, Morshed U. Chowdhury, Robert Krones, Andrei V. Kelarev. Multi-layer Attribute Selection and Classification Algorithm for the Diagnosis of Cardiac Autonomic Neuropathy Based on HRV Attributes[J]. AIMS Medical Science, 2015, 2(4): 396-409. doi: 10.3934/medsci.2015.4.396
[1] | Herbert F. Jelinek, Andrei V. Kelarev . A Survey of Data Mining Methods for Automated Diagnosis of Cardiac Autonomic Neuropathy Progression. AIMS Medical Science, 2016, 3(2): 217-233. doi: 10.3934/medsci.2016.2.217 |
[2] | Herbert F. Jelinek, Jemal H. Abawajy, Andrei V. Kelarev, Morshed U. Chowdhury, Andrew Stranieri . Decision trees and multi-level ensemble classifiers for neurological diagnostics. AIMS Medical Science, 2014, 1(1): 1-12. doi: 10.3934/medsci.2014.1.1 |
[3] | Abdulah Saeed, Alhanouf AlQahtani, Abdullah AlShafea, Abdrahman Bin Saeed, Meteb Albraik . The impact of telemedicine on cardiac patient outcomes: A study in Saudi Arabian hospitals. AIMS Medical Science, 2024, 11(4): 439-451. doi: 10.3934/medsci.2024030 |
[4] | Fatemeh keshavarzi, Mohsen Askarishahi, Maryam Gholamniya Foumani, Hossein Falahzadeh . Parametric and the Cox risk model in the analysis of factors affecting the time of diagnosis of retinopathy with patients type 2 diabetes. AIMS Medical Science, 2019, 6(2): 170-178. doi: 10.3934/medsci.2019.2.170 |
[5] | Andrei V. Kelarev, Xun Yi, Hui Cui, Leanne Rylands, Herbert F. Jelinek . A survey of state-of-the-art methods for securing medical databases. AIMS Medical Science, 2018, 5(1): 1-22. doi: 10.3934/medsci.2018.1.1 |
[6] | Turki M. Bin Mahfoz, Ahmad K. Alnemare . Optic neuropathy related to Onodi cell mucocele: a systematic review and meta-analysis of randomized controlled trials. AIMS Medical Science, 2021, 8(3): 203-223. doi: 10.3934/medsci.2021018 |
[7] | Norhayati Mohd Zainee, Kalaivani Chellappan, Joseph Vehi, Petrick Periyasamy . The vital sign and haematological profile of adult dengue fever: a retrospective study. AIMS Medical Science, 2021, 8(1): 56-69. doi: 10.3934/medsci.2021007 |
[8] | Wenyang Liu, Dan Ruan . Segmentation of Left Ventricle with a Coupled Length Regularization and Sparse Composite Shape Prior: a Variational Approach. AIMS Medical Science, 2015, 2(4): 295-302. doi: 10.3934/medsci.2015.4.295 |
[9] | Ola Shawky, Maha Alkhaldi, Deema Yousef, Adnan Alhindi, Teef Alosaimi, Amjad Jawhari, Aliah Aladwani . Incidence of first-time stroke in Taif, Saudi Arabia. AIMS Medical Science, 2022, 9(2): 293-303. doi: 10.3934/medsci.2022012 |
[10] | Segun Akinola, Reddy Leelakrishna, Vijayakumar Varadarajan . Enhancing cardiovascular disease prediction: A hybrid machine learning approach integrating oversampling and adaptive boosting techniques. AIMS Medical Science, 2024, 11(2): 58-71. doi: 10.3934/medsci.2024005 |
Diabetes mellitus and diabetic complications are predicted to increase world-wide. Cardiovascular complications of diabetes account for 65% of diabetic deaths and cardiac autonomic neuropathy (CAN) belongs to the major contributors to sudden cardiac deaths (SCD). The outcomes of the Action to Control Cardiovascular Risk in Diabetes (ACCORD) and National Diabetes Strategy (NDS) trial recommended regular screening of people with diabetes, especially for comorbidities including autonomic nervous system (ANS) dysfunction [1,2,3]. The increased risk of arrhythmias and SCD associated with CAN makes screening of people with diabetes an important health issue as early detection and intervention improve treatment effectiveness [4,5].
Mortality associated with CAN has been estimated to be approximately five times higher in diabetes with autonomic neuropathy (29%) compared to diabetes without autonomic neuropathy [5,6]. Prevalence of CAN lies between 22% and 60% in people with diabetes depending on duration of diabetes and treatment efficacy. CAN, even at the asymptomatic stage can lead to significant changes in the heart rate variability (HRV) metrics reflecting impaired heart rate and blood pressure regulation. Early detection of CAN and timely treatment are important for risk stratification and reducing the effects of diabetic neuropathy, which affects the function of all major body organs, leading to kidney failure, urinary dysfunction and cardiac arrhythmias.
In the present paper we propose a new machine learning algorithm for the diagnosis of CAN progression based on HRV attributes, the Multi-Layer Attribute Selection and Classification (MLASC) algorithm. It begins with attribute extraction and incorporates our new automated attribute selection procedure, Double Wrapper Subset Evaluator with Particle Swarm Optimization (DWSE-PSO) combined with a multi-layer classifier. We present the results of a comprehensive collection of experiments, which compare MLASC with other simpler versions and counterpart methods.
It is a new task to detect CAN automatically on the basis of HRV attributes, which are often available in relation to other health investigations. We conducted comprehensive experiments comparing several automated feature selection and optimization methods that have not been considered before for the task of classification of CAN progression using HRV attributes. In addition, we introduce a new feature selection method, and apply it for solving this task. Our experiments compare the effectiveness of these automated feature selection methods in conjunction with the application of hierarchical classifiers for the classification of CAN progression.
Our previous article [7] studied multi-level classifiers for neurological diagnostics using different attributes including traditional tests for identification of CAN and available in the Diabetes Screening Complications Research Initiative (DiScRi) database. The Ewing battery is commonly used for detecting CAN but is often not conclusive and therefore more sensitive and accurate tests are required. ECGs are routinely assessed in clinical practice and although they do not directly indicate CAN, HRV can be determined from the interbeat interval tachogram or from a continuous heart rate recording [8]. HRV as a clinical tool using ECG recordings has been shown to be a sensitive marker for risk of future arrhythmias or CAN and easier to use clinically compared to the Ewing battery.
Here we concentrate on developing optimal combinations of options to be employed at the different layers of MLASC for the diagnostics of CAN progression based on HRV attributes in people with diabetes. Background information on HRV attributes, CAN and the DiScRi database are included in Section 3. HRV information can include as many as 20-30 measures sensitive to different characteristics of the ECG time series that can be divided into time, frequency, and nonlinear measures. The use of HRV in diagnostics has been verified in identification of neonatal sepsis [9], arrhythmia risk [10], and in the risk of future adverse cardiac events. Which of the 20-30 measures are most sensitive for the diagnostics of CAN in diabetes progression has not been addressed in [8,11] and will be the subject of subsequent studies.
The present work was motivated by other recent results indicating the utility of incorporating several HRV measures in the classification of CAN [12,13]. Several other studies handled other different multi-stage systems [14,15,16,17]. However, this is the first article where multi-layer attribute selection is included in one of the layers of the main algorithm. The ability to use only HRV for accurate identification of CAN and CAN progression reduces the need for invasive testing such as obtaining cholesterol, BGL and HbA1c results.
HRV attributes are very important and are often collected for patients with various pathologies such as sepsis, heart failure and kidney failure. In order to investigate the role of HRV attributes and the capability of multi-level classifiers in improving diagnostic accuracy, we used a large database of health-related parameters and tests amalgamated in DiScRi [18] organized by Charles Sturt University.
Let us refer to [19,20,21] and [7] for preliminaries on the classification of disease progression associated with CAN. In particular, this article uses the classical set of five Ewing attributes specified in the following table (Table 1).
Test | Normal | Borderline | Abnormal |
(i) Heart rate response to standing (ratio of HR at 15 and 30 sec) | ≥ 1.04 | 1.01-1.03 | ≤ 1.00 |
(ii) Blood pressure response to standing (mmHg) | ≤ 10 | 11-29 | ≥ 30 |
(iii) Heart rate response to deep breathing (BPM) | ≥ 15 | 11-14 | ≤ 10 |
(iv) Valsalva maneuver and heart rate response (BPM) | ≥ 1.21 | 1.11-1.20 | ≤ 1.10 |
(v) Blood pressure response to sustained handgrip (mmHg) | ≥ 16 | 11-15 | ≤ 10 |
Following [7], we investigated three classifications of CAN progression introduced in [19,20]. They are the classifications into 2,3 and 4 classes: the presence or absence of CAN (2 classes), absence of CAN, early and definite CAN (3 classes) or adding the severe class.
The motivation to use HRV data is that HRV data are more often available and are easier to obtain in clinical practice than the Ewing battery features. HRV measures also provide many more variables compared to the five attributes in the Ewing battery.
Testing for CAN in people with diabetes has progressed from the simple Ewing tests to HRV analysis and analysis of more complex ECG features [22,23,24,25]. These new analysis techniques determine the changes in heart rate or ECG characteristics over time and have resulted in CAN being diagnosed before clinical signs are present with greater accuracy and reproducibility (see [26,27,28] for more details and further references). HRV analysis involves determining the interbeat intervals (RR intervals) between successive QRS complexes on an ECG. The next table (Table 2) summarizes various HRV analysis methods, see [29,30,31].
HRV method | Measure | Description |
Time Domain | SDNN | The standard deviation of the NN intervals |
RMSSD | The square root of the mean squared difference of the NN intervals | |
Frequency Domain | Total Power | Variance of N-N intervals over the temporal segment (freq < 0.4) |
VLF | Power in very low frequency range (freq < 0.04) | |
LF | Power in low frequency range (freq 0.04 to 0.15) | |
HF | Power in high frequency range (freq 0.15 to 0.4) | |
Nonlinear | SD1, SD2 | The standard deviations perpendicular to and along the line-of-identity of the Poincaré plot |
ApEn | Approximate entropy | |
SampEn | Sample entropy | |
D2 | Correlation dimension | |
DFA | Detrended fluctuation analysis: | |
α1 - Short-term fluctuation slope; α2- Long-term fluctuation slope | ||
ShanEn | Shannon entropy | |
Rényi En | Rényi entropy |
Nonlinear HRV measures have become popular in recent times as they are more robust against the nonstationarity and nonlinearity characteristics of the RR tachogram and are able to detect how aging and pathological conditions affect the nonstationarity of the signal [32,30]. Non-linear HRV features such as Detrended Fluctuation Analysis (DFA), estimate complexity inherent in the signal. The correlation dimension (D2) can also be applied [33]. Several entropy measures have been proposed such as approximate entropy, sample entropy, and tone-entropy [34,35]. These measures have subsequently led to the multiscale entropy measures including the multi-scale Rényi entropy, which is a generalization of the Shannon entropy [13].
To prepare data for experiments in this paper we kept the Ewing attributes and the HRV attributes in the data files and deleted all other attributes including biomarkers such as cholesterol profile, co-morbidities, HbA1c and blood sugar levels, which are associated with CAN in [8,36].
We used the following classifiers implemented in WEKA.
· Bayes Net (BN) provides network structure, conditional probability distributions and other data structures and facilities for Bayes Network learning. We used it with the K2 search algorithm.
· J48 classifier is one of the most efficient decision tree classifiers in WEKA employing C4.5 tree as described in [37].
· Random Forest (RF) applies a collection of random trees [38]. RF is also implemented in WEKA and Rattle [39].
· SMO trains a support vector classifier by applying sequential minimal optimization [40].
· Decorate ensemble based on Random Forest (DRF). This classifier outperformed several other ensemble classifiers with two tiers for the classification of CAN in [7]. Decorate constructs special artificial training examples to build diverse ensembles of classifiers.
Let us refer to [41,42,43,44] for preliminaries on these methods, and to [39,45] for more information on Random Forest.
Wrapper Subset Evaluator (WSE) in WEKA searches for a set of attributes tailored for the use with a particular classifier. It evaluates attribute sets by using a classifier indicated as an input parameter for the Wrapper as explained in [46]. WSE employs a number of search algorithms based on different strategies. The results produced by WSE were then compared with three search methods:
· Wrapper Subset Evaluator with Best First Search (WSE-BFS). This method produced good results in [46].
· Wrapper Subset Evaluator with Genetic Search (WSE-GS) using genetic algorithm described in [47] and applied previously by our research group [48].
· Wrapper Subset Evaluator with Particle Swarm Optimization search algorithm (WSE-PSO) explores the attribute space using the Particle Swarm Optimization (PSO) algorithm as explained in [49].
In addition, we applied the following attribute ranking methods implemented in WEKA.
· Classifier Attribute Evaluator (CAE) evaluates attributes applying a classifier and ranks all attributes according to their significance for the particular classifier.
· Information Gain Attribute Evaluator (IGAE) assesses attributes by calculating the information gain.
Each of the ranking methods orders all attributes according to its own algorithm for determining significance of the attributes. For each classifier, we used the resulting ranking to decide which attributes to select. To this end, we started with the most significant attribute and determined the effectiveness of the classifier based on this attribute alone. Then we incrementally added the next most significant attribute on the ranking list, and continued this process until the effectiveness of the classifier stopped improving.
MLASC algorithm has three layers explained in this section. Note that this is the first article where multi-layer attribute selection method is introduced and included in one of the layers of the main algorithm.
Layer 1 of MLASC begins by extracting a large collection of HRV attributes. A C++ program was written by the author to extract attributes. In particular, Rényi entropy was calculated with α = −5, −4, …, 4,5, and estimated probabilities pi of sequences of RR intervals of length π = 1,2,4,8,16.
Layer 2 of MLASC uses a hierarchical ensemble, Bagging of Decorate ensemble based on Random Forest (BDRF) and applies a new attribute selection method introduced in this paper.
The BDRF classifier was introduces in our previous article [7], where it produced the best outcomes for a different task. It combines Bagging with Decorate and is based on Random Forest. Bagging (bootstrap aggregating) generates a collection of new sets by resampling the given training set at random and with replacement. New classifiers are then trained, one for each of these new training sets. They are amalgamated via a majority vote.
Our new attribute selection method is a Double Wrapper Subset Evaluator with Particle Swarm Optimization (DWSE-PSO). The first sublayer of this method applies WSE-PSO to select the first subset of attributes for BDRF. All instances are then classified, using the selected first subset of features. The misclassified instances, where incorrect outcomes had been produced, are collected in a new dataset. After that in the second sublayer WSE-PSO is applied to the new dataset of misclassified instances again using BDRF, which results in a second subset of attributes. Two subsets are then combined together to compose the final set of chosen attributes.
Layer 3 of MLASC extracts from the whole training set all attributes chosen in Layer 2 and trains the BDRF classifier using all these attributes. The BDRF is then applied to validate the set using all attributes chosen in Layer 2.
To prepare the dataset for experiments, we selected all patients in the DiScRi database with available HRV attributes and recorded all DiScRi parameters for these patients in a csv file. We have kept only the HRV attributes and the Ewing battery results, deleting all other features, and created 3 copies of the data file to explore the progression of CAN indicated in the DiScRi database as “no CAN”, “early CAN” and “definite CAN”. In the first copy, a two-class paradigm was investigated with the last column containing the class value for classification including “definite CAN” and “no CAN”. In the second copy, we added three CAN classes as “no CAN”, “early CAN” or “definite CAN”. In the third copy, as the last column we added one of four CAN classes including “no CAN”, “early CAN”, “definite CAN” or “severe CAN”. In order to enable all classifiers available in WEKA Explorer to process these three files, the files were reformatted into ARFF format, which is the standard format used by all classifiers in WEKA. These three files were used in all experiments presented in the paper. For each classifier in every test, the data file was divided in two parts of equal size. The first part was used to apply feature selection strategy for the classifier. After the optimal set of features had been selected using the first part, we selected the attributes of the chosen optimal set of features in the second part of the file. This second part with chosen features was then used to determine the effectiveness of the classifier.
We carried out experiments to determine the optimum classifier algorithm and an optimum attribute selection algorithm using WEKA [42,43] and R [39,45] in all experiments.
To compare the effectiveness of classifiers we used the Area Under Curve (AUC), as the measure of performance. The value of AUC is included in the output of all classifiers in WEKA. Since WEKA is open source software, its source code has also always been available for inspection by the researchers. We refer the readers to [42] and [43] for more details.
We carried out experiments to compare the effectiveness of MLASC algorithm with other counterpart methods, and also with several simplified versions of MLASC, where the main ingredients of MLASC are replaced by simpler alternatives. In the figures with the results we include the average values of AUC obtained in the 5 × 2 cross-validation (5 × 2 CV), because it is shown in Table 1 of [52] that the average value is a standard measure of performance which is one of the most frequently used indications of effectiveness of classifiers used in the literature. To prevent over-fitting we used 5 × 2 cross-validation introduced and recommended in [51] for comparison of classifiers. This method carries out five iterations of twofold cross-validation. The results of cross-validation implemented in WEKA are included in the output of all classifiers automatically.
Figure 1 presents the results of our experiments comparing the performance of MLASC with other algorithms. In this figure BDRF, BN, DRF, J48, RF, SMO are applied to process the data directly and not as a part of larger multi-layer process. The 5 × 2 CV test recommended in [52] established that MLASC produced better outcomes and these differences are significant at the 95% confidence level.
Figure 2 presents the results of experiments comparing the performance of MLASC with other simplified versions, where both BDRF and DWSE-PSO, or at least one of these main ingredients of MLASC, are replaced by simpler alternatives.
Let us introduce notation to refer to such simpler versions in the diagram. If A is an attribute selection method and C is a classifier, then we use the symbol SV (A, C) to denote a simplified version of MLASC algorithm, where BDRF is replaced by C and the application of DWSE-PSO is replaced by A. Our experiments compare MLASC with the following simplified versions: SV(WSE-BFS, BN), SV(WSE-BFS, J48), SV(WSE-BFS, RF), SV(WSE-BFS, SMO), SV(WSE-BFS, DRF), SV(WSE-BFS, DRF), SV(WSE-GS, BN), SV(WSE-GS, J48), SV(WSE-GS, RF), SV(WSE-GS, SMO), SV(WSE-GS, DRF), SV(WSE-GS, BDRF), SV(WSE-PSO, BN), SV(WSE-PSO, J48), SV(WSE-PSO, RF), SV(WSE-PSO, SMO), SV(WSE-PSO, DRF), SV(WSE-PSO, BDRF), SV(CAE, BN), SV(CAE, J48), SV(CAE, RF), SV(CAE, SMO), SV(CAE, DRF), SV(CAE, BDRF), SV(IGAE, BN), SV(IGAE, J48), SV(IGAE, RF), SV(IGAE, SMO), SV(IGAE, DRF), SV(IGAE, BDRF), SV(DWSE-PSO, BN), SV(DWSE-PSO, J48), SV(DWSE-PSO, RF), SV(DWSE-PSO, SMO), SV(DWSE-PSO, DRF). This list contains all simplified versions of MLASC, where at least one of the main ingredients is replaced by a simpler version. Figure 2 presents the outcomes of these classifiers and MLASC grouped according to the attribute selection applied at Layer 2: a) applying WSA with best first search, b) genetic search, c) PSO, d) DWSA with PSO, e) CAE ranking, and f) IGAE ranking.
In general, it can be observed that selecting features using PSO (Figure 1c) provided the highest values of AUC and that the BDRF classifier provided the highest accuracy, regardless of which method was used for feature selection. The best outcome was obtained by the MLASC algorithm with an AUC equal to 0.96. The 5 × 2 CV test recommended in [52] established that these results are significant at the 95% confidence level.
In this article, we introduced Multi-Layer Attribute Selection and Classification (MLASC) algorithm for the diagnosis of CAN progression based on HRV attributes. We carried out a comprehensive set of experiments comparing the effectiveness of MLASC with other simplified versions and counterpart methods. The results obtained demonstrated that the MLASC algorithm produced best results. Diagnosis by this method achieved an AUC level of 0.96 when applying DWSA with PSO attribute selection. Classification of presence of CAN and CAN progression remained above 0.90 in the 3 and 4 class paradigms, which is high enough for clinical diagnostics.
The variation of success using different classifier algorithms when using a separate subset for each classifier supports the findings of Kohavi [46], by showing that the attributes chosen should be regarded as part of the classifier algorithm. The current application involving the Rényi entropy addresses methodological issues inherent in previous multiscale applications [52].
The results obtained by applying MLASC algorithm show that recognizing CAN progression from HRV data can provide an effective test which would benefit diabetic patients by early diagnosis and better treatment, and consequently a reduction in hospitalization and length of stay [53].
The authors wish to thank the three reviewers for comments that have helped to improve this article. We acknowledge the many students that have contributed to the data analyzed and reported in this paper. Part of the research presented was funded by the Diabetes Australia Research Trust, Albury and Tallangatta Council Funding, Deakin-Ballarat Collaboration Funding and Charles Sturt University Research Compacts grants. Roche Australia Pty is gratefully acknowledged for provided glucose meters and test strips.
All authors declare no conflicts of interest in this paper.
[1] | Colagiuri S, Colagiuri R, Ward J (1998) National diabetes strategy and implementation plan. Canberra: Paragon Printers. |
[2] | Pop-Busui R, Evans GW, Gerstein HC, et al. (2010) The ACCORD Study Group. Effects of cardiac autonomic dysfunction on mortality risk in the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Trial. Diab Care 33: 1578-1584. |
[3] | Spallone V, Ziegler D, Freeman R, et al. (2011) Cardiovascular autonomic neuropathy in diabetes: clinical impact, assessment, diagnosis, and management. Diab Metab Res Rev 27: 639-653. |
[4] |
Dimitropoulos G, Tahrani AA, Stevens MJ (2014) Cardiac autonomic neuropathy in patients with diabetes mellitus. World J Diab 5: 17-39. doi: 10.4239/wjd.v5.i1.17
![]() |
[5] |
Vinik AI, Erbas T, Casellini CM (2013) Diabetic cardiac autonomic neuropathy, inflammation and cardiovascular disease. J Diabetes Investig 4: 4-18. doi: 10.1111/jdi.12042
![]() |
[6] |
Task Force of The European Society of Cardiology and The North American Society of Pacing and Electrophysiology (1996) Special report: heart rate variability standards of measurement, physiological interpretation, and clinical use. Circulation 93: 1043-1065. doi: 10.1161/01.CIR.93.5.1043
![]() |
[7] | Jelinek HF, Abawajy JH, Kelarev AV, et al. (2014) Decision trees and multi-level ensemble classifiers for neurological diagnostics. AIMS Med Sci 1: 1-12. |
[8] | Dietrich DF, Schindler C, Schwartz J, et al. (2006) Heart rate variability in an ageing population and its association with lifestyle and cardiovascular risk factors: results of the SAPALDIA study. Europace 8: 521-529. |
[9] | Lake DE, Richman JS, Griffin MP, et al. (2002) Sample entropy analysis of neonatal heart rate variability. Am J Physiol 283: 789-797. |
[10] |
La Rovere MT, Pinna GD, Maestri R, et al. (2003) Short-term heart rate variability strongly predicts sudden cardiac death in chronic heart failure patients. Circulation 107: 565-570. doi: 10.1161/01.CIR.0000047275.25795.17
![]() |
[11] |
Huikuri HV, Linnaluoto MK, Seppänen T, et al. (1992) Circadian rhythm of heart rate variability in survivors of cardiac arrest. Am J Cardiol 70: 610-615. doi: 10.1016/0002-9149(92)90200-I
![]() |
[12] | Cornforth DJ, Tarvainen MP, Jelinek HF (2014) Visualization methods for assisting detection of cardiovascular neuropathy. Engineering in Medicine and Biology Society (EMBC2014) 36th Annual International Conference of the IEEE, 26-30 Aug. 2014, 6675-6678. |
[13] | Tarvainen MP, Cornforth DJ, Jelinek HF (2014) Principal component analysis of heart rate variability data in assessing cardiac autonomic neuropathy. Engineering in Medicine and Biology Society (EMBC2014), 36th Annual International Conference of the IEEE, 26-30 Aug. 2014, 6667-6670. |
[14] | Abawajy J, Kelarev A, Chowdhury M (2013) Multistage approach for clustering and classification of ECG data. Comp Meth Pro Biomed 112: 720-730. |
[15] | Abawajy J, Kelarev A, Chowdhury M, et al. (2013) Predicting cardiac autonomic neuropathy category for diabetic data with missing values, Comp Bio Med 43: 1328-1333. |
[16] |
Stranieri A, Abawajy J, Kelarev A, et al. (2013) An approach for Ewing test selection to support the clinical assessment of cardiac autonomic neuropathy. Art Intel Med 58: 185-193. doi: 10.1016/j.artmed.2013.04.007
![]() |
[17] | Jelinek HF, Yatsko A, Stranieri A, et al. (2015) Diagnostic with incomplete nominal/discrete data. Art Intel Med 4: 22-35. |
[18] | Cornforth D, Jelinek HF (2007) Automated classification reveals morphological factors associated with dementia, App Soft Compu 8: 182-190. |
[19] | Ewing DJ, Campbell JW, Clarke BF (1980) The natural history of diabetic autonomic neuropathy. Q J Med 49: 95-100. |
[20] |
Ewing DJ, Martyn CN, Young RJ, et al. (1985) The value of cardiovascular autonomic functions tests: 10 years experience in diabetes. Diab Care 8: 491-498. doi: 10.2337/diacare.8.5.491
![]() |
[21] |
Khandoker AH, Jelinek HF, Palaniswami M (2009) Identifying diabetic patients with cardiac autonomic neuropathy by heart rate complexity analysis. BioMed Engine Online 8: 1-12. doi: 10.1186/1475-925X-8-1
![]() |
[22] | Thayer JF, Yamamoto SS, Brosschot JF (2010) The relationship of autonomic imbalance, heart rate variability and cardiovascular disease risk factors, Int J Cardiol 141: 122-131. |
[23] |
Karmakar CK, Khandoker AH, Jelinek HF, et al. (2013) Risk stratification of cardiac autonomic neuropathy based on multi-lag Tone-Entropy. Med Bio Engine Comp 51: 537-546. doi: 10.1007/s11517-012-1022-5
![]() |
[24] | Tan CO (2013) Heart rate variability: are there complex patterns? Front Physiol 4: 1-3. |
[25] | Imam MH, Karmakar C, Khandoker AH, et al. (2014) Analysing cardiac autonomic neuropathy in diabetes using electrocardiogram derived systolic-diastolic interval interactions. Compu Cardiol 41: 85-88. |
[26] |
Spallone V, Menzinger G (1997) Diagnosis of cardiovascular autonomic neuropathy in diabetes. Diabetes 46: 67-76. doi: 10.2337/diab.46.2.S67
![]() |
[27] | Jelinek HF, Pham P, Struzik ZR, et al. (2007) Short term ECG recording for the identification of cardiac autonomic neuropathy in people with diabetes mellitus. Proceedings of the 19th International Conference on Noise and Fluctuations, Tokyo, Japan, pp. 683-686. |
[28] | Khandoker AH, Weiss DN, Skinner JE, et al. (2011) PD2i heart rate complexity measure can detect cardiac autonomic neuropathy: an alternative test to Ewing battery. Compu Cardiol 38: 525-528. |
[29] | TFESC/NASPE (1996) Heart rate variability. Standards of measurement, physiological interpretation, and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology. Euro Heart J 17: 354-381. |
[30] |
Goldberger AL, Amaral LAN, Hausdorff JM, et al. (2002) Fractal dynamics in physiology: Alterations with disease and aging. Pro Nat Aca Sci USA 99: 2466-2472. doi: 10.1073/pnas.012579499
![]() |
[31] |
Peng CK, Havlin S, Stanley HE, et al. (1995) Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series. Chaos 5: 82-87. doi: 10.1063/1.166141
![]() |
[32] | Ho YL, Lin C, Lin YH, et al. (2011). The prognostic value of non-linear analysis of heart rate variability in patients with congestive heart failure—a pilot study of multiscale entropy. PLoS ONE 6: 1-6. |
[33] | Sturmberg JP, Bennett JM, Picard M, et al. (2015) The trajectory of life. Decreasing physiological network complexity through changing fractal patterns. Front Physiol 6: 1-11. |
[34] |
Oida ET, Moritani KT, Yamori Y (1999) Diabetic alteration of cardiac vago-sympathetic modulation assessed with tone-entropy analysis. Acta Physiol Scandi 165: 129-135. doi: 10.1046/j.1365-201x.1999.00494.x
![]() |
[35] |
Lake DE, Moorman JR (2011) Accurate estimation of entropy in very short physiological time series: the problem of atrial fibrillation detection in implanted ventricular devices. Am J Physiol Heart Cicul Physiol 300: H319-325. doi: 10.1152/ajpheart.00561.2010
![]() |
[36] | Jelinek HF, Khandoker A, Palaniswami M, et al. (2010) Tone-entropy analysis as a cardiac risk stratification tool. Compu Cardiol 37: 955-958. |
[37] | Quinlan R (1993) C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann Publishers. |
[38] |
Breiman L (2001) Random Forests. Machine Learning 45:5-32. doi: 10.1023/A:1010933404324
![]() |
[39] | Williams G (2011) Data mining with Rattle and R: the art of excavating data for knowledge discovery (use R!). New York, Dordrecht, Heidelberg, London: Springer. |
[40] | Platt J (1998) Fast training of support vector machines using sequential minimal optimization. In: Schoelkopf B, Burges C, Smola A, editors, Advances in Kernel Methods—Support Vector Learning 41-64. |
[41] | Witten H, Frank E, Hall MA (2011) Data mining: practical machine learning tools and techniques with java implementations. 3ed, New York, Sydney: Morgan Kaufmann, 2011. |
[42] | Bouckaert, RR, Frank E, Hall M, et al. WEKA manual for version 3-7-13, http://www.cs.waikato.ac.nz/ml/weka/, viewed 15 July 2015. |
[43] | Hall M, Frank E, Holmes G, et al. (2009) The WEKA data mining software: an update. SIGKDD Explor 11: 10-18. |
[44] | Negnevitsky M (2011) Artificial intelligence: a guide to intelligent systems. 3rd eds., New York: Addison Wesley. |
[45] | Williams GJ (2009) Rattle: a data mining GUI for R, The R J 1: 45-55. |
[46] |
Kohavi R, John GH (1997) Wrappers for feature subset selection. Art Intell 97:273-324. doi: 10.1016/S0004-3702(97)00043-X
![]() |
[47] | Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. New York: Addison-Wesley. |
[48] | Cornforth DJ, Jelinek HF, Teich MC, et al. (2004) Wrapper subset evaluation facilitates the automated detection of diabetes from heart rate variability measures. Proceedings of the International Conference on Computational Intelligence for Modelling Control and Automation (CIMCA'2004), University of Canberra, Australia, pp. 446-455. |
[49] | Moraglio A, Di Chio C, Poli R (2007) Geometric Particle Swarm Optimisation. Proceedings of the 10th European Conference on Genetic Programming, Berlin, Heidelberg, 125-136. |
[50] | Demsar J (2006) Statistical comparisons of classifiers over multiple data sets. J Machine Learning Res 7: 1-30. |
[51] |
Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Compu 10: 1895-1924. doi: 10.1162/089976698300017197
![]() |
[52] | Cornforth D, Tarvainen M, Jelinek HF (2014) How to calculate Rényi entropy from Heart Rate Variability, and why it matters for detecting cardiac autonomic neuropathy. Front Bioeng Biotecho 2: 1-7. |
[53] |
Ziegler DA, Rathmann VW, Strom A, et al. (2015) Increased prevalence of cardiac autonomic dysfunction at different degrees of glucose intolerance in the general population: the KORA S4 survey. Diabetologia 58:1118-1128. doi: 10.1007/s00125-015-3534-7
![]() |
1. | Herbert F. Jelinek, Andrei V. Kelarev, A Survey of Data Mining Methods for Automated Diagnosis of Cardiac Autonomic Neuropathy Progression, 2016, 3, 2375-1576, 217, 10.3934/medsci.2016.2.217 | |
2. | Andrei V. Kelarev, Willy Susilo, Mirka Miller, Joe Ryan, Ideals of Largest Weight in Constructions Based on Directed Graphs, 2016, 15, 2278-9634, 8, 10.18052/www.scipress.com/BMSA.15.8 | |
3. | Herbert F. Jelinek, Marc T. P. Adam, Robert Krones, David J. Cornforth, Diagnostic Accuracy of Random ECG in Primary Care for Early, Asymptomatic Cardiac Autonomic Neuropathy, 2017, 11, 1932-2968, 1165, 10.1177/1932296817703670 | |
4. | K.H. Boon, M. Khalil-Hani, MB Malarvili, Paroxysmal atrial fibrillation prediction based on HRV analysis and non-dominated sorting genetic algorithm III, 2018, 153, 01692607, 171, 10.1016/j.cmpb.2017.10.012 | |
5. | Andrei V. Kelarev, Xun Yi, Hui Cui, Leanne Rylands, Herbert F. Jelinek, A survey of state-of-the-art methods for securing medical databases, 2018, 5, 2375-1576, 1, 10.3934/medsci.2018.1.1 | |
6. | Ahmad Shaker Abdalrada, Jemal Abawajy, Tahsien Al-Quraishi, Sheikh Mohammed Shariful Islam, Prediction of cardiac autonomic neuropathy using a machine learning model in patients with diabetes, 2022, 13, 2042-0188, 204201882210866, 10.1177/20420188221086693 | |
7. | Zahra Solatidehkordi, Salam Dhou, 2022, Detecting Diabetic Autonomic Neuropathy from Electronic Health Records Using Machine Learning, 978-1-6654-8016-1, 205, 10.1109/HealthCom54947.2022.9982752 | |
8. | Mayura Nagar, Pooja Raundale, Ahsan Khandoker, Herbert Jelinek, 2022, A Systematic Literature Review: Role of AI Algorithms for Automated Diagnosis of Diabetic Cardiac Autonomic Neuropathy [DCAN], 978-93-80544-44-1, 669, 10.23919/INDIACom54597.2022.9763159 | |
9. | Sharanya Senthamil Selvan, Sridhar P. Arjunan, Ramakrishnan Swaminathan, Dinesh Kant Kumar, Complexity Analysis in the PR, QT, RR and ST Segments of ECG for Early Assessment of Severity in Cardiac Autonomic Neuropathy, 2022, 12, 2076-3417, 5746, 10.3390/app12115746 | |
10. | Jeremy Man Ho Hui, Yan Hiu Athena Lee, Gary Tse, Tong Liu, Kamalan Jeevaratnam, Haipeng Liu, 2024, 9780323956864, 319, 10.1016/B978-0-323-95686-4.00023-X | |
11. | Moustafa Abdelwanis, Karim Moawad, Shahmir Mohammed, Ammar Hummieda, Shayaan Syed, Maher Maalouf, Herbert F. Jelinek, Sequential classification approach for enhancing the assessment of cardiac autonomic neuropathy, 2025, 190, 00104825, 109999, 10.1016/j.compbiomed.2025.109999 |
Test | Normal | Borderline | Abnormal |
(i) Heart rate response to standing (ratio of HR at 15 and 30 sec) | ≥ 1.04 | 1.01-1.03 | ≤ 1.00 |
(ii) Blood pressure response to standing (mmHg) | ≤ 10 | 11-29 | ≥ 30 |
(iii) Heart rate response to deep breathing (BPM) | ≥ 15 | 11-14 | ≤ 10 |
(iv) Valsalva maneuver and heart rate response (BPM) | ≥ 1.21 | 1.11-1.20 | ≤ 1.10 |
(v) Blood pressure response to sustained handgrip (mmHg) | ≥ 16 | 11-15 | ≤ 10 |
HRV method | Measure | Description |
Time Domain | SDNN | The standard deviation of the NN intervals |
RMSSD | The square root of the mean squared difference of the NN intervals | |
Frequency Domain | Total Power | Variance of N-N intervals over the temporal segment (freq < 0.4) |
VLF | Power in very low frequency range (freq < 0.04) | |
LF | Power in low frequency range (freq 0.04 to 0.15) | |
HF | Power in high frequency range (freq 0.15 to 0.4) | |
Nonlinear | SD1, SD2 | The standard deviations perpendicular to and along the line-of-identity of the Poincaré plot |
ApEn | Approximate entropy | |
SampEn | Sample entropy | |
D2 | Correlation dimension | |
DFA | Detrended fluctuation analysis: | |
α1 - Short-term fluctuation slope; α2- Long-term fluctuation slope | ||
ShanEn | Shannon entropy | |
Rényi En | Rényi entropy |
Test | Normal | Borderline | Abnormal |
(i) Heart rate response to standing (ratio of HR at 15 and 30 sec) | ≥ 1.04 | 1.01-1.03 | ≤ 1.00 |
(ii) Blood pressure response to standing (mmHg) | ≤ 10 | 11-29 | ≥ 30 |
(iii) Heart rate response to deep breathing (BPM) | ≥ 15 | 11-14 | ≤ 10 |
(iv) Valsalva maneuver and heart rate response (BPM) | ≥ 1.21 | 1.11-1.20 | ≤ 1.10 |
(v) Blood pressure response to sustained handgrip (mmHg) | ≥ 16 | 11-15 | ≤ 10 |
HRV method | Measure | Description |
Time Domain | SDNN | The standard deviation of the NN intervals |
RMSSD | The square root of the mean squared difference of the NN intervals | |
Frequency Domain | Total Power | Variance of N-N intervals over the temporal segment (freq < 0.4) |
VLF | Power in very low frequency range (freq < 0.04) | |
LF | Power in low frequency range (freq 0.04 to 0.15) | |
HF | Power in high frequency range (freq 0.15 to 0.4) | |
Nonlinear | SD1, SD2 | The standard deviations perpendicular to and along the line-of-identity of the Poincaré plot |
ApEn | Approximate entropy | |
SampEn | Sample entropy | |
D2 | Correlation dimension | |
DFA | Detrended fluctuation analysis: | |
α1 - Short-term fluctuation slope; α2- Long-term fluctuation slope | ||
ShanEn | Shannon entropy | |
Rényi En | Rényi entropy |