
Citation: Mauro Mandrioli, Gian Carlo Manicardi. Cytosine methylation in insects: new routes for the comprehension of insect complexity[J]. AIMS Biophysics, 2015, 2(4): 412-422. doi: 10.3934/biophy.2015.4.412
[1] | Abdulwahab Ali Almazroi . Survival prediction among heart patients using machine learning techniques. Mathematical Biosciences and Engineering, 2022, 19(1): 134-145. doi: 10.3934/mbe.2022007 |
[2] | Weidong Gao, Yibin Xu, Shengshu Li, Yujun Fu, Dongyang Zheng, Yingjia She . Obstructive sleep apnea syndrome detection based on ballistocardiogram via machine learning approach. Mathematical Biosciences and Engineering, 2019, 16(5): 5672-5686. doi: 10.3934/mbe.2019282 |
[3] | Lili Jiang, Sirong Chen, Yuanhui Wu, Da Zhou, Lihua Duan . Prediction of coronary heart disease in gout patients using machine learning models. Mathematical Biosciences and Engineering, 2023, 20(3): 4574-4591. doi: 10.3934/mbe.2023212 |
[4] | Jingchi Jiang, Xuehui Yu, Yi Lin, Yi Guan . PercolationDF: A percolation-based medical diagnosis framework. Mathematical Biosciences and Engineering, 2022, 19(6): 5832-5849. doi: 10.3934/mbe.2022273 |
[5] | Abigail Ferreira, Rui Lapa, Nuno Vale . A retrospective study comparing creatinine clearance estimation using different equations on a population-based cohort. Mathematical Biosciences and Engineering, 2021, 18(5): 5680-5691. doi: 10.3934/mbe.2021287 |
[6] | Atsushi Kawaguchi . Network-based diagnostic probability estimation from resting-state functional magnetic resonance imaging. Mathematical Biosciences and Engineering, 2023, 20(10): 17702-17725. doi: 10.3934/mbe.2023787 |
[7] | Michele L. Joyner, Cammey Cole Manning, Whitney Forbes, Valerie Bobola, William Frazier . Modeling Ertapenem: the impact of body mass index on distribution of the antibiotic in the body. Mathematical Biosciences and Engineering, 2019, 16(2): 713-726. doi: 10.3934/mbe.2019034 |
[8] | Wenzhu Song, Lixia Qiu, Jianbo Qing, Wenqiang Zhi, Zhijian Zha, Xueli Hu, Zhiqi Qin, Hao Gong, Yafeng Li . Using Bayesian network model with MMHC algorithm to detect risk factors for stroke. Mathematical Biosciences and Engineering, 2022, 19(12): 13660-13674. doi: 10.3934/mbe.2022637 |
[9] | Anastasia-Maria Leventi-Peetz, Kai Weber . Probabilistic machine learning for breast cancer classification. Mathematical Biosciences and Engineering, 2023, 20(1): 624-655. doi: 10.3934/mbe.2023029 |
[10] | Tingxi Wen, Hanxiao Wu, Yu Du, Chuanbo Huang . Faster R-CNN with improved anchor box for cell recognition. Mathematical Biosciences and Engineering, 2020, 17(6): 7772-7786. doi: 10.3934/mbe.2020395 |
Obstructive sleep apnea (OSA) is featured by repeated upper airway narrowing or obstruction, causing a substantial reduction or cessation of airflow during sleep [1,2]. These disruptions in breathing lead to intermittent hypercapnia and hypoxemia, causing metabolic dysregulation, endothelial dysfunction, and systemic inflammatory responses [2,3]. These pathophysiologic changes have contributed to various major comorbidities [2,3,4,5]. OSA is a global health concern and economic burden, with estimates of the incidence of moderate or severe OSA reaching 6%-17% in the general adult population and up to 49% in the elderly population [4,6], with most of the cases being undiagnosed [7].
No single physical examination finding can be used to diagnose or exclusively account for OSA. The gold standard diagnostic test for OSA relies on in-lab all-night polysomnography (PSG) [7,8]. However, this method is costly, and patients require continuous sleep monitoring by medical staff [2]; thus, limited availability of PSG may cause delays in diagnosing OSA and an increased disease burden [7,9].
Several body measurements predispose patients to OSA. Body weight, body mass index (BMI), central body fat distribution, and large neck and waist circumferences are possible risk factors [2,6,10]. In addition, increased age and males are also at risk for OSA [2,6]. Despite the epidemiological reports demonstrating that age, sex, and BMI are associated with OSA severity, an integrated predicting model that uses only these three factors to assess OSA risk has not yet been developed.
Machine learning approaches have expanded the ability to design efficient predictive and diagnostic tools and have gained popularity in many areas of medical research and clinical applications [11,12]. Logistic regression (LR) is one of the computational methods for probing multivariate data to detect the independent effects of a variable after adjusting for the contributions of the others [13]. An artificial neural network (ANN) is a supervised learning algorithm with advantages in analyzing nonlinear data, enhancing data interpretation, and designing effective predictive and diagnostic models to assist different clinical tasks [11,14].
The present study investigated the clinical predictive validity of age, sex, and BMI for sleep apnea and applied LR and ANN algorithms to develop a risk-predicting model using these three parameters as predictors.
This retrospective observational study used clinical data from the medical records of participants who had undergone PSG at the Sleep Center of Shuang Ho Hospital, Taipei Medical University, Taiwan, between January 2015 and January 2020.
The inclusion criteria for this study were: (1) participants aged≥18 years and (2) who had undergone a full-night PSG. Participants were excluded if (1) they had received any OSA treatments or been admitted to the hospital for any invasive surgery for OSA, (2) they had undergone PSG for continuous positive airway pressure (CPAP) titration, (3) the total recording time was < 4 h [8], and (4) the total sleep time was < 3 h [15].
Before the PSG examination, a baseline screening questionnaire was used to assess each participant's basic information, medication, and surgical history. The participants were measured for height, weight, and BMI (kg/m2) at the time of registration.
All the participants underwent a full-night PSG recorded by qualified technicians. All the sleep signals were obtained using an Embla N7000 Recording System (Medcare Flaga, Reykjavik, Iceland) and Sandman Elite sleep diagnostic software (Natus Neurology Inc., Middleton, WI, USA).
Experienced sleep technologists scored all sleep apnea and hypopnea events following the American Academy of Sleep Medicine 2012 updated guideline [8,16]. Each participant's apnea-hypopnea index (AHI) was calculated according to the total number of apnea and hypopnea events divided by the total sleeping time. Hypopnea was defined as a 30-89% reduction in nasal airflow for at least 10s, with arousal or at least a decrease of 3% oxygen saturation. Apnea was defined as a more than 90% reduction in oral airflow for at least 10s with or without arousal or decreased saturation.
The severity of OSA was classified as none (AHI < 5), mild (AHI = 5-15), moderate (AHI = 15-30), and severe (AHI ≥ 30) [2,8]. To develop the predictive models for the risk of moderate-to-severe OSA, we divided the participants into none-and-mild (AHI < 15) and moderate-to-severe (AHI ≥ 15) OSA groups for further analysis.
Taipei Medical University-Joint Institutional Review Board (TMU-JIRB: N201911007) approved the protocol of this study. Waivers of informed consent were approved by the TMU-JIRB (TMU-JIRB: N201911007) for this retrospective study involving the secondary analysis of existing data.
All analyses were performed using STATISTICA software (TIBCO Software, Tulsa, OK, USA). Continuous data are presented as mean and standard deviation, whereas categorical variables are expressed as frequencies and percentages. We used Fisher's exact test to determine associations between two categorical variables and Pearson's chi-squared (X2) test to determine the statistical significance between the means of two continuous variables in the participants with none-to-mild and moderate-to-severe OSA. The one-way analysis of variance (ANOVA) was used to determine the statistical significance between the means of ≥ 3 independent groups. A two-tailed p-value less than 0.05 was considered statistically significant.
All the LR and ANN models were established and modeled using STATISTICA software. In the models, age and BMI were set as continuous variables, and sex was designated as a categorical variable. For our ANNs, the employed architecture was feed-forward, combined with a back-propagation algorithm with multilayer perceptron (MLP). A bias term was used for each neural layer in the MLP. We set the number of neurons in the hidden layer from 1 to 50. We initially partitioned the eligible PSG data into a training-testing set and an independent validation set, containing ~80% and ~20% of the data, respectively. The data from the training-testing set were used to determine the generalizability of the LR and ANN analyses using five-fold cross-validation (Figure 1A). All participants were stratified by age, sex, BMI, and OSA severity to maintain consistent proportions of demographic data across the training-testing and validation sets. We reported the mean AUCs for the training and testing sets of the LR and ANN models through five-fold cross-validation. The model with the best AUC in the five testing sets was identified to have the best predictive performance and used as the final model. The final models were then applied to the independent validation set, and we calculated the accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and AUCs of the validation set derived from the confusion matrix [17] (Figure 1A). The classification threshold was set to 0.5 for all models.
During the study period, 11, 100 participants underwent PSG in the Sleep Center. Of these, 1, 665 participants were excluded from the baseline analyses for the following reasons: they had been admitted to the hospital for invasive surgery or CPAP titration (n = 247), the total recording time < 4h (n = 1, 386), the total sleep time < 3h (n = 32), missing BMI information (n = 7) or missing AHI information (n = 6) (Figure 1A). Overall, 9, 422 participants (2918 women and 6504 men, mean age: 49.2 ± 14.0 years) who had undergone PSG were enrolled in this study. Of these, 5, 920 (62.8%) had an AHI of ≥15, indicating moderate-to-severe OSA, and 3, 502 (37.2%) had an AHI < 15, indicating none-to-mild OSA.
A higher proportion of men have moderate-to-severe OSA than women (Table 1). Compared with participants with none-to-mild OSA, participants with moderate-to-severe OSA were older and had a higher BMI. In addition, participants with moderate-to-severe OSA had a shorter total sleep time, lower sleep efficiency, shorter rapid eye movement duration, and lower oxygen saturation during the PSG recording than those with none-to-mild OSA (Table 1).
Variables | Moderate-to-severe OSA (AHI ≥ 15) |
None-to-mild OSA (AHI < 15) |
p-value |
n | 5, 920 | 3, 502 | |
Age | 50.8 ± 13.5 | 46.6 ± 14.4 | < 0.0001 |
Female, n (%) | 1266 (21.4) | 1652 (47.2) | < 0.0001 |
Height (cm) | 167.2 ± 8.5 | 164.4 ± 8.9 | < 0.0001 |
Weight (kg) | 79.7±16.1 | 66.1 ± 13.1 | < 0.0001 |
BMI (kg/m2) | 28.4 ± 4.8 | 24.3 ± 3.8 | < 0.0001 |
Polysomnography | |||
Total recording time (min) | 366.6±11.9 | 367.8±13.4 | < 0.0001 |
Total sleep time (min) | 275.8±60.2 | 281.4±61.1 | < 0.0001 |
Sleep efficiency (%) | 75.2±16.4 | 76.5±16.5 | 0.0002 |
Non-REM sleep (min) | 240.6±51.7 | 242.2±51.3 | 0.133 |
REM sleep (min) | 35.2±22.5 | 39.1±23.7 | < 0.0001 |
AHI | 44.7±23.8 | 6.4±4.5 | < 0.0001 |
SO2 mean (%) | 94.5±2.6 | 96.6±1.3 | < 0.0001 |
Data are expressed as mean ± standard deviation. The p-value represents the comparison between the AHI ≥ 15 and AHI < 15 groups. |
A multivariate analysis was applied to adjust for age, sex, and BMI to determine the independent predictors of moderate-to-severe OSA. Table 2 presents the variables' crude and adjusted odds ratio (OR) values in the LR model. After adjustment for the associated variables, the regression analysis revealed that the female sex was associated with a 0.28-fold reduced risk of moderate-to-severe OSA (95% CI: 0.25-0.32; p < 0.0001), and every unit increase in age or BMI was associated with a 1.04-fold (95%CI: 1.04-1.05; p < 0.0001) or 1.27-fold (95%CI: 1.26-1.29; p < 0.0001) increase in the likelihood of moderate-to-severe OSA, respectively.
Unadjusted Model | Adjusted Model | |||
Variables | OR | 95% CI | OR | 95% CI |
Age | 1.02*† | 1.02-1.03 | 1.04*† | 1.04-1.05 |
Female | 0.30* | 0.28-0.33 | 0.28* | 0.25-0.32 |
BMI | 1.27*† | 1.25-1.28 | 1.27*† | 1.26-1.29 |
The odds ratio (OR) and 95% confidence intervals (CI) represent the association between the variables and the presence of moderate-to-severe OSA (AHI ≥ 15). *p < 0.0001; †Odds ratio of per unit changes. |
As mentioned, age, sex, and BMI were used as input parameters for developing the LR and ANN models for predicting moderate-to-severe OSA. As shown in Figure 1A, the 7328 participants in the training-testing group were randomly partitioned into training and testing sets for five-fold cross-validation. Table 3 showed that age, sex, BMI, and OSA severity distribution remained identical across the five testing sets and the independent validation sets.
Variable | Testing set 1 | Testing set 2 | Testing set 3 | Testing set 4 | Testing set 5 | Independent validation set | p-value |
n | 1466 | 1465 | 1466 | 1465 | 1466 | 2094 | |
AHI≥15, n (%) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 1322 (63.1) | 1.000 |
Age | 49.0 ± 0.4 | 49.0 ± 0.4 | 49.9 ± 0.4 | 49.1 ± 0.4 | 49.0 ± 0.4 | 49.3 ± 0.3 | 0.503 |
Female, n (%) | 483 (33.0) | 438 (29.9) | 472 (32.2) | 445 (30.4) | 431 (29.4) | 649 (31.0) | 0.266 |
BMI (kg/m2) | 26.9 ± 4.9 | 26.9 ± 4.9 | 26.9 ± 5.0 | 26.9 ± 5.0 | 26.8 ± 4.8 | 26.9 ± 4.9 | 0.973 |
The p-value represents the multiple comparisons across the five testing and the independent validation sets. |
Following adequate training, the five testing sets that achieved the best prediction performance after five-fold cross-validation were ANN models with 3, 14, 25, 27, and 29 hidden neurons, respectively. The mean AUCs of the training sets were 0.808 ± 0.004 (Figure 2A) and 0.808 ± 0.015 for the testing sets (Figure 2B). Among the generated ANN models, the model containing 25 hidden neurons obtained the highest testing AUC and was selected as the final model. After being applied to the independent validation set, the ANN-based model achieved the accuracy, sensitivity, specificity, PPV, and NPV of 76.4%, 87.7%, 56.9%, 77.7%, 73.0%, with the AUC of 0.807, respectively. Results of the LR model revealed a mean AUC of 0.802 ± 0.004 for the training sets (Figure 2C) and 0.801 ± 0.015 for the testing sets (Figure 2D). After being validated to the independent validation set, the accuracy, sensitivity, specificity, PPV, and NPV were 76.3%, 87.5%, 57.0%, 77.7%, and 72. 7% with an AUC of 0.806 for the LR-based model. Figure 3 shows the ROC curve and AUC of the final ANN and LR models applied to the independent validation set.
The present study validated age, sex, and BMI as parameters for use in LR- and ANN models to estimate the risk of OSA in participants who had undergone PSG. The established LR- and ANN-based model integrating the three parameters achieved a notable prediction performance with an AUC of 0.806 and 0.807 for predicting moderate-to-severe OSA in the validation set. The relatively high AUC values indicated that the models could distinguish participants between groups of interest, namely, participants with none-to-mild and those with moderate-to-severe OSA. Patients with OSA are prone to various complications and require interventions [2,7]; our models may contribute to further understanding these three variables' fundamental role in assessing OSA risk.
OSA has been recognized as a global health and economic burden, and early identification and diagnosis of OSA is a crucial issue in preventive medicine [4,6,7]. Since standard diagnostic tests for OSA rely on expensive in-laboratory PSG, which can be challenging to schedule in high-demand situations, identifying crucial factors and developing a simple and reliable tool to estimate the risk of OSA is clinically relevant [7,9].
Anatomical and physiological differences in the oropharynx, larynx and upper respiratory tract, changes in respiratory physiology associated with different sleep structures, and differences in the distribution of adipose tissue have been suggested as reasons for sex differences in developing OSA [18]. Deterioration of the collapsibility of the pharyngeal airway and the sensitivity of the respiratory control system may contribute to the pathogenesis of OSA with aging [2,18]. Hormonal changes in menopausal women have been linked to an increased risk of OSA and a decreased sex difference in OSA with age [19,20]. The prevalence of OSA has been reported to increase 1.4 to 3.2-fold with each 10-year increment in age, and men had a 1.7 to 3.0-fold higher odds of developing OSA than women [2,6]. These findings support the rationale for using age and sex as parameters in a generic prediction model to predict OSA risk.
Obesity is a well-recognized risk factor for developing OSA, and a higher BMI is associated with greater severity of OSA for both sexes [19,21]. As already mentioned, anthropometric profiles, such as body weight, BMI, central body fat distribution, and neck and waist circumference, are predisposed to the risk of OSA [2,6,10]. In patients with OSA, occlusion of the nasal and oro-pharynx may occur during sleep when the tongue and epiglottis move backward and lean together. Individuals with obesity have excess soft tissue around the collapsible pharynx, making the narrow airway more prone to collapse [2,3,21]. In line with the findings of other studies, we observed that the participants with moderate-to-severe OSA had a higher BMI. Although other anthropometric profiles have been associated with OSA, neck and waist circumference were considered surrogates for BMI [4,21,22]. The study by Huxley et al. has also suggested that "measures of central obesity were superior to BMI as discriminators of cardiovascular risk factors, although the differences were small and unlikely to be of clinical relevance" [23]. Furthermore, the same study demonstrated that "combining BMI with any measure of central obesity did not improve the discriminatory capability of the individual measures" [23]. In this study, we proposed BMI as a simple, well-known, and readily available predictor of OSA for the prediction models.
Despite the epidemiological reports indicating that age, sex, and BMI are associated with OSA, an integrated predicting model using only these three attributes to assess OSA risk has not yet been developed. Advances in machine learning techniques have improved the ability of researchers and clinical applications to analyze massive amounts of data and develop appropriate diagnostic and predictive methods [24,25,26,27]. Several studies have applied machine learning algorithms to predict OSA risk. However, most of these studies have incorporated more complex parameters, such as sleep questionnaires [28,29,30], at-home PSG [15], cephalometric images [31], neck and waist circumference measurements [28,29,32,33], and facial-cervical measurements [28,32], into the predictive models.
In this context, our study used a large hospital-based sample of adults, allowing accurate comparison between none-to-mild and moderate-to-severe cases. The generated LR- and ANN-boosted models exhibited relatively high AUCs with adequate accuracy, sensitivity, PPV, and NPV, revealing the ability of these models to distinguish between the groups of interest, namely individuals with moderate-to-severe and none-to-mild OSA. Furthermore, our models used simple and easy-to-obtain parameters (age, sex, and BMI), which may provide a direction for developing predictive models under conditions of limited and constrained information.
Both LR and ANN are widely used models in biomedicine. This study's LR and ANN models achieved similar predictive validity in predicting OSA in patients undergoing PSG, which may be related to our data structure. ANN has the advantage of analyzing complex, multidimensional and nonlinear variables [34,35,36,37]. When the predictor-outcome relationships in the dataset are relatively simple and linear, LR using linear combinations of variables may achieve performance that approximates that of ANN [36,38]. This consideration was contextualized in the conclusion by Dreiseitl and Ohno-Machado [38] that "there is no single algorithm that performs better than all other algorithms on any given data set and application area." and also summarized by Sargent [34] that "both methods should continue to be used and explored in a complementary manner."
The present study has several limitations. First, in our cohort, the incidence of moderate-to-severe OSA was 62.8%, which is higher than the reported prevalence of OSA in the general population [4,19,20,39]. In clinical practice, patients referred for a PSG exam are usually expected to have OSA or another sleep disorder; therefore, the participant population in this study cannot be representative of the general population with a very low prevalence of OSA. Second, the data set used in this study was limited to a single center and included only Asian ethnic groups. Therefore, the trained model may not be generalizable beyond the institution or other ethnic groups [28,29]. Third, this retrospective observational study did not exclude undiagnosed medical conditions associated with OSA, such as neurological, cardiovascular, and pulmonary disorders, which may affect the model's predictive performance. Our model did not contain detailed anthropometric imaging or measurements, which may limit the model's ability to distinguish the disease-specific causes of OSA. Finally, the relatively low specificity for the models (57.0% and 56.9%) suggests a higher rate of false positives, meaning that a healthy person is more likely to receive an incorrect diagnosis of OSA using the proposed tool. As Jonas et al. [30] concluded, "There is uncertainty about the accuracy or clinical utility of all potential screening tools." a PSG to verify OSA severity and confirm the diagnosis is warranted for patients with clinically suspected sleep apnea.
Our study validated the diagnostic relevance of age, sex, and BMI for OSA. The generated LR- and ANN-based model effectively predicted OSA risk using these three simple parameters. Our results, therefore, may improve insights into OSA-related essential risk factors and provide knowledge to facilitate individualized risk stratification and formulate diagnostic decisions for sleep apnea.
This work was funded by the National Science and Technology Council (NSTC 111-2314-B-038-132-MY3) to CC Chung.
The authors declare there is no conflict of interest.
[1] | Adams RLP (1996) Principles of Medical Biology, vol. 5, JAI Press Inc., New York, 33-66. |
[2] |
Bird A (2002) DNA methylation patterns and epigenetic memory. Genes Dev 16: 6-21. doi: 10.1101/gad.947102
![]() |
[3] | Ehrlich M, Gama-Sosa MA, Huang LH, et al. (1982) Amount and distribution of 5-methylcytosine in human DNA from different types of tissues of cells. Nucleic Acids Res 10: 2709-2721. |
[4] |
Lister R, Pelizzola M, Dowen RH, et al. (2009) Human DNA methylomes at base resolution show widespread epigenomic differences. Nature 462: 315-322. doi: 10.1038/nature08514
![]() |
[5] |
Jones PA (2012) Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nature Rev Genet 13: 484-492. doi: 10.1038/nrg3230
![]() |
[6] |
Walsh CP, Bestor TH (1999) Cytosine methylation and mammalian development. Genes Dev 13: 26-34. doi: 10.1101/gad.13.1.26
![]() |
[7] | Okano M, Bell DW, Haber DA, et al. 1999. DNA Methyltransferases Dnmt3a and Dnmt3b are essential for de novo methylation and mammalian development. Cell 99: 247-257. |
[8] |
Feng S, Jacobsen SE, Reik W (2010) Epigenetic reprogramming in plant and animal development. Science 330: 622-627. doi: 10.1126/science.1190614
![]() |
[9] | Liu K, Wang YF, Cantemir C, et al. (2003) Endogenous assays of DNA methyltransferases: evidence for differential activities of DNMT1, DNMT2, and DNMT3 in mammalian cells in vivo. Mol Cell Biol 23: 2709-2719. |
[10] |
Chen Z, Riggs AD (2011) DNA methylation and demethylation in mammals. J Biol Chem 286: 18347-18353. doi: 10.1074/jbc.R110.205286
![]() |
[11] |
Mathieu O, Reinders J, Caikovski M, et al. (2007) Transgenerational stability of the Arabidopsis epigenome is coordinated by CG methylation. Cell 130: 851-862. doi: 10.1016/j.cell.2007.07.007
![]() |
[12] | Johannes F, Porcher E, Teixeira FK, et al. (2009). Assessing the impact of transgenerational epigenetic variation on complex traits. PLoS Genet 5: e1000530. |
[13] |
Reinders J, Paszkowski J (2009) Unlocking the Arabidopsis epigenome. Epigenetics 4: 557-563. doi: 10.4161/epi.4.8.10347
![]() |
[14] |
Rando OJ, Verstrepen KJ (2007) Timescales of genetic and epigenetic inheritance. Cell 128: 655-668. doi: 10.1016/j.cell.2007.01.023
![]() |
[15] |
Morgan HD, Sutherland HG, Martin DI, et al. (1999) Epigenetic inheritance at the agouti locus in the mouse. Nature Genet 23: 314-318. doi: 10.1038/15490
![]() |
[16] |
Richards EJ (2006) Inherited epigenetic variation-revisiting soft inheritance. Nat Rev Genet 7: 395-401. doi: 10.1038/nrg1834
![]() |
[17] |
Glastad KM, Hunt BG, Yi SV, et al. (2011) DNA methylation in insects: on the brink of the epigenomic era. Insect Mol Biol 20: 553-565. doi: 10.1111/j.1365-2583.2011.01092.x
![]() |
[18] | Lyko F, Maleszka R (2011) Insects as innovative models for functional studies of DNA methylation. Trends Genet 27: 127-131. |
[19] | Esteller M (2004) DNA methylation: approaches and applications. CRC Press, Boca Raton, FL. |
[20] | Lockett GA, Kucharski R, Maleszka R (2012) DNA methylation changes elicited by social stimuli in the brains of worker honey bees. Genes Brain Behav 11: 235-242. |
[21] | Ikeda T, Furukawa S, Nakamura J, et al. (2011) CpG methylation in the hexamerin 110 gene in the European honeybee, Apis mellifera. J Insect Sci 11: 74. |
[22] |
Mandrioli M (2004) Epigenetic tinkering and evolution: is there any continuity in the functional role of cytosine methylation from invertebrates to vertebrates? Cell Mol Life Sci 61: 2425-2427. doi: 10.1007/s00018-004-4184-y
![]() |
[23] |
Field LM, Lyko F, Mandrioli M, et al. (2004) DNA methylation in insects. Insect Mol Biol 13: 109-115. doi: 10.1111/j.0962-1075.2004.00470.x
![]() |
[24] |
Luco RF, Allo M, Schor IE, et al. (2011) Epigenetics in alternative pre-mRNA splicing. Cell 144:16-26. doi: 10.1016/j.cell.2010.11.056
![]() |
[25] |
Simmen MW, Leitgeb S, Charlton J, et al. (1999) Non-methylated transposable elements and methylates genes in a chordate genome. Science 283: 1164-1167. doi: 10.1126/science.283.5405.1164
![]() |
[26] | Loxdale HD (2009) What’s in a clone: the rapid evolution of aphid asexual lineages in relation to geography, host plant adaptation and resistance to pesticides. In: Schon I, Martens K van Dijk P eds, Lost sex: The Evolutionary Biology of Parthenogenesis. Springer, Heidelberg, Germany, pp. 535-557. |
[27] | Suomalainen E, Saura A, Lokki J (1987) Cytology and evolution in parthenogenesis. CRC Press, Boca Raton. |
[28] | Dixon AFG (1987) Parthenogenetic reproduction and the rate of increase in aphids. In: A. Minks and P. Harrewijn (ed), Aphids, their Biology, Natural Enemies and Control. vol. A, Elsevier, The Netherlands, 269-287. |
[29] |
Janzen DH (1977) What are dandelions and aphids? Am Nat 111: 586-589. doi: 10.1086/283186
![]() |
[30] | Loxdale HD (2008a) Was Dan Janzen (1977) right about aphid clones being a ‘super-organism’, i.e. a single ‘evolutionary individual’? New insights from the use of molecular marker systems. Mitt DGaaE 16: 437-449 |
[31] | Loxdale HD (2008b) The nature and reality of the aphid clone: genetic variation, adaptation and evolution. Agr Forest Entomol 10: 81-90. |
[32] |
Pasquier C, Clément M, Dombrovsky A, et al. (2014) Environmentally selected aphid variants in clonality context display differential patterns of methylation in the genome. PLoS One 9: e115022. doi: 10.1371/journal.pone.0115022
![]() |
[33] | Jenkins RL (1991) Colour and symbionts of aphids. PhD Thesis, University of East Anglia, UK. |
[34] | Terradot L, Simon JC, Leterne N, et al. (1999) Molecular characterization of clones of the Myzus persicae complex differing in their ability to transmit the potato leafroll lutovirus (PLRV). Bull Entomol Res 89: 355-363. |
[35] |
Losey JE, Ives AR, Harmon J, et al. (1997) A polymorphism maintained by opposite patterns of parasitism and predation. Nature 388: 269-272. doi: 10.1038/40849
![]() |
[36] | Devonshire AL, Field LM, Foster SP, et al. (1999) The evolution of insecticide resistance in the peach-potato aphid, Myzus persicae. In: Denholm, I., Pickett, J.A. & Devonshire, A.L. (Eds), Insecticide resistance: from mechanisms to management. Wallingford, Oxon, CABI Publishing. |
[37] |
Brisson JA (2010) Aphid wing dimorphisms: linking environmental and genetic control of trait variation. Philos Trans R Soc Lond B Biol Sci 365: 605-616. doi: 10.1098/rstb.2009.0255
![]() |
[38] |
Le Trionnaire G, Hardie J, Jaubert-Possamai S, et al. (2008) Shifting from clonal to sexual reproduction in aphids: physiological and developmental aspects. Biol Cell 100: 441-451. doi: 10.1042/BC20070135
![]() |
[39] | Davis GK (2012) Cyclical parthenogenesis and viviparity in aphids as evolutionary novelties. J Exp Zool B Mol Dev Evol 318: 448-459. |
[40] | Aoki S (1977) Colophina clematis (Homoptera, Pemphigidae), an aphid species with soldiers. Kontyû 45: 276-282. |
[41] | Miyazaki M (1987) Forms and morphs of aphids. In: A. K. Minks and P. Harrewijin (eds), Aphids, Their Biology, Natural Enemies, and Control. Amsterdam: Elsevier), 27-50. |
[42] |
Fukatsu T (2010) A fungal past to insect color. Science 328: 574-575. doi: 10.1126/science.1190417
![]() |
[43] |
Tsuchida T, Koga R, Horikawa M, et al. (2010) Symbiotic bacterium modifies aphid body color. Science 330: 1102-1104. doi: 10.1126/science.1195463
![]() |
[44] |
Braendle C, Davis GK, Brisson JA, et al. (2006) Wing dimorphism in aphids. Heredity 97: 192-199. doi: 10.1038/sj.hdy.6800863
![]() |
[45] |
Stern DL, Foster WA (1996) The evolution of soldiers in aphids. Biol Rev Camb Philos Soc 71: 27-79. doi: 10.1111/j.1469-185X.1996.tb00741.x
![]() |
[46] |
Shibao H, Kutsukake M, Matsuyama S, et al. (2010) Mechanisms regulating caste differentiation in an aphid social system. Commun Integr Biol 3: 1-5. doi: 10.4161/cib.3.1.9694
![]() |
[47] |
Hattori M, Kishida O, Itino T (2013) Soldiers with large weapons in predator-abundant midsummer: reproductive plasticity in a eusocial aphid. Evol Ecol 27: 847-862. doi: 10.1007/s10682-012-9628-5
![]() |
[48] |
Moran NA, Jarvik T (2010) Lateral transfer of genes from fungi underlies carotenoid production in aphids. Science 328: 624-627. doi: 10.1126/science.1187113
![]() |
[49] | Valmalette JC, Dombrovsky A, Brat P, et al. (2012) Light- induced electron transfer and ATP synthesis in a carotene synthesizing insect. Scientific report 2: 579. |
[50] |
Dombrovsky A, Arthaud L, Ledger TN, et al. (2009) Profiling the repertoire of phenotypes influenced by environmental cues that occur during asexual reproduction. Genome Res 19: 2052-2063. doi: 10.1101/gr.091611.109
![]() |
[51] |
Hick CA, Field LM, Devonshire AL (1996) Changes in the methylation of amplified esterase DNA during loss and reselection of insecticide resistance in peach-potato aphids, Myzus persicae. Insect Biochem Mol Biol 26: 41-47. doi: 10.1016/0965-1748(95)00059-3
![]() |
[52] |
Field LM, Blackman RL, Tyler-Smith C, et al. (1999) Relationship between amount of esterase and gene copy number in insecticide-resistant Myzus persicae (Sulzer). Biochem J 339: 737-742. doi: 10.1042/bj3390737
![]() |
[53] |
Field LM (2000) Methylation and expression of amplified esterase genes in the aphid Myzus persicae (Sulzer). Biochem J 349: 863-868. doi: 10.1042/bj3490863
![]() |
[54] |
Walsh TK, Brisson JA, Robertson HM, et al. (2010) A functional DNA methylation system in the pea aphid, Acyrthosiphon pisum. Insect Mol Biol 19: 215-228. doi: 10.1111/j.1365-2583.2009.00974.x
![]() |
[55] |
Daxinger L, Whitelaw E (2010) Transgenerational epigenetic inheritance: more questions than answers. Genome Res 20: 1623-1628. doi: 10.1101/gr.106138.110
![]() |
[56] | Srinivasan DG, Brisson JA (2012) Aphids: a model for polyphenism and epigenetics. Genet Res Int 2012: 431531 |
[57] | Maynard Smith J, Szathmary E (1995) The major transitions in evolution. Oxford University Press. |
[58] |
Grosberg RK, Strathmann RR (2007) The evolution of multicellularity: a minor major transition? Annu Rev Ecol Evol Syst 38: 621-654. doi: 10.1146/annurev.ecolsys.36.102403.114735
![]() |
[59] |
Branda SS, Gonzalez-Pastor JE, Ben-Yehuda S, et al. (2001) Fruiting body formation by Bacillus subtilis. Proc Natl Acad Sci U S A 98:11621-11626. doi: 10.1073/pnas.191384198
![]() |
[60] |
Lurling M, Van Donk E (2000) Grazer-induced colony formation in Scenedesmus: are there costs to being colonial? Oikos 88: 111-118. doi: 10.1034/j.1600-0706.2000.880113.x
![]() |
[61] |
Kaiser D (2001) Building a multicellular organism. Annu Rev Genet 35:103-123. doi: 10.1146/annurev.genet.35.102401.090145
![]() |
[62] |
Ausmees N, Jacobs-Wagner C (2003) Spatial and temporal control of differentiation and cell cycle progression in Caulobacter crescentus. Annu Rev Microbiol 57: 225-247. doi: 10.1146/annurev.micro.57.030502.091006
![]() |
[63] |
Patalano S, Hore TA, Reik W, et al. (2012) Shifting behaviour: epigenetic reprogramming in eusocial insects. Curr Opin Cell Biol 24: 367-373 doi: 10.1016/j.ceb.2012.02.005
![]() |
[64] | Foret S, Kuxcharski R, Pellegrini M, et al. (2012) DNA methylation dynamics, metabolic fluxes, gene splicing, and alternative phenotypes in honey bees Proc Natl Acad Sci U S A 109: 4968-4973. |
[65] |
Wang Y, Jorda M, Jones PL, et al. (2006) Functional CpG methylation system in a social insect. Science 314: 645-647. doi: 10.1126/science.1135213
![]() |
[66] |
Wurm Y, Wang J, Riba-Grognuz O, et al. (2011) The genome of the fire ant Solenopsis invicta. Proc Natl Acad Sci U S A 108: 5679-5684. doi: 10.1073/pnas.1009690108
![]() |
[67] |
Suen G, Teiling C, Li L, et al. (2011) The genome sequence of the leaf-cutter ant Atta cephalotes reveals insights into its obligate symbiotic lifestyle. PLoS Genet 7: e1002007. doi: 10.1371/journal.pgen.1002007
![]() |
[68] |
Smith CR, Smith CD, Robertson HM, et al. (2011) Draft genome of the red harvester ant Pogonomyrmex barbatus. Proc Natl Acad Sci U S A 108: 5667-5672. doi: 10.1073/pnas.1007901108
![]() |
[69] |
Lyko F, Foret S, Kucharski R, et al. (2010) The honey bee epigenomes: differential methylation of brain DNA in queens and workers. PLoS Biol 8: e1000506. doi: 10.1371/journal.pbio.1000506
![]() |
[70] | Kucharski R, Maleszka J, Foret S, et al. (2008) Nutritional control of reproductive status in honeybees via DNA methylation. Science 319: 1827-1830. |
[71] |
Shi YY, Huang ZY, Zeng ZJ, et al. (2011) Diet and cell size both affect queen-worker differentiation through DNA methylation in honey bees (Apis mellifera, Apidae). PLoS One 6: e18808. doi: 10.1371/journal.pone.0018808
![]() |
[72] |
Weaver N (1966) Physiology of caste determination. Annu Rev Entomol 11: 79-102. doi: 10.1146/annurev.en.11.010166.000455
![]() |
[73] |
Strassmann JE (1983) Gerontocracy in the social wasp, Polistes exclamans. Anim Behav 31: 431-438. doi: 10.1016/S0003-3472(83)80063-3
![]() |
[74] |
Pigliucci M, Murren CJ, Schlichting CD (2006) Phenotypic plasticity and evolution by genetic assimilation. J Exp Biol 209: 2362-2367. doi: 10.1242/jeb.02070
![]() |
[75] | West-Eberhard MJ (2003) Developmental plasticity and evolution. New York: Oxford University Press. |
[76] |
Wittkopp PJ, Kalay G (2012) Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence. Nature Rev Genet 13: 59-69. doi: 10.1038/nri3362
![]() |
Variables | Moderate-to-severe OSA (AHI ≥ 15) |
None-to-mild OSA (AHI < 15) |
p-value |
n | 5, 920 | 3, 502 | |
Age | 50.8 ± 13.5 | 46.6 ± 14.4 | < 0.0001 |
Female, n (%) | 1266 (21.4) | 1652 (47.2) | < 0.0001 |
Height (cm) | 167.2 ± 8.5 | 164.4 ± 8.9 | < 0.0001 |
Weight (kg) | 79.7±16.1 | 66.1 ± 13.1 | < 0.0001 |
BMI (kg/m2) | 28.4 ± 4.8 | 24.3 ± 3.8 | < 0.0001 |
Polysomnography | |||
Total recording time (min) | 366.6±11.9 | 367.8±13.4 | < 0.0001 |
Total sleep time (min) | 275.8±60.2 | 281.4±61.1 | < 0.0001 |
Sleep efficiency (%) | 75.2±16.4 | 76.5±16.5 | 0.0002 |
Non-REM sleep (min) | 240.6±51.7 | 242.2±51.3 | 0.133 |
REM sleep (min) | 35.2±22.5 | 39.1±23.7 | < 0.0001 |
AHI | 44.7±23.8 | 6.4±4.5 | < 0.0001 |
SO2 mean (%) | 94.5±2.6 | 96.6±1.3 | < 0.0001 |
Data are expressed as mean ± standard deviation. The p-value represents the comparison between the AHI ≥ 15 and AHI < 15 groups. |
Unadjusted Model | Adjusted Model | |||
Variables | OR | 95% CI | OR | 95% CI |
Age | 1.02*† | 1.02-1.03 | 1.04*† | 1.04-1.05 |
Female | 0.30* | 0.28-0.33 | 0.28* | 0.25-0.32 |
BMI | 1.27*† | 1.25-1.28 | 1.27*† | 1.26-1.29 |
The odds ratio (OR) and 95% confidence intervals (CI) represent the association between the variables and the presence of moderate-to-severe OSA (AHI ≥ 15). *p < 0.0001; †Odds ratio of per unit changes. |
Variable | Testing set 1 | Testing set 2 | Testing set 3 | Testing set 4 | Testing set 5 | Independent validation set | p-value |
n | 1466 | 1465 | 1466 | 1465 | 1466 | 2094 | |
AHI≥15, n (%) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 1322 (63.1) | 1.000 |
Age | 49.0 ± 0.4 | 49.0 ± 0.4 | 49.9 ± 0.4 | 49.1 ± 0.4 | 49.0 ± 0.4 | 49.3 ± 0.3 | 0.503 |
Female, n (%) | 483 (33.0) | 438 (29.9) | 472 (32.2) | 445 (30.4) | 431 (29.4) | 649 (31.0) | 0.266 |
BMI (kg/m2) | 26.9 ± 4.9 | 26.9 ± 4.9 | 26.9 ± 5.0 | 26.9 ± 5.0 | 26.8 ± 4.8 | 26.9 ± 4.9 | 0.973 |
The p-value represents the multiple comparisons across the five testing and the independent validation sets. |
Variables | Moderate-to-severe OSA (AHI ≥ 15) |
None-to-mild OSA (AHI < 15) |
p-value |
n | 5, 920 | 3, 502 | |
Age | 50.8 ± 13.5 | 46.6 ± 14.4 | < 0.0001 |
Female, n (%) | 1266 (21.4) | 1652 (47.2) | < 0.0001 |
Height (cm) | 167.2 ± 8.5 | 164.4 ± 8.9 | < 0.0001 |
Weight (kg) | 79.7±16.1 | 66.1 ± 13.1 | < 0.0001 |
BMI (kg/m2) | 28.4 ± 4.8 | 24.3 ± 3.8 | < 0.0001 |
Polysomnography | |||
Total recording time (min) | 366.6±11.9 | 367.8±13.4 | < 0.0001 |
Total sleep time (min) | 275.8±60.2 | 281.4±61.1 | < 0.0001 |
Sleep efficiency (%) | 75.2±16.4 | 76.5±16.5 | 0.0002 |
Non-REM sleep (min) | 240.6±51.7 | 242.2±51.3 | 0.133 |
REM sleep (min) | 35.2±22.5 | 39.1±23.7 | < 0.0001 |
AHI | 44.7±23.8 | 6.4±4.5 | < 0.0001 |
SO2 mean (%) | 94.5±2.6 | 96.6±1.3 | < 0.0001 |
Data are expressed as mean ± standard deviation. The p-value represents the comparison between the AHI ≥ 15 and AHI < 15 groups. |
Unadjusted Model | Adjusted Model | |||
Variables | OR | 95% CI | OR | 95% CI |
Age | 1.02*† | 1.02-1.03 | 1.04*† | 1.04-1.05 |
Female | 0.30* | 0.28-0.33 | 0.28* | 0.25-0.32 |
BMI | 1.27*† | 1.25-1.28 | 1.27*† | 1.26-1.29 |
The odds ratio (OR) and 95% confidence intervals (CI) represent the association between the variables and the presence of moderate-to-severe OSA (AHI ≥ 15). *p < 0.0001; †Odds ratio of per unit changes. |
Variable | Testing set 1 | Testing set 2 | Testing set 3 | Testing set 4 | Testing set 5 | Independent validation set | p-value |
n | 1466 | 1465 | 1466 | 1465 | 1466 | 2094 | |
AHI≥15, n (%) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 919 (62.7) | 920 (62.8) | 1322 (63.1) | 1.000 |
Age | 49.0 ± 0.4 | 49.0 ± 0.4 | 49.9 ± 0.4 | 49.1 ± 0.4 | 49.0 ± 0.4 | 49.3 ± 0.3 | 0.503 |
Female, n (%) | 483 (33.0) | 438 (29.9) | 472 (32.2) | 445 (30.4) | 431 (29.4) | 649 (31.0) | 0.266 |
BMI (kg/m2) | 26.9 ± 4.9 | 26.9 ± 4.9 | 26.9 ± 5.0 | 26.9 ± 5.0 | 26.8 ± 4.8 | 26.9 ± 4.9 | 0.973 |
The p-value represents the multiple comparisons across the five testing and the independent validation sets. |