To comprehend the etiology and pathogenesis of many illnesses, it is essential to identify disease-associated microRNAs (miRNAs). However, there are a number of challenges with current computational approaches, such as the lack of "negative samples", that is, confirmed irrelevant miRNA-disease pairs, and the poor performance in terms of predicting miRNAs related with "isolated diseases", i.e. illnesses with no known associated miRNAs, which presents the need for novel computational methods. In this study, for the purpose of predicting the connection between disease and miRNA, an inductive matrix completion model was designed, referred to as IMC-MDA. In the model of IMC-MDA, for each miRNA-disease pair, the predicted marks are calculated by combining the known miRNA-disease connection with the integrated disease similarities and miRNA similarities. Based on LOOCV, IMC-MDA had an AUC of 0.8034, which shows better performance than previous methods. Furthermore, experiments have validated the prediction of disease-related miRNAs for three major human diseases: colon cancer, kidney cancer, and lung cancer.
Citation: Zejun Li, Yuxiang Zhang, Yuting Bai, Xiaohui Xie, Lijun Zeng. IMC-MDA: Prediction of miRNA-disease association based on induction matrix completion[J]. Mathematical Biosciences and Engineering, 2023, 20(6): 10659-10674. doi: 10.3934/mbe.2023471
To comprehend the etiology and pathogenesis of many illnesses, it is essential to identify disease-associated microRNAs (miRNAs). However, there are a number of challenges with current computational approaches, such as the lack of "negative samples", that is, confirmed irrelevant miRNA-disease pairs, and the poor performance in terms of predicting miRNAs related with "isolated diseases", i.e. illnesses with no known associated miRNAs, which presents the need for novel computational methods. In this study, for the purpose of predicting the connection between disease and miRNA, an inductive matrix completion model was designed, referred to as IMC-MDA. In the model of IMC-MDA, for each miRNA-disease pair, the predicted marks are calculated by combining the known miRNA-disease connection with the integrated disease similarities and miRNA similarities. Based on LOOCV, IMC-MDA had an AUC of 0.8034, which shows better performance than previous methods. Furthermore, experiments have validated the prediction of disease-related miRNAs for three major human diseases: colon cancer, kidney cancer, and lung cancer.
[1] | G. Meister, T. Tuschl, Mechanisms of gene silencing by double-stranded RNA, emphNature, 431 (2004), 343–349. https://doi.org/10.1038/nature02873 |
[2] | S. M. Hammond, An overview of microRNAs, Adv. Drug Deliv. Rev., 87 (2015), 3–14. https://doi.org/10.1016/j.addr.2015.05.001 |
[3] | S. Rajasekaran, D. Pattarayan, P. Rajaguru, P. S. Gandhi, R. K. Thimmulappa, MicroRNA Regulation of Acute Lung Injury and Acute Respiratory Distress Syndrome, J. Cell. Physiol., 231 (2016), 2097–2106. https://doi.org/10.1002/jcp.25316 doi: 10.1002/jcp.25316 |
[4] | T. Li, J. Wen, D. Zeng, K. Liu, Has enterprise digital transformation improved the efficiency of enterprise technological innovation? A case study on Chinese listed companies, Math. Biosci. Eng., 19 (2022), 12632–12654. https://doi.org/10.3934/mbe.2022590 doi: 10.3934/mbe.2022590 |
[5] | Y. Meng, C. Lu, M. Jin, J. Xu, X. Zeng, J. Yang, A weighted bilinear neural collaborative filtering approach for drug repositioning, Brief. Bioinformatics, 2 (2022), bbab581. https://doi.org/10.1093/bib/bbab581 doi: 10.1093/bib/bbab581 |
[6] | Y. Liu, P. Failler, Z. Liu, Impact of Environmental Regulations on Energy Efficiency: A Case Study of China's Air Pollution Prevention and Control Action Plan, Sustainability, 14 (2022), 3168. https://doi.org/10.3390/su14063168 doi: 10.3390/su14063168 |
[7] | Y. W. Kong, D. Ferland-McCollough, T. J. Jackson, M. Bushell, microRNAs in cancer management, Lancet Oncol., 13 (2012), e249–e258. https://doi.org/10.1016/S1470-2045(12)70073-6 doi: 10.1016/S1470-2045(12)70073-6 |
[8] | Y. Liu, Z. Li, M. Xu, The influential factors of financial cycle spillover: evidence from China, Emerg. Mark. Finance Trade, 56 (2020), 1336–1350. https://doi.org/10.1080/1540496x.2019.1658076 doi: 10.1080/1540496x.2019.1658076 |
[9] | M. Chen, Y. Zhang, A. Li, Z. Li, W. Liu, Z. Chen, Bipartite heterogeneous network method based on co-neighbor for MiRNA-disease association prediction, Front. Genet., 10 (2019), 385. https://doi.org/10.3389/fgene.2019.00385 doi: 10.3389/fgene.2019.00385 |
[10] | D. Panarello, G. Tassinari, The consequences of COVID-19 on older adults: evidence from the SHARE Corona Survey, Natl. Account. Rev., 4 (2022), 56–73. https://doi.org/10.3934/NAR.2022004 doi: 10.3934/NAR.2022004 |
[11] | L. Cai, M. Gao, X. Ren, X. Fu, J. Xu, P. Wang, et al., MILNP: Plant lncRNA-miRNA Interaction Prediction Based on Improved Linear Neighborhood Similarity and Label Propagation, Front. Plant Sci., 7 (2017), page 637. https://doi.org/10.3389/fpls.2022.861886 doi: 10.3389/fpls.2022.861886 |
[12] | Y. Li, C. Liang, K. Wong, J. Luo, Z. Zhang, Mirsynergy: detecting synergistic miRNA regulatory modules by overlapping neighbourhood expansion, Bioinformatics, 30 (2014), 2627–2635. https://doi.org/10.1093/bioinformatics/btu373 doi: 10.1093/bioinformatics/btu373 |
[13] | Q. Jiang, Y. Wang, Y. Hao, L. Juan, M. Teng, X. Zhang, et al., miR2Disease: a manually curated database for microRNA deregulation in human disease, Nucleic Acids Res., 37 (2009), D98–D104. https://doi.org/10.1093/nar/gkn714 doi: 10.1093/nar/gkn714 |
[14] | Z. Yang, F. Ren, C. Liu, S. He, G. Sun, Q. Gao, et al., dbDEMC: a database of differentially expressed miRNAs in human cancers, BMC Genom., 11 (2010), 1–8. https://doi.org/10.1186/1471-2164-11-S4-S5 doi: 10.1186/1471-2164-11-S4-S5 |
[15] | Q. Jiang, G. Wang, T. Zhang, Y. Wang, Predicting human microrna-disease associations based on support vector machine, 2010 IEEE Int. Confer. Bioinformatics Biomed., (2010), 467–472. https://doi.org/10.1109/BIBM.2010.5706611 |
[16] | P. Wang, W. Zhu, B. Liao, L. Cai, L. Peng, J. Yang, Predicting influenza antigenicity by matrix completion with antigen and antiserum similarity, Front. Microbiol., 9 (2018), 2500. https://doi.org/10.3389/fmicb.2018.02500 doi: 10.3389/fmicb.2018.02500 |
[17] | L. Shen, F. Liu, L. Huang, G. Liu, L. Zhou, L. Peng, VDA-RWLRLS: An anti-SARS-CoV-2 drug prioritizing framework combining an unbalanced bi-random walk and Laplacian regularized least squares, Comput. Biol. Med., 140 (2022), 105–119. https://doi.org/10.1016/j.compbiomed.2021.105119 doi: 10.1016/j.compbiomed.2021.105119 |
[18] | F. Corradin, M. Billio, R. Casarin, Forecasting Economic Indicators with Robust Factor Models, Natl. Account. Rev., 4 (2022), 167–190. https://doi.org/10.3934/NAR.2022010 doi: 10.3934/NAR.2022010 |
[19] | Y. Liu, P. Failler, Y. Ding, Enterprise financialization and technological innovation: Mechanism and heterogeneity, PLoS ONE, 17 (2022), e0275461. https://doi.org/10.1371/journal.pone.0275461 doi: 10.1371/journal.pone.0275461 |
[20] | L. Cai, C. Lu, J. Xu, Y. Meng, P. Wang, X. Fu, et al., Drug repositioning based on the heterogeneous information fusion graph convolutional network, Brief. Bioinformatics, 22 (2021), bbab319. https://doi.org/10.1093/bib/bbab319 doi: 10.1093/bib/bbab319 |
[21] | X. Zhang, X. Zeng, Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks, Bio-inspired Comput. Model. Algorithms, (2019), 75–105. https://doi.org/10.1142/9789813143180_0003 |
[22] | Q. Zou, J. Li, L. Song, X. Zeng, G. Wang, Similarity computation strategies in the microRNA-disease network: a survey, Brief Funct. Genomics, 15 (2016), 55–64. https://doi.org/10.1093/bfgp/elv024 doi: 10.1093/bfgp/elv024 |
[23] | L. Katusiime, 2022: Time-Frequency connectedness between developing countries in the COVID-19 pandemic: The case of East Africa, Quant. Finance Econ., 6 (2022), 722–748. https://doi.org/10.3934/QFE.2022032 doi: 10.3934/QFE.2022032 |
[24] | Bharti, A. Kumar, Asymmetrical herding in cryptocurrency: Impact of COVID 19, Quant. Finance Econ., 6 (2022), 326–341. https://doi.org/10.3934/QFE.2022014 doi: 10.3934/QFE.2022014 |
[25] | C. Tsuji, The meaning of structural breaks for risk management: new evidence, mechanisms, and innovative views for the post-COVID-19 era, Quant. Finance Econ., 6 (2022), 270–302. https://doi.org/10.3934/QFE.2022012 doi: 10.3934/QFE.2022012 |
[26] | Z. Li, F. Zou, B. Mo, Does mandatory CSR disclosure affect enterprise total factor productivity?, ECON. RES-EKON. ISTRAZ., 1 (2021), 1–20. https://doi.org/10.1080/1331677X.2021.2019596 doi: 10.1080/1331677X.2021.2019596 |
[27] | Y. Lui, C. Ma, Z. Huang, Can the digital economy improve green total factor productivity? An empirical study based on Chinese urban data, Math. Biosci. Eng., 20 (2023), 6866–6893. https://doi.org/10.3934/mbe.2023296 doi: 10.3934/mbe.2023296 |
[28] | D. Zavras, 2022: Healthcare access as an important element for the EU's socioeconomic development: Greece's residents' opinions during the COVID-19 pandemic, Natl. Account. Rev., 4 (2022), 362–377. https://doi.org/10.3934/NAR.2022020 doi: 10.3934/NAR.2022020 |
[29] | Q. Jiang, G. Wang, Y. Wang, An approach for prioritizing disease-related microRNAs based on genomic data integration, 2010 3rd Int. Confer. Biomed. Eng. Inform., 6 (2010), 2270–2274. https://doi.org/10.1109/BMEI.2010.5639313 doi: 10.1109/BMEI.2010.5639313 |
[30] | J. Xu, C. Li, J. Lv, Y. Li, Y. Xiao, T. Shao, et al., Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer, Mol. Cancer Ther., 10 (2011), 1857–1866. https://doi.org/10.1158/1535-7163.MCT-11-0055 doi: 10.1158/1535-7163.MCT-11-0055 |
[31] | X. Zeng, Y. Liao, Y. Liu, Q. Zou, Prediction and validation of disease genes using HeteSim Scores, IEEE/ACM Trans. Comput. Biol. Bioinform., 14 (2016), 687–695. 10.1109/TCBB.2016.2520947 doi: 10.1109/TCBB.2016.2520947 |
[32] | Q. Xiao, J. Luo, C. Liang, J. Cai, P. Ding, A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations, Bioinformatics, 34 (2018), 239–248. https://doi.org/10.1093/bioinformatics/btx545 doi: 10.1093/bioinformatics/btx545 |
[33] | J. Xu, L. Cai, B. Liao, W. Zhu, P. Wang, Y. Meng, et al., Identifying potential mirnas–disease associations with probability matrix factorization, Front. Genet., 10 (2019), 1234. https://doi.org/10.3389/fgene.2019.01234 doi: 10.3389/fgene.2019.01234 |
[34] | X. Chen, G. Yan, Semi-supervised learning for potential human microRNA-disease associations inference, Sci. Rep., 4 (2014), 1–10. https://doi.org/10.1038/srep05501 doi: 10.1038/srep05501 |
[35] | W. Liu, X. Sun, L. Yang, K. Li, Y. Yang, X. Fu, NSCGRN: a network structure control method for gene regulatory network inference, Brief. Bioinform., (2022). https://doi.org/10.1093/bib/bbac156 |
[36] | Z. Li, J. Zhu, J. He, The effects of digital financial inclusion on innovation and entrepreneurship: A network perspective, Electron. Res. Arch., 30 (2022), 4697–4715. https://doi.org/10.3934/era.2022238 doi: 10.3934/era.2022238 |
[37] | A. Lippi, D. J. Price, R. Benelli, G. Lippi, Is limiting COVID-19 outside hospitals cost-effective? Cost-effectiveness analysis of the Italian special care continuity units (USCA), Natl. Account. Rev., 4 (2022), 428–447. https://doi.org/10.3934/NAR.2022024 doi: 10.3934/NAR.2022024 |
[38] | Y. Wu, S. Ma, Impact of COVID-19 on energy prices and main macroeconomic indicators—evidence from China's energy market, Green Finance, 3 (2021), 383–402. https://doi.org/10.3934/GF.2021019 doi: 10.3934/GF.2021019 |
[39] | V. Navickas, R. Kontautiene, J. Stravinskiene, Y. Bila, Paradigm shift in the concept of corporate social responsibility: COVID-19, Green Finance, 3 (2021), 138–152. https://doi.org/10.3934/GF.2021008 doi: 10.3934/GF.2021008 |
[40] | Z. Li, Z. Huang, Y. Su, New media environment, environmental regulation and corporate green technology innovation: Evidence from China, Energy Econ., 119 (2023), 106545. https://doi.org/10.1016/j.eneco.2023.106545 doi: 10.1016/j.eneco.2023.106545 |
[41] | Y. Lui, L. Chen, L. Ly, P. Failler, The impact of population aging on economic growth: a case study on China, AIMS Math., 8 (2023), 10468–10485. https://doi.org/10.3934/math.2023531 doi: 10.3934/math.2023531 |
[42] | C. Gu, B. Liao, X. Li, K. Li, Network consistency projection for human miRNA-disease associations inference, Sci. Rep., 6 (2016), 1–10. https://doi.org/10.1038/srep36054 doi: 10.1038/srep36054 |
[43] | X. Chen, C. C. Yan, X. Zhang, Z. You, L. Deng, Y. Liu, et al., WBSMDA: within and between score for MiRNA-disease association prediction, Sci. Rep., 6 (2016), 1–9. https://doi.org/10.1038/srep21106 doi: 10.1038/srep21106 |
[44] | Y. Liu, X. Zeng, Z. He, Q. Zou, Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources, IEEE/ACM Trans. Comput. Biol. Bioinform., 14 (2016), 905–915. https://doi.org/10.1109/TCBB.2016.2550432 doi: 10.1109/TCBB.2016.2550432 |
[45] | A. Li, Y. Deng, Y. Tan, M. Chen, A novel mirna-disease association prediction model using dual random walk with restart and space projection federated method, PLoS One, 6 (2021), e0252971. https://doi.org/10.1371/journal.pone.0252971 doi: 10.1371/journal.pone.0252971 |
[46] | X. Chen, M. Liu, G. Yan, RWRMDA: predicting novel human microRNA–disease associations, Mol. BioSyst., 8 (2012), 2792–2798. https://doi.org/10.1039/c2mb25180a doi: 10.1039/c2mb25180a |
[47] | P. Xuan, K. Han, M. Guo, Y. Guo, J. Li, J. Ding, et al., Prediction of microRNAs Associated with Human Diseases Based on Weighted k Most Similar Neighbors, PloS One, 8 (2013), e70204. https://doi.org/10.1371/journal.pone.0070204 doi: 10.1371/journal.pone.0070204 |
[48] | P. Xuan, C. Sun, T. Zhang, Y. Ye, T. Shen, Y. Dong, Gradient boosting decision tree-based method for predicting interactions between target genes and drugs, Front. Genet., 10 (2019), 459. https://doi.org/10.3389/fgene.2019.00459 doi: 10.3389/fgene.2019.00459 |
[49] | H. Chen, Z. Zhang, Similarity-based methods for potential human microRNA-disease association prediction, BMC Med. Genom., 6 (2013), 1–9. https://doi.org/10.1186/1755-8794-6-12 doi: 10.1186/1755-8794-6-12 |
[50] | M. N. Khatun, S. Mitra, M. N. I. Sarker, Mobile banking during COVID-19 pandemic in Bangladesh: A novel mechanism to change and accelerate people's financial access, Green Finance, 3 (2021), 253–267. https://doi.org/10.3934/GF.2021013 doi: 10.3934/GF.2021013 |
[51] | E. Assifuah-Nunoo, P. O. Junior, A. M. Adam, A. Bossman, Assessing the safe haven properties of oil in African stock markets amid the COVID-19 pandemic: a quantile regression analysis, Quant. Finance Econ., 6 (2022), 244–269. https://doi.org/10.3934/QFE.2022011 doi: 10.3934/QFE.2022011 |
[52] | Y. Lui, J. Liu, L. Zhang, Enterprise financialization and R & D innovation: A case study of listed companies in China, Electron. Res. Arch., 31 (2023), 2447–2471. https://doi.org/10.3934/era.2023124 doi: 10.3934/era.2023124 |
[53] | D. Wang, J. Wang, M. Lu, F. Song, Q. Cui, Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases, Bioinformatics, 26 (2010), 1644–1650. https://doi.org/10.1093/bioinformatics/btq241 doi: 10.1093/bioinformatics/btq241 |
[54] | P. Jain, I. S. Dhillon, Provable inductive matrix completion, arXiv preprint, (2013), arXiv: 1306.0626. |
[55] | D. Wang, J. Wang, M. Lu, F. Song, Q. Cui, H. Yu, et al., Large-scale prediction of microRNA-disease associations by combinatorial prioritization algorithm, Sci. Rep., 7 (2017), 1–15. https://doi.org/10.1038/srep43792 doi: 10.1038/srep43792 |