To comprehend the etiology and pathogenesis of many illnesses, it is essential to identify disease-associated microRNAs (miRNAs). However, there are a number of challenges with current computational approaches, such as the lack of "negative samples", that is, confirmed irrelevant miRNA-disease pairs, and the poor performance in terms of predicting miRNAs related with "isolated diseases", i.e. illnesses with no known associated miRNAs, which presents the need for novel computational methods. In this study, for the purpose of predicting the connection between disease and miRNA, an inductive matrix completion model was designed, referred to as IMC-MDA. In the model of IMC-MDA, for each miRNA-disease pair, the predicted marks are calculated by combining the known miRNA-disease connection with the integrated disease similarities and miRNA similarities. Based on LOOCV, IMC-MDA had an AUC of 0.8034, which shows better performance than previous methods. Furthermore, experiments have validated the prediction of disease-related miRNAs for three major human diseases: colon cancer, kidney cancer, and lung cancer.
Citation: Zejun Li, Yuxiang Zhang, Yuting Bai, Xiaohui Xie, Lijun Zeng. IMC-MDA: Prediction of miRNA-disease association based on induction matrix completion[J]. Mathematical Biosciences and Engineering, 2023, 20(6): 10659-10674. doi: 10.3934/mbe.2023471
To comprehend the etiology and pathogenesis of many illnesses, it is essential to identify disease-associated microRNAs (miRNAs). However, there are a number of challenges with current computational approaches, such as the lack of "negative samples", that is, confirmed irrelevant miRNA-disease pairs, and the poor performance in terms of predicting miRNAs related with "isolated diseases", i.e. illnesses with no known associated miRNAs, which presents the need for novel computational methods. In this study, for the purpose of predicting the connection between disease and miRNA, an inductive matrix completion model was designed, referred to as IMC-MDA. In the model of IMC-MDA, for each miRNA-disease pair, the predicted marks are calculated by combining the known miRNA-disease connection with the integrated disease similarities and miRNA similarities. Based on LOOCV, IMC-MDA had an AUC of 0.8034, which shows better performance than previous methods. Furthermore, experiments have validated the prediction of disease-related miRNAs for three major human diseases: colon cancer, kidney cancer, and lung cancer.
[1] | G. Meister, T. Tuschl, Mechanisms of gene silencing by double-stranded RNA, emphNature, 431 (2004), 343–349. https://doi.org/10.1038/nature02873 |
[2] | S. M. Hammond, An overview of microRNAs, Adv. Drug Deliv. Rev., 87 (2015), 3–14. https://doi.org/10.1016/j.addr.2015.05.001 |
[3] | S. Rajasekaran, D. Pattarayan, P. Rajaguru, P. S. Gandhi, R. K. Thimmulappa, MicroRNA Regulation of Acute Lung Injury and Acute Respiratory Distress Syndrome, J. Cell. Physiol., 231 (2016), 2097–2106. https://doi.org/10.1002/jcp.25316 doi: 10.1002/jcp.25316 |
[4] | Y. Meng, C. Lu, M. Jin, J. Xu, X. Zeng, J. Yang, A weighted bilinear neural collaborative filtering approach for drug repositioning, Brief. Bioinformatics, 2 (2022), bbab581. https://doi.org/10.1093/bib/bbab581 doi: 10.1093/bib/bbab581 |
[5] | Y. W. Kong, D. Ferland-McCollough, T. J. Jackson, M. Bushell, microRNAs in cancer management, Lancet Oncol., 13 (2012), e249–e258. https://doi.org/10.1016/S1470-2045(12)70073-6 doi: 10.1016/S1470-2045(12)70073-6 |
[6] | M. Chen, Y. Zhang, A. Li, Z. Li, W. Liu, Z. Chen, Bipartite heterogeneous network method based on co-neighbor for MiRNA-disease association prediction, Front. Genet., 10 (2019), 385. https://doi.org/10.3389/fgene.2019.00385 doi: 10.3389/fgene.2019.00385 |
[7] | L. Cai, M. Gao, X. Ren, X. Fu, J. Xu, P. Wang, et al., MILNP: Plant lncRNA-miRNA Interaction Prediction Based on Improved Linear Neighborhood Similarity and Label Propagation, Front. Plant Sci., 7 (2017), page 637. https://doi.org/10.3389/fpls.2022.861886 doi: 10.3389/fpls.2022.861886 |
[8] | L. Zhuo, S. Pan, J. Li, X. Fu Predicting miRNA-lncRNA interactions on plant datasets based on bipartite network embedding method, 207 (2022), 97–102. https://doi.org/10.1016/j.ymeth.2022.09.002 |
[9] | L. Peng, Y. Tu, L. Huang, Y. Li, X. Fu, X. Chen, DAESTB: inferring associations of small molecule–miRNA via a scalable tree boosting model based on deep autoencoder, Briefings in Bioinformatics, 23 (2022), bbac478. https://doi.org/10.1093/bib/bbac478 |
[10] | J. Wei, L. Zhuo, Z. Zhou, X. Lian, X. Fu, X. Yao, GCFMCL: predicting miRNA-drug sensitivity using graph collaborative filtering and multi-view contrastive learning, Briefings in Bioinformatics, 24 (2023), bbad247. https://doi.org/10.1093/bib/bbad247 |
[11] | Y. Li, C. Liang, K. Wong, J. Luo, Z. Zhang, Mirsynergy: detecting synergistic miRNA regulatory modules by overlapping neighbourhood expansion, Bioinformatics, 30 (2014), 2627–2635. https://doi.org/10.1093/bioinformatics/btu373 doi: 10.1093/bioinformatics/btu373 |
[12] | Q. Jiang, Y. Wang, Y. Hao, L. Juan, M. Teng, X. Zhang, et al., miR2Disease: a manually curated database for microRNA deregulation in human disease, Nucleic Acids Res., 37 (2009), D98–D104. https://doi.org/10.1093/nar/gkn714 doi: 10.1093/nar/gkn714 |
[13] | Z. Yang, F. Ren, C. Liu, S. He, G. Sun, Q. Gao, et al., dbDEMC: a database of differentially expressed miRNAs in human cancers, BMC Genom., 11 (2010), 1–8. https://doi.org/10.1186/1471-2164-11-S4-S5 doi: 10.1186/1471-2164-11-S4-S5 |
[14] | Q. Jiang, G. Wang, T. Zhang, Y. Wang, Predicting human microrna-disease associations based on support vector machine, 2010 IEEE Int. Confer. Bioinformatics Biomed., (2010), 467–472. https://doi.org/10.1109/BIBM.2010.5706611 |
[15] | P. Wang, W. Zhu, B. Liao, L. Cai, L. Peng, J. Yang, Predicting influenza antigenicity by matrix completion with antigen and antiserum similarity, Front. Microbiol., 9 (2018), 2500. https://doi.org/10.3389/fmicb.2018.02500 doi: 10.3389/fmicb.2018.02500 |
[16] | L. Shen, F. Liu, L. Huang, G. Liu, L. Zhou, L. Peng, VDA-RWLRLS: An anti-SARS-CoV-2 drug prioritizing framework combining an unbalanced bi-random walk and Laplacian regularized least squares, Comput. Biol. Med., 140 (2022), 105–119. https://doi.org/10.1016/j.compbiomed.2021.105119 doi: 10.1016/j.compbiomed.2021.105119 |
[17] | L. Cai, C. Lu, J. Xu, Y. Meng, P. Wang, X. Fu, et al., Drug repositioning based on the heterogeneous information fusion graph convolutional network, Brief. Bioinformatics, 22 (2021), bbab319. https://doi.org/10.1093/bib/bbab319 doi: 10.1093/bib/bbab319 |
[18] | Y. Chen, X. Fu, Z. Li, L. Peng, L. Zhuo, Prediction of lncRNA–protein interactions via the multiple information integration, Front. Bioeng. Biotechnol., 9 (2021), 647113. https://doi.org/10.3389/fbioe.2021.647113 |
[19] | J. Wei, L. Zhuo, S. Pan, X. Lian, X. Yao, X. Fu, Headtailtransfer: An efficient sampling method to improve the performance of graph neural network method in predicting sparse ncRNA–protein interactions, Comput. Biol. Med., 157 (2023), 106783. https://doi.org/10.1016/j.compbiomed.2023.106783 |
[20] | L. Zhuo, B. Song, Y. Liu, Z. Li, X. Fu, Predicting ncRNA–protein interactions based on dual graph convolutional network and pairwise learning, Brief. Bioinformatics, 23 (2022), bbac339. https://doi.org/10.1093/bib/bbac339 |
[21] | X. Zhang, X. Zeng, Integrative approaches for predicting microRNA function and prioritizing disease-related microRNA using biological interaction networks, Bio-inspired Comput. Model. Algorithms, (2019), 75–105. https://doi.org/10.1142/9789813143180_0003 |
[22] | Q. Zou, J. Li, L. Song, X. Zeng, G. Wang, Similarity computation strategies in the microRNA-disease network: a survey, Brief Funct. Genomics, 15 (2016), 55–64. https://doi.org/10.1093/bfgp/elv024 doi: 10.1093/bfgp/elv024 |
[23] | L. Cai, X. Ren, X. Fu, L. Peng, M. Gao, X. Zeng, iEnhancer-XG: interpretable sequencebased enhancers and their strength predictor, Bioinformatics, 37 (2021), 1060–1067. https://doi.org/10.1093/bioinformatics/btaa914 |
[24] | X. Fu, L. Cai, X. Zeng, Q. Zou, StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency, Bioinformatics, 36 (2020), 3028–3034. https://doi.org/10.1093/bioinformatics/btaa131 |
[25] | X. Fu, L. Ke, L. Cai, X. Chen, X. Ren, M. Gao, Improved prediction of cell-penetrating peptides via effective orchestrating amino acid composition feature representation, IEEE Access, 7 (2019), 163547–163555. https://doi.org/10.1109/ACCESS.2019.2952738 |
[26] | W. Liu, T. Tang, X. Lu, X. Fu, Y. Yang, L. Peng, MPCLCDA: predicting circRNA–disease associations by using automatically selected meta-path and contrastive learning, Brief. Bioinformatics, 24 (2023), bbad227. https://doi.org/10.1093/bib/bbad227 |
[27] | L. Peng, C. Yang, Y. Chen, W. Liu, Predicting CircRNA-disease associations via feature convolution learning with heterogeneous graph attention network, IEEE J. Biomed. Health. Inform., 27 (2023), 3072–3082. https://doi.org/10.1109/JBHI.2023.3260863 |
[28] | T. Wang, W. Wang, X. Jiang, J. Mao, L. Zhuo, M. Liu, et al., ML-NPI: predicting interactions between noncoding RNA and protein based on meta-learning in a large-scale dynamic graph, J. Chem. Inf. Model., 64 (2023), 2912–2920. https://doi.org/10.1021/acs.jcim.3c01238 |
[29] | Z. Zhou, Z. Du, J. Wei, L. Zhuo, S. Pan, X. Fu, et al., MHAM-NPI: Predicting ncRNA-protein interactions based on multi-head attention mechanism, Comput. Biol. Med., 163 (2023), 107143. https://doi.org/10.1016/j.compbiomed.2023.107143 |
[30] | Q. Liao, X. Fu, L. Zhuo, H. Chen, An efficient model for predicting human diseases through miRNA based on multiple-types of contrastive learning, Front. Microbiol., 14 (2023), 1325001. https://doi.org/10.3389/fmicb.2023.1325001 |
[31] | W. Liu, H. Lin, L. Huang, L. Peng, T. Tang, Q. Zhao, et al., Identification of miRNA–disease associations via deep forest ensemble learning based on autoencoder, Brief. Bioinformatics, 23 (2022), bbac104. https://doi.org/10.1093/bib/bbac104 |
[32] | Q. Jiang, G. Wang, Y. Wang, An approach for prioritizing disease-related microRNAs based on genomic data integration, 2010 3rd Int. Confer. Biomed. Eng. Inform., 6 (2010), 2270–2274. https://doi.org/10.1109/BMEI.2010.5639313 doi: 10.1109/BMEI.2010.5639313 |
[33] | J. Xu, C. Li, J. Lv, Y. Li, Y. Xiao, T. Shao, et al., Prioritizing Candidate Disease miRNAs by Topological Features in the miRNA Target–Dysregulated Network: Case Study of Prostate Cancer, Mol. Cancer Ther., 10 (2011), 1857–1866. https://doi.org/10.1158/1535-7163.MCT-11-0055 doi: 10.1158/1535-7163.MCT-11-0055 |
[34] | X. Zeng, Y. Liao, Y. Liu, Q. Zou, Prediction and validation of disease genes using HeteSim Scores, IEEE/ACM Trans. Comput. Biol. Bioinform., 14 (2016), 687–695. 10.1109/TCBB.2016.2520947 doi: 10.1109/TCBB.2016.2520947 |
[35] | Q. Xiao, J. Luo, C. Liang, J. Cai, P. Ding, A graph regularized non-negative matrix factorization method for identifying microRNA-disease associations, Bioinformatics, 34 (2018), 239–248. https://doi.org/10.1093/bioinformatics/btx545 doi: 10.1093/bioinformatics/btx545 |
[36] | J. Xu, L. Cai, B. Liao, W. Zhu, P. Wang, Y. Meng, et al., Identifying potential mirnas–disease associations with probability matrix factorization, Front. Genet., 10 (2019), 1234. https://doi.org/10.3389/fgene.2019.01234 doi: 10.3389/fgene.2019.01234 |
[37] | X. Chen, G. Yan, Semi-supervised learning for potential human microRNA-disease associations inference, Sci. Rep., 4 (2014), 1–10. https://doi.org/10.1038/srep05501 doi: 10.1038/srep05501 |
[38] | W. Liu, X. Sun, L. Yang, K. Li, Y. Yang, X. Fu, NSCGRN: a network structure control method for gene regulatory network inference, Brief. Bioinform., (2022). https://doi.org/10.1093/bib/bbac156 |
[39] | Q. Qu, X. Chen, B. Ning, X. Zhang, H. Nie, L. Zeng, et al., Prediction of miRNA-disease associations by neural network-based deep matrix factorization, Methods, 212 (2023), 1–9. https://doi.org/10.1016/j.ymeth.2023.02.003 |
[40] | W. Liu, Y. Yang, X. Lu, X. Fu, R. Sun, L. Yang, et al., NSRGRN: a network structure refinement method for gene regulatory network inference, Brief. Bioinformatics, 24 (2023), bbad129. https://doi.org/10.1093/bib/bbad129 |
[41] | L. Peng, C. Yang, L. Huang, X. Chen, X. Fu, W. Liu, RNMFLP: predicting circRNA–disease associations based on robust nonnegative matrix factorization and label propagation, Brief. Bioinformatics, 24 (2023), bbac155. https://doi.org/10.1093/bib/bbad155 |
[42] | C. Gu, B. Liao, X. Li, K. Li, Network consistency projection for human miRNA-disease associations inference, Sci. Rep., 6 (2016), 1–10. https://doi.org/10.1038/srep36054 doi: 10.1038/srep36054 |
[43] | X. Chen, C. C. Yan, X. Zhang, Z. You, L. Deng, Y. Liu, et al., WBSMDA: within and between score for MiRNA-disease association prediction, Sci. Rep., 6 (2016), 1–9. https://doi.org/10.1038/srep21106 doi: 10.1038/srep21106 |
[44] | Y. Liu, X. Zeng, Z. He, Q. Zou, Inferring microRNA-disease associations by random walk on a heterogeneous network with multiple data sources, IEEE/ACM Trans. Comput. Biol. Bioinform., 14 (2016), 905–915. https://doi.org/10.1109/TCBB.2016.2550432 doi: 10.1109/TCBB.2016.2550432 |
[45] | A. Li, Y. Deng, Y. Tan, M. Chen, A novel mirna-disease association prediction model using dual random walk with restart and space projection federated method, PLoS One, 6 (2021), e0252971. https://doi.org/10.1371/journal.pone.0252971 doi: 10.1371/journal.pone.0252971 |
[46] | X. Chen, M. Liu, G. Yan, RWRMDA: predicting novel human microRNA–disease associations, Mol. BioSyst., 8 (2012), 2792–2798. https://doi.org/10.1039/c2mb25180a doi: 10.1039/c2mb25180a |
[47] | P. Xuan, K. Han, M. Guo, Y. Guo, J. Li, J. Ding, et al., Prediction of microRNAs Associated with Human Diseases Based on Weighted k Most Similar Neighbors, PloS One, 8 (2013), e70204. https://doi.org/10.1371/journal.pone.0070204 doi: 10.1371/journal.pone.0070204 |
[48] | P. Xuan, C. Sun, T. Zhang, Y. Ye, T. Shen, Y. Dong, Gradient boosting decision tree-based method for predicting interactions between target genes and drugs, Front. Genet., 10 (2019), 459. https://doi.org/10.3389/fgene.2019.00459 doi: 10.3389/fgene.2019.00459 |
[49] | H. Chen, Z. Zhang, Similarity-based methods for potential human microRNA-disease association prediction, BMC Med. Genom., 6 (2013), 1–9. https://doi.org/10.1186/1755-8794-6-12 doi: 10.1186/1755-8794-6-12 |
[50] | D. Wang, J. Wang, M. Lu, F. Song, Q. Cui, Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases, Bioinformatics, 26 (2010), 1644–1650. https://doi.org/10.1093/bioinformatics/btq241 doi: 10.1093/bioinformatics/btq241 |
[51] | P. Jain, I. S. Dhillon, Provable inductive matrix completion, arXiv preprint, (2013), arXiv: 1306.0626. |
[52] | D. Wang, J. Wang, M. Lu, F. Song, Q. Cui, H. Yu, et al., Large-scale prediction of microRNA-disease associations by combinatorial prioritization algorithm, Sci. Rep., 7 (2017), 1–15. https://doi.org/10.1038/srep43792 doi: 10.1038/srep43792 |