
Citation: Testimony J Olumade, Oluwafolajimi A Adesanya, Iyanuoluwa J Fred-Akintunwa, David O Babalola, Judith U Oguzie, Olusola A Ogunsanya, Uwem E George, Oluwawapelumi D Akin-Ajani, Damilola G Osasona. Infectious disease outbreak preparedness and response in Nigeria: history, limitations and recommendations for global health policy and practice[J]. AIMS Public Health, 2020, 7(4): 736-757. doi: 10.3934/publichealth.2020057
[1] | Geovanni Alberto Ruiz-Romero, Carolina Álvarez-Delgado . Effects of estrogens in mitochondria: An approach to type 2 diabetes. AIMS Molecular Science, 2024, 11(1): 72-98. doi: 10.3934/molsci.2024006 |
[2] | Fumiaki Uchiumi, Makoto Fujikawa, Satoru Miyazaki, Sei-ichi Tanuma . Implication of bidirectional promoters containing duplicated GGAA motifs of mitochondrial function-associated genes. AIMS Molecular Science, 2014, 1(1): 1-26. doi: 10.3934/molsci.2013.1.1 |
[3] | Naba Hasan, Waleem Ahmad, Feroz Alam, Mahboob Hasan . Ferroptosis-molecular mechanisms and newer insights into some diseases. AIMS Molecular Science, 2023, 10(1): 22-36. doi: 10.3934/molsci.2023003 |
[4] | Jian Zou, Fulton T. Crews . Glutamate/NMDA excitotoxicity and HMGB1/TLR4 neuroimmune toxicity converge as components of neurodegeneration. AIMS Molecular Science, 2015, 2(2): 77-100. doi: 10.3934/molsci.2015.2.77 |
[5] | Fumiaki Uchiumi, Akira Sato, Masashi Asai, Sei-ichi Tanuma . An NAD+ dependent/sensitive transcription system: Toward a novel anti-cancer therapy. AIMS Molecular Science, 2020, 7(1): 12-28. doi: 10.3934/molsci.2020002 |
[6] | Yutaka Takihara, Ryuji Otani, Takuro Ishii, Shunsuke Takaoka, Yuki Nakano, Kaori Inoue, Steven Larsen, Yoko Ogino, Masashi Asai, Sei-ichi Tanuma, Fumiaki Uchiumi . Characterization of the human IDH1 gene promoter. AIMS Molecular Science, 2023, 10(3): 186-204. doi: 10.3934/molsci.2023013 |
[7] | Amena W. Smith, Swapan K. Ray, Arabinda Das, Kenkichi Nozaki, Baerbel Rohrer, Naren L. Banik . Calpain inhibition as a possible new therapeutic target in multiple sclerosis. AIMS Molecular Science, 2017, 4(4): 446-462. doi: 10.3934/molsci.2017.4.446 |
[8] | Dora Brites . Cell ageing: a flourishing field for neurodegenerative diseases. AIMS Molecular Science, 2015, 2(3): 225-258. doi: 10.3934/molsci.2015.3.225 |
[9] | Giulia Ambrosi, Pamela Milani . Endoplasmic reticulum, oxidative stress and their complex crosstalk in neurodegeneration: proteostasis, signaling pathways and molecular chaperones. AIMS Molecular Science, 2017, 4(4): 424-444. doi: 10.3934/molsci.2017.4.424 |
[10] | Tsuyoshi Inoshita, Yuzuru Imai . Regulation of vesicular trafficking by Parkinson's disease-associated genes. AIMS Molecular Science, 2015, 2(4): 461-475. doi: 10.3934/molsci.2015.4.461 |
With the advent of technologies allowing for large-scale, high throughput data, a much clearer understanding of the genomic mechanisms behind gene regulation have been gained. The scientists found that there are unexpected far more noncoding RNAs comparing with protein-coding genes and, and these noncoding regions play important roles in determining the complexity observed in the human genome [1,2]. Within these noncoding regions, long noncoding RNAs (lncRNAs), which are functionally defined as noncoding regions of RNA that are at least 200 base-pairs in length, have attracted lots of attention. Certain lncRNAs appear to act locally, while others have more distal regulatory effects, even acting across multiple chromosomes [3]. Many studies have identified specific functions of particular lncRNAs, including embryonic mechanisms, cell cycle functions, innate immunity, and disease processes. However, there are still thousands of lncRNAs have no identified functions [1,3,4,5,6]. Some studies have been performed that produce relatively few numbers of lncRNA functions [7], and have shown that the function of lncRNAs is highly cell-type-specific: one lncRNA may inhibit particular genes in one type of cell while promoting the same gene in another. This phenomenon makes it even more difficult to identify lncRNA functions on a large scale. Due to this specificity, researchers propose that future lncRNA studies should be performed on specific cell types to identify particular regulatory mechanisms.
One of the most prominent and intriguing applications of lncRNA regulatory investigation comes from cancer studies [8,9]. It has been shown that lncRNAs appear to have high connectivity with numerous diseases, especially cancer. Because of the highly cell type-specific nature of lncRNA regulatory functions and the irregularity of cancer cell genetic information, studying lncRNA regulation in specific cancer types may provide promising insight into specific genomic regulations of common cancer cells. In a few documented cases, specific lncRNAs have been shown to be significantly differentially expressed in specific cancer types, such as prostate cancer and breast cancer [1]. For these reasons, it seems appropriate to further investigate lncRNA-gene interactions in particular cancer cells.
The wealth of gene expression datasets available provides an opportunity to computationally identify co-expressed gene modules(CEMs), each of which is defined as a highly structured expression pattern on a specific gene set [10,11]. These CEMs tend to be functionally related or co-regulated by the same transcriptional regulatory signals (e.g., transcription factors, lncRNA and so on) under a specific condition or in a particular disease cell type. Overall, successful derivation of the CEMs may grant a higher-level interpretation of large-scale gene expression data, improve functional annotation of condition-specific gene activities, facilitate inference of gene regulatory relationships, hence, provide a better mechanism level understanding of complex diseases.
The computational identification of CEMs can be solved by a biclustering approach [12], which is a two-dimensional data mining technique that simultaneously identifies co-expressed genes under a subset of conditions. a high proportion of enriched biclusters on real datasets. Within this study, we try to identify new lncRNA-gene interactions and transcription factor-lncRNA partnerships from cancer RNA-seq data using a biclustering approach. The biclustering method will allow for the identification of particular expression patterns across multiple datasets, indicating networks of lncRNA and gene interactions. This developed method will also provide a framework for future lncRNA interaction studies. We applied this method on two sets of TCGA breast cancer RNA-seq data to generated CEMs based on known lncRNA-gene interactions. Then, the predicted CEMs are linked to lncRNA by a statistic p-value and the new lncRNA-gene relationship are generated. The evaluation on the predicted results showed that the pipeline can find some target genes for given lncRNA, and meanwhile the performance still has some space to be improved. We further conducted a TF motif analysis on the predicted CEMs and provide potential regulation cooperation between TFs and lncRNAs. The related original data with codes, results and supplementary data can be downloaded on https://github.com/IvesG/sGavin.git.
Two sets of TCGA (The Cancer Genome Atlas) breast cancer RNA-seq data, one from the normal cell (referred as normal data) and the other from tumor cell (referred as tumor data) were downloaded from https://portal.gdc.cancer.gov/. The normal and tumor data consist of 113 and 1091 samples, respectively. And of the 113 normal samples, 112 of them are from the same patient among the tumors. Both datasets contain 60,483 genes, among which there are 19,824 protein-coding genes and 7,399 long intergenic noncoding RNAs (lincRNAs) genes. The RNA-seq data are all Upper Quartile normalized FPKM (UQ-FPKM) values.
A total of 1,081 experimentally validated lncRNA-associated regulatory entries were downloaded from LncReg [13], describing the comprehensive regulatory relationships among 258 lncRNAs and 571 genes. All these relationships were manually collected from PubMed with focus on the data generated by laboratory methods, and can be categorized into up/down/active/inactive based on regulatory relationships or transcription/post-transcription/translation/post-translation based on regulatory mechanisms.
As we focus on lncRNA-gene interactions, the relationships downloaded from LncReg were filtered to retain only relationships describing genes regulated by lncRNAs with specified species information (constrained to Homo sapiens and Mus musculus), resulting 925 relationships in total for the downstream analysis, covering interactions between 309 unique human genes and 103 human lncRNAs, as well as between 199 mouse genes and 100 mouse lncRNAs. It is noteworthy that these 925 relationships include 28 post-transcriptional regulations, 41 post-translational regulations, 714 transcriptional regulations, 23 translational regulations, 1 transcriptional & translational regulation and 118 unspecified relationships.
As the table from LncReg [13] only provides gene symbols, while the RNA-seq dataset uses Ensembl ID as gene's identifiers, we use Ensembl BioMart [14] to match gene symbols with Ensembl IDs for all the genes and lncRNAs. Then we got orthologous genes between mouse and human also using BioMart; we found orthologous human genes for all 199 mouse genes, and 38 overlapped with original human genes. For convenience, we recorded human genes, mouse genes that don't overlap with human genes, human lncRNAs and mouse lncRNAs that don't overlap with human lncRNAs as HG, MG, HL, and ML, respectively.
We combined the normal and tumor RNA-seq dataset together, then extracted expression values for all the HG, MG, HL, ML, protein-coding genes (PC, the remaining protein-coding genes except HG and MG) and lincRNAs (linc, the remaining lincRNA except HL and ML). Taking the genes as rows and the conditions as columns, we obtained the RNA-seq expression matrix on which biclustering will be performed to detect CEMs.
QUBIC is a biclustering analysis tool designed for co-expression analyses of genes based on their gene-expression patterns under multiple conditions. The software can generally identify all statistically significant groups, or biclusters, of genes with similar expression patterns under at least a specific number of experimental conditions, which tend to be more sensitive and more specific than other biclustering tools [15]. We use a quantile-based discretization method of QUBIC to generate a qualitative representing matrix for the RNA-seq expression matrix. Then we extracted the rows of known lncRNA regulated HG and MG from this representing matrix as seed 1 and HG, MG, HL, and ML rows as seed 2. Next bi-clustering analysis was performed on these two seeds to predict co-expressed gene modules (CEMs) in the qualitative representing matrix, respectively.
For an identified CEMs, we calculated the P-value of a bicluster enriched with genes regulated by a lncRNA using the hypergeometric function [16],
where r is the number of genes in a CEMs (with size n) that regulated by certain lncRNA, N is the total number of known lncRNA regulated genes in the whole genome, K is the number of genes regulated by that lncRNA in the whole genome.
We assumed that, if the known target genes of a given lncRNA are highly covered by a CEM with a significant p-value, the other genes in this CEM have high possibilities regulated by the given lncRNA. Thus, we used the smallest P-value for all possible lncRNAs as the p-value of the current bicluster and the relationships between lncRNA and genes in the bicluster are predicted.
To evaluate the performance of the new methods on the prediction of new relationships between lncRNA and genes, we randomly separate seed2 into two parts with equal size named seedpart1 and seedpart2, for multiple times. Then bi-clustering analysis will be performed on seedpart2 to predict co-expressed gene modules (CEMs). For seedpart1 we find its part which is covered by co-expressed gene modules (CEMs) from seedpart2. We calculate the cover ratios by the size of seedpart1 to be divided by the size of the covered part by CEMs generated from seedpart2. Also, we calculate the p-values for the coverage rates to present the statistical significance of them.
We choose several significant CEMs with sizes or conditions below 100, to conducted TF motif analysis. The promoter regions of the corresponding genes are inputted into the sub-routine findMotifs.pl of Homer [17], respectively. The script findMotifs.pl can firstly search for the upstream promoter sequences of a certain length automatically, and then perform motif finding on the promoters. For each run of findMotifs.pl on the datasets, we let the program output at most 5 top-ranking motifs, i.e. there will be up to 5 motifs discovered by findMotifs.pl for each CEMs. To evaluate the validity of the discovered motifs, findMotifs.pl automatically compares the similarity between the discovered motif profiles and the motif profiles archived in JASPAR [18] v2018 (http://jaspar.genereg.net/) under its default parameter setting. For each discovered motif having similarity with at least one motif archived in JASPAR, we present its motif logo as well as the information of its most similar motif in JASPAR.
All the known interactions between lncRNAs and genes are showcased in Figure 1A. The related data can be download from https://github.com/IvesG/sGavin.git data/LncReg0419 and more details are written in data/readme.txt. In figure 1A, dark-blue nodes represent LncRNAs, light-blue nodes represent proteins, pink edges represent interactions documented in Homo, green edges represent interactions documented in Mus, orange edges represent interactions documented in both Homo and Mus. Meanwhile, there are some labels on the edges, categorized based on regulatory mechanisms including PTL (post-translational regulation), TC (transcriptional regulation), PTC (post-transcriptional regulation), TL (translational regulation), and NS (not sure). The distribution above is displayed in Figure 1B and nearly three fourth (714/925) of them are identified at the transcriptional level. Other labels on the edges are categorized based on regulatory relationships including down, up, active and inactive. The distribution above is displayed in Figure 1C. The down relationships (575) are more than up relationships (308), and the proportion of active/inactive is scarce (4.5%).
Figure 1E showed the distribution of a number of genes regulated by each lncRNA. It can be found that most lncRNA (~78%) regulate less than 5 genes. To show the specific details of the number of genes regulated by each lncRNA, Figure 1D is made, each point in the Figure 1D reflect the number of lncRNA (horizontal coordinate) that regulate certain number of genes (longitudinal coordinates) e.g. the point with coordinate (4, 14) in Figure 1D indicate that there are 14 lncRNA and each of them regulate 4 genes. The lncRNA that regulate more genes in Figure 1D belongs to the more concentrated parts in Figure 1(A).
With the quantile-based discretization method and biclustering analysis, there are some co-expressed gene modules (CEMs) are found. The details of the way we identify CEMs are showcased in Figure 2C. Figure 2A shows the number of co-expressed gene modules(CEMs) we have got from seed1 and seed2 processed by max(min [19])-based (QUBIC1.0, [15,20]) and KL-based bi-clustering analysis (QUBIC2.0 [21]) respectively. And the distributions of numbers of genes and conditions for each CEM can be found in Figure 2D and Figure 2E. For instance, label seed1_1genes represent that Qubic1.0 is performed on seed1. To better illustrate the distributions, we have constrained the number of each gene and the number of each condition below 100 (in Figure 2B). It is found that the KL-based biclustering method tends to generate CEMs that contain fewer genes while more conditions than max(min [22])-based biclustering method from Figure 2D and Figure 2E. The difference between the results (whether the distribution of genes sizes and condition sizes or the number of CEMs that we have predicted) we get from seed1 and seed2 is subtle.
The proportion of CEMs that have significant P-values (below a pre-selected P-value cutoff) as well as proportions of the number of unique enriched lncRNA in each bicluster that belong to certain categories (i.e., number of lncRNA = 2, 3 or > 3) are calculated and shown in Figure 3A and Figure 3B. In the figures, Seed1_qubic1 represent the proportions from the results obtained using quantile discretization and using max(min [23])-based biclustering on seed 1, seed1_qubic2 represent using quantile discretization and using KL-based biclustering on seed1. In Figure 3A, it can be found that most CEMs have P-value more than 0.001 and seed1_qubic1 seems to have more significant P-value. Constrain P-value below 0.00001 and there is barely CEMs remained (less than 7%). In Figure 3B the majority of (more than70%) CEMs are with enriched lncRNA more than 3 and especially most (around 85%) of CEMs from seed1_qubic1.
For validation, we separated the HG + MG genes into two parts randomly and equally for 10 times and obtained 10 cover ratios correspondingly to check the accuracy of the previously predicted genes. The results of our validation are calculated and shown in Table 1. From Table 1 it can be found that all of the cover ratios are under 25% and the average ratio is 16.25%. We further calculated the p-value of the coverage rates. The results indicated that even the coverage date has a lot of space to be improved, the statistical significance of them are acceptable.
Group | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
Ratio | 14.00% | 19.60% | 14.00% | 15.70% | 14.50% | 21.30% | 9.80% | 24.30% | 14.00% | 15.30% |
P-value | 7.58E-07 | 5.14E-14 | 7.58E-07 | 8.11E-09 | 2.56E-07 | 1.24E-16 | 5.37E-03 | 1.27E-21 | 7.58E-07 | 2.65E-08 |
Since lncRNA plays an important role in regulation, they should have cooperation with transcription factor [23,24]. Thus we conduct the analysis about the DNA binding sites of related to CEMs [19,22,25]. As described in the Method section, we choose five CEMs to conducted TF motif analysis. The corresponding gene list files each containing 3, 6, 14, 20 and 21 genes. The predicted motifs and the comparison between them and JASPAR motifs are listed in Table 2, along with the function of the target TFs. In Table 2, the second column has the name of lncRNA related with this CEMs and the p-value of their correlations; The third column contains the motif consensus by Homer; The fourth column provides TF names of the most similar motifs in JASPAR, along with the similarity scores in the fifth column. These TFs may have cooperation with corresponding lncRNAs. In the first column, all the P-value of the LncRNA from the CEM is below 0.01 and the least P-value is from LncRNA HOTAIR. The supplementary table S1 with more details, including and the logo of discovered motifs and the functions of corresponding TFs, can be downloaded by visiting the GitHub link.
lncRNA (p-value) | Homer motifs | JASPAR TFs | Scores | |
1 | FOXCUT 3.6e-3 |
AACCAVTTHDCG | TFCP2 | 0.64 |
TCCTATCACACR | MEIS2 | 0.62 | ||
TTTTHAAAGGGG | CHR | 0.67 | ||
ARTGGTTGTWGA | FOXJ2 | 0.58 | ||
GCAATCTCGC | IRF4 | 0.66 | ||
2 | ANCR 1.1e-3 |
AGGGTGACAG | SPZ1 | 0.80 |
GGTATCTTAC | GATA5 | 0.64 | ||
CTCATAGGAG | GCM1 | 0.65 | ||
TAAGTGAAAG | PRDM1 | 0.86 | ||
CTTTTGGAAC | CHR | 0.65 | ||
3 | 250-280 2.2e-4 |
WYTRTCTTTGCG | RXR | 0.61 |
TCTTACGG | ELK1 | 0.71 | ||
GGCAAGGA | SD | 0.76 | ||
GAGGTATGTT | TEAD1 | 0.70 | ||
TGCCGGGAGCGT | POL | 0.64 | ||
4 | HOXD-AS1 6.1e-3 |
CTCGAGTAGG | PB0114 | 0.63 |
GCCCCCTGCA | PB0076 | 0.74 | ||
ACGYMYATKYCC | GFY | 0.59 | ||
AGCGGGTT | PH | 0.68 | ||
AGGCGCCGCGCC | SP1 | 0.69 | ||
5 | HOTAIR 5e-6 |
TGGCGCAGCGCG | PB | 0.67 |
GTACAACTTT | PB | 0.66 | ||
CMTSTGTCWCYK | NeuroG2 | 0.66 | ||
GTGATCCATT | RHOXF1 | 0.68 | ||
GGTMGRRGTGMW | TBX20 | 0.58 |
In order to further evaluate the biological significance of the identified CEMs, we tested the enrichment of the genes in each CEM in Gene ontology terms and KEGG pathways using clusterProfiler package of R project BioConductor under q-value cutoff 0.05, of which the description of the GO terms and KEGG pathways that the CEMs are enriched in are presented in Table 3 and Table 4 respectively. And the supplementary table S2 with more details, including original and adjusted P-value, proportion of the matched genes, gene's ID, etc., can be downloaded on GitHub link.
LncRNA | ID | Description | q-value |
FOXCUT | GO:0033613 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
250-280 | GO:0003735 | structural constituent of ribosome | 3.1600E-06 |
GO:0003729 | mRNA binding | 1.1140E-03 | |
GO:0008483 | transaminase activity | 1.1140E-03 | |
GO:0048027 | mRNA 5'-UTR binding | 1.1140E-03 | |
GO:0016769 | transferase activity, transferring nitrogenous groups | 1.1140E-03 | |
GO:0045182 | translation regulator activity | 1.5826E-03 | |
GO:0030170 | pyridoxal phosphate binding | 1.5826E-03 | |
GO:0070279 | vitamin B6 binding | 1.5826E-03 | |
GO:0019843 | rRNA binding | 1.5826E-03 | |
GO:0019842 | vitamin binding | 3.1903E-03 | |
HOXD-AS1 | GO:0004714 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
HOTAIR | GO:0005109 | frizzled binding | 1.9097E-03 |
GO:0001227 | transcriptional repressor activity, RNA polymerase Ⅱ transcription regulatory region sequence-specific binding | 1.9097E-03 | |
GO:0001664 | G-protein coupled receptor binding | 1.9686E-03 | |
GO:0001078 | transcriptional repressor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 1.1196E-02 | |
GO:0008201 | heparin binding | 1.2885E-02 | |
GO:0005539 | glycosaminoglycan binding | 1.8790E-02 | |
GO:1901681 | sulfur compound binding | 1.9016E-02 | |
GO:0045236 | CXCR chemokine receptor binding | 2.1785E-02 | |
GO:0008301 | DNA binding, bending | 2.1785E-02 | |
GO:0001223 | transcription coactivator binding | 2.1785E-02 | |
GO:0042813 | Wnt-activated receptor activity | 2.1785E-02 | |
GO:0035198 | miRNA binding | 2.2807E-02 | |
GO:0017147 | Wnt-protein binding | 2.5258E-02 | |
GO:1990841 | promoter-specific chromatin binding | 2.5258E-02 | |
GO:0000982 | transcription factor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 2.5258E-02 | |
GO:0001221 | transcription cofactor binding | 2.5587E-02 |
LncRNA | ID | Description | q-value |
FOXCUT | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5129E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
250-280 | hsa03010 | Ribosome | 2.9900E-05 |
hsa01210 | 2-Oxocarboxylic acid metabolism | 3.7322E-03 | |
hsa00220 | Arginine biosynthesis | 3.7322E-03 | |
hsa00250 | Alanine, aspartate and glutamate metabolism | 4.7849E-03 | |
hsa01230 | Biosynthesis of amino acids | 7.9156E-03 | |
HOXD-AS1 | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5506E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
HOTAIR | hsa04310 | Wnt signaling pathway | 5.7295E-03 |
Within this study, we have developed a method for elucidating lncRNA-gene and transcription factor-lncRNA interactions using a biclustering approach. The method was performed on 2 breast cancer RNA-seq datasets from TCGA. The bicluster method allows for the identification of particular expression patterns across multiple datasets, indicating networks of lncRNA and gene interactions. The developed method will also provide a way for future lncRNA interaction studies. Certainly, the predict performance still far from satisfactory, which is not unexpected since we only used RNA-Seq data. Actually, the interaction mechanism between lncRNA and genes are far more complex, and more data should be involved if we want to capture the whole picture of them. We are planning to include some other data, like proteomics and chromatin accessibility information, to improve the prediction. Besides, the evaluation on the relationship between lncRNA and predicted CEMs also has the potential to be improved, e.g. calculating the adjusted P-value or overall P-value in place of the original P-values used in this study. In view of the application, we will work on more specific examples of the regulatory functions of some particular lncRNAs and identify some hypothesized mechanisms of these regulatory functions. Also, the further analysis of the difference of lncRNA related genes between tumor and normal samples could provide more information for studying the process and mechanism of cancer occurrence and development, e.g. determination of the stage of developed tumors, which will be our concern in the future research.
This work was supported by the National Nature Science Foundation of China (NSFC) [61772313 and 61432010], Young Scholars Program of Shandong University [YSPSDU, 2015WLJH19], the Innovation Method Fund of China (2018IM020200), and Shanghai Municipal Science and Technology Major Project (2018SHZDZX01) and ZHANGJIANGLAB. Qin Ma's work was supported by an R01 Award from the National Institute of General Medical Sciences of the National Institutes of Health [GM131399-01]. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by the National Science Foundation [ACI-1548562]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health and the National Science Foundation.
All authors declare no conflicts of interest in this paper.
[1] | Nigeria Population (2020) Worldometer. Available from: https://www.worldometers.info/world-population/nigeria-population/. |
[2] |
Muhammad F, Abdulkareem JH, Chowdhury AA (2017) Major Public Health Problems in Nigeria: A review. South East Asia J Public Health 7: 6-11. doi: 10.3329/seajph.v7i1.34672
![]() |
[3] |
Lecompte E, Fichet-Calvet E, Daffis S, et al. (2006) Mastomys natalensis and Lassa fever, West Africa. Emerg Infect Dis 12: 1971. doi: 10.3201/eid1212.060812
![]() |
[4] | Emperador DM, Yimer SA, Mazzola LT, et al. (2018) Diagnostic needs for lassa fever outbreak detection, clinical care, and vaccine development. Am J Trop Med Hyg 99: 524-525. |
[5] |
Mofolorunsho KC (2016) Outbreak of lassa fever in nigeria: Measures for prevention and control. Pan Afr Med J 23. doi: 10.11604/pamj.2016.23.210.8923
![]() |
[6] | CDC. Lassa Fever Fact Sheet. Available from: https://www.cdc.gov/vhf/lassa/index.html. |
[7] |
Achinge GI, Kur JT, Gyoh SK (2013) Lassa fever Outbreak in Makurdi, North Central Nigeria: What You Need To Know. IOSR J Dent Med Sci 7: 42-46. doi: 10.9790/0853-0754246
![]() |
[8] |
Frame JD, Baldwin JM, Gocke DJ, et al. (1970) Lassa fever, a new virus disease of man from West Africa. I. Clinical description and pathological findings. Am J Trop Med Hyg 19: 670-676. doi: 10.4269/ajtmh.1970.19.670
![]() |
[9] | Ogoina D (2013) Lassa fever: a clinical and epidemiological review. Niger Delta J Med Res 1: 1-10. |
[10] |
Siddle KJ, Eromon P, Barnes KG, et al. (2018) Genomic Analysis of Lassa Virus during an Increase in Cases in Nigeria in 2018. N Engl J Med 379: 1745-1753. doi: 10.1056/NEJMoa1804498
![]() |
[11] | Unah L (2020) Nigeria struggles with largest recorded Lassa fever outbreak. Devex. Available from: https://www.devex.com/news/nigeria-struggles-with-largest-recorded-lassa-fever-outbreak-96773. |
[12] |
Aylward B, Barboza P, Bawo L, et al. (2014) Ebola Virus Disease in West Africa - The First 9 Months of the Epidemic and Forward Projections. N Engl J Med 371: 1481-1495. doi: 10.1056/NEJMoa1411100
![]() |
[13] |
Folarin OA, Ehichioya D, Schaffner SF, et al. (2016) Ebola Virus Epidemiology and Evolution in Nigeria. J Infect Dis 214: S102-S109. doi: 10.1093/infdis/jiw190
![]() |
[14] | Otu A, Ameh S, Osifo-Dawodu E, et al. (2017) An account of the Ebola virus disease outbreak in Nigeria: Implications and lessons learnt. BMC Public Health 18: 1-8. |
[15] | WHO (2014) WHO declares end of Ebola outbreak in Nigeria. Available from: https://www.who.int/mediacentre/news/statements/2014/nigeria-ends-ebola/en/. |
[16] | Obi R. Yellow Fever Hemorrhagic Disease is just around the corner (1). The Leader Newspaper Online. Available from: https://theleaderassumpta.com/yellow-fever-hemorrhagic-disease-is-just-around-the-corner-1/. |
[17] | WHO (2015) Increased risk of urban yellow fever outbreaks in Africa. Available from: https://www.who.int/csr/disease/yellowfev/urbanoutbreaks/en/. |
[18] |
Nwachukwu WE, Yusuff H, Nwangwu U, et al. (2020) The response to re-emergence of yellow fever in Nigeria, 2017. Int J Infect Dis 92: 189-196. doi: 10.1016/j.ijid.2019.12.034
![]() |
[19] | WHO (2019) Yellow fever - Nigeria. Available from: https://www.who.int/csr/don/09-january-2019-yellow-fever-nigeria/en/. |
[20] | WHO (2019) Yellow fever - Nigeria. Available from: https://www.who.int/csr/don/17-december-2019-yellow-fever-nigeria/en/. |
[21] | Nigeria Centre for Disease Control (2020) An update of Yellow Fever outbreak in Nigeria. Available from: https://ncdc.gov.ng/diseases/sitreps/?cat=10&name=An%20update%20of%20Yellow%20Fever%20outbreak%20in%20Nigeria. |
[22] | Hambrosky J, Kroger A, Wolfe C (2015) Centres for Disease Control and Prevention - “Poliomyelitis”, Epidemiology and Prevention of Vaccine-Preventable Diseases Washington DC: Public Health Foundation. |
[23] | Last A (2007) Vaccine-linked polio hits Nigeria. BBC News. Available from: http://news.bbc.co.uk/2/hi/africa/7037462.stm. |
[24] |
Bolu O, Nnadi C, Damisa E, et al. (2018) Progress Toward Poliomyelitis Eradication—Nigeria, January–December 2017. Morb Mortal Wkly Rep 67: 253-256. doi: 10.15585/mmwr.mm6708a5
![]() |
[25] |
Ogoina D, Izibewule JH, Ogunleye A, et al. (2019) The 2017 human monkeypox outbreak in Nigeria - Report of outbreak experience and response in the Niger Delta University, Bayelsa State, Nigeria. PLoS One 14: e0214229. doi: 10.1371/journal.pone.0214229
![]() |
[26] | Nigeria Centre for Disease Control. An Update of Monkeypox Outbreak in Nigeria. Available from: https://ncdc.gov.ng/diseases/sitreps/?cat=8&name=An%20Update%20of%20Monkeypox%20Outbreak%20in%20Nigeria. |
[27] |
Yinka-Ogunleye A, Aruna O, Ogoina D, et al. (2018) Reemergence of human monkeypox in Nigeria, 2017. Emerg Infect Dis 24: 1149. doi: 10.3201/eid2406.180017
![]() |
[28] | Wilder-Smith A, Leong WY (2017) Importation of yellow fever into China: assessing travel patterns. J Travel Med 24: tax008. |
[29] |
Njidda AM, Oyebanji O, Obasanya J, et al. (2018) The Nigeria Centre for Disease Control. BMJ Global Health 3: 3-5. doi: 10.1136/bmjgh-2018-000712
![]() |
[30] | Nigeria Centre for Disease Control. Available from: https://ncdc.gov.ng/. |
[31] | Nigeria Centre for Disease Control. Field Epidemiology and Laboratory Trainig Program (NFELTP). Available from: https://www.ncdc.gov.ng/training/nfeltp. |
[32] | Nguku P, Oyemakinde A, Sabitu K, et al. (2014) Training and service in public health, Nigeria Field Epidemiology and Laboratory Training, 2008–2014. Pan Afr Med J 18: 2. |
[33] | WHO (2014) Nigeria is now free of Ebola virus transmission. Available from: https://www.who.int/mediacentre/news/ebola/20-october-2014/en/. |
[34] |
Musa E, Nasidi A, Shuaib F, et al. (2016) Nigeria's Ebola outbreak response: lessons for future epidemic preparedness. Arch Med 8. doi: 10.21767/1989-5216.1000171
![]() |
[35] | Shuaib F, Gunnala R, Musa EO, et al. (2014) Ebola virus disease outbreak—Nigeria, July–September 2014. Morb Mortal Wkly Rep 63: 866-872. |
[36] | WHO (2015) Ebola situation in Port Harcourt, Nigeria. Available from: https://www.who.int/mediacentre/news/ebola/3-september-2014/en/. |
[37] | WHO (2015) Ebola in Nigeria and Senegal: stable - for the moment. Available from: https://www.who.int/csr/disease/ebola/ebola-6-months/nigeria-senegal/en/. |
[38] | Olu OO, Lamunu M, Nanyunja M, et al. (2016) Contact Tracing during an Outbreak of Ebola Virus Disease in the Western Area Districts of Sierra Leone: Lessons for Future Ebola Outbreak Response. Front Public Health 4: 130. |
[39] | Abdulraheem IS (2002) Public health importance of lassa fever epidemiology, clinical features and current management review of literature. African J Clin Exp Microbiol 3: 33-37. |
[40] |
Ajayi NA, Nwigwe CG, Azuogu BN, et al. (2013) Containing a Lassa fever epidemic in a resource-limited setting: outbreak description and lessons learned from Abakaliki, Nigeria (January–March 2012). Int J Infect Dis 17: e1011-e1016. doi: 10.1016/j.ijid.2013.05.015
![]() |
[41] | Dan-nwafor CC, Furuse Y, Ilori EA, et al. (2019) Measures to control protracted large Lassa fever outbreak in Nigeria, 1 January to 28 April 2019. Eurosurveillance 24: 1900272. |
[42] | Ihekweazu C (2018) National guidelines for lassa fever case management. Abuja. Available from: https://ncdc.gov.ng/themes/common/docs/protocols/92_1547068532.pdf. |
[43] |
Okokhere P, Colubri A, Azubike C, et al. (2018) Clinical and laboratory predictors of Lassa fever outcome in a dedicated treatment facility in Nigeria: a retrospective, observational cohort study. Lancet Infect Dis 18: 684-695. doi: 10.1016/S1473-3099(18)30121-X
![]() |
[44] | Monkeypox Outbreak Response: Interim National Guidelines. Abuja; 2017. Available from: https://ncdc.gov.ng/themes/common/docs/protocols/50_1508912430.pdf. |
[45] |
Ajogbasile FV, Oguzie JU, Oluniyi PE, et al. (2020) Real-time Metagenomic Analysis of Undiagnosed Fever Cases Unveils a Yellow Fever Outbreak in Edo State, Nigeria. Sci Rep 10: 3180. doi: 10.1038/s41598-020-59880-w
![]() |
[46] | Nigeria's Population Projected to Double by 2050. Voice of America. 2019. Available from: https://www.voanews.com/africa/nigerias-population-projected-double-2050. |
[47] | Campbell J (2018) Nigeria Faces a Crippling Population Boom. Council on Foreign Relations. 2018. Available from: https://www.cfr.org/blog/nigeria-faces-crippling-population-boom. |
[48] | Kumar R, Pal R (2018) India achieves WHO recommended doctor population ratio: A call for paradigm shift in public health discourse!. J Fam Med Prim Care 7: 841-844. |
[49] | World Health Organisation (2006) The World Health Report 2006 - working together for health. Available from: https://www.who.int/whr/2006/en/. |
[50] | (2011) World Health OrganisationThe Abuja Declaration: Ten Years On. WHO. Geneva: World Health Organization. |
[51] | PACFAH, CS-SUNN (2017) Budgeting for Health and Nutrition in Nigeria: Trend Analysis. Trend in Health Allocation in Nigeria. Available from: http://www.cs-sunn.org/wp-content/uploads/2017/08/Download-Infographics.pdf. |
[52] | Adekoya F, Muanya C, Nelson C, et al. (2018) Analysts unveil highs, lows of 2019 budget. Guardian Available from: https://guardian.ng/news/analysts-unveil-highs-lows-of-2019-budget/. |
[53] | Aworinde T (2019) 2020 budget: Health, education in familiar territory of neglect. Punch Newspapers. Available from: https://punchng.com/2020-budget-health-education-in-familiar-territory-of-neglect/. |
[54] | Akintomide C, Tapsoba MS, Zhao MF (2014) Strengthening west Africa's public health systems. |
[55] |
Kirigia JM, Barry SP (2008) Health challenges in Africa and the way forward. Int Arch Med 1: 37-41. doi: 10.1186/1755-7682-1-27
![]() |
[56] | Uzoechina O (2008) “State Fragility” and the Challenges of Development in West Africa: Moving from Reaction to Prevention. ALC Research Report . |
[57] | Charrièrre F, Fresia M (2008) West Africa as a Migration and Protection area. UN High Comm Refug. Available from: http://www.refworld.org/docid/4a277db82.html. |
[58] |
Kakai CG, Okunromade OF, Dan-Nwafor CC, et al. (2020) Improving Cross-Border Preparedness and Response: Lessons Learned from 3 Lassa Fever Outbreaks Across Benin, Nigeria, and Togo, 2017–2019. Health Secur 18: S105-S112. doi: 10.1089/hs.2019.0080
![]() |
[59] |
Gething PW, Noor AM, Gikandi PW, et al. (2006) Improving imperfect data from health management information systems in Africa using space-time geostatistics. PLoS Med 3: 0825-0831. doi: 10.1371/journal.pmed.0030271
![]() |
[60] | (2017) World BankFrom Panic and Neglect to Investing in Health Security: Financing Pandemic Preparedness at a National Level. Washington DC: . Report No.: 115271 |
[61] | Nigeria Center for Disease Control (2018) National Action Plan for Health Security 2018–2022. Abuja. Available from: https://ncdc.gov.ng/themes/common/files/establishment/5e88f9e22d2b4e4563b527005c8a0c43.pdf. |
[62] |
Talisuna AO, Okiro EA, Yahaya AA, et al. (2020) Spatial and temporal distribution of infectious disease epidemics, disasters and other potential public health emergencies in the World Health Organisation Africa region, 2016–2018. Globalization Health 16: 1-12. doi: 10.1186/s12992-019-0540-4
![]() |
[63] | Owoseye A (2018) Nigeria needs N134bn to strengthen national health security - Official - Premium Times Nigeria. Available from: https://www.premiumtimesng.com/health/health-news/301626-nigeria-needs-n134bn-to-strengthen-national-health-security-health-official.html. |
[64] | World Health Organization (2017) Joint External Evaluation of IHR core capacities of the Federal Republic of Nigeria. Geneva. Available from: https://www.who.int/ihr/publications/WHO-WHE-CPI-REP-2017.46/en/. |
[65] |
Abubakar A, Idris S, Sabitu K, et al. (2010) Emergency preparedness and the capability to identify outbreaks: A case study of Sabon Gari Local Government Area, Kaduna state. Ann Niger Med 4: 21. doi: 10.4103/0331-3131.73877
![]() |
[66] |
Phalkey RK, Yamamoto S, Awate P, et al. (2015) Challenges with the implementation of an Integrated Disease Surveillance and Response (IDSR) system: systematic review of the lessons learned. Health Policy Plan 30: 131-143. doi: 10.1093/heapol/czt097
![]() |
[67] |
Wölfel R, Stoecker K, Fleischmann E, et al. (2015) Mobile diagnostics in outbreak response, not only for ebola: A blueprint for a modular and robust field laboratory. Eurosurveillance 20: 30055. doi: 10.2807/1560-7917.ES.2015.20.44.30055
![]() |
[68] |
Kouadio K, Okeibunor J, Nsubuga P, et al. (2016) Polio infrastructure strengthened disease outbreak preparedness and response in the WHO African Region. Vaccine 34: 5175-5180. doi: 10.1016/j.vaccine.2016.05.070
![]() |
[69] |
Kruk ME (2008) Emergency Preparedness and Public Health Systems. Lessons for Developing Countries. Am J Prev Med 34: 529-534. doi: 10.1016/j.amepre.2008.02.012
![]() |
[70] |
Hulland EN, Wiens KE, Shirude S, et al. (2019) Travel time to health facilities in areas of outbreak potential: Maps for guiding local preparedness and response. BMC Med 17: 1-16. doi: 10.1186/s12916-019-1459-6
![]() |
[71] |
De Rooij D, Belfroid E, Eilers R, et al. (2020) Qualitative Research: Institutional Preparedness During Threats of Infectious Disease Outbreaks. BioMed Res Int 2020. doi: 10.1155/2020/5861894
![]() |
[72] |
Matua GA, Van der Wal DM, Locsin RC (2015) Ebola hemorrhagic fever outbreaks: Strategies for effective epidemic management, containment and control. Braz J Infect Dis 19: 308-313. doi: 10.1016/j.bjid.2015.02.004
![]() |
[73] |
Annan AA, Yar DD, Owusu M, et al. (2017) Health care workers indicate ill preparedness for Ebola Virus Disease outbreak in Ashanti Region of Ghana. BMC Public Health 17: 546. doi: 10.1186/s12889-017-4474-6
![]() |
[74] |
Omole O, Welye H, Abimbola S (2015) Boko Haram insurgency: Implications for public health. Lancet 385: 941. doi: 10.1016/S0140-6736(15)60207-0
![]() |
[75] |
Burki T (2016) Health crisis intensifying in Nigeria's Borno State. Vol. 388, Lancet (London, England). Lancet 388: 645. doi: 10.1016/S0140-6736(16)31345-9
![]() |
[76] |
Okunromade OF, Lokossou VK, Anya I, et al. (2019) Performance of the Public Health System during a Full-Scale Yellow Fever Simulation Exercise in Lagos State, Nigeria, in 2018: How Prepared Are We for the Next Outbreak. Heal Secur 17: 485-494. doi: 10.1089/hs.2019.0048
![]() |
[77] | NCDC (2018) President Muhammadu Buhari Signs Bill For An Act To Establish The NCDC. Available from: https://www.ncdc.gov.ng/news/156/president-muhammadu-buhari-signs-bill-for-an-act-to-establish-the-ncdc. |
1. | Sen Yang, Yan Wang, Shuangquan Zhang, Xuemei Hu, Qin Ma, Yuan Tian, NCResNet: Noncoding Ribonucleic Acid Prediction Based on a Deep Resident Network of Ribonucleic Acid Sequences, 2020, 11, 1664-8021, 10.3389/fgene.2020.00090 | |
2. | Lijun Dou, Xiaoling Li, Hui Ding, Lei Xu, Huaikun Xiang, Is There Any Sequence Feature in the RNA Pseudouridine Modification Prediction Problem?, 2020, 19, 21622531, 293, 10.1016/j.omtn.2019.11.014 | |
3. | Guangmin Liang, Jin Wu, Lei Xu, A prognosis-related based method for miRNA selection on liver hepatocellular carcinoma prediction, 2021, 91, 14769271, 107433, 10.1016/j.compbiolchem.2020.107433 | |
4. | Hua-Sheng Chiu, Sonal Somvanshi, Ting-Wen Chen, Pavel Sumazin, 2021, Chapter 22, 978-1-0716-1696-3, 263, 10.1007/978-1-0716-1697-0_22 | |
5. | Juexin Wang, Yan Wang, Towards Machine Learning in Molecular Biology, 2020, 17, 1551-0018, 2822, 10.3934/mbe.2020156 | |
6. | Consolata Gakii, Paul O. Mireji, Richard Rimiru, Graph Based Feature Selection for Reduction of Dimensionality in Next-Generation RNA Sequencing Datasets, 2022, 15, 1999-4893, 21, 10.3390/a15010021 |
Group | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
Ratio | 14.00% | 19.60% | 14.00% | 15.70% | 14.50% | 21.30% | 9.80% | 24.30% | 14.00% | 15.30% |
P-value | 7.58E-07 | 5.14E-14 | 7.58E-07 | 8.11E-09 | 2.56E-07 | 1.24E-16 | 5.37E-03 | 1.27E-21 | 7.58E-07 | 2.65E-08 |
lncRNA (p-value) | Homer motifs | JASPAR TFs | Scores | |
1 | FOXCUT 3.6e-3 |
AACCAVTTHDCG | TFCP2 | 0.64 |
TCCTATCACACR | MEIS2 | 0.62 | ||
TTTTHAAAGGGG | CHR | 0.67 | ||
ARTGGTTGTWGA | FOXJ2 | 0.58 | ||
GCAATCTCGC | IRF4 | 0.66 | ||
2 | ANCR 1.1e-3 |
AGGGTGACAG | SPZ1 | 0.80 |
GGTATCTTAC | GATA5 | 0.64 | ||
CTCATAGGAG | GCM1 | 0.65 | ||
TAAGTGAAAG | PRDM1 | 0.86 | ||
CTTTTGGAAC | CHR | 0.65 | ||
3 | 250-280 2.2e-4 |
WYTRTCTTTGCG | RXR | 0.61 |
TCTTACGG | ELK1 | 0.71 | ||
GGCAAGGA | SD | 0.76 | ||
GAGGTATGTT | TEAD1 | 0.70 | ||
TGCCGGGAGCGT | POL | 0.64 | ||
4 | HOXD-AS1 6.1e-3 |
CTCGAGTAGG | PB0114 | 0.63 |
GCCCCCTGCA | PB0076 | 0.74 | ||
ACGYMYATKYCC | GFY | 0.59 | ||
AGCGGGTT | PH | 0.68 | ||
AGGCGCCGCGCC | SP1 | 0.69 | ||
5 | HOTAIR 5e-6 |
TGGCGCAGCGCG | PB | 0.67 |
GTACAACTTT | PB | 0.66 | ||
CMTSTGTCWCYK | NeuroG2 | 0.66 | ||
GTGATCCATT | RHOXF1 | 0.68 | ||
GGTMGRRGTGMW | TBX20 | 0.58 |
LncRNA | ID | Description | q-value |
FOXCUT | GO:0033613 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
250-280 | GO:0003735 | structural constituent of ribosome | 3.1600E-06 |
GO:0003729 | mRNA binding | 1.1140E-03 | |
GO:0008483 | transaminase activity | 1.1140E-03 | |
GO:0048027 | mRNA 5'-UTR binding | 1.1140E-03 | |
GO:0016769 | transferase activity, transferring nitrogenous groups | 1.1140E-03 | |
GO:0045182 | translation regulator activity | 1.5826E-03 | |
GO:0030170 | pyridoxal phosphate binding | 1.5826E-03 | |
GO:0070279 | vitamin B6 binding | 1.5826E-03 | |
GO:0019843 | rRNA binding | 1.5826E-03 | |
GO:0019842 | vitamin binding | 3.1903E-03 | |
HOXD-AS1 | GO:0004714 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
HOTAIR | GO:0005109 | frizzled binding | 1.9097E-03 |
GO:0001227 | transcriptional repressor activity, RNA polymerase Ⅱ transcription regulatory region sequence-specific binding | 1.9097E-03 | |
GO:0001664 | G-protein coupled receptor binding | 1.9686E-03 | |
GO:0001078 | transcriptional repressor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 1.1196E-02 | |
GO:0008201 | heparin binding | 1.2885E-02 | |
GO:0005539 | glycosaminoglycan binding | 1.8790E-02 | |
GO:1901681 | sulfur compound binding | 1.9016E-02 | |
GO:0045236 | CXCR chemokine receptor binding | 2.1785E-02 | |
GO:0008301 | DNA binding, bending | 2.1785E-02 | |
GO:0001223 | transcription coactivator binding | 2.1785E-02 | |
GO:0042813 | Wnt-activated receptor activity | 2.1785E-02 | |
GO:0035198 | miRNA binding | 2.2807E-02 | |
GO:0017147 | Wnt-protein binding | 2.5258E-02 | |
GO:1990841 | promoter-specific chromatin binding | 2.5258E-02 | |
GO:0000982 | transcription factor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 2.5258E-02 | |
GO:0001221 | transcription cofactor binding | 2.5587E-02 |
LncRNA | ID | Description | q-value |
FOXCUT | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5129E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
250-280 | hsa03010 | Ribosome | 2.9900E-05 |
hsa01210 | 2-Oxocarboxylic acid metabolism | 3.7322E-03 | |
hsa00220 | Arginine biosynthesis | 3.7322E-03 | |
hsa00250 | Alanine, aspartate and glutamate metabolism | 4.7849E-03 | |
hsa01230 | Biosynthesis of amino acids | 7.9156E-03 | |
HOXD-AS1 | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5506E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
HOTAIR | hsa04310 | Wnt signaling pathway | 5.7295E-03 |
Group | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
Ratio | 14.00% | 19.60% | 14.00% | 15.70% | 14.50% | 21.30% | 9.80% | 24.30% | 14.00% | 15.30% |
P-value | 7.58E-07 | 5.14E-14 | 7.58E-07 | 8.11E-09 | 2.56E-07 | 1.24E-16 | 5.37E-03 | 1.27E-21 | 7.58E-07 | 2.65E-08 |
lncRNA (p-value) | Homer motifs | JASPAR TFs | Scores | |
1 | FOXCUT 3.6e-3 |
AACCAVTTHDCG | TFCP2 | 0.64 |
TCCTATCACACR | MEIS2 | 0.62 | ||
TTTTHAAAGGGG | CHR | 0.67 | ||
ARTGGTTGTWGA | FOXJ2 | 0.58 | ||
GCAATCTCGC | IRF4 | 0.66 | ||
2 | ANCR 1.1e-3 |
AGGGTGACAG | SPZ1 | 0.80 |
GGTATCTTAC | GATA5 | 0.64 | ||
CTCATAGGAG | GCM1 | 0.65 | ||
TAAGTGAAAG | PRDM1 | 0.86 | ||
CTTTTGGAAC | CHR | 0.65 | ||
3 | 250-280 2.2e-4 |
WYTRTCTTTGCG | RXR | 0.61 |
TCTTACGG | ELK1 | 0.71 | ||
GGCAAGGA | SD | 0.76 | ||
GAGGTATGTT | TEAD1 | 0.70 | ||
TGCCGGGAGCGT | POL | 0.64 | ||
4 | HOXD-AS1 6.1e-3 |
CTCGAGTAGG | PB0114 | 0.63 |
GCCCCCTGCA | PB0076 | 0.74 | ||
ACGYMYATKYCC | GFY | 0.59 | ||
AGCGGGTT | PH | 0.68 | ||
AGGCGCCGCGCC | SP1 | 0.69 | ||
5 | HOTAIR 5e-6 |
TGGCGCAGCGCG | PB | 0.67 |
GTACAACTTT | PB | 0.66 | ||
CMTSTGTCWCYK | NeuroG2 | 0.66 | ||
GTGATCCATT | RHOXF1 | 0.68 | ||
GGTMGRRGTGMW | TBX20 | 0.58 |
LncRNA | ID | Description | q-value |
FOXCUT | GO:0033613 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
250-280 | GO:0003735 | structural constituent of ribosome | 3.1600E-06 |
GO:0003729 | mRNA binding | 1.1140E-03 | |
GO:0008483 | transaminase activity | 1.1140E-03 | |
GO:0048027 | mRNA 5'-UTR binding | 1.1140E-03 | |
GO:0016769 | transferase activity, transferring nitrogenous groups | 1.1140E-03 | |
GO:0045182 | translation regulator activity | 1.5826E-03 | |
GO:0030170 | pyridoxal phosphate binding | 1.5826E-03 | |
GO:0070279 | vitamin B6 binding | 1.5826E-03 | |
GO:0019843 | rRNA binding | 1.5826E-03 | |
GO:0019842 | vitamin binding | 3.1903E-03 | |
HOXD-AS1 | GO:0004714 | transmembrane receptor protein tyrosine kinase activity | 1.2937E-02 |
GO:0033613 | activating transcription factor binding | 1.2937E-02 | |
GO:0019199 | transmembrane receptor protein kinase activity | 1.2937E-02 | |
GO:0001085 | RNA polymerase Ⅱ transcription factor binding | 2.0767E-02 | |
HOTAIR | GO:0005109 | frizzled binding | 1.9097E-03 |
GO:0001227 | transcriptional repressor activity, RNA polymerase Ⅱ transcription regulatory region sequence-specific binding | 1.9097E-03 | |
GO:0001664 | G-protein coupled receptor binding | 1.9686E-03 | |
GO:0001078 | transcriptional repressor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 1.1196E-02 | |
GO:0008201 | heparin binding | 1.2885E-02 | |
GO:0005539 | glycosaminoglycan binding | 1.8790E-02 | |
GO:1901681 | sulfur compound binding | 1.9016E-02 | |
GO:0045236 | CXCR chemokine receptor binding | 2.1785E-02 | |
GO:0008301 | DNA binding, bending | 2.1785E-02 | |
GO:0001223 | transcription coactivator binding | 2.1785E-02 | |
GO:0042813 | Wnt-activated receptor activity | 2.1785E-02 | |
GO:0035198 | miRNA binding | 2.2807E-02 | |
GO:0017147 | Wnt-protein binding | 2.5258E-02 | |
GO:1990841 | promoter-specific chromatin binding | 2.5258E-02 | |
GO:0000982 | transcription factor activity, RNA polymerase Ⅱ core promoter proximal region sequence-specific binding | 2.5258E-02 | |
GO:0001221 | transcription cofactor binding | 2.5587E-02 |
LncRNA | ID | Description | q-value |
FOXCUT | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5129E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
250-280 | hsa03010 | Ribosome | 2.9900E-05 |
hsa01210 | 2-Oxocarboxylic acid metabolism | 3.7322E-03 | |
hsa00220 | Arginine biosynthesis | 3.7322E-03 | |
hsa00250 | Alanine, aspartate and glutamate metabolism | 4.7849E-03 | |
hsa01230 | Biosynthesis of amino acids | 7.9156E-03 | |
HOXD-AS1 | hsa05216 | Thyroid cancer | 8.8587E-03 |
hsa04510 | Focal adhesion | 8.8587E-03 | |
hsa05205 | Proteoglycans in cancer | 8.8587E-03 | |
hsa05218 | Melanoma | 1.7683E-02 | |
hsa05214 | Glioma | 1.7683E-02 | |
hsa04151 | PI3K-Akt signaling pathway | 1.8745E-02 | |
hsa05215 | Prostate cancer | 1.8745E-02 | |
hsa01522 | Endocrine resistance | 1.8745E-02 | |
hsa04919 | Thyroid hormone signaling pathway | 2.2317E-02 | |
hsa04152 | AMPK signaling pathway | 2.2317E-02 | |
hsa04068 | FoxO signaling pathway | 2.4442E-02 | |
hsa04550 | Signaling pathways regulating pluripotency of stem cells | 2.5129E-02 | |
hsa05224 | Breast cancer | 2.5506E-02 | |
hsa04218 | Cellular senescence | 2.7471E-02 | |
hsa05225 | Hepatocellular carcinoma | 2.7471E-02 | |
hsa04530 | Tight junction | 2.7471E-02 | |
HOTAIR | hsa04310 | Wnt signaling pathway | 5.7295E-03 |