Functional enrichment analysis based on long noncoding RNA associations

Differential gene expression analysis using RNA-seq data is a popular approach for discovering specific regulation mechanisms under certain environmental settings. Both gene ontology (GO) and KEGG pathway enrichment analysis are major processes for investigating gene groups that participate in common biological responses or possess related functions. However, traditional approaches based on differentially expressed genes only detect a few significant GO terms and pathways, which are frequently insufficient to explain all-inclusive gene regulation mechanisms.

Researchers from the National Taiwan Ocean University sequenced and assembled transcriptomes of survivin (birc5) gene knock-down experimental and wild-type control zebrafish embryos and a differential expression (DE) gene list was obtained for traditional functional enrichment analysis. In addition to including DE genes with significant fold-change levels, the researchers considered additional associated genes near or overlapped with differentially expressed long noncoding RNAs (DE lncRNAs), which may directly or indirectly activate or inhibit target genes and play important roles in regulation networks. Both the original DE gene list and the additional DE lncRNA-associated genes were combined to perform a comprehensive overrepresentation analysis.

In this study, a total of 638 DE genes and 616 DE lncRNA-associated genes (lncGenes) were leveraged simultaneously in searching for significant GO terms and KEGG pathways. Compared to the traditional approach of only using a differential expression gene list, the proposed method of employing DE lncRNA-associated genes identified several additional important GO terms and KEGG pathways. In GO enrichment analysis, 60% more GO terms were obtained, and several neuron development functional terms were retrieved as complete annotations. The researchers also observed that additional important pathways such as the FoxO and MAPK signaling pathways were retrieved, which were shown in previous reports to play important roles in apoptosis and neuron development functions regulated by the survivin gene.

RNA-seq analysis flowchart


Black dotted box: paired-end sequence reads from Birc5aMO and WT RNA sequencing aligned by TopHat2, and expression changes obtained by cufflinks and cuffcompare. Yellow dotted box: GO and KEGG pathway enrichment analysis by using DE genes (traditional) and adding lncGenes for identifying additional significant annotations

These researchers demonstrated that incorporating genes near or overlapped with DE lncRNAs into the DE gene list outperformed the traditional enrichment analysis method for effective biological functional interpretations. These hidden interactions between lncRNAs and target genes could facilitate more comprehensive analyses.

Hung KS, Hsiao CC, Pai TW, Hu CH, Tzou WS, Wang WD, Chen YR. (2018) Functional enrichment analysis based on long noncoding RNA associations. BMC Syst Biol 12(Suppl 4):45. [article]

