A miRNA Target Prediction Model Based on Distributed Representation Learning and Deep Learning.

Journal: Computational and mathematical methods in medicine
Published Date:

Abstract

MicroRNAs (miRNAs) are a kind of noncoding RNA, which plays an essential role in gene regulation by binding to messenger RNAs (mRNAs). Accurate and rapid identification of miRNA target genes is helpful to reveal the mechanism of transcriptome regulation, which is of great significance for the study of cancer and other diseases. Many bioinformatics methods have been proposed to solve this problem, but the previous research did not further study the encoding of the nucleotide sequence. In this paper, we developed a novel method combining word embedding and deep learning for human miRNA targets at the site-level prediction, which is inspired by the similarity between natural language and biological sequences. First, the word2vec model was used to mine the distribution representation of miRNAs and mRNAs. Then, the embedding is extracted automatically via the stacked bidirectional long short-term memory (BiLSTM) network. By testing, our method can effectively improve the accuracy, sensitivity, specificity, and -measure of other methods. Through our research, it is proved that the distributed representation can improve the accuracy of the deep learning model and better solve the miRNA target site prediction problem.

Authors

  • Yuzhuo Sun
    College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming, China.
  • Fei Xiong
    Department of Ultrasound, Deyang People's Hospital, Deyang, Sichuan, China.
  • Yongke Sun
    College of Material Science and Engineering, Southwest Forestry University, Kunming, China.
  • Youjie Zhao
    College of Big Data and Intelligent Engineering, Southwest Forestry University, Kunming, China.
  • Yong Cao
    Department of Neurosurgery, Beijing Tiantan Hospital, Capital Medical University, 100070, Beijing, China.