LncRNA-ID: Long non-coding RNA IDentification using balanced random forests.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Long non-coding RNAs (lncRNAs), which are non-coding RNAs of length above 200 nucleotides, play important biological functions such as gene expression regulation. To fully reveal the functions of lncRNAs, a fundamental step is to annotate them in various species. However, as lncRNAs tend to encode one or multiple open reading frames, it is not trivial to distinguish these long non-coding transcripts from protein-coding genes in transcriptomic data.

Authors

  • Rujira Achawanantakun
    Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA.
  • Jiao Chen
    Affiliated Hospital of Integrated Traditional Chinese and Western Medicine, Nanjing University of Chinese Medicine Nanjing 210028 China.
  • Yanni Sun
    Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA.
  • Yuan Zhang
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.