DeepPlnc: Bi-modal deep learning for highly accurate plant lncRNA discovery.

Journal: Genomics
PMID:

Abstract

We present here a bi-modal CNN based deep-learning system, DeepPlnc, to identify plant lncRNAs with high accuracy while using sequence and structural properties. Unlike most of the existing software, it works accurately even in conditions with ambiguity of boundaries and incomplete sequences. It scored consistently high for performance metrics while breaching accuracy of >98% when tested across a large number of validated instances. During multiple benchmarkings DeepPlnc consistently outperformed all the compared tools and maintained a highly significant lead in the range of 2.5%- 4.6% from the second best performing tool (p-value << 0.01). DeepPlnc was used to annotate a de novo assembled transcriptome of a himalayan species where again it suggested its much better suitability for genome and transcriptome annotation purposes than the existing tools. DeepPlnc has been made freely available as a web-server and stand-alone program at https://scbb.ihbt.res.in/DeepPlnc/.

Authors

  • Ritu
    Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology (HiCHiCoB, A BIC Supported by DBT, Govt. of India), CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, HP 176061, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh 201002, India.
  • Sagar Gupta
    Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India)CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur (HP), India.
  • Nitesh Kumar Sharma
    Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India)CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur (HP), India.
  • Ravi Shankar
    Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India)CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur (HP), India.