Machine Learning-Assisted Direct RNA Sequencing with Epigenetic RNA Modification Detection via Quantum Tunneling.

Journal: Analytical chemistry
PMID:

Abstract

RNA sequence information holds immense potential as a drug target for diagnosing various RNA-mediated diseases and viral/bacterial infections. Massively parallel complementary DNA (c-DNA) sequencing helps but results in a loss of valuable information about RNA modifications, which are often associated with cancer evolution. Herein, we report machine learning (ML)-assisted high throughput RNA sequencing with inherent RNA modification detection using a single-molecule, long-read, and label-free quantum tunneling approach. The ML tools achieve classification accuracy as high as 100% in decoding RNA and 98% for decoding both RNA and RNA modifications simultaneously. The relationships between input features and target values have been well examined through Shapley additive explanations. Furthermore, transmission and sensitivity readouts enable the recognition of RNA and its modifications with good selectivity and sensitivity. Our approach represents a starting point for ML-assisted direct RNA sequencing that can potentially decode RNA and its epigenetic modifications at the molecular level.

Authors

  • Sneha Mittal
    Department of Chemistry, Indian Institute of Technology (IIT) Indore, Indore Madhya Pradesh 453552, India.
  • Milan Kumar Jena
    Department of Chemistry, Indian Institute of Technology (IIT) Indore, Indore Madhya Pradesh 453552, India.
  • Biswarup Pathak
    Department of Chemistry, Indian Institute of Technology (IIT) Indore, Indore Madhya Pradesh 453552, India.