Deep and accurate detection of m6A RNA modifications using miCLIP2 and m6Aboost machine learning.

Journal: Nucleic acids research
PMID:

Abstract

N6-methyladenosine (m6A) is the most abundant internal RNA modification in eukaryotic mRNAs and influences many aspects of RNA processing. miCLIP (m6A individual-nucleotide resolution UV crosslinking and immunoprecipitation) is an antibody-based approach to map m6A sites with single-nucleotide resolution. However, due to broad antibody reactivity, reliable identification of m6A sites from miCLIP data remains challenging. Here, we present miCLIP2 in combination with machine learning to significantly improve m6A detection. The optimized miCLIP2 results in high-complexity libraries from less input material. Importantly, we established a robust computational pipeline to tackle the inherent issue of false positives in antibody-based m6A detection. The analyses were calibrated with Mettl3 knockout cells to learn the characteristics of m6A deposition, including m6A sites outside of DRACH motifs. To make our results universally applicable, we trained a machine learning model, m6Aboost, based on the experimental and RNA sequence features. Importantly, m6Aboost allows prediction of genuine m6A sites in miCLIP2 data without filtering for DRACH motifs or the need for Mettl3 depletion. Using m6Aboost, we identify thousands of high-confidence m6A sites in different murine and human cell lines, which provide a rich resource for future analysis. Collectively, our combined experimental and computational methodology greatly improves m6A identification.

Authors

  • Nadine Körtel
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Cornelia Rücklé
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • You Zhou
    Visionary Intelligence Ltd., Beijing, China.
  • Anke Busch
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Peter Hoch-Kraft
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • F X Reymond Sutandy
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Jacob Haase
    Institute of Molecular Medicine, Sect. Molecular Cell Biology, Martin Luther University Halle-Wittenberg, Charles Tanford Protein Center, Halle 06120, Germany.
  • Mihika Pradhan
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Michael Musheev
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Dirk Ostareck
    Department of Intensive Care Medicine, University Hospital RWTH Aachen, Aachen 52074, Germany.
  • Antje Ostareck-Lederer
    Department of Intensive Care Medicine, University Hospital RWTH Aachen, Aachen 52074, Germany.
  • Christoph Dieterich
    Section of Bioinformatics and Systems Cardiology, Klaus Tschira Institute for Integrative Computational Cardiology, Heidelberg.
  • Stefan Hüttelmaier
    Institute of Molecular Medicine, Sect. Molecular Cell Biology, Martin Luther University Halle-Wittenberg, Charles Tanford Protein Center, Halle 06120, Germany.
  • Christof Niehrs
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Oliver Rausch
    STORM Therapeutics Ltd, Cambridge CB22 3AT, UK.
  • Dan Dominissini
    Cancer Research Center and Wohl Institute for Translational Medicine, Chaim Sheba Medical Center, Tel HaShomer, and Sackler School of Medicine, Tel Aviv University, Tel Aviv 6997801, Israel.
  • Julian König
    Institute of Molecular Biology (IMB), Mainz 55128, Germany.
  • Kathi Zarnack
    Buchmann Institute for Molecular Life Sciences (BMLS) & Faculty of Biological Sciences, Goethe University Frankfurt, Frankfurt 60438, Germany.