Challenges for machine learning in RNA-protein interaction prediction.

Journal: Statistical applications in genetics and molecular biology
Published Date:

Abstract

RNA-protein interactions have long being recognised as crucial regulators of gene expression. Recently, the development of scalable experimental techniques to measure these interactions has revolutionised the field, leading to the production of large-scale datasets which offer both opportunities and challenges for machine learning techniques. In this brief note, we will discuss some of the major stumbling blocks towards the use of machine learning in computational RNA biology, focusing specifically on the problem of predicting RNA-protein interactions from next-generation sequencing data.

Authors

  • Viplove Arora
    Data Science, Department of Physics, International School for Advanced Studies (SISSA), Trieste 34136, Italy.
  • Guido Sanguinetti
    IANC, School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK Synthetic and Systems Biology, University of Edinburgh, Edinburgh EH9 3JD, UK.