Deep learning in regulatory genomics: from identification to design.

Journal: Current opinion in biotechnology
Published Date:

Abstract

Genomics and deep learning are a natural match since both are data-driven fields. Regulatory genomics refers to functional noncoding DNA regulating gene expression. In recent years, deep learning applications on regulatory genomics have achieved remarkable advances so-much-so that it has revolutionized the rules of the game of the computational methods in this field. Here, we review two emerging trends: (i) the modeling of very long input sequence (up to 200 kb), which requires self-matched modularization of model architecture; (ii) on the balance of model predictability and model interpretability because the latter is more able to meet biological demands. Finally, we discuss how to employ these two routes to design synthetic regulatory DNA, as a promising strategy for optimizing crop agronomic properties.

Authors

  • Xuehai Hu
    College of Informatics, Agricultural Bioinformatics Key Laboratory of Hubei Province, Huazhong Agricultural University, Wuhan 430070, P.R. China.
  • Alisdair R Fernie
    Max-Planck-Institute of Molecular Plant Physiology, Am Muehlenberg 1, Potsdam-Golm, 14476, Germany.
  • Jianbing Yan