Deep learning for plant genomics and crop improvement.

Journal: Current opinion in plant biology
Published Date:

Abstract

Our era has witnessed tremendous advances in plant genomics, characterized by an explosion of high-throughput techniques to identify multi-dimensional genome-wide molecular phenotypes at low costs. More importantly, genomics is not merely acquiring molecular phenotypes, but also leveraging powerful data mining tools to predict and explain them. In recent years, deep learning has been found extremely effective in these tasks. This review highlights two prominent questions at the intersection of genomics and deep learning: 1) how can the flow of information from genomic DNA sequences to molecular phenotypes be modeled; 2) how can we identify functional variants in natural populations using deep learning models? Additionally, we discuss the possibility of unleashing the power of deep learning in synthetic biology to create novel genomic elements with desirable functions. Taken together, we propose a central role of deep learning in future plant genomics research and crop genetic improvement.

Authors

  • Hai Wang
    School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China.
  • Emre Cimen
    Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; Computational Intelligence and Optimization Laboratory, Industrial Engineering Department, Eskisehir Technical University, Eskisehir 26000, Turkey.
  • Nisha Singh
    Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; ICAR-National Institute for Plant Biotechnology, New Delhi 110012, India.
  • Edward Buckler
    Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853, USA; United States Department of Agriculture, Agricultural Research Service, Ithaca, NY 14853, USA.