Yield prediction through integration of genetic, environment, and management data through deep learning.

Journal: G3 (Bethesda, Md.)
PMID:

Abstract

Accurate prediction of the phenotypic outcomes produced by different combinations of genotypes, environments, and management interventions remains a key goal in biology with direct applications to agriculture, research, and conservation. The past decades have seen an expansion of new methods applied toward this goal. Here we predict maize yield using deep neural networks, compare the efficacy of 2 model development methods, and contextualize model performance using conventional linear and machine learning models. We examine the usefulness of incorporating interactions between disparate data types. We find deep learning and best linear unbiased predictor (BLUP) models with interactions had the best overall performance. BLUP models achieved the lowest average error, but deep learning models performed more consistently with similar average error. Optimizing deep neural network submodules for each data type improved model performance relative to optimizing the whole model for all data types at once. Examining the effect of interactions in the best-performing model revealed that including interactions altered the model's sensitivity to weather and management features, including a reduction of the importance scores for timepoints expected to have a limited physiological basis for influencing yield-those at the extreme end of the season, nearly 200 days post planting. Based on these results, deep learning provides a promising avenue for the phenotypic prediction of complex traits in complex environments and a potential mechanism to better understand the influence of environmental and genetic factors.

Authors

  • Daniel R Kick
    United States Department of Agriculture, Agricultural Research Service Plant Genetics Research Unit, Columbia, MO 65211, USA.
  • Jason G Wallace
    Department of Crop & Soil Science, University of Georgia, Athens, GA 30602, USA.
  • James C Schnable
    Center for Plant Science Innovation and Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE, USA. schnable@unl.edu.
  • Judith M Kolkman
    School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA.
  • Barış Alaca
    Division of Plant Breeding Methodology, Department of Crop Science, University of Goettingen, Goettingen 37073, Germany.
  • Timothy M Beissinger
    United States Department of Agriculture, Agricultural Research Service, Columbia, Missouri Division of Plant Sciences, University of Missouri, Columbia, Missouri 65211.
  • Jode Edwards
    United States Department of Agriculture, Agricultural Research Service, Ames, IA 50011, USA.
  • David Ertl
    Research and Business Development, Iowa Corn Promotion Board, Johnston, IA 50131, USA.
  • Sherry Flint-Garcia
    United States Department of Agriculture, Agricultural Research Service Plant Genetics Research Unit, Columbia, MO 65211, USA.
  • Joseph L Gage
    Department of Crop and Soil Sciences, North Carolina State University, Raleigh, NC 27695, USA.
  • Candice N Hirsch
    Department of Agronomy and Plant Genetics, University of Minnesota, St Paul, MN, 55108, USA. cnhirsch@umn.edu.
  • Joseph E Knoll
    United States Department of Agriculture, Agricultural Research Service Crop Genetics and Breeding Research Unit, Tifton, GA 31793, USA.
  • Natalia de Leon
    Department of Agronomy, University of Wisconsin, Madison, WI 53706, USA.
  • Dayane C Lima
    Plant Breeding and Plant Genetics Program, University of Wisconsin, Madison, WI 53706, USA.
  • Danilo E Moreta
    School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA.
  • Maninder P Singh
    Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, USA.
  • Addie Thompson
    Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI 48824, USA.
  • Teclemariam Weldekidan
    Plant and Soil Sciences, University of Delaware, Newark, DE 19716, USA.
  • Jacob D Washburn
    Institute for Genomic Diversity, Cornell University, Ithaca, NY 14853.