Environment ensemble models for genomic prediction in common bean (Phaseolus vulgaris L.).

Journal: The plant genome
Published Date:

Abstract

For important food crops such as the common bean (Phaseolus vulgaris, L.), global demand continues to outpace the rate of genetic gain for quantitative traits. In this study, we leveraged the multi-environment trial (MET) dataset from the cooperative dry bean nursery (CDBN) to investigate the use of ensemble models for genomic prediction. This set spans 70 locations and 30 years, and accounts for over 150 phenotypes and hundreds of genotypes sequenced for 1.2 million single nucleotide polymorphism markers. We tested three models (linear regression, ridge regression, and neural networks). Each of the three models was implemented using three different approaches: (1) combining all data into one model (singular model), (2) all available single locations were used to train individual submodels comprising one ensemble model (ensemble model), and (3) optimized sets of single locations were used to train individual submodels comprising one ensemble model (optimized ensemble model). The optimized ensemble approach worked best for low-variance locations because the model variance was reduced by averaging across submodels in the ensemble. For models with low prediction accuracy, the ensemble approach can increase accuracy. In certain locations, prediction accuracy was able to overcome narrow-sense heritability, indicating that genomic selection is more efficient than phenotypic selection in these locations. This study indicates that breeding program collaboration can be a way to bypass the bottleneck of low data volume, as pooled data from the CDBN MET produced prediction accuracies of 0.70 for days to flowering, 0.54 for days to maturity, 0.95 for seed weight, and 0.67 for seed yield in individual locations.

Authors

  • Isabella Chiaravallotti
    Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA.
  • Owen Pauptit
    School of Informatics, University of Edinburgh, Edinburgh, Scotland, UK.
  • Valerio Hoyos-Villegas
    Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, Michigan, USA.