Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes.

Journal: Genetics, selection, evolution : GSE
PMID:

Abstract

BACKGROUND: Transforming large amounts of genomic data into valuable knowledge for predicting complex traits has been an important challenge for animal and plant breeders. Prediction of complex traits has not escaped the current excitement on machine-learning, including interest in deep learning algorithms such as multilayer perceptrons (MLP) and convolutional neural networks (CNN). The aim of this study was to compare the predictive performance of two deep learning methods (MLP and CNN), two ensemble learning methods [random forests (RF) and gradient boosting (GB)], and two parametric methods [genomic best linear unbiased prediction (GBLUP) and Bayes B] using real and simulated datasets.

Authors

  • Rostam Abdollahi-Arpanahi
    Department of Animal Sciences, University of Florida, Gainesville, FL, USA.
  • Daniel Gianola
    Department of Animal Sciences, University of Wisconsin-Madison, Madison, 53706, USA. gianola@ansci.wisc.edu.
  • Francisco Peñagaricano
    Department of Animal Sciences, University of Florida, Gainesville, Florida University of Florida Genetics Institute, University of Florida, Gainesville, Florida 32610.