Enhancing genome-wide populus trait prediction through deep convolutional neural networks.

Journal: The Plant journal : for cell and molecular biology
PMID:

Abstract

As a promising model, genome-based plant breeding has greatly promoted the improvement of agronomic traits. Traditional methods typically adopt linear regression models with clear assumptions, neither obtaining the linkage between phenotype and genotype nor providing good ideas for modification. Nonlinear models are well characterized in capturing complex nonadditive effects, filling this gap under traditional methods. Taking populus as the research object, this paper constructs a deep learning method, DCNGP, which can effectively predict the traits including 65 phenotypes. The method was trained on three datasets, and compared with other four classic models-Bayesian ridge regression (BRR), Elastic Net, support vector regression, and dualCNN. The results show that DCNGP has five typical advantages in performance: strong prediction ability on multiple experimental datasets; the incorporation of batch normalization layers and Early-Stopping technology enhancing the generalization capabilities and prediction stability on test data; learning potent features from the data and thus circumventing the tedious steps of manual production; the introduction of a Gaussian Noise layer enhancing predictive capabilities in the case of inherent uncertainties or perturbations; fewer hyperparameters aiding to reduce tuning time across datasets and improve auto-search efficiency. In this way, DCNGP shows powerful predictive ability from genotype to phenotype, which provide an important theoretical reference for building more robust populus breeding programs.

Authors

  • Huaichuan Duan
    Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, School of Pharmacy, Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Department, Sichuan Industrial Institute of Antibiotics, Chengdu University, Chengdu 610106, China.
  • Xiangwei Dai
    School of Computer Science and Artificial Intelligence, Changzhou University, Changzhou, China.
  • Quanshan Shi
    Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Department, School of Pharmacy, Chengdu University, Chengdu, China.
  • Yan Cheng
    The First Clinical Medical College of Shaanxi University of Chinese Medicine, Xianyang, China.
  • Yutong Ge
    Key Laboratory of Medicinal and Edible Plants Resources Development of Sichuan Education Department, School of Pharmacy, Chengdu University, Chengdu, China.
  • Shan Chang
    Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.
  • Wei Liu
    Department of Radiation Oncology, Mayo Clinic, Scottsdale, AZ, United States.
  • Feng Wang
    Department of Oncology, Binzhou Medical University Hospital, Binzhou, Shandong, China.
  • Hubing Shi
    Laboratory of Integrative Medicine, Clinical Research Center for Breast, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, Sichuan, PR China.
  • Jianping Hu
    Department of Radiology, The First Affiliated Hospital of Fujian Medical University, 20 ChaZhong Rd, Fuzhou, Fujian, 350005, PR China.