Leveraging Automated Machine Learning for Environmental Data-Driven Genetic Analysis and Genomic Prediction in Maize Hybrids.

Journal: Advanced science (Weinheim, Baden-Wurttemberg, Germany)
PMID:

Abstract

Genotype, environment, and genotype-by-environment (G×E) interactions play a critical role in shaping crop phenotypes. Here, a large-scale, multi-environment hybrid maize dataset is used to construct and validate an automated machine learning framework that integrates environmental and genomic data for improved accuracy and efficiency in genetic analyses and genomic predictions. Dimensionality-reduced environmental parameters (RD_EPs) aligned with developmental stages are applied to establish linear relationships between RD_EPs and traits to assess the influence of environment on phenotype. Genome-wide association study identifies 539 phenotypic plasticity trait-associated markers (PP-TAMs), 223 environmental stability TAMs (Main-TAMs), and 92 G×E-TAMs, revealing distinct genetic bases for PP and G×E interactions. Training genomic prediction models with both TAMs and RD_EPs increase prediction accuracy by 14.02% to 28.42% over that of genome-wide marker approaches. These results demonstrate the potential of utilizing environmental data for improving genetic analysis and genomic selection, offering a scalable approach for developing climate-adaptive maize varieties.

Authors

  • Kunhui He
    First Clinical College, Changsha Medical University, Changsha, 410219, P. R. China. iuhe0405@163.com.
  • Tingxi Yu
    State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), Beijing, China.
  • Shang Gao
    Department of Orthopedics, Orthopedic Center of Chinese PLA, Southwest Hospital, Third Military Medical University, Chongqing, 400038, P.R.China.
  • Shoukun Chen
    Key Laboratory of Ministry of Education on Safe Mining of Deep Metal Mines, School of Resources and Civil Engineering, Northeastern University, Shenyang 110819, China.
  • Liang Li
    School of Psychological and Cognitive Sciences, Peking University, Beijing, 100871, China.
  • Xuecai Zhang
    State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), CIMMYT-China Office, Beijing, 100081, China.
  • Changling Huang
    State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), CIMMYT-China Office, Beijing, 100081, China.
  • Yunbi Xu
    Institute of Crop Sciences, CIMMYT-China, Chinese Academy of Agricultural Sciences, Beijing 100081, China; CIMMYT-China Tropical Maize Research Center, School of Food Science and Engineering, Foshan University, Foshan, Guangdong 528231, China; Peking University Institute of Advanced Agricultural Sciences, Weifang, Shandong 261325, China. Electronic address: y.xu@cgiar.org.
  • Jiankang Wang
    National Laboratory of Solid State Microstructures, College of Engineering and Applied Sciences, Nanjing University, 22 Hankou Road, Nanjing 210093, P. R. China.
  • Boddupalli M Prasanna
    CIMMYT (International Maize and Wheat Improvement Center), ICRAF Campus, United Nations Avenue, Nairobi, Kenya.
  • Sarah Hearne
    CIMMYT, KM 45 Carretera Mexico-Veracruz, El Batan, Texcoco 56237, Mexico.
  • Xinhai Li
    State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences (CAAS), International Maize and Wheat Improvement Center (CIMMYT) China office, Beijing 100081, China.
  • Huihui Li
    School of Computer Science and Engineering, South China University of Technology, Guangzhou 510000, China. Electronic address: 29777562@qq.com.