High-throughput prediction of stalk cellulose and hemicellulose content in maize using machine learning and Fourier transform infrared spectroscopy.

Journal: Bioresource technology
PMID:

Abstract

Cellulose and hemicellulose are key cross-linked carbohydrates affecting bioethanol production in maize stalks. Traditional wet chemical methods for their detection are labor-intensive, highlighting the need for high-throughput techniques. This study used Fourier transform infrared (FTIR) spectroscopy combined with machine learning (ML) algorithms on 200 large-scale maize germplasms to develop robust predictive models for stalk cellulose, hemicellulose and holocellulose content. We identified several peak height features correlated with three contents, used them as input data for model building. Four ML algorithms demonstrated higher predictive accuracy, achieving coefficient of determination (R) ranging from 0.83 to 0.97. Notably, the Categorical Boosting algorithm yielded optimal models with coefficient of determination (R) exceeding 0.91 for the training set and over 0.81 for the test set. The approach combined FTIR spectroscopy with ML algorithms offers a precise and high-throughput tool for predicting stalk cellulose, hemicellulose and holocellulose contents, benefiting maize genetic breeding for bioenergy and biofuels.

Authors

  • Fanghui Chen
    The National Engineering Laboratory of Crop Resistance Breeding, School of Life Sciences, Anhui Agricultural University, Hefei 230036, China.
  • Xing Liu
    School of Food Science and Engineering, Hainan University 58 Renmin Avenue Haikou 570228 China zhangzeling@hainanu.edu.cn benchao312@hainanu.edu.cn xuhuan.hnu@foxmail.com qichen@hainanu.edu.cn sunzhichang11@163.com hmcao@hainanu.edu.cn.
  • Chengchen Lu
    The National Engineering Laboratory of Crop Resistance Breeding, School of Life Sciences, Anhui Agricultural University, Hefei 230036, China.
  • Mingxiu Ruan
    The National Engineering Laboratory of Crop Resistance Breeding, School of Life Sciences, Anhui Agricultural University, Hefei 230036, China.
  • Yujing Wen
    The National Engineering Laboratory of Crop Resistance Breeding, School of Life Sciences, Anhui Agricultural University, Hefei 230036, China.
  • Shaodong Wang
    Department of Thoracic Surgery, Peking University People's Hospital, Beijing, China.
  • Youhong Song
    School of Agronomy, Anhui Agricultural University, Hefei 230036, China.
  • Lin Li
    Department of Medicine III, LMU University Hospital, LMU Munich, Munich, Germany.
  • Liang Zhou
    Department of Otorhinolaryngology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China. liang.zhou@fdeent.org.
  • Haiyang Jiang
    College of Veterinary Medicine, China Agricultural University, Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, Beijing Laboratory for Food Quality and Safety, No.2 Yuanmingyuan West Road, Haidian District, Beijing 100193, People's Republic of China.
  • Leiming Wu
    The National Engineering Laboratory of Crop Resistance Breeding, School of Life Sciences, Anhui Agricultural University, Hefei 230036, China. Electronic address: lmwu@ahau.edu.cn.