Machine learning-based identification of critical factors for cadmium accumulation in rice grains.

Journal: Environmental geochemistry and health
PMID:

Abstract

The aggregation of Cadmium (Cd) in rice grains is a significant threat to human healthy. The complexity of the soil-rice system, with its numerous influencing parameters, highlights the need to identify the crucial factors responsible for Cd aggregation. This study uses machine learning (ML) modeling to predict Cd aggregation in rice grains and identify the influencing factors. Data from 474 data points from 77 published works were analyzed, and eight ML models were established using different algorithms. The input variables were total soil Cd concentration (TS Cd) and extractable Cd concentration (Ex-Cd), while rice Cd concentration (Cd) was the output variable. Among the models, the Extremely Randomized Trees (ERT) model performed the best (TS Cd: R = 0.825; Ex-Cd: R = 0.792), followed by Random Forest (TS Cd: R = 0.721; Ex-Cd: R = 0.719). The ERT feature importance ranking analysis revealed that the essential factors responsible for Cd aggregation are cation exchange capacity (CEC), TS Cd, Water Management Model (WMM), and pH for total soil Cd as input variables. For extractable Cd as an input variable, the vital factors are CEC, Ex-Cd, pH, and WMM. The study highlights the importance of the Water Management Model and its impact on Cd concentration in rice grains, which has been overlooked in previous research.Please check and confirm that the authors and their respective affiliations have been correctly identified and amend if necessary.The authors and their respective affiliations are correct.Author details: Kindly check and confirm whether the corresponding author is correctly identified.It is correct.

Authors

  • Weichun Yang
    Institute of Environmental Science and Engineering, School of Metallurgy and Environment, Central South University, 410083, Changsha, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, 410083, Changsha, China.
  • Jiaxin Li
    Department of Gastroenterology, West China Hospital, Sichuan University, Chengdu, China.
  • Kai Nie
    School of Metallurgy and Environment, Institute of Environmental Science and Engineering, Central South University, Changsha, 410083, China.
  • Pengwei Zhao
    School of Metallurgy and Environment, Institute of Environmental Science and Engineering, Central South University, Changsha, 410083, China.
  • Hui Xia
    Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, China.
  • Qi Li
    The First Affiliated Hospital of Yangtze University, Jingzhou, Hubei, China.
  • Qi Liao
    Department of Preventative Medicine, Zhejiang Provincial Key Laboratory of Pathophysiology, Medical School of Ningbo University, Ningbo, China.
  • Qingzhu Li
    School of Metallurgy and Environment, Central South University, Changsha, 410083, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, Changsha, 410083, China.
  • ChunHua Dong
    Soil and Fertilizer Institute of Hunan Province, 410125, Changsha, China.
  • Zhihui Yang
    Institute of Artificial Intelligence, School of Computer Science, Wuhan University, China. Electronic address: zhy@whu.edu.cn.
  • Mengying Si
    Institute of Environmental Science and Engineering, School of Metallurgy and Environment, Central South University, 410083, Changsha, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, 410083, Changsha, China. Electronic address: simysmile@csu.edu.cn.