Machine learning and chemometric methods for high-throughput authentication of 53 Root and Rhizome Chinese Herbal using ATR-FTIR fingerprints.

Journal: Journal of chromatography. B, Analytical technologies in the biomedical and life sciences
Published Date:

Abstract

To address the identification challenges caused by morphological similarities in Root and Rhizome Chinese Herbal (RRCH), this study developed a discrimination system integrating Attenuated Total Reflectance Fourier Transform Infrared Spectroscopy (ATR-FTIR) with multimodal machine learning. 53 kinds of RRCH collected from China were analyzed using ATR-FTIR to acquire spectral fingerprints. An innovative analytical framework was established, combining chemometric Partial Least Squares Discriminant Analysis (PLS-DA) with optimized machine learning models: t-distributed Stochastic Neighbor Embedding (t-SNE), optimized decision trees, optimized discriminant analysis, naive Bayes, optimized SVM, optimized KNN, SVM kernels, and optimized ensemble learning. Multivariate analysis revealed distinct spatial distribution patterns of chemical characteristics among the 53 RRCH species. t-SNE projections demonstrated significant cluster separation in two-dimensional feature space, confirming strong correlations between spectral fingerprints and phytochemical compositions. The SVM model outperformed others, achieving 100 % classification accuracy on both training and validation sets, with a markedly shorter identification time compared to PLS-DA. This ATR-FTIR-machine learning hybrid system enables high-throughput authentication of RRCH and establishes a scalable technical framework for herbal quality standardization. The methodology provides critical insights into chemical marker discovery through vibrational spectrum-feature relationship mapping, advancing intelligent discrimination of botanically similar medicinal materials.

Authors

  • Xiaoyu Liu
    State Grid Hebei Electric Power Co., Ltd., Marketing Service Center, Shijiazhuang 050035, China.
  • Xiaokang Liu
    State Key Laboratory of Biogeology and Environmental Geology, School of Earth Sciences, China University of Geosciences, Wuhan, Hubei, China.
  • Jiawei Wang
    Biomedicine Discovery Institute, Monash University, VIC 3800, Australia.
  • Daidi Zang
    Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 582400, China.
  • Yang Yang
    Department of Gastrointestinal Surgery, The Third Hospital of Hebei Medical University, Shijiazhuang, China.
  • Qinhua Chen
    Shenzhen Baoan Authentic TCM Therapy Hospital, Shenzhen, 518101, China.
  • De-An Guo
    Department of Pharmaceutics, School of Pharmacy, Guangdong Pharmaceutical University, East of Outer Ring Road #280, Guangdong 510006, China; Zhongshan Institute for Drug Discovery, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Zhongshan 528400, China; Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center for TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China. Electronic address: daguo@simm.ac.cn.