Segment-Weighting Similarity-Based Fragment-Learning Model for Single-Cell Raman Spectral Analysis.

Journal: Analytical chemistry
Published Date:

Abstract

Raman spectroscopy provides intrinsic biochemical profiles of all cellular biomolecules in a segmented manner, promising nondestructive and label-free phenotyping at the single-cell level. However, current analytical methods rarely utilize spectral biological characteristics and their fusion with data characteristics, limiting the application of these methods to biological Raman spectroscopy. Herein, a segment-weighting similarity-based fragment-learning (SWS-FL) model, integrating SWS-based feature extraction and fusion learning, is proposed to fuse biological and data characteristics for single-cell spectral analysis, which segments spectra into fragments and differentiates their biological characteristics for fusing feature matrices. The SWS-based feature extraction fabricates a group of low-dimensional feature vectors at multiple values, providing a more distinguishable feature space compared to conventional KNN. The weights of five fragments, including the fingerprint region, protein I region, mixed region, protein II region, and genetic material region, are assigned as 0.282, 0.302, 0.273, 0.276, and 0.239, respectively, which highlights the spectral biological characteristics. The fusion learning process synthesizes characteristics from all spectral fragments using an ANN, achieving accuracy with only 0.5% variation across values from 1 to 30, greatly enhancing the robustness of the model. In the five-classification task of breast cancer cells and their subtypes, the accuracy and kappa coefficient of SWS-FL can reach 94.9% and 0.943%, respectively, which are 5% and 7% higher than those of ANN. The generalization capability is also validated on the data set of lung cancer cells and their subtypes. This model provides a new path for the fusion of biological and data characteristics in spectral analysis and promises to be a powerful analytical framework in more spectroscopic areas.

Authors

  • LangLang Yi
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Qiudi Ye
    School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Chaofan Wang
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Qingqing Hu
    School of Nursing, Zhejiang Chinese Medical University, 548 Binwen Road, Binjiang District, Hangzhou, 310053, Zhejiang Province, People's Republic of China.
  • Shiya Zhang
    School of Mathematics and Physics Science and Engineering, Hebei University of Engineering, Handan, Hebei 056038, China.
  • Xiaokun Shen
    School of Computer Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Minghui Liang
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Guoqian Li
    School of Life Science and Technology, Xidian University, Xi'an, Shaanxi 710126, China.
  • Klyuyev Dmitriy
    Institute of Life Sciences, Karaganda Medical University, Karaganda 100008, Kazakhstan.
  • Yang Guo
    Innovation Research Institute of Combined Acupuncture and Medicine, Shaanxi University of CM, Xianyang 712046, China.
  • Qiang Yu
    State Key Laboratory of Soil Erosion and Dryland Farming on the Loess Plateau, Northwest A&F University, Yangling 712100, China; College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China. Electronic address: yuq@nwsuaf.edu.cn.
  • Bo Hu
    Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China.