Extracting True Virus SERS Spectra and Augmenting Data for Improved Virus Classification and Quantification.

Journal: ACS sensors
Published Date:

Abstract

Surface-enhanced Raman spectroscopy (SERS) is a transformative tool for infectious disease diagnostics, offering rapid and sensitive species identification. However, background spectra in biological samples complicate analyte peak detection, increase the limit of detection, and hinder data augmentation. To address these challenges, we developed a deep learning framework utilizing dual neural networks to extract true virus SERS spectra and estimate concentration coefficients in water for 12 different respiratory viruses. The extracted spectra showed a high similarity to those obtained at the highest viral concentration, validating their accuracy. Using these spectra and the derived concentration coefficients, we augmented spectral data sets across varying virus concentrations in water. XGBoost models trained on these augmented data sets achieved overall classification and concentration prediction accuracy of 92.3% with a coefficient of determination () > 0.95. Additionally, the extracted spectra and coefficients were used to augment data sets in saliva backgrounds. When tested against real virus-in-saliva spectra, the augmented spectra-trained XGBoost models achieved 91.9% accuracy in classification and concentration prediction with > 0.9, demonstrating the robustness of the approach. By delivering clean and uncontaminated spectra, this methodology can significantly improve species identification, differentiation, and quantification and advance SERS-based detection and diagnostics.

Authors

  • Yufang Liu
    Department of Statistics, Franklin College of Arts and Sciences, University of Georgia, Athens, Georgia 30602, United States.
  • Yanjun Yang
    Medical Engineering Technology and Data Mining Institute, Zhengzhou University, Zhengzhou, 450001, Henan, China.
  • Haoran Lu
    Department of Neurosurgery, Renmin Hospital of Wuhan University, 238 Jiefang Street, Wuhan, 430060, Hubei, China.
  • Jiaheng Cui
    School of Electrical and Computer Engineering, College of Engineering, The University of Georgia, Athens, Georgia 30602, United States.
  • Xianyan Chen
    Department of Statistics, University of Georgia Franklin College of Arts and Sciences, Athens, GA, USA.
  • Ping Ma
    Department of Statistics, University of Georgia, Athens, GA 30602, USA.
  • Wenxuan Zhong
    Department of Statistics, Franklin College of Arts and Sciences, University of Georgia, Athens, Georgia 30602, United States.
  • Yiping Zhao
    Department of Physics and Astronomy, The University of Georgia, Athens, Georgia30602, United States.