Deep Learning-Based Method for Compound Identification in NMR Spectra of Mixtures.

Journal: Molecules (Basel, Switzerland)
Published Date:

Abstract

Nuclear magnetic resonance (NMR) spectroscopy is highly unbiased and reproducible, which provides us a powerful tool to analyze mixtures consisting of small molecules. However, the compound identification in NMR spectra of mixtures is highly challenging because of chemical shift variations of the same compound in different mixtures and peak overlapping among molecules. Here, we present a pseudo-Siamese convolutional neural network method (pSCNN) to identify compounds in mixtures for NMR spectroscopy. A data augmentation method was implemented for the superposition of several NMR spectra sampled from a spectral database with random noises. The augmented dataset was split and used to train, validate and test the pSCNN model. Two experimental NMR datasets (flavor mixtures and additional flavor mixture) were acquired to benchmark its performance in real applications. The results show that the proposed method can achieve good performances in the augmented test set (ACC = 99.80%, TPR = 99.70% and FPR = 0.10%), the flavor mixtures dataset (ACC = 97.62%, TPR = 96.44% and FPR = 2.29%) and the additional flavor mixture dataset (ACC = 91.67%, TPR = 100.00% and FPR = 10.53%). We have demonstrated that the translational invariance of convolutional neural networks can solve the chemical shift variation problem in NMR spectra. In summary, pSCNN is an off-the-shelf method to identify compounds in mixtures for NMR spectroscopy because of its accuracy in compound identification and robustness to chemical shift variation.

Authors

  • Weiwei Wei
    Technology Center, China Tobacco Hunan Industrial Co., Ltd., Changsha 410014, China.
  • Yuxuan Liao
    College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China.
  • Yufei Wang
    State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Center for Oral Diseases, West China Hospital of Stomatology, Sichuan University, Chengdu, Sichuan, China.
  • Shaoqi Wang
    College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China.
  • Wen Du
    Technology Center, China Tobacco Hunan Industrial Co., Ltd., Changsha 410014, China.
  • Hongmei Lu
    College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China.
  • Bo Kong
    Technology Center, China Tobacco Hunan Industrial Co., Ltd., Changsha 410014, China.
  • Huawu Yang
    Flavors and Fragrances Research Institute, Technology Center, China Tobacco Hunan Industrial Co., Ltd., Changsha 410014, China.
  • Zhimin Zhang
    School of Control Science and Engineering, Shandong University, Jinan, People's Republic of China. School of Information Technology and Electrical Engineering, University of Queensland, Queensland, Australia.