An end-to-end deep learning method for mass spectrometry data analysis to reveal disease-specific metabolic profiles.

Journal: Nature communications
PMID:

Abstract

Untargeted metabolomic analysis using mass spectrometry provides comprehensive metabolic profiling, but its medical application faces challenges of complex data processing, high inter-batch variability, and unidentified metabolites. Here, we present DeepMSProfiler, an explainable deep-learning-based method, enabling end-to-end analysis on raw metabolic signals with output of high accuracy and reliability. Using cross-hospital 859 human serum samples from lung adenocarcinoma, benign lung nodules, and healthy individuals, DeepMSProfiler successfully differentiates the metabolomic profiles of different groups (AUC 0.99) and detects early-stage lung adenocarcinoma (accuracy 0.961). Model flow and ablation experiments demonstrate that DeepMSProfiler overcomes inter-hospital variability and effects of unknown metabolites signals. Our ensemble strategy removes background-category phenomena in multi-classification deep-learning models, and the novel interpretability enables direct access to disease-related metabolite-protein networks. Further applying to lipid metabolomic data unveils correlations of important metabolites and proteins. Overall, DeepMSProfiler offers a straightforward and reliable method for disease diagnosis and mechanism discovery, enhancing its broad applicability.

Authors

  • Yongjie Deng
    State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China.
  • Yao Yao
    Key Laboratory for Organic Electronics and Information Displays (KLOEID) & Jiangsu Key Laboratory for Biosensors, Institute of Advanced Materials (IAM), National Synergetic Innovation Center for Advanced Materials (SICAM), Nanjing University of Posts and Telecommunications, 9 Wenyuan Road, Nanjing 210023, China.
  • Yanni Wang
    Department of Maternal, Child and Adolescent Health, School of Public Health, Lanzhou University, Lanzhou, Gansu, China.
  • Tiantian Yu
    Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
  • Wenhao Cai
    Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
  • Dingli Zhou
    Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.
  • Feng Yin
    State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
  • Wanli Liu
    Microscopic Image and Medical Image Analysis Group, College of Medicine and Biological Information Engineering, Northeastern University, Shenyang, 110169, China.
  • Yuying Liu
    Guangxi Medical University, Nanning Guangxi, 530021, P.R.China.
  • Chuanbo Xie
    State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou, China.
  • Jian Guan
    State Key Laboratory of Multiphase Complex Systems, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 100190, China; College of Chemical Engineering, University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory of Science and Technology on Particle Materials, Institute of Process Engineering, Chinese Academy of Sciences, Beijing 361021, China.
  • Yumin Hu
    State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou, China. huym@sysucc.org.cn.
  • Peng Huang
    College of Food Science, Sichuan Agricultural University, Ya'an 625014, China.
  • Weizhong Li
    Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou 500040, China.