A deep learning framework for hepatocellular carcinoma diagnosis using MS1 data.

Journal: Scientific reports
Published Date:

Abstract

Clinical proteomics analysis is of great significance for analyzing pathological mechanisms and discovering disease-related biomarkers. Using computational methods to accurately predict disease types can effectively improve patient disease diagnosis and prognosis. However, how to eliminate the errors introduced by peptide precursor identification and protein identification for pathological diagnosis remains a major unresolved issue. Here, we develop a powerful end-to-end deep learning model, termed "MS1Former", that is able to classify hepatocellular carcinoma tumors and adjacent non-tumor (normal) tissues directly using raw MS1 spectra without peptide precursor identification. Our model provides accurate discrimination of subtle m/z differences in MS1 between tumor and adjacent non-tumor tissue, as well as more general performance predictions for data-dependent acquisition, data-independent acquisition, and full-scan data. Our model achieves the best performance on multiple external validation datasets. Additionally, we perform a detailed exploration of the model's interpretability. Prospectively, we expect that the advanced end-to-end framework will be more applicable to the classification of other tumors.

Authors

  • Wei Xu
    College of Food and Bioengineering, Henan University of Science and Technology, Luoyang, 471023 China.
  • Liying Zhang
    School of Information Engineering, Zhengzhou University, Zhengzhou, Henan, PR China.
  • Xiaoliang Qian
    School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China.
  • Nannan Sun
    Hangzhou SanOmics Information Technology Co., Ltd., Hangzhou 310015, P. R. China.
  • Xiao Tu
    College of Basic Medical Science, Zhejiang Chinese Medical University, 548 Binwen Rd, Hangzhou, 310053, China.
  • Dengfeng Zhou
    SanOmics AI Co., Ltd, Lingping District, Hangzhou, 311103, China.
  • Xiaoping Zheng
    Department of Urology, The Third Xiangya Hospital, Central South University, Changsha, Hunan, 410013, China.
  • Jia Chen
    Department of Oncology Internal Medicine, Nantong Tumor Hospital, Affiliated Tumor Hospital of Nantong University, Nantong, China.
  • Zewen Xie
    Hangzhou SanOmics Information Technology Co., Ltd., Hangzhou 310015, P. R. China.
  • Tao He
  • Shugang Qu
    Hangzhou SanOmics Information Technology Co., Ltd., Hangzhou 310015, P. R. China.
  • Yinjia Wang
    The First People's Hospital of Kunming, Intensive Care Unit, Kunming, 650032, China. wangyinj@163.com.
  • Keda Yang
    Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou 310015, P. R. China.
  • Kunkai Su
    State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, the First affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang, China.
  • Shan Feng
    Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou 310024, Zhejiang, China; Institute of Biology, Westlake Institute for Advanced Study, Hangzhou 310024, Zhejiang, China.
  • Bin Ju
    Hangzhou Wowjoy Information Technology Co., Ltd, Hangzhou, China. bin.ju@wowjoy.cn.