HeapMS: An Automatic Peak-Picking Pipeline for Targeted Proteomic Data Powered by 2D Heatmap Transformation and Convolutional Neural Networks.

Journal: Analytical chemistry
Published Date:

Abstract

The process of peak picking and quality assessment for multiple reaction monitoring (MRM) data demands significant human effort, especially for signals with low abundance and high interference. Although multiple peak-picking software packages are available, they often fail to detect peaks with low quality and do not report cases with low confidence. Furthermore, visual examination of all chromatograms is still necessary to identify uncertain or erroneous cases. This study introduces HeapMS, a web service that uses artificial intelligence to assist with peak picking and the quality assessment of MRM chromatograms. HeapMS applies a rule-based filter to remove chromatograms with low interference and high-confidence peak boundaries detected by Skyline. Additionally, it transforms two histograms (representing light and heavy peptides) into a single encoded heatmap and performs a two-step evaluation (quality detection and peak picking) using image convolutional neural networks. HeapMS offers three categories of peak picking: uncertain peak picking that requires manual inspection, deletion peak picking that requires removal or manual re-examination, and automatic peak picking. HeapMS acquires the chromatogram and peak-picking boundaries directly from Skyline output. The output results are imported back into Skyline for further manual inspection, facilitating integration with Skyline. HeapMS offers the benefit of detecting chromatograms that should be deleted or require human inspection. Based on defined categories, it can significantly reduce human workload and provide consistent results. Furthermore, by using heatmaps instead of histograms, HeapMS can adapt to future updates in image recognition models. The HeapMS is available at: https://github.com/ccllabe/HeapMS.

Authors

  • Chi-Ching Lee
    Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan, Taiwan.
  • Yu-Chieh Lin
  • Teng Yu Pan
    Department of Computer Science and Information Engineering, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Cheng Hann Yang
    Department of Computer Science and Information Engineering, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Pei-Hsuan Li
    Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan, Taiwan. Electronic address: M0829002@cgu.edu.tw.
  • Sin You Chen
    Department of Computer Science and Information Engineering, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Jhih Jie Gao
    Department of Computer Science and Information Engineering, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Chi Yang
    Molecular Medicine Research Center, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Lichieh Julie Chu
    Molecular Medicine Research Center, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Po-Jung Huang
    Department of Biomedical Sciences, Chang Gung University, Taoyuan, Taiwan.
  • Yuan-Ming Yeh
    Genomic Medicine Core Laboratory, Chang Gung Memorial Hospital, Linkou, Taiwan. Electronic address: ymyeh@cgmh.org.tw.
  • Petrus Tang
    Graduate Institute of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan, Taiwan.
  • Yu-Sun Chang
    Molecular Medicine Research Center, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Jau-Song Yu
    Molecular Medicine Research Center, Chang Gung University, 33302 Taoyuan, Taiwan.
  • Yung-Chin Hsiao
    Molecular Medicine Research Center, Chang Gung University, 33302 Taoyuan, Taiwan.