Pre-trained Maldi Transformers improve MALDI-TOF MS-based prediction.

Journal: Computers in biology and medicine
PMID:

Abstract

For the last decade, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) has been the reference method for species identification in clinical microbiology. Hampered by a historical lack of open data, machine learning research towards models specifically adapted to MALDI-TOF MS remains in its infancy. Given the growing complexity of available datasets (such as large-scale antimicrobial resistance prediction), a need for models that (1) are specifically designed for MALDI-TOF MS data, and (2) have high representational capacity, presents itself. Here, we introduce Maldi Transformer, an adaptation of the state-of-the-art transformer architecture to the MALDI-TOF mass spectral domain. We propose the first self-supervised pre-training technique specifically designed for mass spectra. The technique is based on shuffling peaks across spectra, and pre-training the transformer as a peak discriminator. Extensive benchmarks confirm the efficacy of this novel design. The final result is a model exhibiting state-of-the-art (or competitive) performance on downstream prediction tasks. In addition, we show that Maldi Transformer's identification of noisy spectra may be leveraged towards higher predictive performance. All code supporting this study is distributed on PyPI and is packaged under: https://github.com/gdewael/maldi-nn.

Authors

  • Gaetan De Waele
    Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium.
  • Gerben Menschaert
    BioBix, Department of Data Analysis and Mathematical Modelling, Ghent University, Coupure Links 653, 900, Gent, Belgium. Electronic address: gerben.menschaert@ugent.be.
  • Peter Vandamme
    Laboratory of Microbiology, Ghent University, K. L. Ledeganckstraat 35, Ghent, 9000, Belgium.
  • Willem Waegeman
    KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium.