Geometric deep learning improves generalizability of MHC-bound peptide predictions.

Journal: Communications biology
Published Date:

Abstract

The interaction between peptides and major histocompatibility complex (MHC) molecules is pivotal in autoimmunity, pathogen recognition and tumor immunity. Recent advances in cancer immunotherapies demand for more accurate computational prediction of MHC-bound peptides. We address the generalizability challenge of MHC-bound peptide predictions, revealing limitations in current sequence-based approaches. Our structure-based methods leveraging geometric deep learning (GDL) demonstrate promising improvement in generalizability across unseen MHC alleles. Further, we tackle data efficiency by introducing a self-supervised learning approach on structures (3D-SSL). Without being exposed to any binding affinity data, our 3D-SSL outperforms sequence-based methods trained on ~90 times more data points. Finally, we demonstrate the resilience of structure-based GDL methods to biases in binding data on an Hepatitis B virus vaccine immunopeptidomics case study. This proof-of-concept study highlights structure-based methods' potential to enhance generalizability and data efficiency, with possible implications for data-intensive fields like T-cell receptor specificity predictions.

Authors

  • Dario F Marzella
    Center for Molecular and Biomolecular Informatics, Radboudumc, Greet Grooteplein 26-28, 6525, Nijmegen, GA, The Netherlands.
  • Giulia Crocioni
    Netherlands eScience Center, Amsterdam, The Netherlands.
  • Tadija Radusinović
    University of Amsterdam, Amsterdam, The Netherlands.
  • Daniil Lepikhov
    Medical BioSciences department, Radboudumc, Radboud University Medical Center, 6525 GA, Nijmegen, The Netherlands.
  • Heleen Severin
    Medical BioSciences department, Radboudumc, Radboud University Medical Center, 6525 GA, Nijmegen, The Netherlands.
  • Dani L Bodor
    Netherlands eScience Center, Amsterdam, The Netherlands.
  • Daniel T Rademaker
    Centre for Molecular and Biomolecular Informatics (CMBI), Radboudumc, 260 Nijmegen, The Netherlands.
  • ChiaYu Lin
    Netherlands eScience Center, Amsterdam, The Netherlands.
  • Sonja Georgievska
    Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands.
  • Nicolas Renaud
    Netherlands eScience Center, Science Park 140, 1098 XG, Amsterdam, The Netherlands.
  • Amy L Kessler
    Department of Gastroenterology and Hepatology, Erasmus MC, University Medical Center Rotterdam, 3015 GD, Rotterdam, The Netherlands.
  • Pablo Lopez-Tarifa
    Netherlands eScience Center, Amsterdam, The Netherlands.
  • Sonja I Buschow
    Department of Gastroenterology and Hepatology, Erasmus MC, University Medical Center Rotterdam, 3015 GD, Rotterdam, The Netherlands.
  • Erik Bekkers
    AMLAB, Amsterdam, The Netherlands.
  • Li C Xue
    Bijvoet Center for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Utrecht, The Netherlands.