Predicting peptide presentation by major histocompatibility complex class I: an improved machine learning approach to the immunopeptidome.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: To further our understanding of immunopeptidomics, improved tools are needed to identify peptides presented by major histocompatibility complex class I (MHC-I). Many existing tools are limited by their reliance upon chemical affinity data, which is less biologically relevant than sampling by mass spectrometry, and other tools are limited by incomplete exploration of machine learning approaches. Herein, we assemble publicly available data describing human peptides discovered by sampling the MHC-I immunopeptidome with mass spectrometry and use this database to train random forest classifiers (ForestMHC) to predict presentation by MHC-I.

Authors

  • Kevin Michael Boehm
    Weill Cornell/Rockefeller/Sloan Kettering Tri-Institutional MD-PhD Program, 1300 York Avenue, New York, NY, USA. kmb2012@med.cornell.edu.
  • Bhavneet Bhinder
    Caryl and Israel Englander Institute for Precision Medicine, Weill Cornell Medical College, 413 East 69th Street, New York, NY, USA.
  • Vijay Joseph Raja
    Department of Biochemistry, Weill Cornell Medical College, 1300 York Avenue, New York, NY, USA.
  • Noah Dephoure
    Department of Biochemistry, Weill Cornell Medical College, 1300 York Avenue, New York, NY, USA.
  • Olivier Elemento
    Institute for Precision Medicine.