Improved pan-specific prediction of MHC class I peptide binding using a novel receptor clustering data partitioning strategy.

Journal: HLA

PMID: 27762504

Abstract

Pan-specific prediction of receptor-ligand interaction is conventionally done using machine-learning methods that integrates information about both receptor and ligand primary sequences. To achieve optimal performance using machine learning, dealing with overfitting and data redundancy is critical. Most often so-called ligand clustering methods have been used to deal with these issues in the context of pan-specific receptor-ligand predictions, and the MHC system the approach has proven highly effective for extrapolating information from a limited set of receptors with well characterized binding motifs, to others with no or very limited experimental characterization. The success of this approach has however proven to depend strongly on the similarity of the query molecule to the molecules with characterized specificity using in the machine-learning process. Here, we outline an alternative strategy with the aim of altering this and construct data sets optimal for training of pan-specific receptor-ligand predictions focusing on receptor similarity rather than ligand similarity. We show that this receptor clustering method consistently in benchmarks covering affinity predictions, MHC ligand and MHC epitope identification perform better than the conventional ligand clustering method on the alleles with remote similarity to the training set.

Authors

A H Mattsson

Evaxion Biotech, Copenhagen, Denmark.
J V Kringelum

Evaxion Biotech, Copenhagen, Denmark.
C Garde

Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
M Nielsen

Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, Lyngby, Denmark.

Keywords

Alleles Animals Binding Sites Epitopes Gene Expression Gorilla gorilla Histocompatibility Antigens Class I Humans Ligands Macaca Machine Learning Mice Oligopeptides Pan troglodytes Protein Binding Protein Interaction Domains and Motifs Software Structural Homology, Protein

External Resources

View on PubMed Access via DOI PubMed (27762504)

Improved pan-specific prediction of MHC class I peptide binding using a novel receptor clustering data partitioning strategy.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals