Mapping HDX-MS Data to Protein Conformations through Training Ensemble-Based Models.

Journal: Journal of the American Society for Mass Spectrometry
Published Date:

Abstract

An original approach that adopts machine learning inference to predict protein structural information using hydrogen-deuterium exchange mass spectrometry (HDX-MS) is described. The method exploits an in-house optimization program that increases the resolution of HDX-MS data from peptides to amino acids. A system is trained using Gradient Tree Boosting as a type of machine learning ensemble technique to assign a protein secondary structure. Using limited training data we generate a discriminative model that uses optimized HDX-MS data to predict protein secondary structure with an accuracy of 75%. This research could form the basis for new methods exploiting artificial intelligence to model protein conformations by HDX-MS.

Authors

  • Ramin E Salmas
    Department of Chemistry, Britannia House, King's College London, London SE1 1DB, U.K.
  • Matthew J Harris
    Department of Chemistry, Britannia House, King's College London, London SE1 1DB, U.K.
  • Antoni J Borysik
    Department of Chemistry, Britannia House, King's College London, London SE1 1DB, U.K.