Predicting phosphorylation sites using machine learning by integrating the sequence, structure, and functional information of proteins.

Journal: Journal of translational medicine
PMID:

Abstract

BACKGROUND: Post-translational modification (PTM) is a biological process that alters proteins and is therefore involved in the regulation of various cellular activities and pathogenesis. Protein phosphorylation is an essential process and one of the most-studied PTMs: it occurs when a phosphate group is added to serine (Ser, S), threonine (Thr, T), or tyrosine (Tyr, Y) residue. Dysregulation of protein phosphorylation can lead to various diseases-most commonly neurological disorders, Alzheimer's disease, and Parkinson's disease-thus necessitating the prediction of S/T/Y residues that can be phosphorylated in an uncharacterized amino acid sequence. Despite a surplus of sequencing data, current experimental methods of PTM prediction are time-consuming, costly, and error-prone, so a number of computational methods have been proposed to replace them. However, phosphorylation prediction remains limited, owing to substrate specificity, performance, and the diversity of its features.

Authors

  • Salma Jamal
    School of Biotechnology, Jawaharlal Nehru University, New Delhi-110067, India phone/fax: +91-11-26738728; fax: +91-11-26702040.
  • Waseem Ali
    JH-Institute of Molecular Medicine, Jamia Hamdard, New Delhi, India.
  • Priya Nagpal
    School of Biotechnology, Jawaharlal Nehru University, New Delhi, India.
  • Abhinav Grover
    Department of Pharmacology, Lady Hardinge Medical College and Associated Hospitals, New Delhi, India.
  • Sonam Grover