Variable Length Character N-Gram Embedding of Protein Sequences for Secondary Structure Prediction.

Journal: Protein and peptide letters
Published Date:

Abstract

BACKGROUND: The prediction of a protein's secondary structure from its amino acid sequence is an essential step towards predicting its 3-D structure. The prediction performance improves by incorporating homologous multiple sequence alignment information. Since homologous details not available for all proteins. Therefore, it is necessary to predict the protein secondary structure from single sequences.

Authors

  • Ashish Kumar Sharma
    Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, Uttar Pradesh, India.
  • Rajeev Srivastava
    Computer Science and Engineering Department, Indian Institute of Technology (Banaras Hindu University) Varanasi, Varanasi, Uttar Pradesh, India.