Multi-scale structural analysis of proteins by deep semantic segmentation.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Recent advances in computational methods have facilitated large-scale sampling of protein structures, leading to breakthroughs in protein structural prediction and enabling de novo protein design. Establishing methods to identify candidate structures that can lead to native folds or designable structures remains a challenge, since few existing metrics capture high-level structural features such as architectures, folds and conformity to conserved structural motifs. Convolutional Neural Networks (CNNs) have been successfully used in semantic segmentation-a subfield of image classification in which a class label is predicted for every pixel. Here, we apply semantic segmentation to protein structures as a novel strategy for fold identification and structure quality assessment.

Authors

  • Raphael R Eguchi
    Department of Biochemistry, School of Medicine, Stanford University, Shriram Center for Bioengineering and Chemical Engineering, 443 via Ortega, Room 036, Stanford, CA 94305, USA.
  • Po-Ssu Huang
    Department of Bioengineering , Stanford University , Shriram Center for Bioengineering and Chemical Engineering, 443 Via Ortega , Stanford , California 94305 , United States.