Machine and Deep Learning for Prediction of Subcellular Localization.

Journal: Methods in molecular biology (Clifton, N.J.)
Published Date:

Abstract

Protein subcellular localization prediction (PSLP), which plays an important role in the field of computational biology, identifies the position and function of proteins in cells without expensive cost and laborious effort. In the past few decades, various methods with different algorithms have been proposed in solving the problem of subcellular localization prediction; machine learning and deep learning constitute a large portion among those proposed methods. In order to provide an overview about those methods, the first part of this article will be a brief review of several state-of-the-art machine learning methods on subcellular localization prediction; then the materials used by subcellular localization prediction is described and a simple prediction method, that takes protein sequences as input and utilizes a convolutional neural network as the classifier, is introduced. At last, a list of notes is provided to indicate the major problems that may occur with this method.

Authors

  • Gaofeng Pan
    Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, USA.
  • Chao Sun
    Hospital for Skin Diseases and Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China.
  • Zijun Liao
    Department of Computer Science and Engineering, University of South Carolina, Columbia, SC, USA.
  • Jijun Tang
    School of Computer Science and Engineering, Tianjin University, Tianjin, 300072, China. jtang@cse.sc.edu.