Deep-GB: A novel deep learning model for globular protein prediction using CNN-BiLSTM architecture and enhanced PSSM with trisection strategy.

Journal: IET systems biology
PMID:

Abstract

Globular proteins (GPs) play vital roles in a wide range of biological processes, encompassing enzymatic catalysis and immune responses. Enzymes, among these globular proteins, facilitate biochemical reactions, while others, such as haemoglobin, contribute to essential physiological functions such as oxygen transport. Given the importance of these considerations, accurately identifying Globular proteins is essential. To address the need for precise GP identification, this research introduces an innovative approach that employs a hybrid-based deep learning model called Deep-GP. We generated two datasets based on primary sequences and developed a novel feature descriptor called, Consensus Sequence-based Trisection-Position Specific Scoring Matrix (CST-PSSM). The model training phase involved the application of deep learning techniques, including the bidirectional long short-term memory network (BiLSTM), gated recurrent unit (GRU), and convolutional neural network (CNN). The BiLSTM and CNN were hybridised for ensemble learning. The CST-PSSM-based ensemble model achieved the most accurate predictive outcomes, outperforming other competitive predictors across both training and testing datasets. This demonstrates the potential of harnessing deep learning for precise GB prediction as a robust tool to expedite research, streamline drug discovery, and unveil novel therapeutic targets.

Authors

  • Sonia Zouari
    National Engineering School of Sfax, University of Sfax, Sfax, Tunisia.
  • Farman Ali
    Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.
  • Atef Masmoudi
    College of Computer Science, King Khalid University, Abha, Saudi Arabia.
  • Sarah Abu Ghazalah
    Department of Informatics and Computer System, College of Computer Science, King Khalid University, Abha, Saudi Arabia.
  • Wajdi Alghamdi
    Data Science & Soft Computing Lab, and Department of Computing, Goldsmiths, University of London, UK.
  • Faris A Kateb
    Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia.
  • Nouf Ibrahim
    Family Medicine Clinic, Makkah Armed Force Medical Center, Makkah, Saudi Arabia.