WinBinVec: Cancer-Associated Protein-Protein Interaction Extraction and Identification of 20 Various Cancer Types and Metastasis Using Different Deep Learning Models.
Journal:
IEEE journal of biomedical and health informatics
Published Date:
Oct 5, 2021
Abstract
Biophysical protein-protein interactions perform dominant roles in the initiation and progression of many cancer-related pathways. A protein-protein interaction might play different roles in diverse cancer types. Hence, prioritizing the PPIs in each cancer type would help detect cancer-associated pathways, find a better understanding of cancer biology, and facilitate drug discovery. Several studies to date have proposed computational methods for extracting the PPI essentiality of different cancer types based on the PPI network. The main drawback of these studies is not using a rich source such as genomics variant data. An amino acid sequence encodes useful information about protein structure and behavior. We represent each amino acid sequence based on its variants/mutations in seven different ways: binary vectors, pathogenicity scores, binding affinity changes upon mutations, gene expression-based network of the interactions, biophysicochemical properties, g-gap dipeptide, and one-hot vectors. Based on these representations, we design and consider seven different deep learning models. Then, we compare the accuracy of these models in predicting 20 different cancer types from the TCGA cohort. WinBinVec is a window-based model that outperforms the other models. Moreover, WinBinVec contains a PPI essentiality module that helps extract the essentiality probability of each PPI for every cancer type. Source code and Data: https://github.com/sabdollahi/WinBinVec.