Prediction of Essential Genes in Comparison States Using Machine Learning.

Journal: IEEE/ACM transactions on computational biology and bioinformatics
Published Date:

Abstract

Identifying essential genes in comparison states (EGS) is vital to understanding cell differentiation, performing drug discovery, and identifying disease causes. Here, we present a machine learning method termed Prediction of Essential Genes in Comparison States (PreEGS). To capture the alteration of the network in comparison states, PreEGS extracts topological and gene expression features of each gene in a five-dimensional vector. PreEGS also recruits a positive sample expansion method to address the problem of unbalanced positive and negative samples, which is often encountered in practical applications. Different classifiers are applied to the simulated datasets, and the PreEGS based on the random forests model (PreEGSRF) was chosen for optimal performance. PreEGSRF was then compared with six other methods, including three machine learning methods, to predict EGS in a specific state. On real datasets with four gene regulatory networks, PreEGSRF predicted five essential genes related to leukemia and five enriched KEGG pathways. Four of the predicted essential genes and all predicted pathways were consistent with previous studies and highly correlated with leukemia. With high prediction accuracy and generalization ability, PreEGSRF is broadly applicable for the discovery of disease-causing genes, driver genes for cell fate decisions, and complex biomarkers of biological systems.

Authors

  • Jiang Xie
    Soil and Fertilizer & Resources and Environmental Institute, Jiangxi Academy of Agricultural Sciences, Nanchang, China.
  • Chang Zhao
    Department of Orthopedics, Orthopedic Hospital of Guangdong Province, The Third Affiliated Hospital of Southern Medical University, No. 183, Zhongshan Rd West, Guangzhou, 510630, China. zhaochang81@126.com.
  • Jiamin Sun
  • Jiaxin Li
    Department of Gastroenterology, West China Hospital, Sichuan University, Chengdu, China.
  • Fuzhang Yang
  • Jiao Wang
    Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Qing Nie
    Department of Mathematics, University of California, Irvine, Irvine, CA USA.