A Machine Learning Approach for Predicting Essentiality of Metabolic Genes.

Journal: Methods in molecular biology (Clifton, N.J.)
PMID:

Abstract

The identification of essential genes is a key challenge in systems and synthetic biology, particularly for engineering metabolic pathways that convert feedstocks into valuable products. Assessment of gene essentiality at a genome scale requires large and costly growth assays of knockout strains. Here we describe a strategy to predict the essentiality of metabolic genes using binary classification algorithms. The approach combines elements from genome-scale metabolic models, directed graphs, and machine learning into a predictive model that can be trained on small knockout data. We demonstrate the efficacy of this approach using the most complete metabolic model of Escherichia coli and various machine learning algorithms for binary classification.

Authors

  • Lilli J Freischem
    Department of Physics, University of Oxford, Oxford, UK.
  • Diego A OyarzĂșn
    School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JH, UK. d.oyarzun@ed.ac.uk.