EPIMUTESTR: a nearest neighbor machine learning approach to predict cancer driver genes from the evolutionary action of coding variants.

Journal: Nucleic acids research
PMID:

Abstract

Discovering rare cancer driver genes is difficult because their mutational frequency is too low for statistical detection by computational methods. EPIMUTESTR is an integrative nearest-neighbor machine learning algorithm that identifies such marginal genes by modeling the fitness of their mutations with the phylogenetic Evolutionary Action (EA) score. Over cohorts of sequenced patients from The Cancer Genome Atlas representing 33 tumor types, EPIMUTESTR detected 214 previously inferred cancer driver genes and 137 new candidates never identified computationally before of which seven genes are supported in the COSMIC Cancer Gene Census. EPIMUTESTR achieved better robustness and specificity than existing methods in a number of benchmark methods and datasets.

Authors

  • Saeid Parvandeh
    Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
  • Lawrence A Donehower
    Department of Molecular Virology and Microbiology, Houston, TX 77030, USA.
  • Katsonis Panagiotis
    Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
  • Teng-Kuei Hsu
    Department of Biochemistry & Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA.
  • Jennifer K Asmussen
    Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
  • Kwanghyuk Lee
    Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.
  • Olivier Lichtarge
    Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA.