Machine learning optimized DriverDetect software for high precision prediction of deleterious mutations in human cancers.

Journal: Scientific reports
Published Date:

Abstract

The detection of cancer-driving mutations is important for understanding cancer pathology and therapeutics development. Prediction tools have been created to streamline the computation process. However, most tools available have heterogeneous sensitivity or specificity. We built a machine learning-derived algorithm, DriverDetect that combines the outputs of seven pre-existing tools to improve the prediction of candidate driver cancer mutations. The algorithm was trained with cancer gene-specific mutation datasets of cancer patients to identify cancer drivers. DriverDetect performed better than the individual tools or their combinations in the validation test. It has the potential to incorporate future novel prediction algorithms and can be retrained with new datasets, offering an expanded application to pan-cancer analysis for cross-cancer study. (115 words).

Authors

  • Herrick Yu Kan Koh
    Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
  • Ulysses Tsz Fung Lam
    Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore.
  • Kenneth Hon-Kim Ban
    Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore. kenneth_ban@nus.edu.sg.
  • Ee Sin Chen
    Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore. bchces@nus.edu.sg.