Identification of Genomic Signatures for Colorectal Cancer Survival Using Exploratory Data Mining.

Journal: International journal of molecular sciences
Published Date:

Abstract

Clinicopathological presentations are critical for establishing a postoperative treatment regimen in Colorectal Cancer (CRC), although the prognostic value is low in Stage 2 CRC. We implemented a novel exploratory algorithm based on artificial intelligence (explainable artificial intelligence, XAI) that integrates mutational and clinical features to identify genomic signatures by repurposing the FoundationOne Companion Diagnostic (F1CDx) assay. The training data set ( = 378) consisted of subjects with recurrent and non-recurrent Stage 2 or 3 CRC retrieved from TCGA. Genomic signatures were built for identifying subgroups in Stage 2 and 3 CRC patients according to recurrence using genomic parameters and further associations with the clinical presentation. The summarization of the top-performing genomic signatures resulted in a 32-gene genomic signature that could predict tumor recurrence in CRC Stage 2 patients with high precision. The genomic signature was further validated using an independent dataset ( = 149), resulting in high-precision prognosis (AUC: 0.952; PPV = 0.974; NPV = 0.923). We anticipate that our genomic signatures and NCCN guidelines will improve recurrence predictions in CRC molecular stratification.

Authors

  • Justin J Hummel
    Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65212, USA.
  • Danlu Liu
    Electrical Engineering and Computer Science Department, University of Missouri, Columbia, MO 65211, USA.
  • Erin Tallon
    Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65212, USA.
  • John Snyder
    Department of Statistics, University of Missouri, Columbia, MO 65212, USA.
  • Wesley Warren
    Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65212, USA.
  • Chi-Ren Shyu
    Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA; Electrical Engineering and Computer Science Department, University of Missouri, Columbia, MO 65211, USA; Department of Medicine, School of Medicine, University of Missouri, Columbia, MO 65212, USA. Electronic address: shyuc@missouri.edu.
  • Jonathan Mitchem
    Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65212, USA.
  • Rene Cortese
    Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65212, USA.