Explainable AI Model Reveals Informative Mutational Signatures for Cancer-Type Classification.

Journal: Cancers
Published Date:

Abstract

: The prediction of cancer types is primarily reliant on driver genes and their specific mutations. The advancement in novel omics technologies has led to the acquisition of additional genetic data. When integrated with artificial intelligence models, there is considerable potential for this to enhance the accuracy of cancer diagnosis. As mutational signatures can provide insights into repair mechanism malfunctions, they also have the potential for more accurate cancer diagnosis. : First, we compared unsupervised and supervised machine learning approaches to predict cancer types. We employed deep and artificial neural network architectures with an explainable component like layerwise relevance propagation to extract the most relevant features for the cancer-type prediction. Ten-fold cross-validation and an extensive grid search were used to optimize the neural network architecture using driver gene mutations, mutational signatures and topological mutation information as input. The PCAWG dataset was used as input to discriminate between 17 primary sites and 24 cancer types. : Overall, our approach showed that the most relevant mutation information to discriminate between cancer types is increased by >10% using the whole genome or intergenic and intronic genome regions instead of exome information. Furthermore, the most relevant features for most cancer types, except for two, are in the mutational signatures and not the topological mutation information. : Informative mutational signatures outperformed the prediction of cancer types in comparison to driver gene mutations and added a new layer of diagnostic information. As the degree of information within the mutational signatures is not solely based on the frequency of occurrence, it is even possible to separate cancer types from the same primary site by the different relevant mutations. Furthermore, the comparison of informative mutational signatures allowed the cancer-type assignment of specific impaired repair mechanisms.

Authors

  • Jonas Wagner
    Institute of Bioinformatics, University Medicine Greifswald, 17475 Greifswald, Germany.
  • Jan Oldenburg
    Institute for ImplantTechnology and Biomaterials e.V, Rostock, Germany. jan.oldenburg@uni-rostock.de.
  • Neetika Nath
  • Stefan Simm
    Institute of Bioinformatics, University Medicine Greifswald, 17475 Greifswald, Germany.

Keywords

No keywords available for this article.