Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.

Journal: Biochimica et biophysica acta. Reviews on cancer
Published Date:

Abstract

The recent deluge of genome-wide technologies for the mapping of the epigenome and resulting data in cancer samples has provided the opportunity for gaining insights into and understanding the roles of epigenetic processes in cancer. However, the complexity, high-dimensionality, sparsity, and noise associated with these data pose challenges for extensive integrative analyses. Machine Learning (ML) algorithms are particularly suited for epigenomic data analyses due to their flexibility and ability to learn underlying hidden structures. We will discuss four overlapping but distinct major categories under ML: dimensionality reduction, unsupervised methods, supervised methods, and deep learning (DL). We review the preferred use cases of these algorithms in analyses of cancer epigenomics data with the hope to provide an overview of how ML approaches can be used to explore fundamental questions on the roles of epigenome in cancer biology and medicine.

Authors

  • Emre Arslan
    Department of Genomic Medicine, MD Anderson Cancer Center, Houston, TX 77030, United States of America.
  • Jonathan Schulz
    Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
  • Kunal Rai
    Department of Genomic Medicine, MD Anderson Cancer Center, Houston, TX 77030, United States of America. Electronic address: krai@mdanderson.org.