Machine Learning in Epidemiology and Health Outcomes Research.

Journal: Annual review of public health
Published Date:

Abstract

Machine learning approaches to modeling of epidemiologic data are becoming increasingly more prevalent in the literature. These methods have the potential to improve our understanding of health and opportunities for intervention, far beyond our past capabilities. This article provides a walkthrough for creating supervised machine learning models with current examples from the literature. From identifying an appropriate sample and selecting features through training, testing, and assessing performance, the end-to-end approach to machine learning can be a daunting task. We take the reader through each step in the process and discuss novel concepts in the area of machine learning, including identifying treatment effects and explaining the output from machine learning models.

Authors

  • Timothy L Wiemken
    Center for Health Outcomes Research, Saint Louis University, Saint Louis, Missouri 63104, USA; email: timothy.wiemken@health.slu.edu.
  • Robert R Kelley
    Department of Computer Science, Bellarmine University, Louisville, Kentucky 40205, USA; email: rkelley@bellarmine.edu.