Machine Learning Methods for Precision Medicine Research Designed to Reduce Health Disparities: A Structured Tutorial.

Journal: Ethnicity & disease

Published Date: Apr 2, 2020

Abstract

Precision medicine research designed to reduce health disparities often involves studying multi-level datasets to understand how diseases manifest disproportionately in one group over another, and how scarce health care resources can be directed precisely to those most at risk for disease. In this article, we provide a structured tutorial for medical and public health researchers on the application of machine learning methods to conduct precision medicine research designed to reduce health disparities. We review key terms and concepts for understanding machine learning papers, including supervised and unsupervised learning, regularization, cross-validation, bagging, and boosting. Metrics are reviewed for evaluating machine learners and major families of learning approaches, including tree-based learning, deep learning, and ensemble learning. We highlight the advantages and disadvantages of different learning approaches, describe strategies for interpreting "black box" models, and demonstrate the application of common methods in an example dataset with open-source statistical code in R.

Authors

Sanjay Basu

Center for Primary Care and Outcomes Research, Center for Population Health Sciences, Departments of Medicine and Health Research and Policy, Stanford University, Palo Alto, CA basus@stanford.edu.
James H Faghmous

Independent Researcher, Los Angeles, CA.
Patrick Doupe

Zalando SE, Berlin, Germany. Electronic address: patrick.doupe@zalando.de.

Keywords

Clinical Decision-Making Health Status Disparities Humans Machine Learning Precision Medicine

External Resources

View on PubMed Access via DOI PubMed (32269464)

Machine Learning Methods for Precision Medicine Research Designed to Reduce Health Disparities: A Structured Tutorial.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals