A data-driven approach to predicting diabetes and cardiovascular disease with machine learning.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: Diabetes and cardiovascular disease are two of the main causes of death in the United States. Identifying and predicting these diseases in patients is the first step towards stopping their progression. We evaluate the capabilities of machine learning models in detecting at-risk patients using survey data (and laboratory results), and identify key variables within the data contributing to these diseases among the patients.

Authors

  • An Dinh
    Department of Mathematics and Computer Science, Eastern Oregon University, La Grande, OR, USA.
  • Stacey Miertschin
    Department of Mathematics and Statistics, Winona State University, Winona, MN, USA.
  • Amber Young
    Department of Statistics, Purdue University, West Lafayette, IN, USA.
  • Somya D Mohanty
    Department of Computer Science, University of North Carolina at Greensboro, Greensboro, NC, USA. sdmohant@uncg.edu.