Exploring Prediabetes Pathways Using Explainable AI on Data from Electronic Medical Records.

Journal: Studies in health technology and informatics
Published Date:

Abstract

This study leverages data from a Canadian database of primary care Electronic Medical Records to develop machine learning models predicting type 2 diabetes mellitus (T2D), prediabetes, or normoglycemia. These models are used as a basis for extracting counterfactual explanations and derive personalized changes in biomarkers to prevent T2D onset, particularly in the still reversible prediabetic state. The models achieve satisfactory performance. Furthermore, feature importance analysis underscores the significance of fasting blood sugar and glycated hemoglobin, while counterfactuals explanations emphasize the centrality of keeping body mass index and cholesterol indicators within or close to the clinically desirable ranges. This research highlights the potential of machine learning and counterfactual explanations in guiding preventive interventions that may help slow down the progression from prediabetes to T2D on an individual basis, eventually fostering a recovery from prediabetes to a normoglycemic state.

Authors

  • Davide Console
    Politecnico di Milano, Milan, Italy.
  • Marta Lenatti
    National Research Council of Italy (CNR), Institute of Electronics, Information Engineering and Telecommunications (IEIIT), Italy.
  • Davide Simeone
    Politecnico di Milano, Milan, Italy.
  • Karim Keshavjee
    Research Lab for Advanced System Modelling, Ryerson University, Toronto, ON, M5B 2K3, Canada.
  • Aziz Guergachi
    Research Lab for Advanced System Modelling, Ryerson University, Toronto, ON, M5B 2K3, Canada.
  • Maurizio Mongelli
    Consiglio Nazionale delle Ricerche (CNR), Institute of Electronics, Information Engineering and Telecommunications (IEIIT), 16149 Genoa, Italy.
  • Alessia Paglialonga
    Institute of Electronics, Computer and Telecommunication Engineering (IEIIT), National Research Council of Italy (CNR), Milan, Italy.