Exploration of Black Boxes of Supervised Machine Learning Models: A Demonstration on Development of Predictive Heart Risk Score.

Journal: Computational intelligence and neuroscience

Published Date: May 12, 2022

Abstract

Machine learning (ML) often provides applicable high-performance models to facilitate decision-makers in various fields. However, this high performance is achieved at the expense of the interpretability of these models, which has been criticized by practitioners and has become a significant hindrance in their application. Therefore, in highly sensitive decisions, black boxes of ML models are not recommended. We proposed a novel methodology that uses complex supervised ML models and transforms them into simple, interpretable, transparent statistical models. This methodology is like stacking ensemble ML in which the best ML models are used as a base learner to compute relative feature weights. The index of these weights is further used as a single covariate in the simple logistic regression model to estimate the likelihood of an event. We tested this methodology on the primary dataset related to cardiovascular diseases (CVDs), the leading cause of mortalities in recent times. Therefore, early risk assessment is an important dimension that can potentially reduce the burden of CVDs and their related mortality through accurate but interpretable risk prediction models. We developed an artificial neural network and support vector machines based on ML models and transformed them into a simple statistical model and heart risk scores. These simplified models were found transparent, reliable, valid, interpretable, and approximate in predictions. The findings of this study suggest that complex supervised ML models can be efficiently transformed into simple statistical models that can also be validated.

Authors

Mirza Rizwan Sajid

Department of Statistics, University of Gujrat, Hafiz Hayat Campus Jalalpur Road, Gujrat 50700, Pakistan.
Arshad Ali Khan

Faculty of Computing, Universiti Malaysia Pahang, Pekan 26600, Malaysia.
Haitham M Albar

Department of Surgery, College of Medicine, Majmaah University, Almajmaah 11952, Saudi Arabia.
Noryanti Muhammad

Centre of Excellence for Artificial Intelligence & Data Science, Department of Knowledge Management & Information Technology, Universiti Malaysia Pahang, Gambang 26300, Malaysia.
Waqas Sami

Department of Community Medicine and Public Health, College of Medicine, Majmaah University, Almajmaah 11952, Saudi Arabia.
Syed Ahmad Chan Bukhari

Department of Pathology, Yale School of Medicine, New Haven, CT, USA. ahmad.chan@yale.edu.
Iram Wajahat

Allied Institute of Medical Sciences, Gujranwala 52250, Pakistan.

Keywords

Cardiovascular Diseases Humans Machine Learning Neural Networks, Computer Risk Factors Supervised Machine Learning Support Vector Machine

External Resources

View on PubMed Access via DOI PubMed (35602638)

Exploration of Black Boxes of Supervised Machine Learning Models: A Demonstration on Development of Predictive Heart Risk Score.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals