Machine Learning Meets Transparency in Osteoporosis Risk Assessment: A Comparative Study of ML and Explainability Analysis
Journal:
arXiv
Published Date:
May 1, 2025
Abstract
The present research tackles the difficulty of predicting osteoporosis risk
via machine learning (ML) approaches, emphasizing the use of explainable
artificial intelligence (XAI) to improve model transparency. Osteoporosis is a
significant public health concern, sometimes remaining untreated owing to its
asymptomatic characteristics, and early identification is essential to avert
fractures. The research assesses six machine learning classifiers: Random
Forest, Logistic Regression, XGBoost, AdaBoost, LightGBM, and Gradient Boosting
and utilizes a dataset based on clinical, demographic, and lifestyle variables.
The models are refined using GridSearchCV to calibrate hyperparameters, with
the objective of enhancing predictive efficacy. XGBoost had the greatest
accuracy (91%) among the evaluated models, surpassing others in precision
(0.92), recall (0.91), and F1-score (0.90). The research further integrates XAI
approaches, such as SHAP, LIME, and Permutation Feature Importance, to
elucidate the decision-making process of the optimal model. The study indicates
that age is the primary determinant in forecasting osteoporosis risk, followed
by hormonal alterations and familial history. These results corroborate
clinical knowledge and affirm the models' therapeutic significance. The
research underscores the significance of explainability in machine learning
models for healthcare applications, guaranteeing that physicians can rely on
the system's predictions. The report ultimately proposes directions for further
research, such as validation across varied populations and the integration of
supplementary biomarkers for enhanced predictive accuracy.