Polynomial-SHAP analysis of liver disease markers for capturing of complex feature interactions in machine learning models.

Journal: Computers in biology and medicine
Published Date:

Abstract

Liver disease diagnosis is pivotal for effective patient management, and machine learning techniques have shown promise in this domain. In this study, we investigate the impact of Polynomial-SHapley Additive exPlanations analysis on enhancing the performance and interpretability of machine learning models for liver disease classification. Our results demonstrate significant improvements in accuracy, precision, recall, F1_score, and Matthews correlation coefficient across various algorithms when polynomial- SHapley Additive exPlanations analysis is applied. Specifically, the Light Gradient Boosting Machine model achieves exceptional performance with 100 % accuracy in both scenarios. Furthermore, by comparing the results obtained with and without the approach, we observe substantial differences in the performance, highlighting the importance of incorporating Polynomial-SHapley Additive exPlanations analysis for improved model performance. The Polynomial features and SHapley Additive exPlanations values also enhance the interpretability of machine learning models by capturing complex feature interactions, enabling users to gain deeper insights into the underlying mechanisms driving the diagnosis. Moreover, data rebalancing using Synthetic Minority Over-sampling Technique and parameter tuning were employed to optimize the performance of the models. These findings underscore the significance of employing this analytical approach in machine-learning-based diagnostic systems for liver diseases, offering superior performance and enhanced interpretability for informed decision-making in clinical practice.

Authors

  • Chukwuebuka Joseph Ejiyi
    College of Nuclear Technology and Automation Engineering, Sichuan Industrial Internet Intelligent Monitoring and Application Engineering Research Center, Chengdu University of Technology, Sichuan, Chengdu, China; Network and Data Security Key Laboratory of Sichuan Province, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
  • Dongsheng Cai
    Sichuan Industrial Internet Intelligent Monitoring and Application Engineering Technology Research Centre, Chengdu University of Technology, Chenghua District, Chengdu, Sichuan, People's Republic of China. caidongsheng@cdut.edu.cn.
  • Makuachukwu B Ejiyi
    Pharmacy Department, University of Nigeria Nsukka, Nsukka, Enugu State, Nigeria.
  • Ijeoma A Chikwendu
    School of Information and Communication Engineering, University of Electronic Science and Technology of China, Sichuan, PR China.
  • Kenneth Coker
    School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, China; Department of Electrical and Electronic Engineering, Ho Technical University Ghana, Ghana.
  • Ariyo Oluwasanmi
    School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China.
  • Oluwatoyosi F Bamisile
    College of Nuclear Technology and Automation Engineering, Sichuan Industrial Internet Intelligent Monitoring and Application Engineering Research Center, Chengdu University of Technology, Sichuan, Chengdu, China.
  • Thomas U Ejiyi
    Department of Pure and Industrial Chemistry, University of Nigeria Nsukka, Nsukka, Enugu State, Nigeria.
  • Zhen Qin
    College of Forestry, Southwest Forestry University, Kunming, Yunnan, China.