A dynamic model using k-NN algorithm for predicting diabetes and breast cancer.

Journal: Computers in biology and medicine
Published Date:

Abstract

Healthcare remains a critical focus due to its direct impact on human well-being. Diabetes, currently the fastest-growing chronic disease globally, poses severe health risks, including cardiovascular complications and kidney failure. Simultaneously, breast cancer has become the most prevalent cancer among women, particularly those in their 40s, surpassing other types. Early detection and diagnosis of these two diseases remain a substantial challenge, yet they are crucial for reducing mortality rates. Machine learning algorithms emerged as powerful tools in healthcare for disease classification and prediction, with the k-nearest neighbors (k-NN) being one of the most widely used supervised learning algorithm. Different traditional machine learning methods have been proposed, which are heavily specialized for specific datasets. More deeply, traditional k-NN relies on a static k-value, which may not provide optimal results across diverse datasets. This paper proposes a dynamic k-NN model that adjusts 'k' value based on local data characteristics, enhancing prediction accuracy. The proposed model is testing on two publicly available datasets; PIMA Diabetes and Breast Cancer Wisconsin (BCW) datasets. Our results are evaluated using different metrics that are; accuracy, precision, recall, F1_score, and execution time. The results of these metrics are as follows; (81.17%, 97.37%), (83.33% 100%), (54.55%, 86.05%), and (65.93%, 92.5%) for PIMA and BCW datasets respectively. These results demonstrate that the proposed model outperformed several state-of-the-art models. Thus, further highlighting its effectiveness and efficiency in medical data classification.

Authors

  • Hussein A A Al-Khamees
    Computer Techniques Engineering Department, College of Engineering and Technology, Al-Mustaqbal University, 51001, Babil, Iraq.
  • Nor Samsiah Sani
    Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia. Electronic address: norsamsiahsani@ukm.edu.my.
  • Ahmed Sileh Gifal
    Department of Information Technology, College of Computer Science and Information Technology, University of Kerbala, 56001, Karbala, Iraq.
  • Luan Xiang Wei Liu
    Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia.
  • Mohd Isrul Esa
    Center for Artificial Intelligence Technology, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia.