Proactive recognition and early detection in communities through targeted HIV screening.

Journal: Scientific reports
Published Date:

Abstract

Human Immunodeficiency Virus (HIV) remains a critical public health concern, and is a significant global health challenge, particularly in developing countries. Early HIV detection supports targeted interventions, and substantially reduce the HIV burden. In many resource-limited settings, early detection of HIV is hindered by stigma, limited access to testing, and low risk awareness. This study aims to enhance HIV screening in resource-limited settings by employing machine learning models to predict HIV risk using demographic and lifestyle variables. We analyzed data from 39,295 individuals in Shiraz, Iran, identifying key predictors, including drug injection, age, having a spouse with a history of HIV, occupation, and prison record. We trained and validated an Extreme Gradient Boosting (XGBoost) model using stratified five-fold cross-validation on the dataset. The XGBoost model achieved high accuracy (0.89; Confidence Interval (CI) 95% [0.88-0.89]), very-good discriminatory ability (Area Under the ROC Curve (AUC = 0.84 [0.83-0.84], with a fair-to-good agreement (Cohen's Kappa of 0.51 [0.51-0.52]). Moreover, the performance of the proposed method (PREDICT-HIV) was consistent across test folds. Our findings align with previous studies, emphasizing the importance of socio-demographic and behavioral factors in HIV risk prediction. The model's robustness suggests its potential for practical implementation, aiding early identification and intervention in high-risk groups. Future research should incorporate additional socioeconomic variables and validate the model in diverse populations to enhance global HIV prevention efforts. The web application, implemented using the Django framework, is freely available online for public access. PREDICT-HIV may support earlier identification and intervention in underserved populations, improving the efficiency of HIV screening programs.

Authors

  • Mehdi Nejat
    Department of Biostatistics and Epidemiology, School of Health, and Student Research Committee, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran.
  • Hamid Reza Marateb
    Biomedical Engineering DepartmentEngineering FacultyUniversity of Isfahan Isfahan 8415683111 Iran.
  • Mehrshad Alirezaei Farahani
    Biomedical Engineering Department, Engineering Faculty, University of Isfahan, Isfahan, Iran.
  • Mohammad Zakaria Rajabi
    Biomedical Engineering Department, Engineering Faculty, University of Isfahan, Isfahan, Iran.
  • Maryam Nasirian
    Department of Biostatistics and Epidemiology, School of Health, Isfahan University of Medical Sciences, Isfahan, Iran.
  • Miguel Angel Mañanas
    Biomedical Engineering Research Centre (CREB), Automatic Control Department (ESAII) Universitat Politècnica de Catalunya-Barcelona Tech (UPC), Barcelona, Spain.
  • Mohammad Javad Tarrahi
    Department of Epidemiology and Biostatistics, School of Health, Isfahan University of Medical Sciences, Hezar Jerib St., Azadi Ave., Isfahan, 8174673461, Iran. mj.tarrahi@hlth.mui.ac.ir.
  • Marjan Mansourian
    Department of Epidemiology and BiostatisticsSchool of HealthIsfahan University of Medical Sciences Isfahan 8174673461 Iran.

Keywords

No keywords available for this article.