Applications of machine learning techniques to predict filariasis using socio-economic factors.

Journal: Epidemiology and infection
Published Date:

Abstract

Filariasis is one of the major public health concerns in India. Approximately 600 million people spread across 250 districts of India are at risk of filariasis. To predict this disease, a pilot scale study was carried out in 30 villages of Karimnagar district of Telangana from 2004 to 2007 to collect epidemiological and socio-economic data. The collected data are analysed by employing various machine learning techniques such as Naïve Bayes (NB), logistic model tree, probabilistic neural network, J48 (C4.5), classification and regression tree, JRip and gradient boosting machine. The performances of these algorithms are reported using sensitivity, specificity, accuracy and area under ROC curve (AUC). Among all employed classification methods, NB yielded the best AUC of 64% and was equally statistically significant with the rest of the classifiers. Similarly, the J48 algorithm generated 23 decision rules that help in developing an early warning system to implement better prevention and control efforts in the management of filariasis.

Authors

  • Phani Krishna Kondeti
    Bioinformatics Group, Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Hyderabad-500 007, Andhra Pradesh, India.
  • Kumar Ravi
    Centre for Excellence in Analytics, Institute for Development and Research in Banking Technology, Hyderabad-500 057, Telangana, India.
  • Srinivasa Rao Mutheneni
    Bioinformatics Group, Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Hyderabad-500 007, Andhra Pradesh, India.
  • Madhusudhan Rao Kadiri
    Bioinformatics Group, Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Hyderabad-500 007, Andhra Pradesh, India.
  • Sriram Kumaraswamy
    Bioinformatics Group, Department of Applied Biology, CSIR-Indian Institute of Chemical Technology, Hyderabad-500 007, Andhra Pradesh, India.
  • Ravi Vadlamani
    Centre for Excellence in Analytics, Institute for Development and Research in Banking Technology, Hyderabad-500 057, Telangana, India.
  • Suryanaryana Murty Upadhyayula
    National Institute of Pharmaceutical Education and Research, Guwahati-781 032, Assam, India.