Analyzing Demographic Grocery Purchase Patterns in Kenyan Supermarkets Through Unsupervised Learning Techniques.

Journal: Inquiry : a journal of medical care organization, provision and financing
PMID:

Abstract

Kenya is experiencing a significant increase in the prevalence of non-communicable diseases (NCDs) such as cardiovascular diseases, hypertension, Type 2 diabetes, and certain cancers (bowel, lung, prostate, and uterine). This case is not unique to Kenya but is common in many Low and Middle-Income Countries (LMICs) in Africa. Many NCDs, are linked to diets high in added sugars, sodium, saturated fat, and low in fiber. There is a notable lack of information regarding the demographic differences among supermarket customers and their purchasing habits of healthy versus unhealthy foods in some parts of Africa. This gap in knowledge hinders the ability to connect grocery purchase patterns to NCDs, including obesity. Supermarkets in LMICs offer valuable demographic insights through grocery data. This research utilizes NOVA classification tool, data mining and unsupervised machine learning techniques to analyze grocery purchase patterns in 10 supermarkets across 5 counties in Kenya between 2022 and 2023. The apriori algorithm was used to create association rules and an analysis was done on the association rules to find out the relationship between demography (location, gender, and age) with purchase patterns. Individual data was collected along with transaction data, since the supermarkets logged transactions done by loyalty card customers. The main aim is to provide guidance to policymakers in public health. We collected 3 934 122 unique transactions and each transaction was associated with a customer who was identified with a unique customer ID. Findings from this research demonstrate that 53% of food purchases from these transactions were mainly industrially processed food items and males above the age of 50 years were the main consumers of these food items. The findings lead to the conclusion that this purchase trend has a chance of rising NCDs in older people. Therefore we recommend that policymakers adopt our recommendations to safeguard public health.

Authors

  • Reinpeter Momanyi
    African Population and Health Research Center, Nairobi, Kenya.
  • Steve Bicko Cygu
    African Population and Health Research Center, Nairobi, Kenya.
  • Agnes Kiragga
    Data Science Program, Africa Population Health Centre, Nairobi, Kenya.
  • Henry Owoko Odero
    African Population and Health Research Center, Nairobi, Kenya.
  • Maureen Ng'etich
    African Population and Health Research Center, Nairobi, Kenya.
  • Gershim Asiki
    African Population and Health Research Center, Nairobi, Kenya.
  • Tatenda Duncan Kavu
    African Population and Health Research Center, Nairobi, Kenya.