Cloud based DevOps Framework for Identifying Risk Factors of Hospital Utilization
Journal:
arXiv
Published Date:
Apr 18, 2025
Abstract
A scalable and reliable system is required to analyze the National Health and
Nutrition Examination Survey (NHANES) data efficiently to understand hospital
utilization risk factors. This study aims to investigate the integration of
continuous integration and deployment (CI/CD) practices in data science
workflows, specifically focusing on analyzing NHANES data to identify the
prevalence of diabetes, obesity, and cardiovascular diseases. An end-to-end
cloud-based DevOps framework is proposed for data analysis which examines risk
factors associated with hospital utilization and evaluates key hospital
utilization metrics. We have also highlighted the modular structure of the
framework that can be generalized for any other domains beyond healthcare. In
the framework, an online data update method is provided which can be extended
further using both real and synthetic data. As such, the framework can be
especially useful for sparse dataset domains such as environmental science,
robotics, cybersecurity, and cultural heritage and arts.