Influenza forecasting for French regions combining EHR, web and climatic data sources with a machine learning ensemble approach.

Journal: PloS one
PMID:

Abstract

Effective and timely disease surveillance systems have the potential to help public health officials design interventions to mitigate the effects of disease outbreaks. Currently, healthcare-based disease monitoring systems in France offer influenza activity information that lags real-time by one to three weeks. This temporal data gap introduces uncertainty that prevents public health officials from having a timely perspective on the population-level disease activity. Here, we present a machine-learning modeling approach that produces real-time estimates and short-term forecasts of influenza activity for the twelve continental regions of France by leveraging multiple disparate data sources that include, Google search activity, real-time and local weather information, flu-related Twitter micro-blogs, electronic health records data, and historical disease activity synchronicities across regions. Our results show that all data sources contribute to improving influenza surveillance and that machine-learning ensembles that combine all data sources lead to accurate and timely predictions.

Authors

  • Canelle Poirier
    Computational Health Informatics Program, Boston Children's Hospital, Boston, MA, United States.
  • Yulin Hswen
    Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, United States.
  • Guillaume Bouzille
    Univ Rennes, CHU Rennes, Inserm, LTSI - UMR 1099, F-35000 Rennes, France.
  • Marc Cuggia
    Univ Rennes, CHU Rennes, Inserm, LTSI - UMR 1099, F-35000 Rennes, France.
  • Audrey Lavenu
    Université de Rennes 1, Faculté de médecine, Rennes, France.
  • John S Brownstein
    Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, USA.
  • Thomas Brewer
    Innovation Program, Boston Children's Hospital, Boston, MA, United States of America.
  • Mauricio Santillana
    Harvard School of Engineering and Applied Sciences, Cambridge, Massachusetts, United States of America; Boston Children's Hospital Informatics Program, Boston, Massachusetts, United States of America; Harvard Medical School, Boston, Massachusetts, United States of America.