Factor Analysis and Prediction of Disease Risk Based on Large Ensembles of Models: Application to Virus Yellows in Sugar Beet.

Journal: Phytopathology

Published Date: Jun 13, 2025

Abstract

Identifying disease risk factors, characterizing their effects, and forecasting disease risk across space and time are crucial tasks in human, animal, and plant epidemiology. Statistical and machine learning models have largely superseded purely descriptive analyses of data in handling these tasks. In addition, these models have demonstrated their full potential in the current era, characterized by an unprecedented abundance of data. However, applying these models to real-world, large-scale data sets raises critical questions: Which model should be used? Which explanatory variables should be selected? What data should be allocated for training and validation…? The answers to these questions often have a significant impact on the analysis outcomes. One way to address some of these challenges is to analyze risk factors and predict risk by using an ensemble of models rather than relying on a single model. This approach is developed in this article and implemented in the case of virus yellows in sugar beet in France. Among the explanatory variables correlated with the severity of virus yellows, we identified winter and spring temperatures (positive correlation), spring humidity and precipitation (negative correlation), the proportion of cereal crops (positive correlation), the proportion of grasslands (negative correlation), and the distance to sugar beet seed production fields (negative correlation). Additionally, we found that predictions are generally more robust when using a spatial aggregation of models compared to relying on the best individual model. Our approach is highly versatile and can be applied to characterize and predict the spatio-temporal distributions of diverse diseases.

Authors

D Chauvin

INRAE, BioSP, Avignon, France; dorian2.c2@gmail.com.
E Gabriel

INRAE, BioSP, Avignon, France; edith.gabriel@inrae.fr.
D Martinetti

INRAE, BioSP, Paris, France; davide.martinetti@inrae.fr.
J Papaïx

INRAE, BioSP, Paris, France; julien.papaix@inrae.fr.
C Martinez

INRAE, Ecodev, Avignon, France; cesar.martinez@inrae.fr.
G Geniaux

INRAE, Ecodev, Avignon, France; ghislain.geniaux@inrae.fr.
F Joudelat

ITB, Paris, France; f.joudelat@itbfr.org.
S Soubeyrand

INRAE, BioSP, Avignon, France; samuel.soubeyrand@inrae.fr.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40512064)

Factor Analysis and Prediction of Disease Risk Based on Large Ensembles of Models: Application to Virus Yellows in Sugar Beet.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Factor Analysis and Prediction of Disease Risk Based on Large Ensembles of Models: Application to Virus Yellows in Sugar Beet.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals