Challenges for Predictive Modeling With Neural Network Techniques Using Error-Prone Dietary Intake Data.

Journal: Statistics in medicine
PMID:

Abstract

Dietary intake data are routinely drawn upon to explore diet-health relationships, and inform clinical practice and public health. However, these data are almost always subject to measurement error, distorting true diet-health relationships. Beyond measurement error, there are likely complex synergistic and sometimes antagonistic interactions between different dietary components, complicating the relationships between diet and health outcomes. Flexible models are required to capture the nuance that these complex interactions introduce. This complexity makes research on diet-health relationships an appealing candidate for the application of modern machine learning techniques, and in particular, neural networks. Neural networks are computational models that can capture highly complex, nonlinear relationships, so long as sufficient data are available. While these models have been applied in many domains, the impacts of measurement error on the performance of predictive modeling have not been widely investigated. In this work, we demonstrate the ways in which measurement error erodes the performance of neural networks and illustrate the care that is required for leveraging these models in the presence of error. We demonstrate the role that sample size and replicate measurements play in model performance, indicate a motivation for the investigation of transformations to additivity, and illustrate the caution required to prevent model overfitting. While the past performance of neural networks across various domains makes them an attractive candidate for examining diet-health relationships, our work demonstrates that substantial care and further methodological development are both required to observe increased predictive performance when applying these techniques compared to more traditional statistical procedures.

Authors

  • Dylan Spicker
    Department of Mathematics and Statistics, University of New Brunswick (Saint John), Saint John, New Brunswick, Canada.
  • Amir Nazemi
    Department of Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada.
  • Joy Hutchinson
    School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada.
  • Paul Fieguth
    Department of Systems Design Engineering, University of Waterloo, Waterloo, Ontario, Canada.
  • Sharon Kirkpatrick
    School of Public Health Sciences, University of Waterloo, Waterloo, Ontario, Canada.
  • Michael Wallace
    Department of Medicine, Mayo Clinic, Director, Digestive Diseases Research Program, Editor in Chief Gastrointestinal Endoscopy, President, Florida Gastroenterology Society, Jacksonville, Florida, USA.
  • Kevin W Dodd
    Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Bethesda, Maryland, USA.