I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data.

Journal: Neuroscience and biobehavioral reviews

Published Date: Oct 6, 2020

Abstract

Machine learning has enhanced the abilities of neuroscientists to interpret information collected through EEG, fMRI, and MEG data. With these powerful techniques comes the danger of overfitting of hyperparameters which can render results invalid. We refer to this problem as 'overhyping' and show that it is pernicious despite commonly used precautions. Overhyping occurs when analysis decisions are made after observing analysis outcomes and can produce results that are partially or even completely spurious. It is commonly assumed that cross-validation is an effective protection against overfitting or overhyping, but this is not actually true. In this article, we show that spurious results can be obtained on random data by modifying hyperparameters in seemingly innocuous ways, despite the use of cross-validation. We recommend a number of techniques for limiting overhyping, such as lock boxes, blind analyses, pre-registrations, and nested cross-validation. These techniques, are common in other fields that use machine learning, including computer science and physics. Adopting similar safeguards is critical for ensuring the robustness of machine-learning techniques in the neurosciences.

Authors

Mahan Hosseini

The School of Computing, University of Kent, United Kingdom.
Michael Powell

Manada Technology LLC, United States.
John Collins

Physics Department, Penn State University, United States.
Chloe Callahan-Flintoft

Army Research Lab, Aberdeen Proving Grounds, United States.
William Jones

The School of Computing, University of Kent, United Kingdom.
Howard Bowman

The School of Computing, University of Kent, United Kingdom; School of Psychology, University of Birmingham, United Kingdom.
Brad Wyble

Psychology Department, Penn State University, United States. Electronic address: BWyble@gmail.com.

Keywords

Brain Humans Machine Learning Magnetic Resonance Imaging

External Resources

View on PubMed Access via DOI PubMed (33035522)

I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals