Predictive coding narrows the gap between convolutional networks and human brain function in misspelled-word reading
Journal:
bioRxiv
Published Date:
Jan 14, 2026
Abstract
Humans can readily recognize words even when they are misspelled, though with slower responses, demonstrating remarkable robustness in reading. The computational mechanisms underlying this combination of robustness and cost in reading remain unclear. Convolutional neural networks (CNNs) have been used to model primate visual and human word processing. However, standard CNNs lack recurrent and feedback connections abundant in the ventral visual stream, which are thought to support robust vision. Here, we incorporated brain-inspired predictive coding dynamics into a CNN to investigate whether predictive coding can enhance brain-likeness in misspelled-word reading. The CNN backbone was trained to classify images of a 1000-word Finnish vocabulary (supervised), and the predictive coding modules were trained to reconstruct the activity in the previous layer (unsupervised). The model, with and without the predictive coding dynamics, was then evaluated using the same real and misspelled word stimuli presented to human participants during a magnetoencephalography (MEG) recording. Predictive coding improved model performance on misspelled words, particularly reducing the accuracy gap between real and word-like misspelled words, mirroring human behavioral patterns. Furthermore, representational similarity and ridge regression analyses showed stronger correspondence between model activations and human MEG responses. These findings provide converging behavioral and neural evidence that predictive coding dynamics narrows the gap between CNN performance and human brain function in misspelled-word reading, offering a biologically plausible computational mechanism for the brains robust reading ability.
Author summaryWhen we read, we can often understand a word even if it is misspelled, e.g, "lamguage", a form we have never learned. Models of visual word recognition in the brain should also exhibit such flexibility. Convolutional neural networks (CNNs) are gaining popularity as models of visual processing in the brain, including reading, yet they typically fail when faced with misspelled words. In our study, we enhanced a CNN by adding feedback connections that perform predictive coding, i.e. continuously attempt to reconstruct the input from the initial output and adjust the output until it can do so. Through this predictive coding loop, the model learned to refine its internal representations, mimicking how the brain may process unexpected or noisy inputs. We found that this predictive coding network not only identified the closest real word form of misspelled words more effectively than a conventional feedforward model but also produced activity patterns resembling those measured from human brain using magnetoencephalography. These results suggest that predictive coding could be a key computational principle enabling the brains remarkable flexibility in reading and provide insights into how biological mechanisms could inspire brain-like computational models.