Evidence of a predictive coding hierarchy in the human brain listening to speech.

Journal: Nature human behaviour
PMID:

Abstract

Considerable progress has recently been made in natural language processing: deep learning algorithms are increasingly able to generate, summarize, translate and classify texts. Yet, these language models still fail to match the language abilities of humans. Predictive coding theory offers a tentative explanation to this discrepancy: while language models are optimized to predict nearby words, the human brain would continuously predict a hierarchy of representations that spans multiple timescales. To test this hypothesis, we analysed the functional magnetic resonance imaging brain signals of 304 participants listening to short stories. First, we confirmed that the activations of modern language models linearly map onto the brain responses to speech. Second, we showed that enhancing these algorithms with predictions that span multiple timescales improves this brain mapping. Finally, we showed that these predictions are organized hierarchically: frontoparietal cortices predict higher-level, longer-range and more contextual representations than temporal cortices. Overall, these results strengthen the role of hierarchical predictive coding in language processing and illustrate how the synergy between neuroscience and artificial intelligence can unravel the computational bases of human cognition.

Authors

  • Charlotte Caucheteux
    Facebook AI Research, Paris, France. ccaucheteux@fb.com.
  • Alexandre Gramfort
    Paris-Saclay Center for Data Science, Université Paris-Saclay, 91440 Orsay, France; INRIA, Parietal team, Saclay, 91120 Palaiseau, France; LTCI, Télécom ParisTech, 75013 Paris, France.
  • Jean-Rémi King
    From the Departments of Neurology (J.C., K.D., A.M., C.C., K.M.B., A.V., J.U.O., S.P., S.A., D.R., M.M., A.E., B.R.) and Neurosurgery (E.S.C.), Columbia University, and the Department of Psychology, New York University (J.-R.K.) - both in New York.