Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

Journal: Nature communications

Published Date: May 20, 2025

Abstract

Humans can learn languages from remarkably little experience. Developing computational models that explain this ability has been a major challenge in cognitive science. Existing approaches have been successful at explaining how humans generalize rapidly in controlled settings but are usually too restrictive to tractably handle naturalistic data. We show that learning from limited naturalistic data is possible with an approach that bridges the divide between two popular modeling traditions: Bayesian models and neural networks. This approach distills a Bayesian model's inductive biases-the factors that guide generalization-into a neural network that has flexible representations. Like a Bayesian model, the resulting system can learn formal linguistic patterns from limited data. Like a neural network, it can also learn aspects of English syntax from naturally-occurring sentences. Thus, this model provides a single system that can learn rapidly and can handle naturalistic data.

Authors

R Thomas McCoy

Department of Linguistics, Yale University, New Haven, CT, USA tom.mccoy@yale.eduhttps://rtmccoy.com/.
Thomas L Griffiths

Department of Psychology, University of California, Berkeley, USA.

Keywords

Bayes Theorem Computer Simulation Humans Language Learning Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (40393968)

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals