Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input.

Journal: Proceedings of the National Academy of Sciences of the United States of America
Published Date:

Abstract

Before they even speak, infants become attuned to the sounds of the language(s) they hear, processing native phonetic contrasts more easily than nonnative ones. For example, between 6 to 8 mo and 10 to 12 mo, infants learning American English get better at distinguishing English and [l], as in "rock" vs. "lock," relative to infants learning Japanese. Influential accounts of this phenomenon initially proposed that infants group sounds into native vowel- and consonant-like phonetic categories-like and [l] in English-through a statistical clustering mechanism dubbed "distributional learning." The feasibility of this mechanism for learning phonetic categories has been challenged, however. Here, we demonstrate that a distributional learning algorithm operating on naturalistic speech can predict early phonetic learning, as observed in Japanese and American English infants, suggesting that infants might learn through distributional learning after all. We further show, however, that, contrary to the original distributional learning proposal, our model learns units too brief and too fine-grained acoustically to correspond to phonetic categories. This challenges the influential idea that what infants learn are phonetic categories. More broadly, our work introduces a approach to the study of early phonetic learning, together with a quantitative modeling framework that can handle realistic input. This allows accounts of early phonetic learning to be linked to concrete, systematic predictions regarding infants' attunement.

Authors

  • Thomas Schatz
    Department of Linguistics and UMIACS, University of Maryland, College Park, Maryland 20742, USA thomas.schatz@laposte.net.
  • Naomi H Feldman
    Department of Linguistics, University of Maryland, College Park, MD 20742.
  • Sharon Goldwater
    School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, United Kingdom.
  • Xuan-Nga Cao
    Cognitive Machine Learning, École Normale Supérieure-École des Hautes Études en Sciences Sociales-Paris Sciences et Lettres Research University-CNRS-Institut National de Recherche en Informatique et en Automatique, 75012 Paris, France.
  • Emmanuel Dupoux
    EHESS, ENS, PSL Research University, CNRS, INRIA, France. Electronic address: emmanuel.dupoux@ens.fr.