Learning about color from language.

Journal: Communications psychology
Published Date:

Abstract

Certain colors are strongly associated with certain adjectives (e.g. red is hot, blue is cold). Some of these associations are grounded in visual experiences such as seeing glowing red embers. Surprisingly, despite having no visual experience, many congenitally blind people show very similar color associations which are likely learned through language. We show that these associations are indeed embedded in the statistical structure of language. We apply a projection method to word embeddings trained on corpora of spoken and written language to identify color-adjective associations as they are represented in English. These projections were predictive of color-adjective associations reported by blind and sighted English speakers. The most predictive projections were generated by embeddings derived from a corpus of fiction, which outperformed even the state-of-the-art large language model, GPT-4. By augmenting the training corpora in various ways we discover the types of sentences most responsible for conveying the color-adjective associations to the models. We find that word embedding models learn these associations from indirect (second-order) co-occurrences, and that when prompted, people are able to identify some of the words that are most informative for associating colors with specific adjectives. Learning through linguistic co-occurrences is one way word meanings can be continually aligned across language users despite large variations in perceptual experience.

Authors

  • Qiawen Liu
    Department of Psychology, University of Wisconsin-Madison, Madison, WI, 53706, USA. ql3814@princeton.edu.
  • Jeroen van Paridon
    Department of Psychology, University of Wisconsin-Madison, Madison, WI, 53706, USA.
  • Gary Lupyan
    Department of Psychology, University of Wisconsin-Madison, Madison, WI, 53706, USA.

Keywords

No keywords available for this article.