Incorporating Demographic Embeddings Into Language Understanding.

Journal: Cognitive science
PMID:

Abstract

Meaning depends on context. This applies in obvious cases like deictics or sarcasm as well as more subtle situations like framing or persuasion. One key aspect of this is the identity of the participants in an interaction. Our interpretation of an utterance shifts based on a variety of factors, including personal history, background knowledge, and our relationship to the source. While obviously an incomplete model of individual differences, demographic factors provide a useful starting point and allow us to capture some of this variance. However, the relevance of specific demographic factors varies between situations-where age might be the key factor in one context, ideology might dominate in another. To address this challenge, we introduce a method for combining demographics and context into situated demographic embeddings-mapping representations into a continuous geometric space appropriate for the given domain, showing the resulting representations to be functional and interpretable. We further demonstrate how to make use of related external data so as to apply this approach in low-resource situations. Finally, we show how these representations can be incorporated into improve modeling of real-world natural language understanding tasks, improving model performance and helping with issues of data sparsity.

Authors

  • Justin Garten
    Department of Computer Science, University of Southern California.
  • Brendan Kennedy
    Department of Computer Science, University of Southern California.
  • Joe Hoover
    Brain and Creativity Institute, University of Southern California.
  • Kenji Sagae
    Department of Linguistics, University of California, Davis.
  • Morteza Dehghani
    Department of Computer Science, University of Southern California.