The role of valence, dominance, and pitch in perceptions of artificial intelligence (AI) conversational agents' voices.

Journal: Scientific reports
PMID:

Abstract

There is growing concern that artificial intelligence conversational agents (e.g., Siri, Alexa) reinforce voice-based social stereotypes. Because little is known about social perceptions of conversational agents' voices, we investigated (1) the dimensions that underpin perceptions of these synthetic voices and (2) the role that acoustic parameters play in these perceptions. Study 1 (N = 504) found that perceptions of synthetic voices are underpinned by Valence and Dominance components similar to those previously reported for natural human stimuli and that the Dominance component was strongly and negatively related to voice pitch. Study 2 (N = 160) found that experimentally manipulating pitch in synthetic voices directly influenced dominance-related, but not valence-related, perceptions. Collectively, these results suggest that greater consideration of the role that voice pitch plays in dominance-related perceptions when designing conversational agents may be an effective method for controlling stereotypic perceptions of their voices and the downstream consequences of those perceptions.

Authors

  • Victor Kenji M Shiramizu
    School of Psychological Sciences & Health, University of Strathclyde, Glasgow, Scotland.
  • Anthony J Lee
    Division of Psychology, University of Stirling, Stirling, Scotland.
  • Daria Altenburg
    Department of Marketing, Innovation and Organisation, Ghent University, Ghent, Belgium.
  • David R Feinberg
    Department of Psychology, Neuroscience & Behaviour, McMaster University, Hamilton, Canada.
  • Benedict C Jones
    School of Psychological Sciences & Health, University of Strathclyde, Glasgow, Scotland. benedict.jones@strath.ac.uk.