Public attitudes value interpretability but prioritize accuracy in Artificial Intelligence.

Journal: Nature communications
Published Date:

Abstract

As Artificial Intelligence (AI) proliferates across important social institutions, many of the most powerful AI systems available are difficult to interpret for end-users and engineers alike. Here, we sought to characterize public attitudes towards AI interpretability. Across seven studies (N = 2475), we demonstrate robust and positive attitudes towards interpretable AI among non-experts that generalize across a variety of real-world applications and follow predictable patterns. Participants value interpretability positively across different levels of AI autonomy and accuracy, and rate interpretability as more important for AI decisions involving high stakes and scarce resources. Crucially, when AI interpretability trades off against AI accuracy, participants prioritize accuracy over interpretability under the same conditions driving positive attitudes towards interpretability in the first place: amidst high stakes and scarce resources. These attitudes could drive a proliferation of AI systems making high-impact ethical decisions that are difficult to explain and understand.

Authors

  • Anne-Marie Nussberger
    Center for Humans and Machines, Max Planck Institute for Human Development, Berlin, Germany. nussberger@mpib-berlin.mpg.de.
  • Lan Luo
    School of Civil Engineering and Architecture, Nanchang University, Nanchang, PR China.
  • L Elisa Celis
    Department of Statistics and Data Science, Yale University, New Haven, CT, USA.
  • M J Crockett
    Department of Psychology and University Center for Human Values, Princeton University, Princeton, NJ, USA. mj.crockett@princeton.edu.