Predicting personality from patterns of behavior collected with smartphones.

Journal: Proceedings of the National Academy of Sciences of the United States of America
PMID:

Abstract

Smartphones enjoy high adoption rates around the globe. Rarely more than an arm's length away, these sensor-rich devices can easily be repurposed to collect rich and extensive records of their users' behaviors (e.g., location, communication, media consumption), posing serious threats to individual privacy. Here we examine the extent to which individuals' Big Five personality dimensions can be predicted on the basis of six different classes of behavioral information collected via sensor and log data harvested from smartphones. Taking a machine-learning approach, we predict personality at broad domain ([Formula: see text] = 0.37) and narrow facet levels ([Formula: see text] = 0.40) based on behavioral data collected from 624 volunteers over 30 consecutive days (25,347,089 logging events). Our cross-validated results reveal that specific patterns in behaviors in the domains of 1) communication and social behavior, 2) music consumption, 3) app usage, 4) mobility, 5) overall phone activity, and 6) day- and night-time activity are distinctively predictive of the Big Five personality traits. The accuracy of these predictions is similar to that found for predictions based on digital footprints from social media platforms and demonstrates the possibility of obtaining information about individuals' private traits from behavioral patterns passively collected from their smartphones. Overall, our results point to both the benefits (e.g., in research settings) and dangers (e.g., privacy implications, psychological targeting) presented by the widespread collection and modeling of behavioral data obtained from smartphones.

Authors

  • Clemens Stachl
    Department of Psychological Methodology and Assessment, Ludwig-Maximilians-University Munich, Munich, Germany.
  • Quay Au
    Department of Statistics, Computational Statistics, Ludwig-Maximilians-Universität München, 80539 Munich, Germany.
  • Ramona Schoedel
    Department of Psychology, Psychological Methods and Assessment, Ludwig-Maximilians-Universität München, 80802 Munich, Germany.
  • Samuel D Gosling
    Department of Psychology, University of Texas at Austin, Austin, TX 78712.
  • Gabriella M Harari
    Department of Communication, Media and Personality Laboratory, Stanford University, Stanford, CA 94305.
  • Daniel Buschek
    Research Group Human Computer Interaction and Artificial Intelligence, Department of Computer Science, University of Bayreuth, 95447 Bayreuth, Germany.
  • Sarah Theres Völkel
    Media Informatics Group, Ludwig-Maximilians-Universität München, 80337 Munich, Germany.
  • Tobias Schuwerk
    Department of Psychology, Developmental Psychology, Ludwig-Maximilians-Universität München, 80802 Munich, Germany.
  • Michelle Oldemeier
    Department of Psychology, Psychological Methods and Assessment, Ludwig-Maximilians-Universität München, 80802 Munich, Germany.
  • Theresa Ullmann
    Institute for Medical Information Processing, Biometry, and Epidemiology, Ludwig-Maximilians-Universität München, 81377 Munich, Germany.
  • Heinrich Hussmann
    Media Informatics Group, Ludwig-Maximilians-Universität München, 80337 Munich, Germany.
  • Bernd Bischl
    Department of Statistics, Ludwig-Maximilians-University, Munich, Germany.
  • Markus Bühner
    Department of Psychological Methodology and Assessment, Ludwig-Maximilians-University Munich, Munich, Germany.