Using sequences of life-events to predict human lives.

Journal: Nature computational science
PMID:

Abstract

Here we represent human lives in a way that shares structural similarity to language, and we exploit this similarity to adapt natural language processing techniques to examine the evolution and predictability of human lives based on detailed event sequences. We do this by drawing on a comprehensive registry dataset, which is available for Denmark across several years, and that includes information about life-events related to health, education, occupation, income, address and working hours, recorded with day-to-day resolution. We create embeddings of life-events in a single vector space, showing that this embedding space is robust and highly structured. Our models allow us to predict diverse outcomes ranging from early mortality to personality nuances, outperforming state-of-the-art models by a wide margin. Using methods for interpreting deep learning models, we probe the algorithm to understand the factors that enable our predictions. Our framework allows researchers to discover potential mechanisms that impact life outcomes as well as the associated possibilities for personalized interventions.

Authors

  • Germans Savcisens
    DTU Compute, Technical University of Denmark, Lyngby, Denmark.
  • Tina Eliassi-Rad
    Network Science Institute, Northeastern University, Boston, MA, USA.
  • Lars Kai Hansen
    DTU Compute, Technical University of Denmark, Lyngby, Denmark.
  • Laust Hvas Mortensen
    Section for Epidemiology, Department of Public Health, University of Copenhagen, Copenhagen, Denmark.
  • Lau Lilleholt
    Department of Psychology, University of Copenhagen, Copenhagen, Denmark.
  • Anna Rogers
    Computer Science Department, IT University of Copenhagen, Copenhagen, Denmark.
  • Ingo Zettler
    Department of Psychology, University of Copenhagen, Copenhagen, Denmark.
  • Sune Lehmann
    Copenhagen Center for Social Data Science, University of Copenhagen, Copenhagen, Denmark.