The JIBO Kids Corpus: A speech dataset of child-robot interactions in a classroom environment.

Journal: JASA express letters
PMID:

Abstract

This paper describes an original dataset of children's speech, collected through the use of JIBO, a social robot. The dataset encompasses recordings from 110 children, aged 4-7 years old, who participated in a letter and digit identification task and extended oral discourse tasks requiring explanation skills, totaling 21 h of session data. Spanning a 2-year collection period, this dataset contains a longitudinal component with a subset of participants returning for repeat recordings. The dataset, with session recordings and transcriptions, is publicly available, providing researchers with a valuable resource to advance investigations into child language development.

Authors

  • Natarajan Balaji Shankar
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Amber Afshan
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Alexander Johnson
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Aurosweta Mahapatra
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Alejandra Martin
    Department of Education, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Haolun Ni
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Hae Won Park
    MIT Media Lab, Cambridge, MA, United States.
  • Marlen Quintero Perez
    Department of Education, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Gary Yeung
    Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Alison Bailey
    Department of Education, University of California Los Angeles, Los Angeles, California 90095, USA.
  • Cynthia Breazeal
    Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • Abeer Alwan
    University of California, Los Angeles, CA, USA.