The JIBO Kids Corpus: A speech dataset of child-robot interactions in a classroom environment
Author(s)
Shankar, Natarajan Balaji; Afshan, Amber; Johnson, Alexander; Mahapatra, Aurosweta; Martin, Alejandra; Ni, Haolun; Park, Hae Won; Perez, Marlen Quintero; Yeung, Gary; Bailey, Alison; Breazeal, Cynthia; Alwan, Abeer; ... Show more Show less
DownloadPublished version (1.605Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
This paper describes an original dataset of children's speech, collected through the use of JIBO, a social robot. The dataset encompasses recordings from 110 children, aged 4–7 years old, who participated in a letter and digit identification task and extended oral discourse tasks requiring explanation skills, totaling 21 h of session data. Spanning a 2-year collection period, this dataset contains a longitudinal component with a subset of participants returning for repeat recordings. The dataset, with session recordings and transcriptions, is publicly available, providing researchers with a valuable resource to advance investigations into child language development.
Date issued
2024-11-01Department
Massachusetts Institute of Technology. Media LaboratoryJournal
JASA Express Letters
Publisher
Acoustical Society of America
Citation
Natarajan Balaji Shankar, Amber Afshan, Alexander Johnson, Aurosweta Mahapatra, Alejandra Martin, Haolun Ni, Hae Won Park, Marlen Quintero Perez, Gary Yeung, Alison Bailey, Cynthia Breazeal, Abeer Alwan; The JIBO Kids Corpus: A speech dataset of child-robot interactions in a classroom environment. JASA Express Lett. 1 November 2024; 4 (11): 115201.
Version: Final published version