CHILDES Chinese Corpora

This page provides an index to CHILDES data from Chinese languages.

You can browse the Chinese database online from this link.

For Mandarin, we use Simplified characters. For Cantonese, we used Traditional characters. If you want to convert the Mandarin corpora to Traditional, you can use this converter.

Alternatively, you can download this complete set of the transcripts in Traditional characters. However, this version may be slightly out of date in terms of corrections.

Corpus Age_Range N Media Comments
Cantonese
HKU-70 2;6-5;6 70 audio Cross-sectional corpus designed to complement the CanCorp
Lee/Wong/Leung 1;05-3;08 8 - Longitudinal study; language development of Cantonese-speaking children each recorded for approximately one year.
PaidoCantonese 1;5-2;5 80 audio in PhonBank
Mandarin
AcadLang - 15 - -
Chang1 3-6 24 - Toy play
Chang2 3-4 16 - Anecdote and Book Reading
ChangPlay 3,4,5 21 audio Toy Play Narratives
ChangPN 3-9 181 audio Personal Narratives
Erbaugh 2;0-3;9 4 audio home recordings
LiReading 4,5,6 214 - Shared book reading
LiZhou 3,4,5,6 80 - Peer and role playing talk
NSCtoys - - - Not yet open to public
TCCM-Reading 2 20 - book reading
TCCM 1;7-3;4 10 - longitudinal in Taiwan
Tong 1;7-3;4 1 video longitudinal study linked to video
Xinjiang 4-8 60 audio children in Xinjiang
Zhou1 3-6 15 - Play sessions with mother
Zhou2 3-6 15 - Play sessions with mother
Zhou3 0;8-4;5 1 - longitudinal case study
ZhouAssessment 3,4,5,6 334 - assessment interactions
ZhouDinner 3,4,5,6 80 audio Dinner conversations
ZhouNarratives 3,4,5,6 200 audio Hungry Caterpillar and Robber stories
Taiwanese
Tsay 2;0-2;6 4 audio in PhonBank