CHILDES Chinese Corpora

This page provides an index to CHILDES data from Chinese languages.

You can browse the Chinese database online from this link.

For Mandarin, we use Simplified characters. For Cantonese, we used Traditional characters. If you want to convert the Mandarin corpora to Traditional, you can use this converter.
Corpus Age_Range N Media Comments
Cantonese
HKU-70 2;6-5;6 70 audio Cross-sectional corpus designed to complement the CanCorp
Lee/Wong/Leung 1;05-3;08 8 - Longitudinal study; language development of Cantonese-speaking children each recorded for approximately one year.
PaidoCantonese 1;5-2;5 80 audio in PhonBank
Mandarin
AcadLang - 15 - -
Beijing 1;9.3-2;2.7 10 audio longitudinal
Chang1 3-6 24 - Toy play
Chang2 3-4 16 - Anecdote and Book Reading
ChangPN 3-9 181 audio Personal Narratives
Context 2 25+25 - 25 Mandarin and 25 English-speaking children
LiZhou 3,4,5,6 80 - Peer and role playing talk
TCCM 1;7-3;4 10 - longitudinal in Taiwan
Tong 1;7-3;4 1 video longitudinal study linked to video
Xinjiang 4-8 60 audio children in Xinjiang
Zhou1 3-6 15 - Play sessions with mother
Zhou2 3-6 15 - Play sessions with mother
Zhou3 0;8-4;5 1 - longitudinal case study
ZhouDinner 3,4,5,6 80 audio Dinner conversations
ZhouNarratives 3,4,5,6 200 audio Hungry Caterpillar and Robber stories
Taiwanese
Tsay 2;0-2;6 4 audio in PhonBank