CHILDES Chinese Corpora

This page provides an index to CHILDES data from Chinese languages.

You can browse the Chinese database online from this link.

For consistency, we use Simplified for processing through MOR and display on the browser pages. However, for easier reading, you can download the Mandarin corpora in traditional characters from here. If this version is not sufficiently up-to-date, you can send a request for updating to Brian MacWhinney.
Corpus Age_Range N Media Comments
Cantonese
HKU-70 2;6-5;6 70 audio Cross-sectional corpus designed to complement the CanCorp
Lee/Wong/Leung 1;05-3;08 8 - Longitudinal study; language development of Cantonese-speaking children each recorded for approximately one year.
PaidoCantonese 1;5-2;5 80 audio in PhonBank
Mandarin
AcadLang - - - -
Beijing 1;9.3-2;2.7 10 - longitudinal
Chang1 3-6 24 - Toy play
Chang2 3-4 16 - Anecdote and Book Reading
Context 2 25+25 - 25 Mandarin and 25 English-speaking children
TCCM 1;7-3;4 10 - longitudinal in Taiwan
Tong 1;7-3;4 1 video longitudinal study linked to video
Xinjiang 4-8 60 - children in Xinjiang
Xu/Min/Chen 1;3-3;0 5 -
Zhou1 3-6 15 - Play sessions with mother
Zhou2 3-6 15 - Play sessions with mother
ZhouDinner 3,4,5,6 80 - Dinner conversations
ZhouNarratives 3,4,5,6 200 - Hungry Caterpillar and Robber stories
ZhouPeer 3,4,5,6 80 - Peer adn role playing talk
Taiwanese
Tsay 2;0-2;6 4 audio in PhonBank