CHILDES CHILDES Corpora

This page provides an index to CHILDES corpora, organized by language group and data type.

Signed contribution forms are available here .

Corpora that focus on early child phonology can be found at the PhonBank site . The majority of PhonBank corpora contain transcriptions of child productions without inclusion of the overall conversation. However, there are six PhonBank corpora (Davis, Lyon, Paris, Providence, Yamaguchi, and WeistJarosz) that have full transcriptions. Those corpora are included here in the CHILDES database and their transcriptions in Phon format can be found at the PhonBank website.
Collection Description Collection Description
Bilingual children learning two or more languages Celtic Irish and Welsh
Clinical-MOR language disorders - English Clinical language disorders - other languages
Chinese Cantonese, Mandarin, Taiwanese DutchAfrikaans Dutch and Afrikaans
EastAsian Korean, Indonesian, Thai Eng-AAE North America
Eng-NA North America Eng-UK United Kingdom
French French German German
Japanese Japanese Romance Catalan, Italian, Portuguese, Romanian
Scandinavian Danish, Swedish, Icelandic, Norwegian Slavic Croatian, Czech, Polish, Russian, Serbian, Slovenian
Spanish Spanish Other - 1 Arabic, Basque, Berber, Cree
Other - 2 Estonian, Farsi, Greek, Hebrew, Hungarian Other - 3 Jamaican, Nungon, Quechua, Sesotho, Tamil, Turkish
Frogs Frog story narratives MAIN MAIN narratives
Narrative Other narratives XLing Crosslinguistic studies