This page provides an index to CHILDES corpora, organized by language group and data type. In accordance with TalkBank rules, any use of data from these corpora must cite at least one corpus reference (see citation info on corpus page) and acknowledge CHILDES grant support -- NICHD HD082736.
Signed contribution forms are available here .
Corpora that focus on early child phonology can be found at the PhonBank site . The majority of PhonBank corpora contain transcriptions of child productions without inclusion of the overall conversation. However, there are six PhonBank corpora (Davis, Lyon, Paris, Providence, Yamaguchi, and WeistJarosz) that have full transcriptions. Those corpora are included here in the CHILDES database and their transcriptions in Phon format can be found at the PhonBank website.
|Bilingual||children learning two or more languages||Celtic||Irish and Welsh|
|Clinical-MOR||language disorders - English||Clinical||language disorders - other languages|
|Chinese||Cantonese, Mandarin, Taiwanese||DutchAfrikaans||Dutch and Afrikaans|
|EastAsian||Korean, Indonesian, Thai||Eng-AAE||North America|
|Eng-NA||North America||Eng-UK||United Kingdom|
|Japanese||Japanese||Romance||Catalan, Italian, Portuguese, Romanian|
|Scandinavian||Danish, Swedish, Icelandic, Norwegian||Slavic||Bulgarian, Croatian, Czech, Polish, Russian, Serbian, Slovenian|
|Spanish||Spanish||Other - 1||Arabic, Basque, Berber, Cree|
|Other - 2||Estonian, Farsi, Greek, Hebrew, Hungarian||Other - 3||Jamaican, Nungon, Quechua, Sesotho, Tamil, Turkish|
|Frogs||Frog story narratives||MAIN||MAIN narratives|
|Narrative||Other narratives||XLing||Crosslinguistic studies|