CHILDES Korean Ko Corpus

Eon-Suk Ko
English Language and Literature
Chosun University


Participants: 30
Type of Study: naturalistic, cross-sectional
Location: Seoul
Media type: audio
DOI: doi:10.21415/8ZYX-QC86

Browsable transcripts

Download transcripts

Link to media folder

Citation information

J. Jo & Ko, E.-S. (2018) Korean mothers attune the frequency and acoustic saliency of sound symbolic words to the linguistic maturity of their children, Frontiers in Psychology 9:2225, doi: 10.3389/fpsyg.2018.02225

Ko, E.-S., Jo, J., On, K.-W. & Zhang, B.-T. (2020) Introducing the Ko Corpus of Korean mother-child interaction, Frontiers in Psychology. doi: 10.3389/fpsyg.2020.602623

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Authors of any published work citing the data are requested to send a copy to the contributor.

Project Description

The participants included in this corpus are 30 mother-child pairs. This cross-sectional data set includes 10 dyads in each of the Preverbal (M=8 months), Early speech (M=13 months), and Multi-word (M=27 months) group. The data were collected at a mock apartment laboratory in Seoul National University. It consists of a foyer, a living room, a bedroom and a kitchen. We equipped the apartment with age-appropriate books and toys to simulate natural environment of a chlid's home.

The infant-mother dyads were first given 40 minutes of free-play time. Mothers were told to interact with their infants as they would normally do at home. Researchers monitored the interaction at a different room remotely via four wall-mounted cameras. After the play session, we made a phone call to the mother and reminded her to move on to the next task of calling one of their family members or friends and talk for a few minutes. Most of them had a chat with their mother or husband on the phone. In the few cases where the father came along, he entered the lab to chat with the mother. The researchers then entered the room, and talked with the mother for a few minutes. The total duration of the recording is about 50 minutes for each file.

The data were transcribed in Korean in the main participant tiers. We also provide a Roman transliteration in the %ort tier, adopting the Yale romanization system.


The collection and the preparation of the data were financially supported by Global Research Network Program through the Ministry of Education of the Republic of Korea and the National Research Foundation of Korea (NRF-2016S1A2A2912606) and Institute for Information & communications Technology Planning & Evaluation (IITP) grant funded by the Korea government(MSIT) (2019-0-01367, Infant-Mimic Neurocognitivive Developmental Machine Learning from Interaction Experience with Real World (BabyMind)). I thank Prof. Byung-Tak Zhang of Seoul National University for letting me collect the data in his lab.