CHILDES Dutch Gillis Corpus

Steven Gillis
Department of Linguistics
University of Antwerp


Participants: 1
Type of Study: case study
Location: Netherlands
Media type: no longer available
DOI: doi:10.21415/T5G30F

Browsable transcripts

Download transcripts

Citation information

Publications using these data should cite:

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by the above reference.

Other publications in which this corpus was used include:

Project Description

This directory contains a longitudinal corpus from a boy learning Dutch. The corpus was donated to the CHILDES by Steven Gillis, Department of Germanic Linguistics, University of Antwerp, Belgium. The data are in CHAT format without English glosses.

The child, Maarten, was a Flemish boy learning Dutch. Biweekly videotapings were taken at the child’s home between the ages of 0;11.15 and 1;11.28. Recordings began when the child’s vocalizations exhibited what Dore, Franklin, Miller, and Ramer (1976) called phonetically consistent forms. They lasted until the child’s MLU exceeded 1.5 for three consecutive sessions. The entire corpus consists of 29,324 intelligible child utterances. The child was recorded for an average of 3 hours a week for a total of 104 hours of recording (average: 1:18 hours per recording, with a range of 0:15:18 hours to 3:44:52 hours). The sessions included interactions between the child and an adult (usually his mother) as well as solitary play. All recordings were made in an unstructured regular home setting.

The video recordings were transcribed according to the CHAT conventions and include the child’s vocalizations in Dutch UNIBET transcription on the %pho tier. There are no adult glosses of the child’s utterances. The transcripts also include the adults’ utterances in normal graphemic transcription on the main tier and the child’s and the adults’ nonverbal behavior (gestures, gaze direction, object manipulation), notes on the synchronization of the verbal and the nonverbal behaviors, and description of the context. All this information can be found on the %sit tier, which is at the present written in Dutch.

In the %sit line, dashes separate actions. The match of actions to the phonology is sometimes indicated. Three-letter codes indicate the actor and the addressee. For example, MXA means that M did X to A. MXA &1 MYB means that M did X to A and while this is going on M does Y to B.