CHILDES English Manchester Corpus

Elena V. M. Lieven

Max Planck Institute for Evolutionary Anthropology
lieven@eva.mpg.de

Julian Pine
Department of Psychology
University of Liverpool
julian.pine@liverpool.ac.uk

Caroline Rowland
Department of Psychology
University of Liverpool
crowland@liverpool.ac.uk
website

Anna Theakston
Department of Psychology
University of Manchester
theaksto@fs4.psy.man.ac.uk
website

Participants:	12
Type of Study:	normal play activities with mother
Location:	England
Media type:	audio archived
DOI:	doi:10.21415/T54G6D

Citation information

Theakston, A. L., Lieven, E. V. M., Pine, J. M., & Rowland, C. F. (2001). The role of performance limitations in the acquisition of verb-argument structure: an alternative account. Journal of Child Language, 28, 127-152.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Project Description

This corpus consists of transcripts of audio recordings from a longitudinal study of 12 English-speaking children between the ages of approximately 2 and 3 years. The children were recruited through newspaper advertisements and local nurseries. All the children were first borns, monolingual and were cared for primarily by their mothers. Although socioeconomic status was not taken into account with respect to recruitment, the children were from predominantly middle-class families. There were six boys and six girls, half from Manchester and half from Nottingham. At the beginning of the study, the children ranged in age from 1;8.22 to 2;0.25 with MLUs ranging between 1.06 to 2.27 in morphemes. The children’s ages are available in the headers to each transcript. There birthdates are as follows:

Anne 10-SEP-1996
Aran 01-SEP-1994
Becky 08-JUL-1994
Carl 07-AUG-1994
Dominic 26-MAR-1997
Gail 29-JUN-1995
Joel 27-JUN-1994
John 16-AUG-1994
Liz 02-MAY-1994
Nick 07-JUL-1994
Ruth 30-MAY-1995
Warren 19-NOV-1994

The transcripts for each child are numbered from 1 to 34 corresponding to the tape number and labeled (a) and (b) to correspond to the two 30-minute sessions within each recording. The following recording sessions were missed and therefore have no corresponding transcript: Aran14a/b, Carl14b, Carl24a/b, John15a/b, John16a/b, Ruth4a/b, Warren3b.

Procedure

The children were audiotaped in their homes for an hour on two separate occasions in every 3-week period for one year. They engaged in normal play activities with their mothers. For the first 30 minutes of each hour they played with their own toys whilst for the second 30 minutes, toys provided by the experimenter were available to the child. For the duration of the recordings, the experimenter attempted as far as possible to remain in the background to allow contextual notes to be taken.

Transcription

All speech was transcribed with the exception of speech not directed to the child(i.e. speech between adults, telephone calls etc.). However, if the child produced an utterance in response to such speech, the relevant utterances were transcribed. Generally speaking, contextual information was added only when the utterance would otherwise be unclear. Of course, because the children were not videotaped, we had only the experimenter’s notes for such information. Punctuation was kept to a minimum – double commas indicate tag questions and single commas were used to indicate vocatives.

Phonological Forms

The data were collected with the intention of looking specifically at early grammatical development. We were not interested in the specific phonological forms the children used. Therefore, unless the child used what appeared to be child-specific forms, the target word was transcribed rather than an approximation of the child’s phonological form. This also helped with coding using the MOR program.

Error Coding

The data were coded for the following errors (where ‘0’ indicates a missing speech component). For all of the errors the marker [*] was added to the main line and a dependent tier was added showing the correct form.

Missing morphemes two dog-0s, he’s go-0ing
Case errors her do it, me get it
Missing auxiliaries it 0is going there, I 0am getting a drink
Word Class Errors a that one
Agreement errors a bricks, does she likes it?, it don’t go there
Pronominal Errors carry you (when the child wants to be carried)
Wrong word I put it off (where the context indicates take is appropriate)
Overgeneralisations it broked, I stayed it on there.

Although we have attempted to be consistent in coding, errors may have been missed. In particular, missing auxiliaries and copulas have often not been coded. Where it was impossible to identify exactly what the error was, the error was simply marked on the main line with [*]. Anyone wishing to work on particular error types should carry out a detailed analysis of the child’s use of a particular system (e.g., pronoun case marking) rather than relying on pulling out errors by searching for the [*] error marker.