CHILDES Danish-Japanese Hayashi Corpus

Mariko Hayashi
Institute of Psychology
University of Århus
ostmh@hum.aau.dk
website

Participants:	1
Type of Study:	naturalistic
Location:	Denmark
Media type:	no longer available
DOI:	doi:10.21415/T5SG7P

Citation information

Publications using these data should cite:

Hayashi, M. (1993). A longitudinal study of the language development in bilingual chil-dren. Unpublished doctoral dissertation, University of Aarhus.

Klausen, T., Subritzky, M. S., & Hayashi, M. (1992). Initial production of inflections in bilingual children. In G. Turner & D. Messer (Eds.), Critical influences on language acquisition and development. London: Macmillan.

In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.

Project Description

This corpus includes longitudinal data from a child growing up in a Japanese-Danish bilingual family in the age range of 12 to 29 months. The data were collected by Mariko Hayashi, University of Aarhus, Denmark, in the context of her doctoral study investigating language development in bilingual children. Pseudonyms have been used to preserve informant anonymity. The child is called “Anders.” Anders was a first-born boy, and had no siblings during the period studied. The father had an university education, the mother college education, thus the family belonged to the educated middle class.

Anders’ mother was Japanese and his father was Danish. The family resided in Den-mark, where the community language is Danish. The parents spoke their respective native tongue to the child from the beginning. Occasional code-switching, especially by the father, occurred to a certain extent. The parents spoke mainly English, and occasionally Japanese and Danish to each other. Anders and his mother spent summer vacation in Japan at the child’s age of 21 to 23 months. In this period, Anders was exposed exclusively to Japanese.

Anders was taken care of by his mother in the day time. He had a couple of Danish- speaking playmates he was occasionally together with. In the evenings and the weekends the father took care of the child as well. The father’s parents, who spoke Danish, lived in the neighborhood and visited the family regularly. People who visited the mother spoke either Japanese or English, as the mother did not understand much Danish. The father and the mother, as mentioned above, spoke mainly English to each other. Otherwise, the child was not exposed to English.

The language Anders was exposed most to was Japanese, as it was the mother who took care of him in the day time. He also spent a three-month summer vacation in Japan, where he was exposed exclusively to Japanese. In his productive vocabulary Japanese began to be dominant at 20 months. The dominance of the Japanese language became especially clear during and after his visit to Japan. Although Anders did not show any clear sign for comprehending English, he did pick up a few English expressions such as “see you” and “two.”

Monthly videotapings of the child of about an hour’s duration were made in the age range of 11 to 38 months. All recordings were made in the child’s own home by Hayashi. With a few exceptions, both parents were present at each session. Each visit included until a certain time testing on the Uzgiris-Hunt Infant Assessment Scales (1978) as well. For a certain period, the parents kept a record of lexical items, which was used as a supplement to the videotapings. The mother made audio recordings during their stay in Japan as well.

Thirty minutes of each session were transcribed based on standard orthography by Hayashi, who is a native speaker of Japanese as well as a fluent speaker of Danish. All transcripts were checked by a native speaker of Danish. Three or four different situations, typically dinner, free play, and book reading, were selected for transcription. Furthermore, care was taken so that the mother and the father were more or less equally included in the portion of recording to be transcribed. Utterances are identified after prosodic criteria such as intonation and pauses, whereas utterances themselves are divided into units based on clarity of articulation and fluency. Limited attention is paid to overlapping, retracings, and hesitations. A deviated phonological form is described in the phonetic tier. However, it does not provide a precise phonetic analysis. Speech errors are not coded.

The corpus contains the following 17 files:

File Date of recording Age of Child
and03.cha 02-NOV-1986 1;0.15
and04.cha 07-DEC-1986 1;1.20
and05.cha 18-JAN-1987 1;3.1
and06.cha 15-FEB-1987 1;3.28
and07.cha 08-MAR-1987 1;4.21
and08.cha 12-APR-1987 1;5.25
and09.cha 04-MAY-1987 1;6.17
and10.cha 31-MAY-1987 1;7.14
and11.cha 29-JUN-1987 1;8.12
and12.cha 03-AUG-1987 1;9.16
and13.cha 05-SEP-1987 1;10.18
and14.cha 31-OCT-1987 2;0.14
and15.cha 28-NOV-1987 2;1.11
and16.cha 07-JAN-1988 2;2.20
and17.cha 14-FEB-1988 2;3.27
and18.cha 17-MAR-1988 2,5.0
and19.cha 15-APR-1988 2;5.28

Warnings:

Overlapping is not accurately transcribed in these data.
Retracings and hesitations are not accurately transcribed in these data.
These data contain limited information regarding the context.
Repetitions of identical units/utterances are transcribed twice at most.
Productive units within an utterance are identified on the basis of articulation and fluency criteria.
The phonetic tier is used to describe more accurately the child’s pronunciation of a given sound. However, it does not provide a precise phonetic analysis.
Regular inflections of nouns and verbs are preceded by a dash in the main text line. Irregularly inflected nouns and verbs are not divided into morphemes.
There are three letters of the Danish alphabet that cannot be typed onto the comput-er using ASCII codes. Based on the conventional method, these letters are replaced by ae, oe, and aa.