SARDES, Unit Taal
Free University of Amsterdam
|Participants:||176 transcripts from children ages 4, 5, 7, 9, and 10|
|Type of Study:||narrative|
This research resulted into the two doctoral theses cited below.
Publications using these data should cite:
Aarssen, J. (1996). Relating events in two languages: Acquisition of cohesive devices by Turkish-Dutch bilingual children at school age. Studies in Multilingualism, Vol. 2. Tilburg: Tilburg University Press.
Bos, P. (1997). Development of bilingualism: A study of school-age Moroccan children in the Netherlands. Studies in Multilingualism, Vol. 8. Tilburg: Tilburg University Press.
Mayer, M. (1969). Frog, where are you? New York: Dial Press.
In accordance with TalkBank rules, any use of data from this corpus must be accompanied by at least one of the above references.
This database contains 1021 transcripts collected in the Netherlands, Turkey, and Morocco by Jeroen Aarssen and Petra Bos, at Tilburg University. Bilingual data (either Turkish-Dutch or Moroccan Arabic-Dutch) were collected within the framework of a longitudinal study into development of bilingualism among Turkish and Moroccan children in the Netherlands.
Beginning in 2003, work began in two sites on digitizing the audio files for this corpus. The Turkish data were being digitized by Ad Backus (firstname.lastname@example.org) and the Moroccan data were being digitized by Louis Boumans at Nijmegen University.
The age range of the bilingual informants was from 4 to 10. The design of the study is pseudo-longitudinal with two consecutive cohorts of 25 informants. The younger cohort was followed for four rounds (from age 4 to age 7) and the older cohort for three rounds (from 8 to 10). The first round of data collection took place in 1991, and data collection was repeated in 1992, 1993, and 1994. The interval between subsequent rounds of data collection was about 1 year.
Turkish, Moroccan Arabic, and Dutch monolingual control data were collected as well in Turkey, Morocco, and The Netherlands, respectively. The Dutch control data were collected according to the same pseudo-longitudinal design as described above. The Turkish and Moroccan control data, however, were collected cross-sectionally from three different age groups (ages 5, 7, and 9).
Each transcript contains retellings of six short six-picture stories and the frog story (Mayer, 1969). The six short stories were constructed according to the following set-up: two stories with a clearly identifiable main character; two with two equivalent main char-acters; and two without a clearly identifiable main character.
The file names use the following code. First comes the child’s
pseudonym. Next comes a number for the child’s age group. These numbers
are often off by a year, so please rely on the ages as given inside the
files. Then comes a letter for the language of the interaction
(t=Turkish, m=Moroccan Arabic, n=Dutch). For the monolingual children,
no letter is given. The files are structured into five directories:
|ArabBiling:||350 files, ages 4–10|
|ArabMono:||71 files, ages 5, 7, 9|
|DutchMono:||175 files, ages 4–10|
|TurkBiling:||350 files, ages 4–10|
|TurkMono:||75 files, ages 5, 7, 9|
Some adjustments were made in order to represent some Moroccan Arabic
characters in Roman:
|Character  ||Code  ||Character  ||Code  ||Unicode|
|ch of "loch"||x||pharyngealized h||h2||bar h|
|j||j||emphatic t||t2||t glottal|
|uvular r||gh||emphatic s||s2||s glottal|
|ayn||c||emphatic d||d2||d glottal|
This research was supported by the Linguistic Research Foundation (Grant No. 300-172-002), which is funded by the Netherlands Organization for Scientific Research, NWO.