The CHILDES database contains transcript and media data collected from conversations with children.
Conversations involving adults are available from TalkBank.
All of the data is transcribed in CHAT format. The use of all CHILDES and TalkBank data is governed by
the Creative Commons License.
Please remember to read and follow the Ground Rules for data-sharing.
Accessing TalkBank Data
Working with transcripts and media locally
- You need to download the transcripts and unzip them.
- If the corpus is linked to audio or video, you need to
download those files and place the media into the transcript folders.
- You need to download and install the CLAN program.
- To open a transcript, you double-click on it. If there is associated media, you can play the media
using escape-8 for continuous playback or command-click for playing single utterances.
Downloading media with plug-ins
If you find it tedious to download files one by one, you can configure your browser to download all the items on a web page in one go.
Many methods are available, but these are the ones we have tested:
- Firefox: Add-in "DownThemAll". After adding and restarting, you can right click on a page full of
items you want to download and you select "dTa One Click" and a new page will open up
allowing you to control your downloading of all the items on the original page.
- Chrome: You can use the Chrono Download Manager which is free at the Google Web Store. After installing you get an icon at the top of
Chrome with three colors and a down arrow. Go the the folder from which you want to download items, then lick on the Chrono icon
and (if necessary) click on the magnifying glass to bring up the Chrono Sniffer to grab all the files you want.
- Safari: no add-in is needed, if you follow these steps:
Open the page that has the files you want to download
Press Command-Option-A (to show Safari's Activity window) and you will see all links on that page, including the links to other pages
Select those links belonging to the files or pages you want to download
Press Command-C to copy the links
Press Command-Option-L to open the Downloads window
Click on the downloads window in order to activate it
Press Command-V to paste the links (from memory)
- Internet Explorer: Not possible.
Downloading media with wget
You can also use the Unix wget command for downloading. Here are the steps for Mac OSX:
- Install HomeBrew by going to brew.sh and copying the command given there to your terminal and running.
$ brew update
$ brew install wget
- For example, to retrieve all the *.mp3 audio ih the McDivitt folder, you can run this one-line wget command:
$ wget -c --user=gordon --ask-password -e robots=off -r -l inf --no-remove-listing -nH --no-parent -R 'index.html*' -A '*.mp3' http://homebank.talkbank.org/media/Password/McDivitt/
- Enter your password
This creates a folder "media" into the calling directory with the appropriate substructure.
For Windows, Visual WGet may be a good option. For Linux, you can get wget from github, compile it, and use this same command.