Home News About Us Contact Contributors Disclaimer Privacy Policy Help FAQ

Home
Search
Quick Search
Advanced
Fulltext
Browse
Collections
Persons
My eDoc
Session History
Login
Name:
Password:
Documentation
Help
Support Wiki
Direct access to
document ID:


          Institute: MPI für Psycholinguistik     Collection: Yearbook 2011     Display Documents



  history
ID: 555250.0, MPI für Psycholinguistik / Yearbook 2011
The Nijmegen corpus of casual French
Authors:Torreira, Francisco; Adda-Decker, Martine; Ernestus, Mirjam
Language:English
Date of Publication (YYYY-MM-DD):2010
Title of Journal:Speech Communication
Volume:52
Start Page:201
End Page:212
Review Status:Peer-review
Audience:Not Specified
Intended Educational Use:No
Abstract / Description:This article describes the preparation, recording and orthographic transcription of a new speech corpus, the Nijmegen Corpus of Casual French (NCCFr). The corpus contains a total of over 36 h of recordings of 46 French speakers engaged in conversations with friends. Casual speech was elicited during three different parts, which together provided around 90 min of speech from every pair of speakers. While Parts 1 and 2 did not require participants to perform any specific task, in Part 3 participants negotiated a common answer to general questions about society. Comparisons with the ESTER corpus of journalistic speech show that the two corpora contain speech of considerably different registers. A number of indicators of casualness, including swear words, casual words, verlan, disfluencies and word repetitions, are more frequent in the NCCFr than in the ESTER corpus, while the use of double negation, an indicator of formal speech, is less frequent. In general, these estimates of casualness are constant through the three parts of the recording sessions and across speakers. Based on these facts, we conclude that our corpus is a rich resource of highly casual speech, and that it can be effectively exploited by researchers in language science and technology.
External Publication Status:published
Document Type:Article
Communicated by:Karin Kastens
Affiliations:MPI für Psycholinguistik
External Affiliations:External Organizations - Center for Language Studies
Identifiers:URL:http://pubman.mpdl.mpg.de/pubman/item/escidoc:1479...
DOI:10.1016/j.specom.2009.10.004
LOCALID:escidoc:147913
The scope and number of records on eDoc is subject to the collection policies defined by each institute - see "info" button in the collection browse view.