Corpus Esteve-Prieto

1. Orthographically and pragmatically transcribed corpus (using PhonProject)

General description. Corpus of audiovisual session of four monolingual Catalan speaking-children (An, On, Bi, and Ma), from 6 to 32 months of age. Sessions from 6 to 11 months are sessions have been segmented, annotated, and orthographically and pragmatically transcribed using Phon. Children were recorded on a weekly basis from 6 to 12 months, fortnightly from 12 to 18 months, and every three weeks from 19 to 32 months of age. They were carried out at their home during spontaneous interactions with an adult, usually their mothers. Sessions last between 30 and 40 minutes. The linguistic environment of the children is almost exclusively Catalan, since all parents speak exclusively Catalan to their children and to each other. Moreover, the children all come from small town in the same region of Catalonia, Alt Penedès, located 50 km to the south of Barcelona. According to the information available from the official statistics website of Catalonia (www.idescat.cat), in their towns Catalan is spoken regularly by about 90% of the population.

 Materials. Audiovisual recordings from 6 to 32 months of age. Sessions from 7 to 11 months include the orthographic and pragmatic transcription of all children's productions. The pragmatic annotation has taken into account non-vocal cues like hand movements, eye gaze direction, or face and body movements. From the observation of these cues, vocalizations have been classified as non-communicative (if the vocalizations conveys no communicative intentionality), protest (if children are expressing disapproval or disagreement), request (when the child tries to reach an object), response (when the child replies a mother's question), satisfaction (if the child is happy and it is verbally expressed), statement (when the child initializes the communication with the mother while looking at her), surprise (if the child vocalizes after an unexpected event), vocative (when the child is calling at somebody), and fuzzy intention (when vocalizations are communicative but with no specific intention).

 

Name Age Gender Sessions Corpus
An from 7 to 31 month-old girl 45 link
On from 5 to 31 month-old girl 39 link
Bi from 6 to 32 month-old boy 42 link
Ma from 6 to 31 month-old boy 40 link

 

Team. The GrEP members that worked in the corpus are Núria Esteve, who segmented and annotated the corpora, and Pilar Prieto, who supervised the whole process.

 

Sponsorship. The project is funded by the projects awarded by the Spanish Ministry of Science and Education HUM2006-01758/FILO, 2006-2009 ("Estructura prosòdica i adquisició de la prosòdia en català i espanyol") and FFI2009-07648/FILO, 2009-2011, appart from the Batista and Roca project, funded by the Catalan Government (2009 PBR 00018). All three projects are directed by Pilar Prieto (ICREA-Universitat Pompeu Fabra).