Corpus Serra-Solé

 

1. Phonetically and orthographically transcribed corpus (using PhonProject)

General description. Corpus including data of four monolnigual Catalan speaking-children (Gi, Gu, La, and Pe), available using Phon program. All recorded sessions have been segmented, annotated, and phonetically and orthographically transcribed. Children were recorded on a monthly basis from 1;00 until 2;09, at home during spontaneous interactions with an adult. Sessions last between 30 and 45 minutes. The corpus has been obtained from the CHILDES database.

 

Materials. Phonetic and orthographic transcription of all children's productions. Phonetic transcription is made of all children's productions (IPA Actual), together with the phonetic transcription of that word in adult language (IPA Target).

 

Name Age Gender Sessions Corpus
Gi 1;07–2;09 F 9 link
Gu 1;01–2;04 M 13 link
La 1;07–2;04 F 7 link
Pe 1;00–2;03 M 14 link

 

Team. The GrEP members (Group of Prosodic Studies) which segmented and phonetically transciped this corpus are: Eduard artés, Ana Estrella, and Maria del Mar Vanrell for Gi; Núria Esteve, Maria del Mar Vanrell,Núria Argemí, and Roger Craviotto for Gu; Io Salmons, Ana Estrella i Maria del Mar Vanrell for La; Io Salmons, Ana Estrella, Núria Argemí, Roger Craviotto, and Verònica Crespo-Sendra for Pe.

 

Sponsorship. This project was funded by the project awarded by the Spanish Ministry of Science and Education entitled "Estructura prosòdica i adquisició de la prosòdia en català i espanyol" (HUM2006-01758/FILO, 2006-2009), directed by Pilar Prieto (ICREA-Universitat Pompeu Fabra).

 

 

2. Orthographically transcribed corpus (using CLAN)

General description. Corpus including data of ten children: five monolingual Catalan speaking-children, four bilingual Catalan-Spanish speaking-children, and one monolingual Spanish speaking-child, available downloading both .pdf files or .cha files (CLAN program). María's father recorded her fortnightly recorded from 1;07 until 3;10. Children belong to midle-class families and were recorded on a monthly basis from 1;00 until 2;09, at home during spontaneous interactions with an adult. Sessions last between 30 and 45 minutes.

 

Materials. Orthographically transcribed sessions, which can be downloaded both in .pdf files and .cha files (in order to use .cha fie, the CLAN program must be installed). All the material is obtained from the CHILDES database.

Monolingual Catalan speaking-children
Name Age Gender Sessions Corpus
        .pdf .cha
Alvar 1;2.28–3;1.13 M 20 download download
Gi 1;7.14–4;2.3 F 11 download download
Gu 1;0.0–4;0.0 M 8 download download
La 1;7,20–4;0.10 F 7 download download
Pe 1;0.27–3;6.21 M 31 download download

 

Bilingual Catalan-Spanish speaking-children
Name Age Gender Sessions Corpus
        .pdf .cha
Antoni 1;4.1–3;0.24 M 23 download download
Caterina 1;1.17–4;3.21 F 18 download download
Josep Andreu 0;10,1–4;0.3 M 24 download download
Martí 0;10.14–4;0.13 M 24 download download

 

Nen monolingüe de castellà
Name Age Gender Sessions Corpus
        .pdf .cha
Eduard 1;4.8–3;10.20 M 11 download donwload
 

Team. Montserrat Cortés, Connie Schultz, Elisabet Serrat, Vicens Torrens, Melina Aparici, Cristina Vila, and Montse Capdevila worked on this project

 

Sponsorship. The research project was entitled “Language acquisition in Catalan and Spanish children” and was directed by Miquel Serra (University of Barcelona) and Rosa Solé (Universitat Autonoma de Barcelona). It has received support from the Spanish research council (Grants DGICYT PB84/0455; PB89/0317; PB91/0851; PB94/0886).