Leo Corpus

 

♥ Trilingual first language acquisition of Mandarin, Cantonese and English in a Hong Kong child in an innovative input model

♥ Nurturing infant trilingualism involving a national language, an international language and a local “dialect” in Chinese-dominant contexts

 

To use the data from the Leo Corpus, please cite:

Mai, Z. & Yip, V. (under review) Trilingual first language acquisition: Leo Corpus.

 

Leo and his trilingual input

 

 

Leo was born and raised in Hong Kong. His mother is an early bilingual equally proficient in Cantonese and Mandarin, and his father a native speaker of a Chinese dialect and Mandarin. Both parents speak English as their second language, but the language between them is Mandarin, with occasional switches to Cantonese and English. The family initially adopted a strict ‘one parent-one language’ practice in which the mother spoke to the child in Cantonese and the father in Mandarin. Leo’s paternal grandmother also addressed the child consistently in Mandarin.

Leo’s systematic exposure to English began at 1;01 when his mother started to interact with him in English every other day in a week and Cantonese the rest of the days, hence the “one day-one language model”. From 1;02, Leo also received regular English input from native speakers of British and American English in weekly playgroup or private tutorial play sessions, and later from teachers at international nursery schools and kindergartens, where English is the medium of instruction. Another important source of English input comes from a Filipino domestic helper, who interacted with Leo frequently in English from 2;02.

 
 

The Corpus

 

 

Leo has been video-recorded by a parent or research assistant using Sony HandyCam and H2N audio recorder every week since 0;06. In the initial release of the Leo Corpus, we selected 54 recordings made in 18 consecutive months from 1;06-2;11. Each recording consists of 30-minute video documentation of the child and an adult engaged in daily routines such as games, role-playing, story-telling and dining at home. Each recording session focused on only one language as the target language, with only one adult interacting with the child. The adult interlocutors were chosen to sample the input Leo regularly received, that is, Mandarin from his grandmother and father, Cantonese from his mother, and English from his mother, helper and school teachers (represented by a research assistant, who is a native speaker of American English, in the recordings). Below is a full list of recordings and main adult interlocutors. The resulting corpus contains a total of 44,846 utterances, of which 11,884 were produced by the target child, and 32,982 by the adults.

The speech data were transcribed and tagged by trained research assistants. The child’s mother manually checked all the transcripts for accuracy. The corpus, including transcripts and corresponding audio recordings, has been deposited in the Child Language Data Exchange System (CHILDES, MacWhinney, 2000, doi:10.21415/T5V398). For access to the video and wav sound files, please contact the authors.

 

Table: List of recordings in the Leo Corpus

 
 

Video Highlights

 

Here are some linguistically interesting highlights, selected from the video documentation of Leo interacting with adults in the preparation stage (0;07-1;05)  and the released period (1;06-2;11) of the Leo Corpus.

 

Preparation stage (0;07-1;05, not released, contact authors for access)

 

0;07 Mandarin


Leo attempting to articulate Mandarin basic number words yi (one), er (two) and san (three) at 0;07

 


 
1;02 English


Leo engaged in English interactions after only one month of English exposure at 1;02

 

 

1;03 Cantonese


Leo working with mother to differentiate between bei6 (nose) and bei2 (thigh, leg) in Cantonese at 1;03

 

 

1;04 English


Leo responding to English instructions after 3 months of English exposure at 1;04

 

 

1;05 English


Leo offering simple answers to English who-questions after five months of English exposure at 1;05

 

 

Leo Corpus (1;06-2;11, transcripts and audio files released on CHILDES)

 

1;07 Mandarin


Leo, at one-word stage, drawing on his multilingual lexicon to interact with his Mandarin-speaking father at 1;07

 

 

*FAT:      啊 这个 什么 呀 ?

%mor:     sfp|a1 pro|zhe4ge4=this pro:wh|shen2me=what sfp|ya ?

%gra:      1|0|INCROOT 2|3|DET 3|1|OBJ 4|1|SFP 5|1|PUNCT

*CHI:       [- yue] 車車 .

%mor:     n|ce1&DIM=vehicle .

*FAT:      [- eng] yes .

%mor:     co|yes .

%gra:      1|0|INCROOT 2|1|PUNCT

*FAT:      车车 [/] 车车 .

%mor:     n|che1&DIM=vehicle .

%gra:      1|0|INCROOT 2|1|PUNCT

*CHI:       [- yue] 豬豬 .

%mor:     n|zyu1&DIM=pig .

*FAT:      豬豬@s:yue 在 哪儿 啊 ?

%mor:     L2|豬豬 prep|zai4=at pro:wh|na3-DIA=which sfp|a1 ?

%gra:      1|0|INCROOT 2|1|JCT 3|2|PREPO 4|1|SFP 5|1|PUNCT

*FAT:      告诉 爸爸 .

%mor:     v|gao4su4=tell n:relat|ba4&DIM=father .

%gra:      1|2|MOD 2|0|INCROOT 3|2|PUNCT

*CHI:       &m .

%exp:      the child pointed on the book .

*FAT:      这个 啊 ?

%mor:     pro|zhe4ge4=this sfp|a1 ?

%gra:      1|0|INCROOT 2|1|SFP 3|1|PUNCT

*CHI:       &a &ju &zhu .

*FAT:      哦 这个 呢 ?

%mor:     co|o1=chant pro|zhe4ge4=this sfp|ne ?

%gra:      1|2|COM 2|0|INCROOT 3|2|SFP 4|2|PUNCT

*CHI:       啊 豬豬@s:yue .

%mor:     sfp|a1 L2|豬豬 .

%gra:      1|0|INCROOT 2|1|SFP 3|1|PUNCT

*FAT:      这个 也 是 豬豬@s:yue 啊 ?

%mor:     pro|zhe4ge4=this adv|ye3=also v:cop|shi4=is L2|豬豬 sfp|a1 ?

%gra:      1|3|SUBJ 2|3|JCT 3|0|ROOT 4|3|PRED 5|3|SFP 6|3|PUNCT

*FAT:      哦 .

%mor:     co|o1=chant .

%gra:      1|0|INCROOT 2|1|PUNCT

*FAT:      这个 是 什么 呀 ?

%mor:     pro|zhe4ge4=this v:cop|shi4=is pro:wh|shen2me=what sfp|ya ?

%gra:      1|2|SUBJ 2|0|ROOT 3|2|OBJ 4|2|SFP 5|2|PUNCT

*CHI:       猫猫 .

%mor:     n|mao1&DIM=pussycat .

%gra:      1|0|INCROOT 2|1|PUNCT

%exp:      the father laughed .

*FAT:      诶 , 对了 !

%mor:     co|ei1=ey cm|cm co|dui4le=correct !

%gra:      1|2|COM 2|0|INCROOT 3|2|COM 4|2|PUNCT

*FAT:      猫猫 .

%mor:     n|mao1&DIM=pussycat .

%gra:      1|0|INCROOT 2|1|PUNCT

*FAT:      猫猫 [/] 猫猫 是 怎么 叫 的 呀 ?

%mor:     n|mao1&DIM=pussycat v:cop|shi4=is adv:wh|zen3me=how v|jiao4=call nom|de sfp|ya ?

%gra:      1|4|SUBJ 2|4|JCT 3|4|JCT 4|0|ROOT 5|4|POSS 6|4|SFP 7|4|PUNCT

*CHI:       owl@s:eng .

%mor:     L2|owl .

%gra:      1|0|INCROOT 2|1|PUNCT

*FAT:      那 是 狗狗 吧 .

%mor:     pro|na4=that v:cop|shi4=is n|gou3&DIM=dog sfp|ba .

%gra:      1|3|DET 2|3|MOD 3|0|INCROOT 4|3|SFP 5|3|PUNCT

*FAT:      猫猫 是 喵 .

%mor:     n|mao1&DIM=pussycat v:cop|shi4=is on|miao1=meow .

%gra:      1|2|SUBJ 2|0|ROOT 3|2|PRED 4|2|PUNCT

*CHI:       猫 .

%mor:     n|mao1=pussycat .

%gra:      1|0|INCROOT 2|1|PUNCT

*FAT:      喵 .

%mor:     on|miao1=meow .

%gra:      1|0|INCROOT 2|1|PUNCT

 

1;08 Cantonese


Leo interacting with mother in single-word sentences in Cantonese at 1;08

 

 

1;09 English


Leo entering two-word stage in English at 1;09

 

 

2;00 English


Leo discussing colours with a native speaker of English at 2;00

 

 

*HAN:      oh , very nice .

%mor:      co|oh cm|cm adv|very adj|nice .

%gra:       1|4|COM 2|1|LP 3|4|JCT 4|0|ROOT 5|4|PUNCT

*HAN:      I like your dress .

%mor:      pro:sub|I v|like det:poss|your n|dress .

%gra:       1|2|SUBJ 2|0|ROOT 3|4|MOD 4|2|OBJ 5|2|PUNCT

*HAN:      what color is it ?

%mor:      pro:int|what n|color aux|be&3S pro:per|it ?

%gra:       1|2|MOD 2|4|SUBJ 3|4|AUX 4|0|ROOT 5|4|PUNCT

*CHI:       blue .

%mor:      adj|blue .

%gra:       1|0|INCROOT 2|1|PUNCT

*HAN:      blue .

%mor:      adj|blue .

%gra:       1|0|INCROOT 2|1|PUNCT

*HAN:      wow , I like my dress too .

%mor:      co|wow cm|cm pro:sub|I co|like det:poss|my n|dress adv|too .

%gra:       1|6|COM 2|1|LP 3|6|SUBJ 4|6|COM 5|6|MOD 6|0|ROOT 7|6|NJCT 8|6|PUNCT

*HAN:      what color is my dress ?

%mor:      pro:int|what n|color cop|be&3S det:poss|my n|dress ?

%gra:       1|2|MOD 2|3|SUBJ 3|0|ROOT 4|5|MOD 5|3|PRED 6|3|PUNCT

*CHI:       yellow .

%mor:      n|yellow .

%gra:       1|0|INCROOT 2|1|PUNCT

*HAN:      yellow .

%mor:      n|yellow .

%gra:       1|0|INCROOT 2|1|PUNCT

 

2;00 Mandarin


Leo producing a “long passive” in Mandarin at 2;00

 

 

*FAT:        哎哟 , 那 爸爸 看 一下 这个 跳伞 .

%mor:      co|ai1yo1=arrgh cm|cm pro|na4=that n:relat|ba4&DIM=father v|kan4=look adv|yi1xia4=a_bit pro|zhe4ge4=this v|tiao2san2=parachute .

%gra:       1|5|COM 2|5|JCT 3|5|SUBJ 4|5|SUBJ 5|8|SUBJ 6|8|JCT 7|8|SUBJ 8|0|ROOT 9|8|PUNCT

*FAT:        哎呀 , 爸爸 好 想 去 跳伞 呐 .

%mor:      co|ai1ya1=jeez cm|cm n:relat|ba4&DIM=father adv|hao3=very v:aux|xiang3=want v:dirc|qu4=go v|tiao2san2=parachute sfp|na .

%gra:       1|3|COM 2|3|MOD 3|7|SUBJ 4|5|JCT 5|7|JCT 6|5|VD 7|0|ROOT 8|7|SFP 9|7|PUNCT

*FAT:        爸爸 还 没 跳 过 伞 呢 .

%mor:      n:relat|ba4&DIM=father adv|hai2=still neg|mei2=not v|tiao4=jump asp|guo4 n|san3=umbrella sfp|ne .

%gra:       1|4|SUBJ 2|4|JCT 3|4|NEG 4|0|ROOT 5|6|MOD 6|4|OBJ 7|4|SFP 8|4|PUNCT

*CHI:       被 bi4bi2@s:yue 跳 了 .

%mor:      n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le .

%gra:       1|3|SUBJ 2|3|JCT 3|0|ROOT 4|3|OBJ 5|3|PUNCT

*FAT:        嗯 ?

%mor:      co|en1=okay ?

%gra:       1|0|INCROOT 2|1|PUNCT

*CHI:       被 bi4bi2@s:yue 跳 了 .

%mor:      n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le .

%gra:       1|3|SUBJ 2|3|JCT 3|0|ROOT 4|3|OBJ 5|3|PUNCT

*FAT:        什么 ?

%mor:      pro:wh|shen2me=what ?

%gra:       1|0|INCROOT 2|1|PUNCT

*CHI:       被 bi4bi2@s:yue 跳 了 .

%mor:      n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le .

%gra:       1|3|SUBJ 2|3|JCT 3|0|ROOT 4|3|OBJ 5|3|PUNCT

*FAT:        什么 被 bi4bi2@s:yue 跳 了 ?

%mor:      pro:wh|shen2me=what n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le ?

%gra:       1|2|MOD 2|4|SUBJ 3|4|JCT 4|0|ROOT 5|4|OBJ 6|4|PUNCT

*CHI:       绳 被 bi4bi2@s:yue 跳 了 .

%mor:      n|sheng2=rope n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le .

%gra:       1|2|MOD 2|4|SUBJ 3|4|JCT 4|0|ROOT 5|4|OBJ 6|4|PUNCT

*MOT:     伞 被 bi4bi2@s:yue 跳 了 .

%mor:      n|san3=umbrella n|bei2=quilt L2|bi4bi2 v|tiao4=jump asp|le .

%gra:       1|2|MOD 2|4|SUBJ 3|4|JCT 4|0|ROOT 5|4|OBJ 6|4|PUNCT

*FAT:        [- eng] okay .

%mor:      co|okay .

%gra:       1|0|INCROOT 2|1|PUNCT

*FAT:        你 跳 过 伞 啊 南哥 ?

%mor:      pro|ni3=you v|tiao4=jump asp|guo4 n|san3=umbrella sfp|a1 n:name|nan2ge2 ?

%gra:       1|2|SUBJ 2|0|ROOT 3|4|MOD 4|2|OBJ 5|2|SFP 6|2|OBJ 7|2|PUNCT

*FAT:        你 跳 过 伞 没 有 ?

%mor:      pro|ni3=you v|tiao4=jump asp|guo4 n|san3=umbrella neg|mei2=not v|you3=have ?

%gra:       1|2|SUBJ 2|6|SUBJ 3|6|JCT 4|6|SUBJ 5|6|NEG 6|0|ROOT 7|6|PUNCT

%exp:       the child stood up and jumped .

*CHI:       &tite .

*FAT:        哦 , 你 要 跳 是吧 ?

%mor:      co|o1=chant cm|cm pro|ni3=you v|yao=want v|tiao4=jump tag|shi4ba=right ?

%gra:       1|5|COM 2|5|JCT 3|4|SUBJ 4|2|PREPO 5|0|ROOT 6|5|COM 7|5|PUNCT

*CHI:       跳绳 .

%mor:      v|tiao4sheng2=skip_rope .

%gra:       1|0|ROOT 2|1|PUNCT

*FAT:        哦 , 跳伞 跟 跳绳 不 一样 .

%mor:      co|o1=chant cm|cm v|tiao2san2=parachute prep|gen1=with v|tiao4sheng2=skip_rope neg|bu4=not adj|yi1yang4=same .

%gra:       1|3|COM 2|3|JCT 3|0|ROOT 4|3|JCT 5|4|PREPO 6|7|NEG 7|3|OBJ 8|3|PUNCT

*FAT:        跳伞 [/] 跳伞 跟 跳绳 不 一样 .

%mor:      v|tiao2san2=parachute v|gen1=follow v|tiao4sheng2=skip_rope neg|bu4=not adj|yi1yang4=same .

%gra:       1|3|SUBJ 2|3|JCT 3|0|ROOT 4|5|NEG 5|3|OBJ 6|3|PUNCT

*FAT:        哦 .

%mor:      co|o1=chant .

%gra:       1|0|INCROOT 2|1|PUNCT

*FAT:        哇 .

%mor:      co|wa1=wah .

%gra:       1|0|INCROOT 2|1|PUNCT

*CHI:       bi4bi2@s:yue 想 跳绳 .

%mor:      L2|bi4bi2 v:aux|xiang3=want v|tiao4sheng2=skip_rope .

%gra:       1|3|SUBJ 2|3|AUX 3|0|ROOT 4|3|PUNCT

*FAT:        bi4bi2@s:yue 想 跳绳 啊 ?

%mor:      L2|bi4bi2 v:aux|xiang3=want v|tiao4sheng2=skip_rope sfp|a1 ?

%gra:       1|3|JCT 2|3|AUX 3|0|ROOT 4|3|SFP 5|3|PUNCT

 

2;04 English


Leo producing a non-target-like Chinese-like null object “I want” in English at 2;04

 

 

*MOT:     why do you not want this ?

%mor:      pro:rel|why mod|do pro:per|you neg|not v|want pro:dem|this ?

%gra:       1|5|LINK 2|5|AUX 3|5|SUBJ 4|3|NEG 5|0|ROOT 6|5|OBJ 7|5|PUNCT

*MOT:     because it’s +…

%mor:      conj|because pro:per|it~aux|be&3S +…

%gra:       1|3|LINK 2|3|SUBJ 3|0|ROOT 4|3|PUNCT

*CHI:       green .

%mor:      n|green .

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     because it’s green ?

%mor:      conj|because pro:per|it~cop|be&3S adj|green ?

%gra:       1|3|LINK 2|3|SUBJ 3|0|ROOT 4|3|PRED 5|3|PUNCT

*CHI:       yeah .

%mor:      co|yeah .

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     okay .

%mor:      co|okay .

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     <green is a> [/] green is a good color .

%mor:      n|green cop|be&3S det:art|a adj|good n|color .

%gra:       1|2|SUBJ 2|0|ROOT 3|5|DET 4|5|MOD 5|2|PRED 6|2|PUNCT

%exp:       the mother put the green crayon next to the green water bottle for comparison .

*MOT:     see [/] see .

%mor:      v|see .

%gra:       1|0|ROOT 2|1|PUNCT

*MOT:     yeah ?

%mor:      co|yeah ?

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     yeah ?

%mor:      co|yeah ?

%gra:       1|0|INCROOT 2|1|PUNCT

*CHI:       I want .

%mor:      pro:sub|I v|want .

%gra:       1|2|SUBJ 2|0|ROOT 3|2|PUNCT

*MOT:     now you want it ?

%mor:      adv|now pro:per|you v|want pro:per|it ?

%gra:       1|3|JCT 2|3|SUBJ 3|0|ROOT 4|3|OBJ 5|3|PUNCT

*CHI:       yeah .

%mor:      co|yeah .

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     okay .

%mor:      co|okay .

%gra:       1|0|INCROOT 2|1|PUNCT

 

2;04 English


Leo using “I’s lion” and “I lion” instead of “my lion” to express possession creatively at 2;04

 

 

*CHI:       this is I’s lion .

%mor:      pro:dem|this cop|be&3S pro:sub|I~cop|be&3S n|lion .

%gra:       1|2|SUBJ 2|0|ROOT 3|4|SUBJ 4|2|PRED 5|4|PRED 6|2|PUNCT

*MOT:     oh , this is your lion .

%mor:      co|oh cm|cm pro:dem|this cop|be&3S det:poss|your n|lion .

%gra:       1|4|COM 2|1|LP 3|4|SUBJ 4|0|ROOT 5|6|MOD 6|4|PRED 7|4|PUNCT

*CHI:       yeah .

%mor:      co|yeah .

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     oh , okay .

%mor:      co|oh cm|cm adj|okay .

%gra:       1|0|INCROOT 2|1|LP 3|1|ENUM 4|1|PUNCT

*MOT:     so this is your lion ?

%mor:      adv|so pro:dem|this cop|be&3S det:poss|your n|lion ?

%gra:       1|3|JCT 2|3|SUBJ 3|0|ROOT 4|5|MOD 5|3|PRED 6|3|PUNCT

*CHI:       this (.) I lion .

%mor:      pro:dem|this pro:sub|I n|lion .

%gra:       1|3|DET 2|3|SUBJ 3|0|INCROOT 4|3|PUNCT

*MOT:     this ?

%mor:      pro:dem|this ?

%gra:       1|0|INCROOT 2|1|PUNCT

*MOT:     this is just a paper .

%mor:      pro:dem|this cop|be&3S adv|just det:art|a n|paper .

%gra:       1|2|SUBJ 2|0|ROOT 3|5|JCT 4|5|DET 5|2|PRED 6|2|PUNCT

*MOT:     where’s the +..?

%mor:      pro:rel|where~cop|be&3S det:art|the +..?

%gra:       1|2|LINK 2|0|ROOT 3|2|PRED 4|2|PUNCT

*CHI:       this [/] this is I [/] I lion .

%mor:      pro:dem|this cop|be&3S pro:sub|I n|lion .

%gra:       1|2|SUBJ 2|0|ROOT 3|4|MOD 4|2|PRED 5|2|PUNCT

*MOT:     oh , this is your lion .

%mor:      co|oh cm|cm pro:dem|this cop|be&3S det:poss|your n|lion .

%gra:       1|4|COM 2|1|LP 3|4|SUBJ 4|0|ROOT 5|6|MOD 6|4|PRED 7|4|PUNCT

*CHI:       yeah .

%mor:      co|yeah .

%gra:       1|0|INCROOT 2|1|PUNCT

 

2;05 English


Leo able to produce English articles, but rather inconsistently at 2;05

 

 

*CHI:       I am baby .

%mor:      pro:sub|I aux|be&1S n|baby .

%gra:       1|2|SUBJ 2|0|ROOT 3|2|PRED 4|2|PUNCT

*CHI:       you just eat orange .

%mor:      pro:per|you adv|just v|eat n|orange .

%gra:       1|3|SUBJ 2|3|JCT 3|0|ROOT 4|3|OBJ 5|3|PUNCT

*CHI:       in basket .

%mor:      prep|in n|basket .

%gra:       1|0|INCROOT 2|1|POBJ 3|1|PUNCT

*CHI:       I am a boy .

%mor:      pro:sub|I aux|be&1S det:art|a n|boy .

%gra:       1|4|SUBJ 2|4|AUX 3|4|DET 4|0|INCROOT 5|4|PUNCT

 

2;05 Mandarin


Leo articulating a Mandarin sentence with a Cantonese sentence-final particle (wo3, epistemic) at the end at 2;05

 

 

*CHI:       还有 我 这里 还 有 绿色 的 喎@s:yue .

%mor:      adv|hai2you3=moreover pro|wo3=I post|zhe4li3=here adv|hai2=still v|you3=have adj|lv4se4=green nom|de L2|喎 .

%gra:       1|2|JCT 2|5|SUBJ 3|5|SUBJ 4|5|JCT 5|0|ROOT 6|8|MOD 7|6|POSS 8|5|OBJ 9|5|PUNCT

 

2;07 Mandarin


Leo producing a quantificational adverb jiu indicating a relatively lower value on a scale in Mandarin at 2;07

 

 

*CHI:       三 个 副 [x 3] 队长 就 可以 了 .

%mor:      num|san1=three cl|ge4 cl|fu4 n|dui4zhang3=captain adv|jiu4=just v:aux|ke3yi3=may asp|le .

%gra:       1|2|QUANT 2|4|CLASS 3|4|MOD 4|6|SUBJ 5|6|JCT 6|7|AUX 7|0|INCROOT 8|7|PUNCT

 

2;08 Mandarin


Leo producing a Mandarin A-not-A questions with an existential shenme (something) at 2;08

 

 

*CHI:       你 想 不 想 放 什 [/] 什么 进 去 啊 ?

%mor:      pro|ni3=you v:aux|xiang3=want neg|bu4=not v:aux|xiang3=want v|fang4=put pro:wh|shen2me=what v:dirc|jin4=enter v:dirc|qu4=go sfp|a1 ?

%gra:       1|5|SUBJ 2|5|AUX 3|4|NEG 4|5|AUX 5|0|ROOT 6|5|OBJ 7|5|VD 8|5|SRL 9|5|SFP 10|5|PUNCT

 

2;09 English


Leo attempting a relative clause in English at 2;09

 

 

*CHI:    but there’s something else (.) that have in [/] in there burnt .

%mor:   conj|but pro:exist|there~cop|be&3S pro:indef|something post|else pro:rel|that v|have adv|in adv|there v|burn&PAST .

%gra:    1|3|LINK 2|3|SUBJ 3|0|ROOT 4|3|PRED 5|4|PQ 6|7|LINK 7|4|CMOD 8|7|JCT 9|10|SUBJ 10|7|COMP 11|3|PUNCT

 

2;10 Mandarin


Leo commenting on parents’ language abilities based on the languages they used to address him at 2;10

 

 

*CHI:    &en 你 就 [/] &hm &hm 就 是 普通话 的 .

%mor:   pro|ni3=you adv|jiu4=just v:cop|shi4=is n|pu3tong1hua4=Putonghua nom|de .

%gra:    1|4|SUBJ 2|4|JCT 3|4|MOD 4|0|INCROOT 5|4|POSS 6|4|PUNCT

*CHI:    <就 这> [/] 就 这么 吧 .

%mor:   adv|jiu4=just adv|zhe4me=like_this sfp|ba .

%gra:    1|2|JCT 2|0|INCROOT 3|2|SFP 4|2|PUNCT

*FAT:   好 [/] 好 .

%mor:   adj|hao3=good .

%gra:    1|0|INCROOT 2|1|PUNCT

*FAT:   我 是 普通话 老师 .

%mor:   pro|wo3=I v:cop|shi4=is n|pu3tong1hua4=Putonghua n|lao3shi1=teacher .

%gra:    1|4|SUBJ 2|4|MOD 3|4|MOD 4|0|INCROOT 5|4|PUNCT

*FAT:   那 妈妈 是 什么 老师 啊 ?

%mor:   pro|na4=that n:relat|ma1&DIM=mother v:cop|shi4=is pro:wh|shen2me=what n|lao3shi1=teacher sfp|a1 ?

%gra:    1|3|SUBJ 2|3|SUBJ 3|0|ROOT 4|5|MOD 5|3|OBJ 6|3|SFP 7|3|PUNCT

*CHI:    妈妈 是 广东话 老师 , 英语 老 [/] &s 老师 .

%mor:   n:relat|ma1&DIM=mother v:cop|shi4=is n|guang3dong1hua4=Cantonese_language n|lao3shi1=teacher cm|cm n|ying1yu3=English n|lao3shi1=teacher .

%gra:    1|4|MOD 2|4|MOD 3|4|MOD 4|7|MOD 5|7|MOD 6|7|MOD 7|0|INCROOT 8|7|PUNCT

*FAT:   哦 妈妈 会 教 这么 多 .

%mor:   co|o1=chant n:relat|ma1&DIM=mother v:aux|hui4=will v|jiao1=teach adv|zhe4me=like_this adv|duo1=far_more .

%gra:    1|4|COM 2|4|SUBJ 3|4|JCT 4|0|ROOT 5|6|JCT 6|4|OBJ 7|4|PUNCT

*FAT:   爸爸 只 会 讲 普通话 .

%mor:   n:relat|ba4&DIM=father adv|zhi3=only v:aux|hui4=will v|jiang3=speak n|pu3tong1hua4=putonghua .

%gra:    1|0|INCROOT 2|3|JCT 3|1|PREPO 4|5|MOD 5|1|OBJ 6|1|PUNCT

*FAT:   对不对 啊 ?

%mor:   tag|dui4bu2dui4=correct sfp|a1 ?

%gra:    1|0|INCROOT 2|1|SFP 3|1|PUNCT

 

2;10 Cantonese


Leo correctly producing a cluster of verb-final and sentence-final particles (zo2, perfective; laa3, persuasive; wo3, epistemic) in Cantonese at 2;10

 

 

*CHI:    然後 [<] [/] 然後 就 養 大 咗 喇 喎 .

%mor:   adv|jin4hau6=then adv|zau6=then v|joeng5=raise adj|daai6=big asp|zo2=perfective sfp|laa3=persuasive sfp|wo3=epistemic .

 
2;11 Cantonese


Leo producing a passive sentence spontaneously in Cantonese at 2;11

 

 

*CHI:    會 唔 會 俾 風 浪 吹 [/] 吹 (.) 咗 去 天 上 呀 ?

%mor:   v:aux|wui5=will adv:neg|m4=not v:aux|wui5=will prep|bei2=passive n|fung1=wind n|long6=wave v|ceoi1=blow asp|zo2=perfective v|heoi3=go n|tin1=sky v:dirc|soeng5=go_up sfp|aa3 ?

 

2;11 Mandarin


Leo producing a Mandarin sentence with postverbal preposition zai and localizer shangmian (top) at 2;11

 

 

*CHI:    我 停 在 消防车 的 上面 .

%mor:   pro|wo3=I v|ting2=stop prep|zai4=at n|xiao1fang2che1=fire_engine poss|de post|shang4mian4=above .

%gra:    1|2|SUBJ 2|0|ROOT 3|2|JCT 4|3|PREPO 5|2|POSS 6|2|OBJ 7|2|PUNCT

 

2;11 English


Leo producing an English sentence with the correct past-tense morpheme (lost) and plural marker (wheels) in English at 2;11

 

 

*CHI:    he lost his wheels .

%mor:   pro:sub|he v|lose&PAST det:poss|his n|wheel-PL .

%gra:    1|2|SUBJ 2|0|ROOT 3|4|MOD 4|2|OBJ 5|2|PUNCT

 
 

Acknowledgements

 

We would like to express our gratitude to Brian MacWhinney, Director of CHILDES for his expertise, advice and technical support in constructing the Leo Corpus. Our special thanks go to the research assistants and student helpers who participated in the project in one or more ways: recording the child’s speech data, transcribing the data, tagging and checking the transcripts: Sophia Zishu Yu, Riki Yuqi Wu, Hannah Lam, Joy Jieyu Zhou, Bill Wu, Christa Schmidt, Chang Wang, Michelle Tsz Hsing Au, especially Vaness Tsz Yan Law. We gratefully acknowledge the support and help of our lab members: Stephen Matthews, Xiangjun Deng, Gloria Yanhui Zhang, Elaine Lau, Emily Haoyan Ge and Jenny Jiangling Zhou. The research was supported by a CUHK Direct Grant awarded to Ziyin Mai (“Trilingualism in Early Childhood: Production and Comprehension”), and a start-up grant to set up the University of Cambridge-Chinese University of Hong Kong Joint Laboratory for Bilingualism at CUHK, co-directed by Virginia Yip and Boping Yuan.