Language Learning and Development ISSN: 1547-5441 (Print) 1547-3341 (Online) Journal homepage: http://www.tandfonline.com/loi/hlld20 Twenty-Four-Month-Olds’ Perception of WordMedial Onsets and Codas Yuanyuan Wang & Amanda Seidl To cite this article: Yuanyuan Wang & Amanda Seidl (2016) Twenty-Four-Month-Olds’ Perception of Word-Medial Onsets and Codas, Language Learning and Development, 12:4, 447-460, DOI: 10.1080/15475441.2016.1150185 To link to this article: http://dx.doi.org/10.1080/15475441.2016.1150185 Published online: 04 Apr 2016. Submit your article to this journal Article views: 54 View related articles View Crossmark data Full Terms & Conditions of access and use can be found at http://www.tandfonline.com/action/journalInformation?journalCode=hlld20 Download by: [Nationwide Childrens Hospital] Date: 20 October 2016, At: 10:44 LANGUAGE LEARNING AND DEVELOPMENT 2016, VOL. 12, NO. 4, 447–460 http://dx.doi.org/10.1080/15475441.2016.1150185 Twenty-Four-Month-Olds’ Perception of Word-Medial Onsets and Codas Yuanyuan Wanga and Amanda Seidlb a Linguistics, Purdue University; bSpeech, Language and Hearing Sciences, Purdue University ABSTRACT Recent work has shown that children have detailed phonological representations of consonants at both word-initial and word-final edges. Nonetheless, it remains unclear whether onsets and codas are equally represented by young learners since word edges are isomorphic with syllable edges in this work. The current study sought to explore toddler’s sensitivity to onsets and codas without this overlap. This question is of theoretical interest since many theories of phonology suggest that onsets may be weighted more heavily than codas. We examined English-learning 24-month-olds’ sensitivity to mispronunciations in word-medial onset and coda positions in a word learning task. We predicted that if children’s sensitivity to onsets and codas is positionally constrained, then they should be more tolerant of Coda mispronunciations than onset ones. The results supported this prediction: children were less sensitive to coda mispronunciations than to onset mispronunciations in word-medial position, suggesting positionally driven biases in word processing. Introduction A large literature documents adults’ rapid and robust sensitivity to phonological distinctions in word recognition (Magnuson, Dixon, Tanenhaus, & Aslin, 2007; McMurray, Tanenhaus, Aslin, & Spivey, 2003). This sensitivity enables adults to focus on the linguistic dimensions that are critical for distinguishing among highly similar words. For example, this enables adults to discriminate margin from Marvin by attending to a change in the word-medial onset and to discriminate grateful from graceful by attending to a change in the word-medial coda. Despite their well-specified representations of words, adults are also able to accommodate variability in the words produced by different talkers or at different speech rates. For example, after less than a minute of familiarization to an accent which significantly impacts the pronunciation of phonemes, English-speaking adults are able to recognize accented English words produced by Spanish and Chinese speakers (Clarke & Garrett, 2004). Similarly, prior studies show that children are also able to accommodate small acoustic differences due to accent in certain situations (e.g., Schmale, Cristia, & Seidl, 2012). However, these abilities are limited and may rely heavily on lexical knowledge (White & Morgan, 2008), and the exact nature and specificity of infant lexical representations is far from clear. We would ideally like to know if children bring certain phonological biases in encoding words such that while they accommodate some types of mispronunciations, they will not accommodate others. The primary purpose of this study is to investigate one such area of phonological structure, namely, syllable structure. Specifically, we examine whether young children show an asymmetrical pattern in processing word-medial onsets and codas while learning novel words. This is an important question as many phonological theories, motivated by surveys of language typology, CONTACT Yuanyuan Wang [email protected] Linguistics Program, Purdue University, West Lafayette, IN 47906. Color versions of one or more of the figures in the article can be found online at www.tandfonline.com/hlld. © 2016 Taylor & Francis 448 Y. WANG AND A. SEIDL suggest such biases exist because, cross-linguistically, languages favor onsets over codas (e.g., Blevins, 1996; Prince & Smolensky, 1993). In support of this idea, evidence from children’s production shows a clear developmental pattern regarding the acquisition of syllable types and positions, suggesting that onsets are acquired earlier than codas (e.g., Demuth, 1995; Levelt, 2012; C. C. Levelt, Schiller, & Levelt, 2000). In addition to the evidence from production, the speech perception literature has demonstrated that pre-literate children show different degrees of phonological awareness of onsets and rhymes (Goswami & Bryant, 1990); this evidence, to a certain degree, can be related to children’s different level of sensitivity to onsets and codas. Even as mature language learners, adults may still find codas to be more difficult to identify than onsets. The asymmetry between onsets and codas may be related to phonologically driven learning biases. Thus, investigation of children’s sensitivity to word-medial onset and coda mispronunciations will help to illuminate the nature of children’s early lexical representations and the presence or absence of positional biases. Much like adults, researchers have found that infants are able to distinguish small phonetic contrasts from the second half of their first year. For example, Jusczyk and Aslin (1995) showed that even 7.5-month-old infants recognized the difference between cup and tup, indicating that these young infants have a fairly well-specified representation of word-initial onset segments. Subsequent work exploring the level of detail in young children’s phonological representations has similarly found that they are able to distinguish between canonical word forms and mispronounced forms. This sensitivity to mispronunciations has been found for word-initial onset consonants at 12 months (Ballem & Plunkett, 2005; Fennell & Werker, 2003; Mani & Plunkett, 2010; Nazzi, 2005; Swingley, 2005, 2009; White & Morgan, 2008), vowels at 12 months (Mani & Plunkett, 2007, 2010), wordmedial onset consonants at 19 months (Swingley, 2003), and word-final coda consonants at 17 months (Levelt, 2012; Swingley, 2005, 2009) for both familiar and novel words. Nevertheless, up until now, only a few studies have looked at both word-initial onset and wordfinal coda and these have yielded mixed results. Some studies have shown that children are sensitive to segmental contrasts regardless of the word position (e.g., Nazzi, 2005; Swingley, 2009; Zamuner, 2013). For example, 14–22-month-old (mean age: 17;23) English-speaking children fixated named targets more when hearing correct pronunciations than hearing mispronunciations either when the mispronunciations involved word-initial or word-final position (Swingley, 2009). In a categorization task, 20-month-old French-learning children showed similar sensitivity to phonological distinctions in both word-initial and word-final positions (Nazzi, 2005). In addition, 29-month-old Dutchlearning children in Zamuner (2013)’s study did not show differential perception of segmental contrasts in word-initial and -final positions. In contrast, others studies have found that children are better at identifying differences in word-initial position than in word-final positions (e.g., Altvater-Mackensen & Fikkert, 2010). For example, Altvater-Mackensen and Fikkert (2010) found that 14-month-old Dutch learners showed earlier sensitivity to the stop-fricative contrast in wordinitial compared with word-final position in newly learned words. These conflicting results may due to the age differences in the children being tested. Children who showed recognition of differences in both word-initial and word-final positions in the above-mentioned studies were much older and may be at a more mature stage of language development. Indeed, the role of age or developmental stage on the perception of word-initial and word-final consonants has been found in numerous studies. For example, Levelt (2012)’s study demonstrated that 18-, but not 14-month-olds, are sensitive to word-final consonant omission in a novel-word learning task. In the same vein, Zamuner (2006) found negative results for word-final mispronunciations with 11-month-olds, but positive results for some phonological contrasts in this position with 16-month-olds. In summary, these studies revealed that up until 14 months of age, children showed a more developed ability to process contrasts in word-initial positions than in word-final positions. However, it is noteworthy that the majority of the previous studies mentioned have only focused on mispronunciations in monosyllabic words, in which word edges are isomorphic with syllable edges. It is, however, important to separate out word-edge effects from syllable position effects, which has not been done in any of these studies cited above. This is crucial because we LANGUAGE LEARNING AND DEVELOPMENT 449 know that word-edges are both psychologically (because of primacy and recency effects (Capitani, Della Sala, Logie, & Spinnler, 1992)) and phonologically special. For example, 7-month-olds identify words by focusing on the first and last syllables, but ignoring medial syllables (Benavides-Varela & Mehler, 2015). Similarly, adult show better learning of phonotactic regularities at word-edge than at word-medial positions. Specifically, they learned that C1 and C2 came from distinct sets in structures of C1VCCVC2, where the target consonants were at word-edge positions, but failed to learn this constraint in words of CVC1C2VC structure, where the target consonants were in word-medial positions (Endress & Mehler, 2010). In addition, children’s processing of word-initial and word-final positions is not necessarily a reflection of their ability in encoding onsets and codas, but could be rather a reflection of temporal processing. This is because, in monosyllabic words, onsets always come before codas. Thus, children’s better knowledge of onsets over codas (as suggested in the work previously mentioned from production) can also be interpreted as support for the fact that their interpretation of linguistic units is incremental. However, in word-medial positions, we can avoid such large temporal differences and even place coda constraints before onset ones. For example, in the monosyllabic word cat /kæt/, the word-initial sound /k/ occurs before the word-final sound /t/; however, in the bisyllabic word faithful /feθ.fʊl/, the word-medial onset /f/ occurs after the word-medial coda /θ/. Thus, if children’s representation of segments is incremental, then they should be more sensitive to word-medial codas than word-medial onsets. On the contrary, if children’s knowledge of onsets and codas were influenced by the phonological knowledge rather than a reflection of temporal effect, then we would expect them to be more sensitive to word-medial onsets than codas. Finally, it should also be noted that acoustic saliency differs as a function of word position. More specifically, in monosyllabic words, word-final consonants are acoustically less salient than wordinitial consonants. For example, in English, word-final stops are produced with an unreleased burst (Denes, 1955) while word-initial ones are produced with aspiration. Thus, the asymmetrical perceptual pattern in onset and coda position seen in the literature may be caused by acoustic differences. However, the acoustic advantage of onsets relative to codas may be largely reduced in word-medial position, especially in bisyllabic trochees where the word-medial coda is in the stressed syllable, while the word-medial onset is located in the unstressed syllable. Thus, for all the above reasons, it is necessary to look at word-medial position in order to explore the question of whether there are phonologically driven positional constraints on word representations. Up until now, only a few studies have examined children’s sensitivity to word-medial consonants at all. For example, Swingley (2003) found that 19-month-old Dutch-learning children showed better recognition of correct pronunciations over mispronunciations when the word-medial consonant was altered in familiar words (where for required lesser attention because it was familiar), however, he only tested children on familiar words with a CVCV structure (with the underscored consonant as the target mispronunciation). Thus, infants were only tested on mispronunciations of word-medial onsets, but not on word-medial codas. Using the Headturn Preference procedure (HPP; Jusczyk & Aslin, 1995), Wang and Seidl (2015) compared infants’ sensitivity to both onsets and codas in word-medial positions. They found that 12-month-olds learned the experimental pattern when the target fricatives were licit in the word-medial onset position, but not in the word-medial coda position. However, by 15 months of age, infants were also able to learn and generalize such patterns for word-medial codas. Similarly, in a recent study, Archer, Zamuner, Engel, Fais, and Curtin (2016) explored English-learning 12- and 20-month-olds’ discrimination of coda consonants in word-medial positions using a habituation paradigm. The results showed that 20-month-olds, but not 12-month-olds, discriminated this contrast for both voiced and voiceless stops. Nevertheless, these studies do not tell us much about how well the information in the onsets and codas are represented while learning novel words, because infants and children often demonstrate less sensitivity to phonological details in words, particularly when they need to simultaneously pay attention to meaning and form (Werker, Fennell, Corcoran, & Stager, 2002). 450 Y. WANG AND A. SEIDL The secondary goal of the research was to investigate the degree of phonological deviation in children’s ability to process segmental contrasts. Many studies have demonstrated that children are sensitive to the major dimensions of phonological variation, such as voicing, place, and manner, which define phonological features (e.g., Mani & Plunkett, 2010; Miller & Eimas, 1983; White & Morgan, 2008). This knowledge is of particular importance to infants and young children who are faced with the challenging task of building the lexicon of their native language, in which words with minimal phonetic differences may be abundant. For example, as mature language learners, they will have to distinguish between cat and pat, which differ only in the place of articulation (POA) of the first sound. Previous research has found that toddlers in the second year of life have demonstrated sensitivity to even one such featural change in familiar words (Ballem & Plunkett, 2005; Mani & Plunkett, 2010; Swingley & Aslin, 2000, 2002). Despite this seemingly fine-grained sensitivity to featural changes, literature concerning the degree of phonological deviation in children’s word recognition or processing has been inconclusive. Specifically, in some studies, children up to 23 months responded similarly to single-feature and multiple-feature mispronunciations (Bailey & Plunkett, 2002; Swingley & Aslin, 2002). In other studies this was not the case. For example, in White and Morgan (2008) 19-month-olds showed graded sensitivity to the varying degree of phonological mismatch, and this sensitivity can be generalized to a varying combinations of features. These conflicting results may arise from differences in task demands, age at test, as well as the very nature of tasks. In order to add to the existing body of literature on this standing issue, we incorporated another variable, namely 1-feature and 2-feature change from correct pronunciation, in our experimental design. In summary, the primary goal of the current research was to explore whether there is an asymmetry in children’s representation of word-medial onsets and codas in a word learning task. The secondary goal was to examine whether children’s ability to encode contrasts is affected by the degree of phonological deviation (1-feature vs. 2-feature change). Specifically, using the Preferential Looking paradigm (Golinkoff, Hirsh-Pasek, Cauley, & Gordon, 1987), we tested 24-month-old English-learning children’s response to correct pronunciations (CPs) and mispronunciations to newly learned words in both onset (MP-Onset) and coda (MP-Coda) positions. We varied the degree of phonological deviation such that half of the children received 1-feature change from target labels to the MPs, and the other half received 2-feature changes from the target labels to the MPs. Children’s eye movements in response to stimuli were monitored and analyzed. If children’s word representations were constrained by phonologically driven positional biases, then they should show better recognition of the target object when the label was mispronounced in coda over onset position. With regard to the secondary research question, we predicted that if children are sensitive to the degree of phonological deviation, they should display more sensitivity to MPs that deviate from CPs in two phonological features than to MPs that deviate from CPs in one phonological feature. Methods Participants Thirty-two healthy full-term monolingual English-learning children (12 females) participated in the study. These children were between the ages of 21.68 and 25.59 months (M = 23.76 months, SD = 0.79). An additional 11 children were tested whose results were not reported for the following reasons: 7 for fussing or crying; 3 for experimenter error; 1 for having looking times with difference scores (Target-Distractor) more than 2.5 standard deviations off the overall mean. Children were compensated with a book or a toy for their participation. Informed consent was given to the caregivers before testing was carried out. The caregivers were also asked to complete the short form A of the MacArthur-Bates Communicative Developmental Inventory: Level II Vocabulary Checklist (CDI, Fenson et al., 2000) in order to elicit participants’ productive vocabulary LANGUAGE LEARNING AND DEVELOPMENT 451 sizes. The number of words produced by these 32 children ranged from 14 to 100 (M = 55.41, SD = 26.24). Stimuli The visual stimuli were digitized pictures of novel objects on a white background, presented on a large screen TV monitor. Pictures were of similar sizes with bright colors. An example stimulus pair is shown in Figure 1. The auditory stimuli were novel bisyllabic trochees (CVC.CV) composed of English phonemes. These auditory stimuli were produced by a young adult female native speaker of American English in an infant-directed register. The speaker had experience in recording stimuli for previous infant studies. She was informed that all the pseudowords involved were trochaic words with a CVC.CV structure.1 The stimuli were recorded using a wireless Lavalier microphone (AKG WMS40) in a sound-shielded booth with a Marantz Professional Solid State Recorder (PMD660ENG). The auditory stimuli were digitized at a 44.1 kHz sampling rate and normalized at an amplitude of 75 dB. The target labels presented to infants in the training trials were: /gæz.bi/, /læz.di/, /dul.mi/, /ruv. ni/, /rin.va/, /din.la/, /lad.vu/, /dim.la/, and /gab.vu/. Each infant was taught two words, one from the Onset-group, and the other from the Coda-group. The word from the Onset-group was mispronounced in the word-medial onset position (MP-Onset; CVC.CV; the underlined segment was mispronounced), and the word from the Coda-group was mispronounced in the word-medial coda position (MP-Coda; CVC.CV, the underlined segment was mispronounced), in other words, they received MP-Onset and MP-Coda test trials for different words. In addition to the mispronunciations (MPs) for the two labels, children also received correct pronunciations (CPs) of the two target labels. For example, the CP and MP-Onset of the target word /gæz.bi/ were /gæz.bi/ and /gæz. mi/, and the CP and MP-Coda of the target word /lad.vu/ were /lad.vu/ and /lag.vu/, respectively. Figure 2 presents an example of experimental design. Since we also wanted to explore whether this effect is more salient when the MPs involved more phonological features, we manipulated the stimuli in this way so that half the participants received MPs whose mispronounced segments deviated from the correct pronunciations in only one phonological feature (1-feature change), and the other half in two phonological features (2-feature change). The number of feature change, duration, amplitude, and maximum pitch of target labels in the CPs, MP-Onset, and MP-Coda Test trials are given in Table 1 and Table 2. Separate One-way ANOVAs were conducted to compare these acoustic measures among the three types of labels. The results showed that there was no systematic difference in the duration, F(2, 22) = 2.45, p = .110; however, we found a significant main effect of Type (CP, MP-Onset, MP-Coda) in the amplitude, F(2, 22) = 33.08, p < .001, η2 = .182, and the maximal pitch, F(2, 22) = 46.14, p < .001, η2 = .808. Post hoc pairwise comparisons with Bonferroni correction (multiplied p-values by 3) were conducted. The results revealed that MP-Onset and MP-Coda labels were significantly higher in both measures of amplitude, t(14) = 6.18, p < .001, t(14) = 8.50, p < .001, Figure 1. Sample visual stimulus pair. 452 Y. WANG AND A. SEIDL Word Trial Visual Stimuli Salience No audio Training (3 trials) “Look, it’s the gæz.bi. The gæz.bi!” Test: MP-Onset “Where is the gæz.bi? Can you find the gæz.bi?” “Where is the gæz.mi? Can you find the gæz.mi?” “Where is the gæz.bi? Can you find the gæz.bi?” “Where is the gæz.mi? Can you find the gæz.mi?” Salience No audio Training (3 trials) “Look, it’s the lag.vu. The lag.vu!” Test: CP gæz.bi Test: MP-Onset Test: CP “Where is the lag.vu? Can you find the lag.vu?” “Where is the lad.vu? Can you find the lad.vu?” “Where is the lag.vu? Can you find the lag.vu?” “Where is the lad.vu? Can you find the lad.vu?” Test: CP lag.vu Auditory stimuli Test: MP-Coda Test: CP Test: MP-Coda Figure 2. Experimental design. Table 1. Number of feature changes, duration (ms), maximum pitch (Hz), and intensity (dB) of the CPs and MPs for words from the Onset-group. CPs Feature 1 1 1 1 2 2 2 2 Label gaz.bi laz.di dul.mi ruv.ni rin.va din.la lad.vu gab.vu Dur (ms) 612 676 546 577 530 532 621 610 MPs-Onset Amp (dB) 74 72 74 74 70 73 74 73 F0 (Hz) 346 363 335 488 318 331 378 323 Label gaz.mi laz.gi dul.bi ruv.di rin.da din.ba lad.gu gab.mu Dur (ms) 580 680 584 608 597 583 722 581 Amp (dB) 75 78 80 79 76 77 77 80 F0 (Hz) 410 595 581 623 543 573 549 630 and maximal pitch, t(14) = 6.77, p < .001, t(14) = 10.48, p < .001, than CP labels. However, importantly, there was no significant difference between MP-Onset and MP-Coda labels for both the measures of the amplitude, t(15) = 0.17, p = 1, and maximal pitch, t(15) = 0.99, p = 1. 2 Procedure Participants were trained and tested using the Preferential Looking Procedure (Golinkoff et al., 1987) in a single-walled sound booth. The participants sat on a caregiver’s lap and watched images projected on the display at eye level. Each caregiver wore darkened sunglasses to avoid looking at the display, and was asked to avoid interfering with their child’s performance. Speech stimuli were presented through a central speaker beneath the display. A camera below the display recorded toddlers’ looking patterns. LANGUAGE LEARNING AND DEVELOPMENT 453 Table 2. Number of feature changes, duration (ms), maximum pitch (Hz), and intensity (dB) of the CPs and MPs for words from the Coda-group. CP MPs-Coda Feature Label Dur (ms) Amp (dB) F0 (Hz) Label Dur (ms) Amp (dB) F0 (Hz) 1 1 1 1 2 2 2 2 lad.vu gab.vu rin.va dim.la dul.mi ruv.mi gaz.bi laz.di 621 610 530 534 546 613 612 676 74 73 70 73 74 73 74 72 378 323 318 383 335 451 346 363 lag.vu gam.vu rid.va dibla dub.mi rud.mi gam.bi lag.di 741 699 662 547 579 585 671 662 78 79 78 78 78 77 76 79 582 578 615 598 529 565 622 630 The toddlers were taught two novel target words. For each target word, they received 8 trials: 1 Salience trial, 3 Training trials, and 4 Test trials (2 CP trials, and 2 MP trials). Thus, each child received 16 trials (8 trials x 2 words) in total. Salience trials served to reduce the difference in exposure time to the two objects, one of which was presented repeatedly during the Training (the “Target”), while the other (the “Distractor”) only appeared at Test. An attention-getter (smiling baby’s face) appeared preceding each trial to attract toddlers’ attention. When the toddlers looked at the attention-getter, the experimental trials began. In Salience trials, the Target and the Distractor were shown on the right and the left sides of the screen with no auditory stimulus. During each of the three Training trials, participants saw a picture (Target) shown in the middle of the screen, while listening to the target label, recorded within a carrier sentence. The sentences presented during Training trials were: “Look, it’s the [Target]. The [Target]!” During the four Test trials, the Target and the Distractor were shown simultaneously on the screen, as in Salience trials. The sentences presented in the test trials were, “Where is the [CP]/ [MP]? Can you find the [CP]/[MP]?” The Test trials were always played in the order of CP-MP-CPMP. This is because we expected that the task of representing word-medial segments while learning new words might be challenging for young children, and to have CPs come before MPs might reinforce their learning of these words with a complex syllable structure. Since our primary question of interest was not in comparing CPs to MPs, but rather in exploring potential differences between the two types of MPs (MP-Onset, MP-Coda), we were not concerned about creating biases with this strict ordering. Speech stimuli began 500 ms after the visual stimuli were shown on the display; the speech stimuli for each trial was 3500 ms in total; the visual stimuli remained on the display for another 1500 ms after the speech stimuli were completed. Thus, each trial was 5.5 s. A total of 32 stimuli orders were created by balancing the target objects, sides, and the location of the mispronounciation (Onset or Coda). Coding Videos of the participants’ looking patterns to objects on the display were digitized at 30 frames per second and coded offline frame by frame by a highly trained coder using SuperCoder software (Hollich, 2008). Participants’ looking directions to the left, right, or away, were coded by a skilled coder. This procedure allowed us to determine the total amount of time and longest looking time children spent looking to the Target and Distractor. The Test trials were divided into two phases: a pre-naming phase and a post-naming phase. The time window before the onset of the target words in each Test trial was the pre-naming phase. Children’s looking patterns during the pre-naming phase provide a baseline of target preference. In the post-naming phase, following previous research (Swingley & Aslin, 2000), looking time to each object in each Test trial was measured in a time window from 367–2000 ms from the beginning of 454 Y. WANG AND A. SEIDL Table 3. Mean (SD) of PTL and PLL measures separated by naming and trial type. Trial Type Measure PTL PLL Phase Pre-naming Post-naming Difference Pre-naming Post-naming Difference CP 0.444 (0.18) 0.590 (0.14) 0.146 (0.25) 0.442 (0.18) 0.606 (0.14) 0.164 (0.25) MP-Onset 0.508 (0.33) 0.406 (0.24) −0.102 (0.40) 0.509 (0.33) 0.388 (0.26) −0.121 (0.39) MP-Coda 0.444 (0.32) 0.584 (0.26) 0.140 (0.41) 0.444 (0.32) 0.595 (0.28) 0.151 (0.45) the target segment, i.e., the beginning of the fourth segment of CVC.CV (MP-Onset), and the third segment of CVC.CV (MP-Coda). This ensured that any change in the looking pattern occurred as result of the response to the spoken word. For dependent measures, following previous research (Mani & Plunkett, 2008; Swingley, 2009), we calculated the proportion of time that participants looked at the named target picture relative to the total looking time (either at the Target or the Distractor)—Proportion of Target Looking measure (PTL). In addition, we also calculated children’s longest looking time to the Target and Distractor within the same time window. The Proportion of Longest Looking measure (PLL) was calculated by dividing the longest time children spent looking at the Target by the sum of the longest looking to the Target and the Distractor. Given that children were tested with correct pronunciations in both CP-Onset and CP-coda trials, the average CPs were computed over the CP-Onset and CP-Coda, thus resulting in three different types of Test trials (CP, MP-Onset, MP-Coda) in the analyses. Table 3 presents the PTL and PLL measures for these three types of test trials in both the pre-naming and post-naming phases. Results A repeated measures ANOVA on PTL with Feature (1-feature change, 2-feature change) as betweensubjects factor, Naming (pre-naming, post-naming) and Trial type (CP, MP-Onset, MP-Coda) as within-subjects factors revealed a significant interaction of Naming and Trial type, F(1.72, 51.46, Huynh-Feldt corrected values) = 4.96, p = .014, η2 = .142. The only factor that approached significance was Naming, F(1, 51.46, Huynh-Feldt corrected values) = 2.94, p = .097. However, all other main factors or interactions were not significant, Fs < 2.62, ps > .116. To explore the source of the interaction, post hoc paired-samples t-tests were conducted to compare children’s target looking between pre-naming and post-naming phases. Figure 3 presents the effect of Naming using the PTL measure for the CP, MP-Onset, and MP-Coda Test trials. In CP trials, the results showed a significant increase in target looking from the pre-naming to the post-naming phase, t(31) = 3.31, p = .002 (all reported t-tests were two-tailed). This significant target preference provided evidence that the toddlers learned the novel words used in the experiment. In MP-Coda trials, the PTL was marginally higher in the post-naming than in the pre-naming phase, t(31) = 1.94, p = .062, however, in MP-Onset trials, no significant difference was observed between the pre- and post-naming phases, t(31) = −1.44, p = .161. In order to corroborate these results using a different, potentially more sensitive measure, we also conducted a repeated measures ANOVA on the PLL measure with Feature (1-feature change, 2feature change) as between-subjects factor, Naming (pre-naming, post-naming) and Trial type (CP, MP-Onset, MP-Coda) as within-subjects factors. The results revealed a significant interaction of Naming and Trial type, F(1.71, 51.26, Huynh-Feldt corrected values) = 5.99, p = .007, η2 = .166. The main factor of Naming was marginally significant, F(1, 51.26, Huynh-Feldt corrected values) = 3.09, p = .089. All other main factors or interactions were not significant, Fs < 2.12, ps > .137. Post hoc paired-samples t-tests demonstrated that in CP trials, the PLL measure was significantly higher in the post-naming phase than in the pre-naming phase, t(31) = 3.70, p = .001; In MP-Coda trials, the PTL was marginally higher in the post-naming phase than in the pre-naming phase, t(31) = 1.91, LANGUAGE LEARNING AND DEVELOPMENT 0.25 0.2 * 455 + Effect of naming PTL measure 0.15 0.1 0.05 CP MP-Onset 0 MP-Coda –0.05 –0.1 –0.15 –0.2 * The effect of naming was significant, p < .05. + The effect of naming was marginally significant, .05 < p < .1. Figure 3. Effect of naming in CP, MP-Onset, and MP-Coda trials using PTL measure. Error bars indicate standard error. p = .066. However, no significant difference was observed between the pre- and post-naming phases in the MP-Onset trials, t(31) = −1.77, p = .086. Figure 4 presents the effect of naming using the PLL measure. It should be noted that the effect of naming (the magnitude of change from pre-naming to postnaming) could be influenced by the measures taken at the baseline, such that they determined how much room there was to move from the pre-naming phase to the post-naming phase. In order to address this potential confound, repeated measures ANOVAs on PTL and PLL with Feature (1feature change, 2-feature change) as between-subjects factor, and Trial type (CP, MP-Onset, MPCoda) as within-subjects factors were conducted for pre-naming phase. No significant main effects or interactions were found using either PTL, Fs <.94, p > .382, or PLL, Fs < .93, p > .387. Taken together, these results suggest that: (1) differences between pre- and post-naming phases following CP and MP-Coda that we found in the previous set of analyses were indeed attributable to the effect of naming;3 and (2) children recognized the target objects when hearing correctly pronounced labels and labels with mispronounced codas, however, they failed to recognize target objects when hearing labels with mispronounced onsets. Discussion The primary goal of the current study was to explore syllable positional effects on children’s sensitivity to the mispronunciation of recently learned words. Although a considerable body of research has documented infants’/children’s sensitivity to mispronunciations in both word-initial and word-final positions, no previous study has systematically examined whether children show asymmetrical patterns in recognizing mispronunciations in word-medial onset vs. coda positions. Our study suggests that children show a positional asymmetry while learning novel words that is unrelated to word edge effects. Specifically, children looked longer at the target object when the label was correctly pronounced or was mispronounced in word-medial coda, but failed to do so when the mispronunciation occurred in word-medial onset position. The secondary goal of this research was 456 Y. WANG AND A. SEIDL 0.25 * + 0.2 Effect of naming PLL measure 0.15 0.1 0.05 CP MP-Onset 0 MP-Coda –0.05 –0.1 –0.15 –0.2 * The effect of naming was significant, p < .05. + The effect of naming was marginally significant, .05 < p < .1. Figure 4. Effect of naming in CP, MP-Onset, and MP-Coda trials using PLL measure. Error bars indicate standard error. Error bars indicate standard error. to examine the degree of phonological deviation in children’s ability to process segmental contrasts. Previous research examining this question has yielded mixed results. Our study shows that children are equally sensitive to MPs that deviate from CPs in one phonological feature and two phonological features in lexical processing. With respect to our primary research question, the results are consistent with prior studies showing an asymmetrical learning of phonotactic patterns restricted to either word-medial onset or coda position in younger infants (Wang & Seidl, 2015). While this finding, at first glance, seems to contrast with Swingley (2009) and Nazzi (2005)’s studies reporting a symmetrical representation of word-edge onsets and codas in children at younger ages, there is of course a crucial difference: word edges are special. In addition, given the task of word learning is cognitively more demanding than the task of familiar word recognition, it is possible that children’s symmetrical sensitivity to wordinitial and -final mispronunciations in Nazzi (2005) and Swingley (2009)’s work is the result of a ceiling effect. However, while from the second year on children are able to represent detailed phonological information correctly in both word-initial and word-final positions of familiar words, when exposed to a more challenging word learning task in which mispronunciations occur in word-medial positions, children show onset biases. These biases may appear due to the greater cognitive demands of complex syllable structure in word learning or may simply reflect the lesser salience of word-medial positions, either way, it is clear that not all positions may be treated equally. One may argue that children’s better representation of sounds in onset position over coda position in our study might be due to the fact that onsets are acoustically more salient than codas. However, this is less likely in word-medial position in English trochees in which codas appear in the stressed syllable, and onsets in the unstressed syllable. Indeed, in a recent study, Wang and Seidl (2015) compared the acoustics of word-medial onsets and codas in bisyllabic trochees with the structure CVC.CVC, and found that the sounds in the word-medial codas were greater in both amplitude and F0 than the word-medial onsets. Thus, the acoustic advantage of onsets over codas LANGUAGE LEARNING AND DEVELOPMENT 457 that is typical at word-edges disappears in these word-medial positions, allowing us to disentangle positional effects with acoustic saliency. While disentangling the word-edge effects, sequence effects, as well as acoustic saliency of onsets and codas, this study reveals for the first time children’s asymmetrical sensitivity to onsets and codas in the word-medial position in newly learned words. One possible interpretation of this processing advantage of onset over coda position is that children may have built more detailed phonological representations of sounds appearing in onset position. Another possibility is that children may have built detailed phonological representations of sounds in both onset and coda positions, however, they are more willing to accommodate a change in coda position than in onset position, such that mispronunciations in onset, but not in coda position, hinder word recognition. Interestingly, the asymmetrical processing of word-medial onsets and codas may account for why the world’s languages are the way they are. Specifically, despite the range of syllable structures present in the languages of the world, typological investigations have demonstrated that CV is the only syllable type that is always licit across the world languages. Given the complexity of language structures to be learned by children, a potential role of onset biases are to direct learners’ attention to the kind of information that is needed to further develop their perceptual capacity. With respect to our secondary research questions, we do not find graded sensitivity to the degree of phonological deviation on word recognition. This result is in line with Bailey and Plunkett (2002) and Swingley and Aslin (2002)’s findings that children up to 23 months responded similarly to single-feature and multiple-feature mispronunciations. However, our results differ from White and Morgan (2008)’s findings that 19-month-olds showed linearly graded sensitivity to mispronunciations involving one, two and three phonological features. Specifically, children showed significant increases in looking toward the familiar object in the correct and one-feature conditions, but not in two- or three-feature conditions. The discrepancies between our studies may be due to the referential status of the visual competitors. Specifically, they presented children with a familiar and an unfamiliar object. In contrast, children were presented with novel objects and novel labels in our study. It is also likely that since the children in our study were older, their phonological representation of newly learned words is more detailed. Finally, another possibility is that the differences in the feature changes involved in the two studies result in different magnitudes of acoustic-phonetic mismatch; however, only future studies with systematic investigations can tease these explanations apart. Last but not least, this study adds to the existing body of literature explaining characteristics of children’s early speech production. When children begin to produce meaningful words, coda consonants are often omitted (e.g., Altvater-Mackensen & Fikkert, 2010; Demuth & Fee, 1995) and CV syllables are acquired earlier than CVC syllables in production (Levelt et al., 2000). Recent experimental evidence and theoretical models claim that children use the same lexical representation for speech perception and production (Altvater-Mackensen & Fikkert, 2010; Levelt et al., 1999), thus, the coda omission and later acquisition of CVC syllables as opposed to CV syllables may be the consequence of children’s better sensitivity to onset over coda position in their lexical representations. Consequently, we may also predict that word-medial onsets will be produced earlier than word-medial codas in children’s output in a way driven by the asymmetrical nature of their phonological representations of sounds in these two syllable positions. The current study is limited in several respects. First, a potential confound which may impact children’s sensitivity to word-medial onsets and codas is that in word-medial positions, codas are often undergo regressive assimilation to the place and voicing features of their following consonants. In contrast, word-medial onsets are less vulnerable to such changes. Thus, it is possible that children’s lesser sensitivity to coda mispronunciations may due to the fact that codas simply allow for more changes in the input. Second, as we noted earlier, codas were always in stressed syllables, while onsets were always in unstressed syllables. Thus, given that onsets were located at the locus of the change to the second syllable and as well as a change in stress, it is possible that these facts may make the onset easier to notice than coda, resulting in children’s higher sensitivity to onsets than 458 Y. WANG AND A. SEIDL codas. Unfortunately, current study is not able to address this issue. It would be thus interesting for future work to replicate the present study with other types of word forms such that the coda patterns are in the unstressed syllable and the onset patterns in the stressed syllable in order to examine possible interactions between syllable position and word stress. In addition, this confound could also be eliminated by examining trisyllabic or polysyllabic words with the target onset and coda restricted at the transition of two syllables with relatively equal stress.4 Despite the above-mentioned caveats, we have demonstrated that 24-month-old English-learning children have an asymmetrical sensitivity to word-medial onsets and codas, indicating a potential role of phonological biases in children’s word processing. The asymmetrical patterns that we see in speech perception may also predict children’s production patterns during the course of language acquisition. In addition, the research provides evidence that by the age of 24 months, children have very sophisticated phonological knowledge in words such that minimal phonetic deviation in onset position may affect word recognition. Notes 1. Although we cannot be sure how infants in our study have interpreted the stimuli, we think it is more likely that they have interpreted them as bisyllabic trochees as it is a more common pattern in English. With regard to how children syllabify the stimuli, we think it is more likely they interpreted the stimuli as CVC. CVC, due to the effect of Weight-Stress Principle and Maximal Onset Principle on syllabification (e.g., Hayes, 1995; Kahn, 1976). In addition, most of these consonant-consonant sequences in the word-medial position, e.g, zm, zg, vd, violate the phonotactic constraints for being a consonant cluster within one syllable position. 2. However, these differences may be less likely to affect the results. Indeed our results go in the opposite direction from the acoustic analyses: children show better recognition of target labels in CP trials than in MP trials. We suspect that the different acoustic measurements between CPs and MPs may due to the fact that the speaker emphasized the MPs because these were relatively new labels compared to the CPs. Crucially, however, MPonset and MP-coda measures were not distinct prosodically. 3. We also evaluated the possible role of children’s vocabulary size on their representation of segments in wordmedial onset and coda positions. The expressive vocabulary size of these 32 children ranged from 14–100 (M = 55.41, SD = 26.24), as calculated from parental CDI reports. Vocabulary size was strongly correlated with children’s age, r = 0.453, p = .009. However, we did not find any correlation between children’s vocabulary size and the effect of naming in any of the Test trials for either PTL measure (/r/s < .165, ps > 0.368) or PLL measure (/r/s < 0.187, ps > 0.306). These results are consistent with previous findings showing no evidence of a relationship between vocabulary size and infants’ sensitivity to mispronunciations (Ballem & Plunkett, 2005; Swingley, 2009). 4. We thank our anonymous reviewers for suggestions related to this discussion. References Altvater-Mackensen, N., & Fikkert, P. (2010). The acquisition of the stop-fricative contrast in perception and production. Lingua, 120(8), 1898–1909. doi:10.1016/j.lingua.2010.02.010 Archer, S., Zamuner, T. S., Engel, K., Fais, L., & Curtin, S. (2016). Infants’ discrimination of consonants: Interplay between word position and acoustic saliency. Language Learning and Development, 12(1), 60–78. Bailey, T. M., & Plunkett, K. (2002). Phonological specificity in early words. Cognitive Development, 17(2), 1265–1282. doi:10.1016/S0885-2014(02)00116-8 Ballem, K. D., & Plunkett, K. (2005). Phonological specificity in children at 1;2. Journal of Child Language, 32(1), 159– 173. doi:doi:10.1017/S0305000904006567 Benavides-Varela, S., & Mehler, J. (2015). Verbal positional memory in 7–olds. Child Development, 86(1), 209–223. doi:10.1111/cdev.12291 Blevins, J. (1996). The syllable in phonological theory. In J. A. Goldsmith (Ed.), The Handbook of Phonological Theory. Oxford, UK: Blackwell Publishing. Capitani, E., Della Sala, S., Logie, R. H., & Spinnler, H. (1992). Recency, primacy, and memory: Reappraising and standardising the serial position curve. Cortex, 28(3), 315–342. doi:10.1016/S0010-9452(13)80143-8 Clarke, C. M., & Garrett, M. F. (2004). Rapid adaptation to foreign-accented English. The Journal of the Acoustical Society of America, 116(6), 3647–3658. doi:10.1121/1.1815131 LANGUAGE LEARNING AND DEVELOPMENT 459 Demuth, K. (1995). Markedness and the development of prosodic structure. In J. N. Beckman (Ed.), NELS 25: Proceedings of the North East Linguistics Society (pp. 13–25). GLSA (Graduate Linguistic Student Association), Dept. of Linguistics, University of Massachusetts, Amherst, MA. Demuth, K., & Fee, E. J. (1995). Minimal prosodic words in early phonological development. Unpublished manuscript. Providence, RI: Brown University; Halifax, Canaca: Dalhousie University. Denes, P. (1955). Effect of duration on the perception of voicing. The Journal of the Acoustical Society of America, 27(4), 761–764. doi:10.1121/1.1908020 Endress, A. D., & Mehler, J. (2010). Perceptual constraints in phonotactic learning. Journal of Experimental Psychology: Human Perception and Performance, 36(1), 235–250. doi:10.1037/a0017164 Fennell, C. T., & Werker, J. F. (2003). Early word learners’ ability to access phonetic detail in well-known words. Language and Speech, 46(2–3), 245–264. doi:10.1177/00238309030460020901 Fenson, L., Pethick, S., Renda, C., Cox, J. L., Dale, P. S., & Reznick, J. S. (2000). Short-form versions of the MacArthur communicative development inventories. Applied Psycholinguistics, 21(01), 95–116. doi:10.1017/ S0142716400001053 Golinkoff, R. M., Hirsh-Pasek, K., Cauley, K. M., & Gordon, L. (1987). The eyes have it: Lexical and syntactic comprehension in a new paradigm. Journal of Child Language, 14(01), 23–45. doi:10.1017/S030500090001271X Goswami, U. C., & Bryant, P. (1990). Phonological skills and learning to read: Essays in developmental psychology. New York: Psychology Press. Hayes, B. (1995). Metrical stress theory: principles and case studies. Chicago, IL: Chicago University Press. Hollich, G. (2008). Supercoder: A program for coding preferential looking (Version 1.7.1). [Computer Software]. West Lafayette, IN: Purdue University. Jusczyk, P. W., & Aslin, R. N. (1995). Infants′ Detection of the Sound Patterns of Words in Fluent Speech. Cognitive Psychology, 29(1), 1–23. doi:10.1006/cogp.1995.1010 Kahn, D. (1976). Syllable-based generalizations in English phonology. (Ph.D.), Massachusetts Institute of Technology, Massachusetts, MA. Levelt, C. C. (2012). Perception mirrors production in 14- and 18-month-olds: The case of coda consonants. Cognition, 123(1), 174–179. doi:10.1016/j.cognition.2011.12.001 Levelt, C. C., Schiller, N. O., & Levelt, W. J. (2000). The acquisition of syllable types. Language Acquisition, 8(3), 237–264. doi:10.1207/S15327817LA0803_2 Levelt, W. J., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral Brain Sciences, 22(1), 1–38. doi:10.1017/S0140525X99001776 Magnuson, J. S., Dixon, J. A., Tanenhaus, M. K., & Aslin, R. N. (2007). The dynamics of lexical competition during spoken word recognition. Cognitive Science, 31(1), 133–156. doi:10.1080/03640210709336987 Mani, N., & Plunkett, K. (2007). Phonological specificity of vowels and consonants in early lexical representations. Journal of Memory and Language, 57(2), 252–272. doi:10.1016/j.jml.2007.03.005 Mani, N., & Plunkett, K. (2008). Fourteen-month-olds pay attention to vowels in novel words. Developmental Science, 11(1), 53–59. doi:10.1111/j.1467-7687.2007.00645.x Mani, N., & Plunkett, K. (2010). Twelve-month-olds know their cups From their Keps and Tups. Infancy, 15(5), 445–470. doi:10.1111/j.1532-7078.2009.00027.x McMurray, B., Tanenhaus, M. K., Aslin, R. N., & Spivey, M. J. (2003). Probabilistic constraint satisfaction at the lexical/phonetic interface: Evidence for gradient effects of within-category VOT on lexical access. Journal of Psycholinguistic Research, 32(1), 77–97. doi:10.1023/A:1021937116271 Miller, J. L., & Eimas, P. D. (1983). Studies on the categorization of speech by infants. Cognition, 13(2), 135–165. doi:10.1016/0010-0277(83)90020-3 Nazzi, T. (2005). Use of phonetic specificity during the acquisition of new words: Differences between consonants and vowels. Cognition, 98(1), 13–30. doi:10.1016/j.cognition.2004.10.005 Prince, A., & Smolensky, P. (1993). Optimality theory: Constraint interaction in generative grammar. Boulder, NJ: Rutgers University, New Brunswick and University of Colorado. Schmale, R., Cristia, A., & Seidl, A. (2012). Toddlers recognize words in an unfamiliar accent after brief exposure. Developmental Science, 15(6), 732–738. doi:10.1111/j.1467-7687.2012.01175.x Swingley, D. (2003). Phonetic detail in the developing lexicon. Language and Speech, 46(2–3), 265–294. doi:10.1177/ 00238309030460021001 Swingley, D. (2005). 11-month-old’s knowledge of how familiar words sound. Developmental Science, 8(5), 432–443. doi:10.1111/j.1467-7687.2005.00432.x Swingley, D. (2009). Onsets and codas in 1.5-year-olds’ word recognition. Journal of Memory and Language, 60(2), 252–269. doi:10.1016/j.jml.2008.11.003 Swingley, D., & Aslin, R. N. (2000). Spoken word recognition and lexical representation in very young children. Cognition, 76(2), 147–166. doi:10.1016/S0010-0277(00)00081-0 Swingley, D., & Aslin, R. N. (2002). Lexical neighborhoods and the word-form representations of 14-month-olds. Psychological Science, 13(5), 480–484. doi:10.1111/1467-9280.00485 460 Y. WANG AND A. SEIDL Wang, Y., & Seidl, A. (2015). The learnability of phonotactic patterns in onset and coda positions. Language Learning and Development, 11(1), 1–17. doi:10.1080/15475441.2013.876270 Werker, J. F., Fennell, C. T., Corcoran, K. M., & Stager, C. L. (2002). Infants’ ability to learn phonetically similar words: effects of age and vocabulary size. Infancy, 3(1), 1–30. doi:10.1207/S15327078IN0301_1 White, K. S., & Morgan, J. L. (2008). Sub-segmental detail in early lexical representations. Journal of Memory and Language, 59(1), 114–132. doi:10.1016/j.jml.2008.03.001 Zamuner, T. S. (2006). Sensitivity to word-final phonotactics in 9–16-month-old infants. Infancy, 10(1), 77–95. doi:10.1207/s15327078in1001_5 Zamuner, T. S. (2013). Perceptual evidence for young children’s developing knowledge of phonotactic probabilities. Language Acquisition, 20(3), 241–253. doi:10.1080/10489223.2013.796951
© Copyright 2026 Paperzz