THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA VOLUME 29, NUMBER 5 MAY, 1957 Perception of Vowels Heard in Noises of Various Spectra* J. M. PICKETT OperationalApplicationsLaboratory,Air ForceCambridgeResearchCenter,Bolling Air ForceBase25, D.C. (ReceivedDecember3, 1956) The perceptionof vowelsheard in noisesof variousspectrais analyzedby meansof stimulus-response matrices.The stimulusvowelswere spokenin PB-word lists and in syllablelists in which the vowelswere equally probable.The matricesshowshiftsin vowel confusions dependingon how differentnoisespectra mask the vowel formants. Vowel duration and intensity are measuredand related to vowel perception. Vowel guessingis related to past training. HISpaper presents ananalysis oftheintelligibilityreplacementand spokenin a short carrier phrase.The and confusionof vowelsheardat low signallevels bVb tests were presented through earphonesand the noises. The data are of interest for two PB-word tests were presentedwith the earphonesfor reasons'first, they provide a basisfor selectingvowels sometests, and over loudspeakersfor other tests. The for use in noise, and, second,they are relevant to the speechwas recordedfor the tests with bVb syllables; the PB wordswere read into a microphone,amplified, formant theory o[ vowel perception. The roleof the formantsin vowelperceptionhasbeen and presentedto listeners. Four noisesignalswere obtained by RC filtering of studied largely with synthesizedvowels rather than spokenvowelsin order that formant frequenciesmay the output of a random noise sourceproviding white be preciselymanipulatedJ Spokenformants may be noiserestrictedto the range0-20 000 cy. These signals studiedby maskingthem with noiseand constructing had spectrawith very nearly linear slopesof +6, 0, --6, stimulus-response matricesas did Miller and Nicely in and --10 db/octave between 200 and 7000 cy as analyzingthe perceivedfeaturesof consonants. 2 in our measuredper cyclewith a wave analyzer;the respective caselisteners'vowel responseswill be groupedin the acousticreproductionsof these noiseswill be referred matricesaccordingto the typical frequencyregionsof to belowashighfrequency,flat, low frequency,and very their formants. Spokenvowels also differ perceivably low frequency. These terms are descriptive of the in durationand intensitya and the contributionof these frequencyregionof speechwhich the noiseswill mask, cuesto vowelperceptionmay be isolatedwhenpatterns sincethe speechand noisesignalswere reproducedby the sametransducer.The noiseand speechsignalswere of formant frequencyare maskedby noise. mixed electrically, power-amplified,and then applied PROCEDURES to the earphonesor loudspeakers.The earphoneshad a real ear responsewhich was flat 4-3 db from 200 to Two setsof test syllableswereused:one artificial set 7000 cy. The loudspeakers'responsecurve, varying in which the vowelsoccurredwith equalprobability and 4-5 db, wasflat from 100 to 1200cy, then fell 4 db from anothersetin which the vowelsoccurredapproximately as they do in English.The artificialsyllableswereof the 1200 to 3000 cy and 8 db from 3000 to 7000 cy. The form bVb; lists of 60 syllableswere formed by using soundpressurelevel of the noisesat 2000 cy was 45 to 50 db per cyclere 0.0002dyne/cm•'.The soundpressure five of each of the twelve American vowels. 4 The set of level of speech,dependingon test material and noise English syllablesconsistedof the Harvard PB words, 1000 syllablesphonetically balanced in lists of 50 spectrum,rangedfrom 50 db to 83 db, over all longrms. ratio) was measuredwith a VU to match approximatelythe frequencyof occurrence of SIN (signal-to-noise meter at the output of the power amplifier.The microthe speechsoundsin English.In both setsof tests,the phone, recorder, and amplifiers introduced negligible syllableswere drawn randomlyfrom the lists without distortionand had flat responsecurves4-« db from 50 * This is AFCRC Technical Note 56-12 of the Operational to 15 000 cy. ApplicationsLaboratory, Air Force CambridgeResearchCenter, The talkers were college-educated men 23 to 33 years in broad-band Bolling Air Force Base 25, D.C. ASTIA Document No.: AD old. Three talkers were used for the bVb tests and four 110057. • Delattre, Liberman, Cooper, and Gerstman, Word 8, 195 (1952); R. K. Potter and J. C. Steinberg,J. Acoust.Soc.Am. 22, 807 (1950); G. E. Petersonand H. L. Barney, J. Acoust. Soc. Am. 24, 175 (1952); R. L. Miller, J. Acoust.Soc.Am. 25, 114-121 with (1953). the monitored 2 G. A. Miller and P. E. Nicely, J. Acoust.Soc.Am. 27, 338 PB words. Two talkers were common to both crews.They were trained to monitor their speechso that the averageintensity of the carrier phrasefell at level and the test words fell at their own natural levels. The listeners were made thoroughly 0955). aW. R. Tiffany, J. SpeechandHearingDisorders18, 289 (1953). familiar with the syllablesand other conditionsby pre4The vowelsusedare referredto belowby their symbolsin the InternationalPhoneticAlphabet and a key word for pronuncia- liminary tests affordingpractice at all rangesof inteltion follows each: u (boob); u (book), o (boat), o (bought), a (Bob), A (bud), , (Bert), ae (bat), e (bet), e (babe), I (bib), i (beet). ligibility. Five listenerswrote the wordsthey heardin the PB word tests. For the bVb tests, six listeners 613 Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 614 j.M. PICKETT recordedthe vowelsthey heard by a simplephonetic code.Theselistenersknew that the vowelswere equally probable,but they did not consciously keep countsof vowels during a test list. Listeners respondedto all probable.The vowelsof the matricesare arrangedin three groupsaccordingto whether they have generally low, medium,or high frequencyof the secondformant. Within thesegroupsthe vowelsare arrangedin order stimuli. of increasingfrequencyof the first formant.5 To make The vowel or diphthongof eachresponsewas tabu- it easierto seethe salient features of the matrices, the lated in a stimulus-response matrix witk rows for the confusionswere systematicallydegradedto give only spokenvowel stimuli and columnsfor the response four gradesof confusionfrequency:2 to 5%, 5 to 10%, vowels of the listeners. The entries in the final matrix 10 to 20%, and greater than 20% of all responses to gave the numberof timesthat eachresponse occurred each stimulus vowel. These were entered in new matrices to each stimulus. Results with different talkers, noise usingdots gradedin four sizesand eliminatingcorrect Larger sizeof dot representsmoreconfusion. spectra,andS/N's weretabulatedin separatematrices. responses. Reference to the "Formant Key" of Fig. 2 will These werethen pooledto showthe main effectof noise spectrumon confusion patterns.Effectsdueto different illustrate the following interpretations of confusion talkers were relatively minor. patterns' (1) Confusionswhich fall near the main diagonal indicate correct perceptionof the frequency RESULTS WITH VOWELS EQUALLY PROBABLE region of the secondformant. Large rectanglesare Table I showsthe pooledmatricesresultingfrom the formedby groupingthe vowelsinto our second-formant ;bVb tests where the stimulus vowels were equally categoriesand theseare shadedso that one can easily TABLe.I. Vowelsheard in variousnoisespectrawhen stimulusvowelsare equally probable. Part A' low-frequency noise,SIN =- 20 db.•' Responsevowel u u o 202 34 12 225 10 34 6 2 9 33 5 6 9 7 115 11 16 34 16 6 12 3 20 242 4 8 341 26 5 3 13 1 8 10 4 8 4 71 12 14 8 29 11 2 7 28 6 286 18 1 6 268 2 5 13 379 14 5 1 8 16 6 6 7 3 1 2 4 2 4 28 5 9 17 4 7 7 10 3 30 4 8 1 1 4 11 279 18 44 7 5 18 257 10 16 1 292 20 10 283 25 2 2 51 14 13 4 4 5 51 2 6 o •r A a Part B' Flat noise,S/N=- i I 3 7 3 8 11 3 10 2 5 2 3 2 15 3 31 1 5 16 8 11 19 11 5 5 3 24 309 11 and --15 db.b Responsevowel u U o 244 12 5 12 318 8 2 16 285 19 1 56 6 13 9 5 64 1 7 o , A a i 1 1 1 18 33 9 1 3 206 4 2 428 1 7 18 2 17 30 3 349 9 4 353 1 2 4 475 15 1 1 22 3 3 3 2 6 1 5 414 6 15 6 2 7 24 19 5 Each vowel spoken 405 times; vowel intelligibility, Each vowel spoken 495 times; vowel intelligibility, 1 10 4 2 4 50 4 2 1 33 1 13 10 106 4 7 6 1 141 1 10 5 32 6 5 2 35 1 4 14 67 6 15 7 388 2 13 1 5 427 4 2 2 16 1 395 3 9 ! 8 424 69.3%. 75.7%. 5R. K. Potter and J. C. Steinberg,J. Acoust.Soc.Am. 22, 807 (1950); G. E. Petersonand H. L. Barney,J. Acoust.Soc.Am. 24, 175 (1952). Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 PERCEPTION OF VOWELS IN NOISE 615 TABLe. I.--Continued. Part C' High-frequencynoise,S/N--- 30 db.e Responsevowel o U u o 132 1 2 162 1 12 5 El' o , 2 3 A 2 2 179 1 2 252 22 16 2 23 177 1 i a I e 124 1 6 80 12 7 65 3 5 3 195 1 1 8 1 19 5 62 1 266 31 2 4 2 239 51 218 1 1 13 1 11 1 1 34 1 1 250 227 23 Part D. High-frequencynoise,S/N----40 2 234 db.d Responsevowel u u u o • • o u 8 113 o o 88 12 5 1 • 12 16 A a '• i 2 2 36 2 o 2 6 87 3 , 1 145 48 26 9 1 21 9 2 1 I 4 60 7 2 e 16 3 26 1 e 1 7 4 16 m 1 1 3 42 Each vowel spoken 270 times; vowel intelligibility. Each vowel spoken 270 times; vowel intelligibility, A 3 12 17 6 17 2 16 81 4 7 123 i a 5 3 45 6 e g a• 1 20 148 6 7 3 7 85 3 2 12 5 139 4 1 11 4 18 1 3 2 52 5 4 4 2 19 5 36 5 14 72 2 23 3 1 6 2 19 2O3 7 19 3 8 6 163 1 11 9 5 4 187 3 9 5 7 3 155 15 4 1 0 15 130 229 1 9 9 2 4 i 5 1 2 8 42 78.1%. 52.6%. see the generalextent to which the secondformant is frequencyrange rather than in the middle. This may correctlyheard.(2) Whenconfusions fall in the unshaded be partially due to the more frequent occurrenceof rectangles,the secondformant is not heard correctly. low and high secondformantsin English. (3) When confusions fall alongthe parallel diagonalof When the confusionsin high- and low-frequency an unshadedrectangle,the first formant is correctly noiseare compared,the effectsof different amountsof heard, since within these categories,the vowels are maskingof the first and secondformants can be seen. arrangedin orderof increasingfirst formant frequency. First, with high-frequencyand flat noisethere is more For the testswith syllableshaving equal vowelprob- tendencyto confusethe secondformants,as shownby abilities, the simplified confusionmatrices are shown the greaternumberof confusions in the unshadedrecin Fig. 1. There is a strongtendencyfor the confusions tangles. With low-frequencynoise many more conto fall alongthe diagonalwithin categoriesof secondfusions occur within the shaded rectangles showing formant frequency. This is particularly so with the greaterhigh-frequency maskingprovidedby the high- grosslycorrectperceptionof the frequencyregionof the second formant and confusion of the first formant. ø frequency and flat noises. Thus, when the second formant is masked, confusionsoccur between vowels 6The effectsseenwith different masking spectraare analagous mainly distinguished by differences in this feature,and to resultsobtainedby Delattre et al.• with identificationof synthe response vowelsas heard tend rather closelyto be thetic vowels having but one formant. When low formant frequencieswere presentedback vowelswith corresponding first given the correctfirst formant. Confusionof the two formant frequencieswereheard.When the formant wasincreased extreme values of the second formant is much more commonthan confusionwith a middle frequencyposi- in frequencyfront vowelswere assignedand accordingto their secondformant frequencies.In the present study, however, it appearsthat the presence of the maskedsecondformantis always detected or assumedby the listener and hence assignmentsto tion. Apparently,whenthe second formantis not heard, the listener tends to assignit at the extremesof its front vowelsare always present. Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 616 j.M. LOW-FREQUENGY NOISE PICKETT FLAT NOISE , ß u u 'u' o o o A 0 i i e e Fro. 1. Simplified patterns of confusion among vowels heard in noisesof various spectra. The stimulus vowels (rows) occurred with equal frequency. Original data HIGH-FREOUENOY NOISE.S/N - -30 # HIGH-FREQUENCY NOISE,S/N ß- 40 ß i are shown in Table I. The formant key of Fig. 2 showshow the vowelswere groupedaccording to the typical frequencyregionsof B u their formants. Amount of con- fusion is indicated in four grades by the size of dot entered in the confusion cells. When the dots fall heavily in the shadedsquares, the first formant is masked by the noise and the second is heard; when they fall on the diagonalsof the large, unshadedrectangles,the o D A first formant is heard second is masked. and the i I VOWELS EQUALLY PROBABLE the confusedpairs i-u, I-U, e-o, e-a, and ae-Aform a series having rising frequency of the first formant, whereeachpair has a first formant frequencywhich is There is a rather remarkable constriction of the commonto the pair but differentfrom the next pair. vowel confusionsto the diagonals of the confusion However, in the utteranceof averagetalkers there is categories. This is probablydue to the joint operation overlapof formant frequenciesin this series.Therefore, of cuesof duration and formant frequency.Particularly, cuesof vowel duration must also be a factor in forcing confusions into the diagonalcells.8 The two rather long pairs i-u and e-o are separatedby a short pair, I-U. TA]rr.•. II. Intensities of vowels and their intelligibility in noise. Thesethree pairs showthe neatestconfusionrelations. The next pairs, ae-Aand e-a are respectivelylong-short Intelligibility, % correct and short-longand hencethey are mutually confused Mean relative --6/oct. Flat +6/oct. but not often with members of the first three pairs Vowel intensity, db noise noise nome The very commonconfusionof the vowel pairs i-u, I-U, and e-o was first demonstratedwith white noiseby Moser and I)reher 7 and is confirmedby our results. 93.6 96.0 91.6 --0.1 0.0 84.2 86.5 73.5 --0.6 66.2 71.3 58.8 --0.8 --0.9 -- 1.0 -- 1.2 70.6 55.6 59.8 69.9 70.5 64.2 57.6 79.8 47.7 50.9 49.2 70.7 --1.3 -- 1.7 --2.1 76.3 63.5 72.1 85.7 78.4 86.3 67.3 70.5 80.9 --3.3 49.9 49.3 40.7 --4.9 68.9 83.6 81.8 which have lower first formants. In order to see what effect the natural intensity differencesof different vowels might have on their intelligibility in noise, the intensitiesof all of the 8 Measurements of the duration of five of each stimulus vowel of eachof the three talkersgave the followingmeandurationsin seconds: •e, 0.328; a, 0.323; o, 0.322; e, 0.319; o, 0.298;•, 0.288; u, 0.285; i, 0.284; e, 0.238; 3_,0.227; u, 0.227; I, 0.207ßThese durations were surprisinglyinvariable among utterancesand talkers. The evidence that duration is used as a cue can be seen by collapsingconfusions into smallermatricesand lookingfor a long-short"axis"acrossthe matrixßThe threedurationcategories ?H. M. MoserandJ. J. Dreher,"Phonemic confusion vectors," of long vowels•e, a, o, e, medium vowelso, •, u, i, and short Air Force CambridgeResearchCenter, Report TR 54-84, 1954. vowels,I, e, 3_,U, showthe effect especiallywell. Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 PERCEPTION OF VOWELS IN NOISE 617 stimuls vowelsin bVb syllableswere measuredwith a the vowelsu and u are least intelligible.This result is VU meter. Table II showsthe mean relative intensity grosslyconsistentwith differencesamong vowels in of eachvowelalongwith its intelligibility in eachnoise. thresholdof detectabilityin quiet asfound by Tiffany? The relationsbetweenmeanintensityand intelligibility were evaluated by rank correlations.The relation is RESULTS WITH VOWELS OF PB WORDS statistically significant only in low-frequencynoise (p=0.05). Apparentlyvowelsof highnaturalintensity are more intelligible only when both formants are heard in some degree. When the secondformant is maskedby flat or high-frequencynoise,high intensity of the remaining first formant cannot increaseintelligibility. Insofar as general resistanceto noise is concerned, the vowelsa, •, and e are most intelligiblein noiseand In order to extendthe principleseento hold for the maskingof vowelformantsin bVb syllables,we undertook an identicalanalysisfor the vowelsand diphthongs of PB words.Ourtalkersmighthaveemphasized]the formantsto better distinguishamongthe bVbsyllables? Also the variety of consonantmovementsin the PB syllablesmight changevowel formantsin ways that TA..•. III. Vowelresponses to PB wordsheardin variousnoisespectra. Part A' Very low frequencynoise,S/N-- 16 and --18 dbP Responsevowel o 36 25 1 21 9 7 8 9 6 o , 2 2 58 27 10 53 4 14 11 1 3 8 A 1 1 4 23 5 1 4 1 ! 4 2 15 1 3 4 4 ai 1 3 ' 2 a 4 2 3 6 5 129 22 i 1 83 15 1 7 6 7 7 1 4 m 3 2 1 1 3 1 18 2 3 5 3 8 2 23 8 22 7 2 21 146 9 19 7 95 10 10 7 20 10 119 24 7 6 6 3 77 1 ! 1 4 2 1 2 1 e 1 1 4 5 e 5 5 5 1 2 4 1 I 1 1 2 au 1 1 1 1 2 10 2 4 2 4 1 14 5 8 3 4 3 14 205 1 1 2 2 8 11 2 118 1 iu 1 1 Part B- Low-frequencynoise,S/N-- ai 2 1 1 2 3 4 3 2 1 2 46 1 5 au iu 1 5 1 2 7 1 12 db.b Responsevowel o 43 4 13 2 g . o A a • i '• m 4 3 1 7 I 9 e 2 e 2 ae 2 ai au iu •r A a i I e 17 5 2 1 1 2 1 3 2 1 5 3 9 1 4 2 81 11 3 58 5 1 10 4 10 3 1 1 3 3 1 4 3 25 14 1 ae ai 1 4 2 1 3 2 13 7 1 4 2 3 134 6 4 11 3 5 3 7 1 8 91 1 7 2 3 4 9 1 2 1 3 2 2 3 1 7 5 5 1 5 7 85 9 13 3 3 28 139 13 22 9 17 17 99 16 6 4 17 3 94 12 2 2 1 6 214 9 2 1 2 1 2 4 1 4 3 2 6 4 3 1 1 2 7 2 2 2 1 2 1 e 9 4 4 1 2 o 5 3 12 5 1 1 2 1 3 3 3 142 1 1 1 1 1 1 1 4 34 Total stimuli, 1983; vowel intelligibility, 61.2%. Total stimuli, 1953; vowel intelligibility, 63.8%. 9 W. R. Tiffany, J. SpeechandHearingDisorders18, 379 (1953). x0SeePotterand Steinberg's paper(reference 3) wherevowelsspokenrepeatedlyby a trainedphoneticJan are shownto have remarkably distinct and invariableformant frequencies. Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 618 J. M. TABLE PICKETT III.--Continued. Part C' Flat noise,S/N=- 10 and --12 db.c Responsevowel • u A a 59 1 2 3 9 5 5 21 2 1 2 149 10 4 80 7 2 1 4 1 3 2 17 8 69 2 6 90 1 2 1 9 e ai 2 8 i I 41 3 1 5 16 11 3 1 1 7 4 1 11 114 lO 11 4 1 5 198 5 15 2 1 8 151 19 12 ai e 28 7 5 19 1 6 13 1 5 6 lOO lO 6 5 2 14 204 1 au iu 1 4 in an 1 3 12 12 1 5 117 14 8 56 3 2 1 14 Part D' High-frequency noise,SIN =--33 db.d Responsevowel • u A a 1 u 11 2 1 2 35 1 6 4 38 6 3 6 a• 10 1 2 2 1 3 ai 9 4 3 5 2 e au i 1 3 8 I e ai 14 4 6 1 2 6 4 1 13 3 1 8 1 4 10 1 4 2 2O 10 6 1 12 17 27 14 14 9 7 7 28 13 10 1 1 2 2 2 15 1 1 lO 11 2 3 2 24 11 1 2 2 4 36 12 3 2 au 4 13 11 27 5 9 11 iu Part E' High-frequencynoise,S/N--- 22 db.e Responsevowel u U o o o' A a i I e au in 13 u 69 2 7 37 3 9 2 17 lO 4 e 83 64 49 124 3 5 2 ai au 1 3 71 1 4 4 2 54 6 1 2 131 2 78 1 22 iu Total stimuli, 2055; vowel intelligibility, 69.6 %. Total stimuli, 747; vowel intelligibility, 32.8 %. Total stimuli, 986; vowel intelligibility, 84.3 %. Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 PERCEPTION OF VOWELS IN NOISE VERY LOW FREQUENGYNOISE SECOND FORMANT • LOW 619 LOW-FREQUENCY NOISE -'HIGH foFIrst .-•Hiah Low '---,' High rmon! . --Law '-- LoveHigh I LOW LOW LOw High , HIGH FORMANT FLAT NOISE KEY HIGH-FREQUENOY NOISE f,S/Nß-33 ,HIGH-FREQUENGY NOISE f S/N--E2 Fro. 2. Simplifiedpatterns of confusionamongthe vowelsand diphthongsin PB wordsheard in noisesof variousspectra.The stimuli are the rows. The frequencyof occurrenceof each stimulusdependedon the PB-word lists used. Original data are shown in Table III. The formant key showshow the vowelswere groupedaccordingto the typical frequencyregionsof their formants. Amount of confusionis indicated in four gradesby the size of dot entered in the confusioncells. When the dots fall heavily in the shadedsquares,the first formant is maskedby the noiseand the secondis heard; when they fall on the diagonalsof the large, unshadedrectangles,the first formant is heard and the secondis masked. complicatetheir relationswith vowel perception. n The data from PB wordsalsoofferedan opportunityto see whetherthe vowelfrequencydistributionof the stimulus set had a marked effecton confusionpatterns. The stimulus-response matricesfor the vowelsof the PB-word testsare shownin Table III and the simplified confusionmatrices appear in Fig. 2. In interpreting thesematrices,it shouldbe kept in mind that the different stimulus vowels and diphthongshave different probabilitiesof occurrence.Vowelswith relatively high values of the secondformant frequency occur about 2 timesmoreoften per vowel than thosewith low values main differencesbetweenconfusions amongthe syllables where the vowelsoccurredwith equal probability, and among the vowels of the P B words. The shift in confusion patterns with progressivelylower masking frequenciesis very striking. An analysiswas made of the effectof stimulusprobability on the frequencyof occurrenceof eachvowel as an error under the different noise conditions. The question was whether the listeners assigneduncertain vowel responsesat random or accordingto their frequencyof occurrencein the stimulusset, or, in the case of equalvowelprobability, accordingto their frequency and 1.5 times more often than the three middle vowels. of occurrencein past experience.The rank-order correSince the listenerswere made quite familiar with the lation betweendifferent vowel frequencydistributions wordswhich might occur,we expectedto seea greater was usedas a meansof comparingthem. These correconcentrationof "guesses"fall amongthe vowelswith lations are shown in Table IV. The correlation between the stimulus distribution and high secondformants.Essentially,this accountsfor the the error distributionfor PB wordsis significantin flat n See Moser, Dreher, and Harbold, "Three magnitudes of inter-phonemictransitionalinfluence," AF Cambridge Research and high-frequencynoise,showingthat listenersdo not Center, Report TN 55-74, 1955. This investigationshowsthat assign uncertain responsesat random but tend to listeners isolate and, to some extent, respondto the amount of match the stimulus set. For the equally probable formant frequency shift which occurs when the articulatory apparatus movesfrom consonantconstrictionto vowel position vowelsthe vowel error distributionsin high-frequency and v.v. noisealso correlatehighly with the distributionof the Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57 620 J. M. PICKETT TABLE IV. Rank-order correlations between frequency distributions of vowels as errors and as stimuli. Vowel Vowel errors to PB words Noise spectnlm with PB-word stimulus vowels Very low frequency 40.47 errors to equally Vowel errors to equally probable vowels probable vowels with PB-word stimulus vowels -.. about 1.4 timesmoreoften per vowelin the PB lists than thosewith low first formants,i, I, u, u, e, and o. Perhapsthisrangeof stimulus probabilities is not large enoughto havemucheffecton uncertainresponses. with vowel errors to PB words SUMMARY The perceptionof vowelsspokenin syllablesand --0.09 --'•.•}4 40.42 heard in noisesof various spectrawas analyzedby meansof confusionmatrices.The followingconclusions +0.56 • +0.82 b were reached: Low frequency +0.48 Flat +0.61 a --0.40 High frequency +0.69 a (1) Significant shiftsin vowelconfusions occurwith Significantly different from zero at the $ % level. Significantly different from zero at the 1% level. changesin noisespectrum. (2) The shiftsin vowelconfusions areconsistent with stimulus vowels in the PB words and with the distribution of vowel errors to PB words heard in the same type of noise.Three of the six listenersin thesetests had had extensiveprior test experience with PB-word lists. Apparently a listener,when guessingat equally probablevowels,is still constrained to a distributionof guesses about the secondformant which conformsto somedegreewith pasttraining.On the otherhand,with low-frequency and flat noisewherethe secondformant is better heard but the first formant is masked,there is no consistent relation between vowel distribution in a formanttheoryof vowelperception. (3) When noisemasksprimarilyone formant,the resulting confusions showthat theunmasked formantis correctlyperceived. (4) Whenonlyoneformantis perceived, differences in durationamongthe vowelsare a strongfactor in restrictingthe listener'sresponse. (5) Vowelsspokenwith highernaturalintensityare significantly moreintelligiblein low-frequency noise, but not in high-frequency and flat noise. (6) Vowel-guessing produces frequency distributions whichconformin somedegreewith the stimulussetand that in errorresponses. The vowels of vowelresponses distribution of thevowels in pasttraining. with high first formants,ee,A, a, •, e, and• occuronly thefrequency Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 21 Oct 2014 10:32:57
© Copyright 2026 Paperzz