Some Further Experiments upon the Recognition of

THE
JOURNAL
OF THE
ACOUSTICAL
SOCIETY
OF AMERICA
VOLUME
26, NUMBER
4
JULY,
1954
Some Further Experiments upon the Recognition of Speech,
with
One and with
Two Ears
E. COLIN CHERRY*AND W. K. TaYLoR•
Departmentof ElectricalEngineering,Imperial Collegeof Scienceand Technology,
London,England
(ReceivedJanuary 21, 1954)
The experimentsdescribedhere continuea seriesreportedin this Journalearlier. A numberof objective
testsare described,directedtowardsthe generalproblemconcerningthe faculty we possess
of perceiving
only oneacousticworldundernormalconditions(in spiteof havingtwo ears)and yet, if our earsbe stimulated by differentsignals,artificially, we can attend to one or the other of them. The testshave been made
with continuousspeech(readingsfrom light fiction) and the resultsassessed
statistically.The first tests
aim at measuringthe reactiontime r requiredto "switch the attention" from oneear to the other, as assessed
by perceptionof the wordsof the message.
A secondset of testsshowthat we perceiveonly onespeaker,as
a "gestalt," when the earsare stimulatedby similarmessages
but with a delay betweenthem exceeding20
times that ever experiencedin real life by virtue of binaural directivity.
I. INTRODUCTION:
EARLIER
MENTAL
CORELATION
EXPEIMENTS
OF SIGNALS
EARS
ON
relation of such test conditionsto those experienced
in real life are hereregardedasof secondaryimportance.
AT THE
TWO
The reader is referredto this earlier paper for further
experimental conditions. The work described here
that we have two of themmand yet we hear only forms a continuationof these earlier tests, especially
one acousticworld; only one voiceper speaker.
of those tests in which di•erent messagesare fed to
In an earlier issueof this Journal, one of the authors the two ears (samespeaker,two tape recordings).The
has reportedsomepreliminaryobjectiveexperiments presentobject is to exploretwo problems'
(a) How to measurethe averagedelay time r in
uponthefacultywepossess
of (a) listening
and"attending to" messages
in either our R.H. (right-hand) recognizingcontinuousspeechmessagesintothe extent
or our L.H. (left-hand) earsat will, when the messages of being able to repeat the words.
theyreceive
arearranged
to bedifferent,or (b) listening (b) To observewhat factors in spoken messages
to a single integrated message,when the ears are control our ability to "attend to" one ear only (i.e.,
stimulated identically.x Roughly speaking, our ears how "close" can the two messagesbe made, and in
appear to be independentunder certain conditions, what aspects,beforea "gestalt" operates,making them
but not under others,and theseconditionsmay readily into one field of speech?).
Again, our earlier tests had illustrated the different
be controlled experimentally; in all cases,the final
act of recognitioncan only be performedupon one levels at which aural recognitioncan take place. For
example, under certain conditions a subject can
message
at a time.
In theseearlier experiments,and in thosereported recognizeonly the existenceof "human speech" as
here, the human "subject" is regardedonly as a opposedto other types of continuousnoise; in other
transducer;the tests are essentiallybehavioristic. conditions he recognizesthis speech as being, say,
Messages,
consisting
of tape recordings
of continuousEnglish or French, male or female, but nothing more.
Such might be called "statistical recognition."Under
readingsfrom literary texts, are applied through
different conditionshe can recognizewords, in conheadphones
to the subject'sears;the R,H. and L.H. nected chains, but may be unaware of the semantic
messages
are arrangedto differ in variousspecified contentof the messages
(and it is at this level that our
ways.The listeneris requiredto respond
by repeating experimentshave been directed). Finally, full recogniwhat he hears,in one ear or the other as demanded. tion involvesinterpretationand responseactions,such
Success"scores" have been based upon percentage
as the carrying out of instructionsor orders."Recogwords correct. Thus the tests are all of a statistical
nition" is then not one simple thing, but is a multicharacter. However, subjectiveaspectsconcerning layered phenomenon.
NEofthe
most
striking
facts
about
our
ears
is
what the subject"feelslike," his difficulties,or the
II.
* D. Sc.,A.M. I. E. E. Readerin Telecommunication,
Imperial
College,
Universityof London.Part of thisworkwasperformed
SWITCHING
OF A MESSAGE
TO ANOTHER
FROM
ONE
EAR
at the Research Laboratory of Electronics, Massachusetts Our earlier experimentshad suggestedthat a finite
Institute of Technology,Cambridge,Massachusetts,
and was delay elapsesfor the attention to "catch onto" one ear
supported
inpartbytheSignal
Corps,
theAirMaterielCommand,or the other, when stimulated differently--a kind of
$ B. Sc.,M. Sc.ImperialCollege,
University
of London.
Now mental time constant r. It was suggestedthat the
NuffieldFoundationScholar,AnatomyDepartment,University average
value of r might be assessed
by periodicswitchCollege,Universityof London.
ing of the messagefrom one ear to the other; then, at
• E. ColinCherry,J. Acoust.
Soc.Am.25,975(1953).
and the Officeof Naval Research,of the United States.
554
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33
RECOGNITION
OF
SPEECH
WITH
low rates of switching, the attention also must be
alternated between the ears, but lagging in phase. At
a certain rate, one might expect the acousticswitching
and the mental switchingto becomeout of phase, so
that no recognitiontakes place. This was found to be
the case, but with the significantcomplicationthat
the syllabicrate of speakingaffectsthe results.
In the first experimentsa continuousreading from
contemporarylight literaturewasrecordedon magnetic
tape; a singlespeakerwas used throughout,and his
words/minute were observed and recorded.• This
message
wasappliedto oneear or the otherof a subject,
through headphones,being switchedelectronicallyat
periodswhichcouldbe varied between10 seconds
and
0.!
millisecond.
Thus
each ear received
sound and
silencealterr/ately,with 50 percenton-offratio; the
switchingwas periodicand in no way dependentupon
the speechsyllabicperiods.The subjectwas instructed
to repeat the wordsas he heard them and a count was
• ii01
I
I
,.,I00.
0
80
:-
60
I
ß
I
I
•
• 40
•
ONE
AND
TWO
EARS
555
• 16
•12
j
0 60
I
I
I
I0
I00
I000
I
ß
• •o
•,,• 20-0.1
1.0
I0.000
FREQUENCY- GPS
Fro. 2. Subject; J.K. Date; March 11, 1953. Articulation score
for continuousspeech,switchedperiodicallyfrom one ear to the
other. For each ear the proportion of the period occupiedby
speechis 50 percent, the remainder being silent. Voltage across
telephones0.087-volt rms when speechis uninterrupted.
starting with a complete switching cycle of !0 sec.
At the end of 120 words, a new textual passagewas
taken, and a secondtest made at a faster switching
rate ... and so on, with different passages,until a
wide range of switching rates had been covered.
Figures!-4 showa plot of the resultsfor somedifferent
subjects;here, since the words per minute differed
slightly for each successiveexperimental point, this
rate is indicatedat the top of eachcurve.The average
rate, about !30-!40 words per minute, is a normal
rate of speakingfor a southernEnglishman.
All the curves taken under such conditions showed
2o
the sharpdipping towardsa very low word-identification score, at a switching (complete cycle) period
between 0.3 and 0.2 sec, approximately, depending
Fro. 1. Subject;J.K. Date; March 4, 1953.Articulation score uponthe subject.When a test wasset at sucha point,
for continuousspeech,switchedperiodicallyfrom one ear to the the subjectwould be observedto be making syllabic
other. For each ear the proportion of the period occupiedby "mutterings," with only an occassional
word correct.
speechis 50 percent,the remainderbeingsilent.Voltageacross
The
second
dip,
which
is
seen
to
occur
at
about
!500-cps
telephones
0.087-voltrmswhenspeechis uninterrupted.
switchingfrequencyis due to a purely physicalcause'
made of the percentagecorrect.õAll questionsas to the severe distortion of the speechsound spectra at
his methodof executingthis task, whetherhe "guesses"
or not, are here regardedas subjectiveand irrelevant
_ 150
to our presentobject; so alsowas the questionas to
o
o.I
,
I.O
IO
IOO
FREQUENCY - GPS
IOOO
, ,,i
IO•OOO
whether he understood the "sense" of the text. All
subjects
werewithinthe agegroup25-40 andof similar
educationalbackground (University). Test runs of
!20 consecutivewords were used for assessingthe
results, but the first 20 words were ignored, being
•110
I
I
I
1
/"
80
regardedonly as a meansof gettingthe subjectover
his initial difficulties and starting him on to steady
conditions of behavior.
With a given subject,a seriesof tests was made,
:1:
It was thought that prior familiarity with the text might
influencea subject'sbehavior. All subjectswere questionedon
this point, but it wasfoundto exertno noticeableeffecton his
0
•1
I I Ill
1.0
I0
I00
FREOUENCY-
I0•
I0,•
CPS
Fro. 3. Subject; E.M.T. Date; March 5, 1953. Articulation
scorefor continuousspeech,switchedperiodically from one ear
õ As a matter of interest,all subjectsthoughtit was easierto to the other.For each ear the proportionof the periodoccupiedby
storeup a few wordsor phrases
and repeatthemquickly,before speechis 50 percent, the remainderbeing silent. Voltage across
0.087-voltrms whenspeechis uninterrupted.
proceeding
to storeup the next,ratherthanspeakcontinuously. telephones
responses.
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33
556
E.
C.
CHERRY
AND
W.
K.
TAYLOR
listenswith bothearsto what now appearsto be speech
interrupted with a square wave. But such a test
corresponds
somewhatto Miller and Licklider's test on
140
interrupted
speech,
and it is not surprisingthat their
I
I
1201
I
I
I
results correspondcloselywith ours, but only so far
I00 .• ............................. e..
as the R.H. flanks of these curves, Figs. 1-4, are
80 _
"-. ß
ß ß
concerned.An additional set of tests was made, simu60
lating Miller and Licklider's, but using continuous
40
speech.This was applied to both ears simultaneously,
and switchedon and off periodically;again the subject
\
was instructedto repeat what he heard, while listening,
0 i I ,,:1 ,; : i,I I , ,•1 I ,•,1 : , :•1
0.1
1.0
I0
I00
I000
I0,000
and a count was made of the percentof wordscorrect.
FREQUENCY- CPS
Figure 5 showsa typical result; the R.H. flank of the
Fro. 4. Subject; A.C.L. Date; March 5, 1953. Articulation main dip correspondsclosely to those in Figs. 1-4,
scorefor continuousspeech,switchedperiodicallyfrom one ear
to the other. For each ear the proportion of the period occupied while the L.H. flank, of course,differs. At very low
by speechis 50 percent,the remainderbeingsilent.Voltageacross interruption rate (0.! cps) only half the' speechis in
telephones0.087-voltrms when speechis uninterrupted.
fact availableto the subject,and so his scorerisesto
50 percent only. A further set of tests was made,
this high interruption rate. Miller and Licklider2 have applying interruptedspeechto one ear, with backward
published some results similar to ours as shown in interrupted speechto the other ear, the two being out
Figs. 1-4. However,their testshad a differentpurpose, of phase in their switching;[[this simulatesmore
and were based upon lists of separate words (not
continuoustext) repeated at unknown intervals at
Ii1100
unspecified
syllabicspeeds;factorswhichare important
o
o 8o
to our presentpurpose.These authors' resultsshow a
o 60
seconddip similar to that seenin our Figs. 1-4, caused
by spectrumdistortionthroughthe switchingsquare- 40
wave modulation.ô
•- 20
We are primarily interestedin the first sharpdip, at
0
3-5-cps switchingrate. It is suggestedthat this dip
0,1
1.0
I0
I00
I000
i0,000
FREQUENCY- CPS
is a goodobjectivemeasureof the recognitiontime r
for an individual subject, with continuous speech.
Fro.6. Subject;
E.M.T.Date;April24,19•3.
Articulation
On the L.H. flank of this dip, the subjectis switching score for continuous speech, switched periodically to the right
his attention from oneear to another synchronouslybut ear whilst continuousbackward speechis switched periodically
to the left ear. For eachear the proportionof the period occupied
laggingin phasewith the switchingcycle.At the dip by sound is 50 percent, one ear being switched on whilst the
itself he is frustrated. On the R.H. flank however, he other is silent. Voltage acrosstelephones0.087-volt rms when
z
signalsare uninterrupted.
closely the conditions of our first tests, but again
(Fig. 6) showedresultssimilarto Fig. 5.
• II0
I
I
III.
IOO
EFFECT
OF SYLLABIC
UPON
RATE
OF SPEAKING
lIECOGNITION
•: 80
o
z 60
• •o
C- 20
o
o.I
I.O
10
ioo
Iooo
io,ooo
FREQUENCY- CPS
The facultiesof speechand hearingnormallydevelop
together within one individual. When we speak we
hear and may monitor the syllabicflow of sound.Is it
unreasonableto assumethat the rate of productionof
syllables,by any one speaker,is set not only by anatomical dynamics, but by this monitoring process
also, at least as regardshis normal maximum rate of
speaking?Perhaps,then, the syllabicrate of speaking
and the rate of recognition are interdependentto
Fro. 5. Subject; K.R.F. Date; March 7, 1953. Articulation
scorefor continuousspeech,switched periodically to both ears
simultaneously.The proportionof eachperiod occupiedby speech some extent.
is 50 percent,the remainderbeingsilent.Voltageacrosstelephones
Although not intended as a direct test of such an
0.087-volt rms when speechis uninterrupted.
•' G. Miller and j. C. R. Licklider, j. Acoust. Soc.Am. 22, 167
(1950).
ô This hasbeenconfirmedby oneof the authors(W.K.T.) by
Fourier spectracalculationswhich will not be reproducedhere.
hypothesis,the followingexperimentwas suggested
by
it; this is to repeatthe earliertestsbut usingrecordings
I[Backwardspeech
washereusedasunintelligible
noisehaving
approximatelyLthestatisticalcharacteristics
of normalspeech.
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33
RECOGNITION
OF
SPEECH
WITH
ONE
AND
TWO
EARS
557
made at various average syllabic rates of speaking.
With somepractice it was found that it is possibleto
read texts at average speedsof 85 and 36 wordsper
minute, in a tolerably uniform way (normal speed
130-140 words per minute). The marked effect upon
•40
o o138
WORDS/MINUTE
the principal dip in the word-scorecurve is shownby
C-- 20
ß $6 WORDS/MINUTE
two examplesof Figs. 7 and 8 (the 138-words/min
curve in Fig. 7 correspondsto the curve of Fig. 2;
(•.!
•.o
•o
•oo •ooo •o,ooo
FREQUENCY- CPS
that in Fig. 8 correspondsto the curve of Fig. 3).
At 36 words/minute the subject is able to repeat
Fro. 8. Subject; E.M.T. Articulation scores for continuous
substantiallythe whole of the text; the dip virtually speech,switchedperiodicallyfrom one ear to the other. For each
•f
vanishes.
At normal rates of speaking(here 138 words/minute
or, roughly,6 syllables/sec)the sharpdip (e.g., Fig. 7)
occurs at a complete switching cycle (both ears)
period of 3 per sec, that is, when each ear receives
about 1 syllableat a time. But at 36 words/minute
(roughly 1.6 syllables/sec),and at the sameswitching
rate, each ear receivesat least 3 or 4 distinct samples
of each syllable, on an average. Such samples are
presumablymemorizedand associated,
therebyassisting
the faculty of syllable and word recognition.Notice
howeverthat the switchingcyclefrequencyof the dips
x85
WORDS/MIN
eartheproportion
of theperiodoccupied
by speech
is 50 percent,
the remainderbeingsilent.Resultsfor threeratesof speaking
word, but rather proceedsat many levels. Thus our
method of measuringr and our interpretationof this
quantity are essentiallystatisticalin character;they
are averagesfor the whole complex speech-hearing
process.
On the assumptionof sucha simplehypothesis,that
the time interval
r has the nature of a dead-time
constantduringwhichnothingof the signalis perceived
at eitherear,because
the "attentionis beingtransferred
from oneear to the other," a simplecalculationmay
be made.Thus r is a dead-timeinterval;at any switching frequency
f, eachear receivesthe signalfor periods
1/2fat a time;hence
theremainder
ofthetime(1/2f- r)
corresponds
to the time during which the signal is
_•6o
perceivedat each ear. The fraction of each cycle of
•
/
o I,.'38
WORDS/MINUTE
switching,duringwhichperceptiontakesplaceis then
2f(1/2f--r) or 100(1-- 2fr) percent.As we seefromthe
curvesFigs. 1-4, the sharp dip occurswhen f•3
o•.•
•.0
•0
•o0
•000
•0,000
cycles/sec,
when little perceptionoccurs.Hence we
FREQUENCY- CPS
assume,from the above formula, that (1- 2fr)-- 0
Fro. 7. Subject; J.K. Articulation scoresfor continuousspeech, givingr= • sec.
switchedperiodicallyfrom one ear to the other. For each ear
The curve100(1--2f/6) is plottedas a dottedline on
the proportionof the periodoccupiedby speechis 50 percent,
the remainder being silent. Results for three rates of speaking Fig. 4; it may be seento correspond
quite closelyto
are given.
the experimentalcurve (and those others, such as
Figs. 1-3) on theL.tt. flank. Sucha theoreticalcurve,
remainsnearlyconstant(Figs. 7 and 8) as the syllabic of course,takesno accountof the syllabicstructureof
rates of speakingare varied.
speech;it may be regardedasa limitingcurve,towards
which
the otherstend asymptoticallyas the syllabic
IV. RELATION BETWEEN "RECOGNITION TIME" •,
0
/
-
SWITCHING
ß woRs,.UT
RATE BETWEEN
PERCENT
ARTICULATION
THE EARS, AND
SCORE
We have earlier made the observation that the shard
periodsgo to zero.
As we have already remarked, the R.H. flanks of
such curves correspondto a different mental process
on behalf of the subjects.Here, the switchingrates
dip in the percent-articulation
scorecurvecorrespondshave become so fast that the attention cannot "transfer
to an average"word recognitiondelaytime," calledr, betweenthe ears"in synchronism;
the subjectrather
wherethis idea is given a rather naive interpretation. listensto one ear, but the syllabic rates are such that
It is beingassumedthat the time r we are measuring he is able to obtain a number of samplesof each
(about-} second,
asjudgedfrom the dip in the curves)
syllable, thereby ensuring an increasedperception
has the generalcharacterof a dead-timeconstant,
(and articulation) score. The dead-time r is then
as thoughit takesa listenera time r to "switchhis
assumed
to be ineffectiveover the R.H. flanks, and a
attention" from one ear to another. Of course,speech
different
hypothesis is required to account for this
itself does not consist of a chain of statistically inpart
of
the
perception(articulation) scorecurve. It
dependentsyllables,or words;redundancyarisesat
will be assumed that this score increases as the number
manylevels(syllableandwordtransitionprobabilities,
phrasestructure,syntax,etc.) Recognition
too, cannot of samples of each syllable, reaching the one ear,
be said to proceedsyllableby syllable,or word by increases.On an average,let d be the syllabicduration.
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33
558
E.
C.
CHERRY
AND
• 60
K.
TAYLOR
presentto the "rejected" ear, during each switching
half-cycle,not completesilencebut a low-levelsignal
havinga fractionk of the powerof that reachingthe
opposite ear. It might be of interest to see how r
changeswith k; that is, to seehow our reaction times
in switchingattentionfromoneear to the otherdepend
upon the extent to which the attention is "primed"
(3
TIME
00NSTANT--O.025
mSEC
e:: 20 -
o SWITCHIN(3 TIME CONSTANT=5.0
roSE(;
_
I
•.1
W.
I ,•1
1.0
I
I III
I
I0
I III
I
I00
I III
I000
•
I III
I0,000
in advance.
FREQUENCY- GPS
Fro. 9. Subject; E.M.T. Date; March 12, 1953. Articulation
scorefor continuousspeech,switched periodically from one ear
to the other. Voltage acrosstelephones0.087-volt rms when
speechis uninterrupted. Average rate of speakingis 146 words
per minute for both curves.
VI.
ON THE
FACULTY
OR
OF LISTENING
WITH
WITH
ONE EAR
BOTH
As stated earlier, our experimentsdescribedhere
formpart of a series,somehavingbeenreportedearlier,•
all directedtowardsthat generalproblem:how is it
Then at switchingrate f, this number of samplesisfd. we are able to listen with one ear (when the two are
From previous considerations,we have seen that the stimulatedwith different signals)or with both (when
perceptionscoreapproacheszero percent whenfd--•-l. stimulated similarly), at will? How closedo the two
In this case, the simplest function representingthe stimuli need to be beforewe correlatethem mentally
perceptionscorewill be 100(1-1/fd). This has also into a "gestalt"?In our earlierpaper,someexperiments
been plotted in Fig. 4, as a dotted line (taking were describedin which two unrelatedmessages
were
1/d=5.98), and resemblesthe R.H. flank of the fed to the two earsof a subject,who was requiredto
experimentalcurve,in an approximatemanner,but of
coursetakes no accountof the subsidiarydip due to
the speechspectrum interferenceby the high-speed
square-wavemodulationeffectof switching.
V. EFFECT
OF LESS-ABRUPT
REVERSALS
BETWEEN
THE EARS
It was thought that the abrupt switchingof speech
signalsfrom one ear to anothermight produceserious
noise, by virtue of the square-wavemodulation it
producesupon the signal reachingeach ear. Simple
Fourier calculationsshowedhowever,that the energy
spectra of the speechsignalsare not appreciably
affectedat switchingrates up to and well beyondthe
critical region of 1-10 cycles/sec.A further experimental checkwas devised.This consistedof fading out
the signalat one ear while fadingin that at the other
ear; exponentialmodulationenvelopeswereproduced,
suchthat the totalspeechenergy (reachingboth ears)
remained constant. Now there is always somesignal
reachingboth ears, but the ratio of the two signals
---0•
I00
t 0 -0-f
_
20
1e.
-o.4
-o.3
-o.2
-o.•
o
o.i
o.•,
c•3
o.,•
TIME OISPLAGEMENTOF RIGHTEAR MESSAGE,RELATIVE
TO LEFT EAR ME$$AGE,-otSEO
Fro. 10(a). Binauralperceptionof speech
direction(right or left).
repeat one, concurrentlywith hearingit; then we have
shownthat he is unawareof any singleword of the
messagereachingthe "rejected" ear--only statistical
properties of it are perceived. Secondly, the two
swingsup anddownexponentially,
ratherthanabruptly messages
were made the same,but with a long time
as before. The results with one subject, using two delay between them; this time delay was steadily
different exponential time constants, are shown in reduceduntil the subjectwas aware of the similarities
Fig. 9; comparingthis curvewith that of Fig. 3 (same of the messages,at a delay between 6--•2 seconds,
subject)showsa similarity,thoughthe dip appearsto roughly.It then appearsthat there is a memoryof this
haverisenslightlyalongthe switching-frequency
axisJ duration which is operative under such conditions;
(this frequencyfor this particular subjectwas rather that is, a word or phrase is memorizedat one ear and
higherthan average).The exponentialtime constants identified later at the other.
Continuing such tests, these earlier results were
of this switchingwere madevery short (<5 msec)so
that only the initial part of eachhalf-cyclewasaffected; confirmedby further trials and, subsequently,the
then the "unstimulated" ear, during each half-cycle behavior of a subject was observedwhile the delay
of switching, received energy very far below the betweenthe identical messages
reachinghis two ears
thresholdof hearing, exceptperhapsduring the first was steadilyreducedto zero. We wish to report one
few milliseconds.The slight increasein r may not be result of particular interest.
very significant,but it suggests
a further experiment, Continuouslyspoken, tape-recordedreadingsfrom
which has not yet been carriedout. This would be to light fiction were appliedto the two earsof a subject.
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33
RECOGNITION
OF
SPEECH
The only differencebetweenthe two signalswas a
time delay T, which couldbe varied from zero up to
50 msec.Naturally, suchconditionssetup some,though
not all, of the subjectiveeffectsof binauraldirectivity.a
The subjectwasinstructedto saywhetherthe "speaker"
appearedto be to his right- or left-hand side (i.e., a
logicaldecisionRH/LH). The subjectiveeffectof the
speaker'sappearingto move to the right-or left-hand
of the listeningsubject can be producedeither by
changingthe delay betweenthe aural signals,or by
changingtheir relative amplitudes.Over a wide range
the effectsare indistinguishable.But in our tests the
aural signalswere set to have equal powersand only
the delaywasvaried. The delay T waschangedsuccessively to random values and the results recordedas a
WITH
ONE
1400
-
1200
-
AND
TWO
EARS
559
-
•ooc -
800_
600-
400_
200-
o'• • f,,•:,
-OJ
I.O
, ,,-'• , •,il
I0
I00
FREQUENCY-
, • t,I ....
I000
•
I0,000
CPS
Fro. 11. Impedanceof onetelephoneearpiece
percent of correct guesseswith delays as high as
15 msec.
probability distribution. Figure 10(a) shows some
The surprising thing about such distributions is
typical results. Figure 10(b) showsthe same curve their widths. In normal,everyday binaural hearing, the
plotted on a much-compressedscale, showing the greatest delay we can experiencebetween our ears
correspondsto occasionswhen the sound source is
SIGNALS PERCEIVED
to our extreme right or left hand. In such casesthe
.
,/- ASONE
ACOUSTIC
delayis no greaterthan «-1 msec;how then have we
,oo
I
/ FIELD
(SAME
CURVE
learned to correlate the stimuli of our two ears, when
the delay is as great as 15 msecso as to perceivea
"gestalt"?If the delay T was increasedbeyond this
there would comea value suchthat the subjectwould
be unable to give a logical decisionR.H. or L.H.; he
would protest that he heard not one personspeaking,
but two. The delay is quite critical, with any one
subject;at onemomenthe hearsa speakerapparently
standing to one side of him, the next moment the
soundimage "tears apart" into two.** Figure 10(b)
: t••'•
• •'
F'G"ø(ø)
' ,/
SlGNALSPERCEIVED-x
illustrates
O0
5
I0
•5
80
25
30
TIMEDELAYOF RIGHTEARMESSAGE
RELATIVETOLEFT
Fig. 11 showsa plot of their impedance/frequency
EAR MESSAGE-mSEC
characteristics.The acousticphasingof the headphone
pair was found to make no observabledifferenceto
any of the results.
Fxo. 10(b). Binaural perceptionof one or of two acousticfields.
Positive delays are shown here; curves for negative delays are
quite similar.
a S.S.
this barrier.
On all the various diagramsan indication is given
of the rms signal voltage across the headphones;
Stevens and H. Davis, HearingsIts Psychologyand
Physiology
(JohnWiley and Sons,Inc., New York, 1938).
** Stevensand Davis report similar measurements,by themselvesand others, but basedon continuoustones or on "clicks";
such elementalsignalsdo not apparently show this remarkable
binaural coherencewhich is the casefor speech.
Redistribution subject to ASA license or copyright; see http://acousticalsociety.org/content/terms. Download to IP: 192.87.79.51 On: Tue, 14 Jul 2015 12:18:33