Hearing Musical Streams

Hearing Musical Streams
Author(s): Stephen McAdams and Albert Bregman
Source: Computer Music Journal, Vol. 3, No. 4 (Dec., 1979), pp. 26-43+60
Published by: The MIT Press
Stable URL: http://www.jstor.org/stable/4617866 .
Accessed: 27/09/2011 13:53
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
The MIT Press is collaborating with JSTOR to digitize, preserve and extend access to Computer Music
Journal.
http://www.jstor.org
Hearing
Musical
Streams
AlbertBregman
Departmentof Psychology
McGillUniversity
Montreal,Quebec,Canada
Stephen McAdams
Hearingand Speech Sciences
Stanford University School of Medicine
Stanford, California 94305
Introduction
Theperceptual
effectsof a soundaredependentupon
the musicalcontextin whichthatsoundis imbedded.Thatis,
a givensound'sperceivedpitch,timbre,andloudnessareinfluencedby the soundsthat precedeit, coincidewith it,
andevenfollowit in time.Thus,thiscontextinfluencesthe
waya listenerwillassociatethe soundwithvariousmelodic,
rhythmic,dynamic,harmonic,and timbralstructureswithin
the musicalsequence.It thus behoovesthe composerand
to understand
the variousperceptualorganizing
interpreter
of musicalcontextfrom
principlesthataffectthe derivation
of
acoustic
We
events.
include
the interpreter
here
sequences
becauseseveralmusicaldimensions,suchas timbre,attack
and decay transients,and tempo, are often not specified
exactlyby the composerandarecontrolledby the performer.
In this articlewe shalldiscussprinciplesthat describe
affectthe perceivedcontinuihowvariousmusicaldimensions
ty of music.LeonvanNoordenhasstatedthat "insequences
wherethe tones follow one anotherin quicksuccession,
effectsareobservedwhichindicatethat the tones arenot
by the perceptionsystem.Onthe one
processedindividually
handwe find varioustypesof mutualinteractionbetween
successivetones, such as forwardand backwardmasking,
anddurationinteractions.
loudnessinteractions
Onthe other
hand,a kindof connectionis foundbetweenthe successive
sonicevents,
perceivedtones."[28, p.1 ]. As for simultaneous
has
that
different
areexsounds
9]
Bregman[5,
suggested
tractedaccordingto variousperceptualand cognitiveorganizationalmechanisms
fromthe superimposed
acousticvibrations.Whilesomeresearchers
wouldlike to attributethese
or earlycentral
to mechanisms
in the peripheral
phenomena
nervoussystem(whicharesurelyinvolvedto someextent,
see [24, 33]), prominentmembersof thismorepsychophysicallyoriented"school"arebeginningto thinkthe organizationis too complexto be so easilyexplained.However,rather
thandelveinto theoreticalexplanations
of thesephenomena,
it will sufficefor the presentpurposeto describethemin
that
generalandto brieflyquantifysomesalientparameters
andAlbertBregman
? 1979 by StephenMcAdams
Page 26
haveintrigueduswithcompositional
possibilities.Thisarticle
thuspresents,in a tutorialfashion,a reviewof research(includingour own) whichhas directimplicationsfor musicians,especiallyfor composers
workingwithcomputermusic.
andin mostotherresearchcited,comInall of ourresearch
wereusedto
PDP-11minicomputers)
puters(predominantly
usedfor
also
were
sounds
the
synthesize
presented.They
of soundstimuli,collectionof responsesfromthe
presentation
listeners,andanalysisof the data.For thoroughsummaries
of thisareaof research,see [2, 5,
andtheoreticaltreatments
9.] .
WhatIs An AuditoryStream?
Auditorystreamformationtheoryis concernedwith
how the auditorysystemdetermineswhethera sequenceof
acousticeventsresultsfromone, or morethanone, "source."
A physical"source"maybe consideredas somesequenceof
acousticeventsemanatingfromone location.A "stream"
such
thatmentallyrepresents
is a psychological
organization
a sequenceanddisplaysa certaininternalconsistency,or continuity,that allowsthat sequenceto be interpretedas a
"whole."By way of example,two possibleperceptualorin
of a repeatingsix-tonesequenceareillustrated
ganizations
axis
and
the
horizontal
is
on
1.
Time
represented
Figure
on the verticalaxis.Thedottedlines
frequencyis represented
connectingthe tonesin the figureindicatethe streampercepts.
six tones areheardone afterthe
In the firstconfiguration,
otherin a continuouslyrepeatingcycle (TapedIllustration
la)l; it is easy to followthe entiremelodicpattern.In the
secondpercept,though,onemightheartwo separatethreetone patternswhichappearto havelittle relationshipto
eachother(TapedIllustration1b).It is difficultin thiscase
to followthe originalsix-tonepattern.Notethatin the first
exampleone streamis heard,andin the second,two areheard.
(1) A tapecontainingsoundexamplesis availablefrom
Mr.McAdams.
Descriptionsof the tapedillustrations
arefoundinAppendix1.
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
D
D
D
B'
/B
B"B
I
"
II
O
F
-
.
I
II
I
I
%
F
F
Cr
-
ST
%
(a)
%
(b)
Time-
Time -,
Two Streams
One Stream
Figure 1. A repeating6-tone sequencecomposed of interspersedhigh and low tones can result in different percepts. In Figure la,
with high and low tones alternatingat a tempo of 5 tones/sec., one perceptualstream is heard. In Figure1b, at a tempo of
10 tones/sec., the high tones perceptuallysegregatefrom the low tones to form two streams(cf. Taped Illustration1).
D
4
F
t
t
o
0
D
L.
C
A
C
C"=I
U-U
A
,/
u
F
/
Time -+
Time -+
I:JJ
Ji
HighStream
I:JJJ:
Low Stream
J
I
HighStream
l
JU
:I
Low Stream
Figure2. Due to the competition amongstreamorganizations,tone F may be perceivedas belongingto either the higherstream
or the lower stream but not to both. The organizationof the streamschanges, among other things, the perceived rhythmic
structure,as indicatedundereach diagram(cf. Taped Illustration2).
StephenMcAdamsand AlbertBregman:HearingMusicalStreams
Page27
In the rest of this section we will discusssome of the
propertiesexhibited by a stream.For example, it is possible
to focus one's attention on a stream and follow it through
time [6, 28]. In the first taped example one can follow the
six-tone pattern without any trouble. Thus a stream must
exhibit a certaincoherenceover time. However,in the second
example, it is difficult to follow the six-tone pattern,but it is
easy to follow either three-tone pattern. Notice that one can
pay attention to either the higheror lower stream,switching
between them at will, but that it is not possible to attend to
both simultaneously(Taped Illustration ib). Indeed, each
three-tone patternin Figure lb constitutes a separatestream
and maintainsits own temporalcoherence.Whilea listeneris
paying attention to one coherent stream, other acoustic information is perceptually relegated to the background.If
one group of sounds is distinct enough, the foreground-backgroundrelationmay be almost involuntaryand it may require
a greatdeal of attentionaleffort to focus on streamsinitially
relegatedto the background.
The information-processingnatureof the streamsegregation processis suggestedby the observationthat the segregation of a sequenceinto smallerstreamstakes time to occur
[4, 13]. In Figure lb, notice that one can hear a six-tone pattern for the first few cycles before it segregatesinto two
separatestreams(Taped Illustrationlb). It thus appearsthat
the perceptualsystem assumesthings are coming from one
source until it acquiresenough information to suggest an
alternateinterpretation.
When temporalcoherence is lost in a sequence, it becomes more difficult to orderthe events of that sequencein
time [6, 11, 14, 28] . In Figure la, it would be easy to tell the
orderof tones A, B, and C. But as this largerstreambreaks
down into the smallerstreamsof Figureib, i.e., as temporal
coherenceis lost, it becomes more difficult to judge the order
of these tones. In such a case one might notice that tone A
comes before tone C but it would be hard to tell whether
tone B came before A, between A and C, or after C. This
statementshould be qualifiedby noting that in the tone sequencesmentioned all of the tones are of equal loudnessand
timbre. The tone sequences are furthermorecontinually
recyclingand are faded in so that the listenercannot label tone
A as being the first tone in the sequence. Informationin a
musicalcontext, such as the first beat being emphasizedas a
downbeat,may give the listeneran anchorpoint againstwhich
to relatethe temporalpositions of other tones. Thus one can
judge the order of events in time within a given perceptual
streambut not necessarilyacrossstreams.Accompanyingthis
is the observationthat different streamscan appearto overlap
in time even when they do not. Again in Figure lb, notice
that one can hear two three-tone patterns apparentlygoing
on at the same time even though the tones are alternating
(Taped Illustrationib).
If an event is potentially a memberof more than one
"competing"stream,one may perceiveit as belongingto one
streamor anotherbut not to both simultaneously[3, 10, 11] .
This is not to say that a musiciancannot hear severalsimultaneous lines. The point is that it is impossibleto use several
parsing schemes at the same time. Figure 2 illustrates the
effect on our second example of moving the higher triplet
into closer frequencyproximity to the lower triplet. We can
find an intermediateposition where the lowest tone, tone F,
Page 28
can group with either the higher or lower stream (Taped
Illustration2). Notice that this regroupingresultsin a rhythmic transformationas shown under each configuration.
Some non-musicalexamplesof this phenomenonare the facevase illusion and the reversibleNecker cubes; Escherand
Vasarelyhave produced art works based on such principles
of perceptualorganization.
Finally, this bringsup the relationshipbetween "source"
and "stream."A streamis perceivedas emanatingfrom a single
source. So, in the first example, the pattern was fairly continuous and was easily recognizedas coming from one source;
but in Figure lb, the large frequency distance between the
two three-note groups introduced a sort of discontinuity
that caused the perceptualsystem to interpretthe sequence
as resultingfrom two sources. Since at any givenmoment the
composite pressurevariationsstimulatingthe ear result from
severalsources,the auditorysystem needs a battery of heuristics to parse, or segregate,the information into separate
streams.It thus needs to build a descriptionof the acoustic
environmentfrom separatedescriptionsof the variousstreams
and the relationshipsbetween them [2]. Factorswhich the
perceptualsystem uses to build descriptionsof streams,and
subsequentlysources, are frequency, rate of occurrenceof
events (or tempo), intensity, timbre, and attack/decay transients. In the rest of the paperthese will be discussedin some
detail. Of course it is obvious that sounds are assignedperceptually to different sources when the physical sources are at
different spatial positions. In this case, intensity, spectral,
and temporal cues are all utilized to parse the sound into
separate sources. However, we will primarilyconfine our
discussion to the illusion of many sources which occurs
due to the organizationswithin a single emanationof sound.
Frequencyand Tempo
Considerthat a repetitivecycle of tones spreadover a
certain frequency rangemay be temporally coherent, or integrated,at a particulartempo. It is possible to graduallyincrease the tempo until certain tones group together into
separatestreamson the basis of frequency,as discussedabove.
The faster the tempo, the greaterthe degreeof breakdownor
decomposition into narrowerstreams until ultimately every
given frequency might be beating along in its own stream.
This last possibility is dependent upon a number of other
factorswhich will be discussedlater.
Figure 3 illustratesthe possible stages of perceptual
decompositionfor a recyclingsix-tone patternas one gradually increases the tempo (Taped Illustration 3). Note that
streamsper se are not trackedbeyond a certainpoint, but a
texture or timbreis perceivedsince the ability to temporally
resolvethe individualtones degeneratesaltogether.Of course,
one could hold the tempo constant and graduallyexpand the
frequency relationshipsto achieve a similarstreamingeffect
[3, 6, 14, 17], but the musicalconsequenceswould be vastly
different as one can hear in TapedIllustration4. Dowling [17]
used simple melodies to illustrate this frequency-based
streamingprinciple.He interleavedtwo melodies in the same
frequencyrangethereby makingit very difficult, without prior
knowledge of the melodies, to separatethem perceptually.
But as they were pulled apartin frequency,i.e. when all the
tones of one of the melodies were transposedupward,each
melody becameapparent(Taped Illustration5).
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
f
0
-
I ?
r
I
I
i
I
!
II
CL
.2-.
ccL
0)0
r-
I
•
,
,
=.
A
A
I
0
04
cm
%
Q/)
A&B
Absorbed
I
I
0.
" .
A&B
Isolated
r
IE
S
je.-"-x,0.1
Y
S
O"
c>
B
B
'O
a)
0
oo
(D
O
CD
E
------- CC
u,
L--
X
-------QI
-----,
--o,)=----------
x
Time
Time-
----------------------?
-I
I>
Time
Timbre/Texture
Figure 3. This figure illustrates the decomposition of an
acoustic sequence into smallerand smallerperceptualstreams
as the frequencyseparationbetween the tones or the tempo of
the sequence increases.In the latter case, a point is ultimately
reachedwhere one can no longer perceive individualtonal
events;a texture or timbre is heardinstead(cf. Taped Illustrations 3 and 4).
Thefrequencyrangewithinwhichthe perceptual
system
groupstones on the basisof frequencyproximityis not
constant;the groupingcan varywith the particular
pattern
of frequenciespresented.For example(see Figure4), two
tones, A and B, are arrangedwith givenfrequencyand
temporalseparationssuch that they will alwaysstream
togetherwhenno othertones arepresent.Wecancreatea
similarorganization
withtonesX andY in anotherfrequency
range.Thesetwo groupswill each forma streamas can be
heardin TapedIllustration
6a. By bringingthe pairsinto the
samefrequencyrange,new streamscan be formed,suchas
A-X and B-Y, on the basisof an alternativeproximity
organization
6b). Thus,the particular
[3] (TapedIllustration
betweenfrequencies
in a tonalpattern,andnot
relationships
just the frequencyseparationbetweenadjacenttones, plays
a vitalrolein the formationof streams.
It appearsfromourexamplesthatthereis anessentially
inverseand strictly interdependentrelationshipbetween
amongindividualtones
tempoand frequencyrelationships
the
faster
the
tones
follow
one another,the smaller
28]:
[26,
the frequencyseparation
at whichthey segregateinto separate
perceptualstreams.Conversely,the greaterthe frequency
the slowerthe tempoat whichsegregation
occurs.
separation,
Thisillustratesyet anotheraspectof musicwhich,with the
aidof the computer,cancomeunderthe composer'scontrol.
Supposewe makea graph,as in Figure5, whichrelates
of two alternating
toneson one axisto
frequencyseparation
StephenMcAdamsand AlbertBregman:HearingMusicalStreams
Figure4. This figure illustratesthe effect of frequencycontext
as opposed to frequencyseparationon streamformation. In
the first part,tones A and B form one perceptualstreamwhich
is unaffected by the simultaneousstreamcomposed of tones X
and Y. Whentones X and Y are moved into the proximity of
tones A and B, which remain unchanged, an alternate
perceptualinterpretationresultswhereby tones A and B are
assignedto separatestreamson the basisof a new frequency
context (cf. Taped Illustration6).
of the toneson the otheraxis.Note
the rateof alternation
that the horizontalaxisindicatesincreasingtone repetition
to decreasing
tempo.Onecandraw
time,whichcorresponds
boundarieson this graphindicatingthe frequency-tempo
regionsin whichthe tonescohereas a singlestreamandthose
streamsof
in whichthey segregateinto two simultaneous
differentfrequencies.Thereare two suchboundaries.Leon
VanNoorden[28] hastermedthesethefissionboundaryand
temporalcoherenceboundaryandnotedthatthey areslightly
differentfor eachperson.In Figure5 the uppercurveis the
temporalcoherenceboundary.Abovethis boundaryit is
tones into one
impossibleto integratethe two alternating
stream.Belowthis boundarylies the temporalcoherence
region,whereit is possibleto integrateeventsinto a single
perceptualstream.The lower boundaryis the fission
boundary.Belowthisit is impossibleto hearmorethanone
stream.Note that the regionsof fissionand coherencealso
overlap,creatingan ambiguousregionwhereeitherpercept
maybe heard.
As the tempo decreasestemporalcoherencecan be
maintainedat greaterfrequencyseparations.
However,the
frequencyseparationrequiredfor fission remainsfairly
constantwithdecreasing
tempo.Thetemporalcoherenceand
fission boundariesabove and below a given tone are
with respectto pitch.Thisindicatesthat the
symmetrical
phenomenaof temporalcoherenceand fissionmay occur
dependingon the absolutemusicalintervalbetweenthe tones
of the directionof pitchchange.Whilethere
but irrespective
are substantialquantitative
differencesin theseboundaries
Page29
FrequencyTrajectories
20
o
Another frequency-based effect involves frequency
trajectories.These are important on two levels. The first
involves trajectoriesbetween tones (see Figure 6). Bregman
and Dannenbring[7] have found that tones that are connected by glissandiare much less likely to segregateunder
given conditions of tempo and frequency separationthan
those which make abruptfrequency transitions.Intermediate
situations beget intermediateresults. Yet even when using a
sine wave for frequencymodulation of a sine tone, it is possible to discerna sort of streamingeffect of the higherand
3
15
10
Al
Alwaysays
Segregated,
/
O
c/
Tempo (Tones/Sec)
5
10
5
I
_I
a
Tone Repetition TiAm biguous
=
I
D
IRegion
I
I
I
Always
I
Fission BoundaryCoen
Ramped Semi- I
I
0
50
100
150
200
250
300
350
I
400
r-
Tone Repetition Time (msec)
I
a.,
Sta
a
Figure 5. Boundaries of temporal coherence (upper curve) and
fission (lower curve) define three perceptual regions in the
stream relationship between two alternating tones each lasting
40 msec. This relationship is a function of both the tempo of
alternation and the frequency separation between the tones
(after [28] ).
I
Discrete
between listeners, the qualitativetrends tend to be similar
[28]. Note that the difference between boundaries is
increasinglysubstantialfor tempi below about 10 tones/sec.
(tone repetition time = 100 msec.). This means that at very
slow tempi of, say, five tones/sec., a separationof more than
a minor 10th is necessaryto induce streaming.Below this it
becomes virtually impossible to induce streaming. In the
limiting case, if the temporal distance between tones is too
great, they do not seem to be connected at all but sound
as isolated events.
A musicallyrelevantaspect of these boundariesshould
be mentioned. The regionbetween the two boundariesmay be
consideredto be an ambiguousregionsince either a segregated
or an integrated percept may be heard. The primary
determiningfactor in this regionis attention. In other words,
it is possible to shift one's attention back and forth between
the two perceptswhen the sequence falls in this region. For
example, one may focus either on a whole streampercept or
on smallerindividualstreams. It appearsthat the closer the
sequencelies to one of the boundaries,the easierit is to focus
on the percept which is predominantbeyond that boundary.
Conversely,it is very difficult in this situation to shift one's
attention back to the other percept. Once the physicalvalues
go beyond either of these boundaries,attaining the complementarypercept may be consideredto be impossible.The role
of attention will be discussedin more detail later.
Page 30
1
a
Tas
rapeady-
SteadyState
__
_
__
_
aaTime-+
Transition
SteadyState
Figure 6. These are the 3 types of frequency transitions
between tones used by Bregman and Dannenbring (1973). In
the top section the tones are completely connected by a
frequency glissando. In the middle section an interrupted
glissando is directed towards the succeeding tone. No
frequency glide occurs in the bottom section; the first tone
ends on one frequency and the next tone begins on another
(after [71 ). The unconnected tones segregate more readily
than the others.
lower peaks of the modulation at certainmodulation frequencies (Taped Illustration7). At lower modulating frequencies
one can track the modulation, but higher modulatingfrequenciesresultin the effect of a texture or timbre. This continuum is found for modulation involving discrete changes
in frequencyas well.
The second type of trajectorymight be called a melodic
trajectory. The basic rule goes: large jumps and sudden
changes in direction of a melody produce discontinuity in
that melody. In terms of streamformation,one or two tones
in a melody that are removedfrom the melodic continuity of
the rest could be perceivedas coming from a differentsource
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
and would not be integratedinto the melody as a whole,
perhapsleavinga rhythmic gap in the phrasedependingon
the natureof the main sequence. It has been reported,however, that the excluded tones are sometimes noticed in the
background,with their absencehavinglittle effect on the main
melody line [23]. Implied polyphony in a solo compound
melody line, as in the suites for solo instrumentsby Bach, is a
compositionaluse of this principle.
Schouten [30] reported that if an ascendingand descendingmajorscale fragmentplayed with sine tones is continually repeated, temporal coherence is maintainedup to a
tempo of about 20 tones/sec. However,if these same tones
are arrangedat random,the maximumtempo at which coherence still occursis reducedto about 5-10 tones/sec. It might
be inferredthat the reduction of predictabilityreduced the
pitch boundary within which the auditory system could
successfully integrate the incoming information. But van
Noorden [28] found that previousknowledge of the order
of tones had no effect on the coherenceboundary.It might
be that the small frequencyjumps in Schouten's demonstration are effective in holding the streamtogether.
Heise and Miller [23] investigatedfour melodic contours: a V-shaped contour, an invertedV, and risingand falling scale patterns;the V-shaped patternswhich change direction can be thought of as being less "predictable"than
the scale patternswhich move in only one direction. Each
patternwas eleven tones long and the frequencyof the middle
tone was variable.They found that the degreeto which the
variabletone could be separatedin frequency from the rest
of the patternbefore it segregatedinto its own streamwas a
function both of the shapeand of the steepnessof the pattern.
The steepnesswas variedby keepingthe tempo constant and
varyingthe intervalbetween successivetones. As the rate of
frequency change for the entire pattern increasesover time,
so does the amount of frequency separationof the middle
tone requiredto producesegregation.Less separationof the
middle tone is required to produce segregationwith the
V-shaped patterns than with the scale patterns, possibly
indicatingthat the perceptualsystem can follow "predictable" patterns more quickly than "unpredictable"ones.
(However,certainanomaliesin their data, which will not be
discussedhere, might suggestother interpretations.)
Van Noorden found similarresultsfor patternswith as
few as three tones [28] .He investigatedthe relativetemporal
coherence of so-called linear and angularthree-tone sequences. For linear sequences, i.e., those with two tone
intervalsin the samedirection,the temporalcoherenceboundary occurs at faster tempi than were found with angularsequences in which the melodic pattern changed direction at
the middle tone. In this latter case the first and third tones
are more contiguous in frequency than are the first and
second, or second and third tones, which facilitatesa loss of
temporal coherence similarto that found by Schouten. It
should be noted that while these melodic trajectoryeffects
do seem to play a role in musicalstreamformationbeyond
the simple frequencyseparationeffect, they are certainlyconfounded by other contextual organizations.
Both of these trajectoryexamplesillustratea principle
of perceptual organizationthat uses pattern continuity,
in some form or another,as a criterionfor "source"distinction. Figure7 illustrates,however, that frequencyproximity
may sometimescompete with trajectoryorganization;Deutsch
found that simultaneousascendingand descendingscale patStephenMcAdamsand AlbertBregman:HearingMusicalStreams
terns presentedto opposite ears segregateinto upright and
inverted V-shaped melodic contours [16]. Each contour
is heardas if being presentedto one ear. The same resultwas
found by Halpern [22] for simultaneousascendingand descendingsine tone glissandi,as can be heardin TapedIllustration 8; in this case both glissandiwere presentedto both ears.
In these two examples a stream boundaryis establishedat
the pitch where the two lines cross.
a
a
High
Contour
\
I
I
1
t
--I
L
/S
Low
a
Low
.Contour
'
_
Time-+
Figure 7. This stimulus, used by Deutsch (1975), is composed of ascending and descendingscales presentedsimultaneously to opposite ears. Two V-shaped patterns (high
and low contours outlined with dashes)are perceivedrather
than the complete ascendingand descendingscale patterns.
LoudnessandContinuityEffects
Fissioncan be obtainedby alternating
sequencesof
tonessimilarin frequencyandtimbrebut differingin intensity. Figure8 illustratesthe rangeof perceptsachievedas the
amplitudelevelof tone A is variedrelativeto thatof tone B
in the alternating
sequenceABAB... . The referencelevel
fortone B usedby vanNoordenin theseexperiments[28]
was35 db SPL;the frequencyof bothtoneswas1 KHzand
each tone lasted40 msec.If the level of tone A is below
6 db SPLat 1 KHz),
the auditorythreshold(approximately
only the B streamis heardat halftempo,asmightbe expected
(seeFigure8a). Whentone A is loudenoughto be heardand
is at least 5 db belowtone B, two separatestreamsof different loudnesscan be perceived,each at half tempo, with A
being the softer stream(see Figure8b). WhenA is within 5
db of B, a "pulsing"streamis heard and neither the A nor
the B streamcan be heardindependently,i.e., temporal
coherenceis inevitablein this range(see Figure8c). As the
levelof the A toneis increasedabovethatof the B tone,three
differentperceptsmay result,dependingon the alternation
tempoof A andB. If thistempois lessthanabout13 tones/
sec., fissionis the next perceptheard.ThistimeB is the softer
stream(see Figure8d). Thusa certaindegreeof loudness
differenceallowsus to focus on eitherstreamusingonly
thisinformation.If the tempois greaterthanabout12.5-13
tones/sec.,the perceptencounteredis the "roll"effect
discoveredby vanNoorden.It soundsas if streamA, the
louderstream,werepulsingat half tempoas in the fission
percept,but the B streamsoundsasif it werepulsingat full
Page31
A Below Threshold
a -K
B
A
(
B
time
Fission
b
fA
B
A
I
B
A
B
A
B
A
B
A
B
A
B
Coherence
c
A
Fission
d
Roll
e
Continuity
tempo (see Figure8e). Thus the A streammay be heardindependently,but not the B stream. In other words,it is as if
the A tones consistedof two parts:one that combineswith
the B stream to give a full tempo roll, and another, at half
tempo, which can be perceivedseparately.At a tempo of
about 13 tones/sec. another effect emergeswhen the level
of A is about 18-30 db above that of B. This is the continuity
effect shown in Figure 8f, so named because tone B is not
heardas pulsingbut ratheras a fairly soft, continuous tone
under the louder, pulsing A stream;this is an example of a
class of effects that will be discussed below. Finally, if the
level of the A streamis incrementedstill further,this stream
completely masksthe B stream(Figure 8g). Again, this set of
loudness-based phenomena exhibits an ambiguous region
between the coherence and fission boundarieswhere one
might pay attention to either the A or B streamsindividually.
or to the AB streamas a whole. There are thus three perceptual regions for alternatingtones at the same frequency
where the tempo is above about 12.5 tone/sec.: the roll region,
the continuity region, and the temporalcoherenceregion.
Van Noorden made quantitativemeasurementsof the
fission boundary (see Figure 9). For tempi of about 2.5
to 10 tones/sec., the fission boundary is more or less horizonal, i.e., the intensity difference (AL) necessary for a
segregatedpercept does not change with tempo over this
range,but lies about 2 to 4 db on either side of the reference
tone level. For tempi less than 2.5 tones/sec., the minimum
level difference for fission increaseswith decreasingtempo
(or longer inter-tone intervalsof silence) and is symmetric
about AL = 0 (no difference in level). For tempi greater
than 10 tones/sec. the level differenceat which fission occurs
increaseswith increasingtempo but the situation is not symTempo (Tones/sec)
20
0
5
3
2
Masking
SContinuity
"
r
Masking
08.,
g
A
B,
A
Roll
Inevitable
ev le
_Coherence
I
j3
?
Page 32
--* 35 db SPL
Fission
20
Figure 8. This figure illustratesthe range of possible percepts found by van Noorden (1975) for two alternatingsine
tones differing in each case only in their intensities.Tone B
was kept at 35 db SPL throughout; both tones were 40
msec. long and had a frequency of 1 KHz. a) The level of
tone A is below the auditory threshold. b) Tone A is at
least 5 db below tone B. c) Tone A is within 5 db of tone B.
d) Tone A is louder than tone B with an alternationtempo
less than about 13 tones/sec. e) Tone A is louderthan tone
B with more than 13 tones/sec. f) Tone A is about 18-30
db louder than tone B and the tempo is still above 13 tones
/sec. g) Tone A is more than 30 db louderthan tone B. The
arrows indicate the percepts reported; a more complete
descriptionis given in the text.
LA>
LA>LB
LB
Fission
20o
LA< LB
Sub-Threshold
0 40
200
400
60o
800
Tone RepetitionTime (msec)
Figure9. The perceptsshown in Figure8 fall into different
regionsof tempo-intensity combinations.The verticaldotted
line correspondsto the durationof each tone. The level differences between tones A and B are relativeto the level of
tone B at 35 db SPL. The horizontal axis is shown as tone
repetition time on the bottom and as its reciprocal,tempo,
on the top. It should be pointed out that Figure5 and Figure
9 cannot be directly compared,becausethe alternatingtones
in Figure5 were played at different frequencies.
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
metrical about AL = 0; for tempi greaterthan this, there is
the complex interactionof level differenceand tempo which
resultsin the roll, continuity, and maskingregions.
A combinationof frequency separationand level difference was used to obtain the roll and pulsationthresholds
shown for two different tempi in Figure 10. A great deal of
psychoacousticresearchhas been devoted to the boundary
between the roll and continuity regions. The "pulsation
threshold,"as this boundaryhas been termed [24, 28, 33],
requiresa greaterloudness difference at slower tempi than
does the roll threshold,as can be seen in Figure 10.
The continuity effect may operate accordingto principles very similarto those which governocclusion in vision.
The interpretationthat one object occludes anotheris given
by informationderivedfrom the intersectingedges of the objects. In Figure 11, the fragmentshave no apparentstructure
since informationregardingthe shape of the occludingstructure is not present.However,in Figure 12, with the appropriate edge information, the fragmentscan be grouped into
meaningful chunks. The same process may be postulated
to occur in audition. That is, the "edges"of a louder sound
and a contiguous softer sound might be used to induce the
perceptionthat the less prominentsound is partiallyoccluded,
or masked,by the louder. In such a case, the illusion of continuity of the softer sound "behind" the louder sound may
result.This has been found to be true for sine tones masking
sine tones, noise maskingsine tones, and noise maskingnoise
[8, 14, 24, 28. 32, 33, 34].
Tempo - 21 Tones/sec
Tempo
a
EI
,
Figure 11. These fragments have little apparent structure and
meaning since information regarding the shape of the
occluding structure is not present.
r
1
16 Tones/sec
40
il/
I
g
i
,?
IC
+-
>
30"\-0continuity
0Qo\
1
_
ro
fission
fession
\
0-400
-200
LB = 35db SPL
Figure 12. With appropriate information about the edges of
the occluding structure, the fragments of Figure 11 can be
grouped into meaningful chunks.
0
200
-400
fB = 1 kHz
-200
0
200
fB = 1 kHz
Frequency Difference (Hz.)
Figure 10. The boundaries between the fission, roll, and
continuity regions already discussed in Figures 8 and 9 are
shown here as a function of frequency and level differences.
There is an increase in slope for roll and pulsation thresholds
and an upward shift in the latter at slower tempi. The
frequency and level differences are given relative to the
frequency and level of tone B, i.e. 1000 Hz. and 35 db SPL,
respectively. 0 KHz. frequency separation corresponds to the
vertical dashed line in Figure 9. (Note that Figure 9 cannot be
mapped directly onto Figure 10; in the former illustration,
listeners were asked to try to hear fission, but in Figure 10,
the test subjects were listening for roll. The different focus on
the part of the listener results in a shift in the boundaries
shown.)
Stephen McAdams and Albert Bregman: Hearing Musical Streams
That the nature of the temporal "edge" between the
two sounds is the importantfactor was illustratedby Bregman
and Dannenbring[8]. As Figure 13 illustrates,they varied
the transitionalamplitude of a tone both leading into the
attack and coming out of the decay of a noise masker. In
reality, the tone stopped the instant the noise beganand then
began againthe instant the noise stopped. The greatestcontinuity was found when there was no changein the amplitude
of the tone. Whenthe level of the tone was either increased
or decreasedbefore the attack of the noise, the illusion of
continuity was substantially diminished. The amount of
change in judged continuity was greater for the initially
decreasingamplitude ramp than for the initially increasing
ramp.The changein the amplitudeof the tone may suggesta
possible terminationof the tone sometime duringthe noise
burst. The length of the rampwas also found to have an effect
on perceivedcontinuity. The least continuity was found when
the amplitude rampslasted 50 msec., and for longer ramps
there was less of a break in perceivedcontinuity. This may
suggestthat a very long rampwould be interpretedas no ramp
since the rate of change of intensity would approachzero.
Page 33
Timbre
db
76
91
Tone
6
db
0Noise
db-ne
76 or 66 db Tone
91 db
Noise
-
-
76db
\
6
db
Tone
/
91 db
Noise
Time -+
Figure 13. The stimuli used by Bregmanand Dannenbring
(1977) consisted of a pure tone interrupted by noise.
The level of the tone leading into and coming out of the
noise burst was varied as shown. The greatest degree of
continuity was found in the center case where there was
no change in level.
It has also been reported that alternationspeed has almost
no effect on the perceivedcontinuity [14].
An important finding in relation to this phenomenon
was reportedby Warren,et al. [34]. They noted that there
must be some degree of spectraloverlapin the two signals
to obtain the continuity effect. Withsine tones, for example,
the effect is greatestwhen the tones are identicalin frequency,
and falls off monotonically as the frequency separationon
either side of the maskerincreases.The effect was even found
to occur througha burst of noise as long as 50 seconds [34] .
Though some people think the pulsation threshold
reflects only a neuralexcitation pattern that decays gradually, with the perceptionof continuity resultingfrom overlap of excitation at smallertime intervals[24, 33] , the following demonstrationillustrates that higher-level context-dependent predictivemechanismsmay be involved.These seem
to parallel the predictive nature of the stream formation
mechanismsmentioned previously.In Taped Illustration9,
one hearsspeech that has been gated;it is difficult to understand the spoken message.Then the gaps in the speech are
filled with white noise. This effect was first verified experimentally by Millerand Lickliderin 1950 [27]. The same
sort of phenomenonwith gated music and then gated-plusnoise-filled music is presented in Taped Illustration 10.
We have chosen a familiar composition to dramatizethe
effect. It may be very difficult to recognizeit in the gated
condition, but should become easily recognizablewhen the
gaps are filled with noise. In both of these demonstrations
the amount of informationremovedand then replaced,or
inducedby the perceptualsystem, is far too largeand far too
complex to be entirely attributedto interactionbetween localized excitation patterns.
Page 34
Another musically useful dimensionthat can play a
role in streamingeffects is timbre. Timbre tends to be the
psychoacoustician'smultidimensionalwaste-basket category
for everything that cannot be labeled pitch or loudness,
includingshort-termspectralchangessuch as onset transients,
long-term spectra,those dynamicqualitieswhich a musician
would term "texture," and so on. The important thing to
notice about streamformationis that any groupof simultaneous spectralcomponentswhich move roughlyparallelto each
other in the pitch (logarithmicfrequency) domain are most
likely to resultin a fused perceptand thus to be interpreted
as emanatingfrom one source [22, 25] . So even though the
individualspectralenvelopes of two instrumentsor sources
changeover time, they each follow patternswhich are sufficiently consistent internally to be fused and yet different
enough, in most cases, to distinguishthem one from the other.
Thereare, of course, many situationsin which especiallysensitive instrumentalistscan blend their tones and become as
one voice, but these situations usually involve very slow,
smooth changeswhere onset transientscannot give the listener
the informationneededto separatethe sources.As an example,
how often is a performanceof Ravel'sBolero preciseenough
so that the instrumentsplayingthe various"harmonic"parts
are not heardindividually,but fuse into one voice? The fact
that one typicallyhears them separatelypoints to the extreme
sensitivity of the auditory system to even minute variations
in temporaland spectralelements.
Let us examine one of the many aspectsof timbrelisted
above in more detail, using a simplistic steady-state timbre
derivedfrom summedsinusoids,as was done to test the effect
of timbreon streamsegregation[25]. If the frequenciesof a
sequence of sinusoidaltones, as shown in Figure 14, are within
close proximity (such that they would form one streamif
none of them were enriched),it is possible to cause a subset of
these tones to segregateinto its own stream by inducing a
timbredifference.In this figure,the verticalline indicatesthat
timbreis createdby, or at least co-exists with, the fusion of
components into a single percept (Taped Illustration11). We
will discussthe natureof perceptualfusion later.
A musicaluse of timbre-basedstreamingmight work as
follows: Figure 15 shows an isotimbralmelodic phrase containingthree sub-melodiesthat are slowly to emergeas contrapuntal melodies on their own. Each sub-melody is marked
with a differentsymbol under the notes that are to belong to
it. Now we might cause the "X" melody to slowly changein
timbreto a point whereits timbrediffers sufficiently from the
rest of the streamto induce separation.At this point there
would be two contrapuntallines distinguishedby their respective timbresas discussedby Wessel [35]. Alternatively,several
simultaneous timbral modifications might cause an equal
number of streams to emerge as separate contrapuntal
melodies, as is illustratedin the second part of Figure 15 by
the "X" and "O" streams.Note that as these two imbedded
melodies emerge,the originalmelody is changedsince several
of its elements have left and joined other groups. (It is
importantto note that certaintypes of inharmonicenrichment
of a tone might changethe pitch as well as the timbre,which
would be anothercase altogether).An understandingof stream
formation based on timbre modification is importantif one
intends to use multitimbralmelodies compositionally, as
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
3f4
3f2
f4 "
" .f4"
Two Timbres
Streams
One
Stream
OneTimbre
Time-
Time-
Figure14. Timbredifferencesbetween groupsof tones have been shown by McAdams(1977) to influencesegregationof those
groups. A repeating4-tone sequence can be decomposed into two streamswhen two of the tones are enriched with their
respectivethird harmonics.The dotted lines indicatethe streamperceptsand the verticalsolid lines representthe fusion and
timbre of the complex tone (cf. Taped Illustration11).
Varise and Schoenberg,amongothers, have done. A problem
potentially ariseswhen largechangesin both timbreand pitch
occur simultaneously.It becomes very difficult for the listener
to follow the "melody" line if the tempo is above the listener's
fission boundaryfor that set of events. One's attention may
tend to drift from one timbre set to anotherand relegatethe
rest to unattendedstreamsratherthan following the sequence
the composerintended.
o
xx
-
I •
S
I/li
w'_
?vi
X
XA
x
I
-h
•
I/I
.IF
#t,•
#
l
Figure 15. This figure illustratesthe possibleeffect of changing
the timbre of different sub-groupsof a melody line. The notes
markedwith X's and O's in the originalstreamare shown on
separatestaves below to indicatethe rhythmictransformation
that takes place after segregationoccurs. (Taken from the
A MajorPrelude,Well-TemperedClavier,Book I, by J.S.
Bach).
Stephen McAdams and Albert Bregman: Hearing Musical Streams
TimbralTrajectories
This leads into a discussion of the effects of timbral
trajectorieson streamsegregation.As was mentioned before,
if two glissandiof identical spectralstructurecross in frequency, it is reportedlyeasierfor listenersto hearhigh and low Vshaped contours than to hear the ascendingand descending
paths actually taken by each glissando. However, if the
glissandido not cross, or if they differ in spectralstructure,
it becomes possible to segregatethem and hear the risingand
fallinglines separately.In Taped Illustration12 and in Figure
16, harmonics3, 4, and 5 of two different "missing"fundamentals are modulatedexponentiallyin opposite directions.It
is easy to hear them separatelysince neither the spectral
components nor the "missing"fundamentalscross. In the next
example (see Figure 17) two different spectralstructuresare
used in the same frequencyrange.The descendingglissandois
composed of harmonics 10, 11, and 12 and the ascending
glissando is composed of harmonics 3, 4, and 5 (Taped
Illustration 13). One might argue that it is not a timbre
differencethat allows one to distinguishthese patternsbut the
fact that the pitches do not cross, as is illustratedin Figure 17
by the dotted lines which show the "missing"fundamentals.
Halpern [22] tested for this by using the same harmonic
composition (see Figure 18) as in the last example and crossing
the pitches of the glissandi,which would correspondto the
"missing"fundamentals,rather than crossing the frequency
components. The risingand falling glissandiwere still perceived, which suggeststhat the segregatingeffect was based on
spectralcontent ratherthan on pitch (TapedIllustration14).
For comparison,listen again to the pure tone crossingin
Taped Illustration8. Some pre-tests done for both Halpern's
Page 35
4f,
3f,
2f
1
5f2
LL
0
4f
LL
3f,
0
Time
Time
Time
Time
-"
[22] and McAdams'[25] studies bore out the intuition that
it is necessary to use constant ratio relationshipsamong
components to attain fusion and constant timbre. Further
researchon the fusion of harmonicsfor the buildingof timbre
is currently in progressin Bregman'slaboratoryat McGill
University.
TimbreMovementVersusPitch Movement
One pitch phenomenonhas a scientifichistory which is
almost coextensivewith that of pitch researchin general.This
phenomenonhas been termedthe "missing"fundamentalfor
reasonsthat will become obvious. It is possible to construct
a tone from severalsinusoidalcomponentswhose frequencies
are integer multiples of some fundamentalfrequency which
may or may not be actually present in the final signal's
spectrum.For example, frequenciesof 1000, 1250, 1500, and
1750 Hz would representthe 4th, 5th, 6th and 7th harmonics,
respectively,of a fundamentalat 250 Hz. Whetheror not a
spectralcomponent is presentat the fundamentalfrequency,
the perceived pitch almost always corresponds to the
fundamental.Of interest in our discussionof timbre and
streamingis the fact that tonal events with variousspectral
structurescan elicit the same "missing"fundamentalas long as
the componentsare integermultiplesof that fundamentaland
are within a certain so-called "existence region" [29] of
harmonicsthat can actually contribute to the pitch. Thus
different tonal events can elicit the same pitch but with
different timbres.
Some of the first author'spersonalexperiencein working with the pitch of complex tones has led him to the conclusion that it is possible for changesin pitch and timbre to contribute competinginformationto the overallperceivedmovement in tone sequences.Figure 19 illustratesthat is is possible
for timbreto get "sharper"or "brighter"[see for example 1,
18, 20], i.e. the spectralcomponents move to a higher frequency range,while the pitch moves in the opposite direction.
In Figure 19, while both complexes will elicit pitches correspondingto the "missing"fundamental,i.e. Fl and F2, the
Page 36
f2
Figure 16 (above, left). Harmonics3, 4, and 5 of two different
"missing"fundamentalsare modulated exponentially in frequency in opposite directions. Neitherthe frequenciesnor the
fundamentalsof the two glissandi cross each other. Two
separate glissandi are heard (cf. Taped Illustration 12).
Figure 17 (above, right).An ascendingglissandocomposed of
harmonics3, 4, and 5 of one "missing"fundamentaland a
descendingglissandocomposed of harmonics10, 11, and 12 of
another "missing"fundamentalare crossed in frequency, but
the pitches of the "missing"fundamentalsdo not cross. Two
different glissandiare heard (cf. Taped Illustration 13).
Figure 18 (below). Here the same harmonicstructureas in
Figure 17 is used but now the pitches of the "missing"fundamentalscross each other and the partialsdo not. Two separate
streamsare still heard(cf. Taped Illustration14).
lof2
5f,
4 fi
C
3fl
0,
uL
0)
0
Ts
m
f
Tm-
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
~T28f2
7f2
5f,
t TI4f,
3f,
U-
f0I
T-f2
Time
Figure19. It is possibleto construct two tones whose respective spectral "centersof gravity" (see text) and "missing"
fundamentalpitches move in opposite directionswhen these
tones are presentedin succession.
-
lOf9
"centerof gravity"of the spectralenvelope,TI, of the first
toneis lowerthanthatof the secondtone,T2. Spectral"center of gravity"maybe roughlydefinedas the weightedmean
over
of the spectralcomponentsin anacousticeventaveraged
the durationof the event,theirweightsbeingderivedfrom
theirrelativeamplitudes.
Van Noordeninvestigatedthe possibilityof streaming
basedon
basedon pitchcontiguity,as opposedto streaming
the
continued
due
to
i.e.
frequencycontiguity, continuity
presenceof spectralcomponents.Figure20 showsthe stimuli
he used.Thefirstconsistedof two differentthree-component
complextoneswithharmonics3, 4, and5 for the firsttone
andharmonics8, 9, and10 for the secondtone. Bothtones
havethe samefundamental
frequency.Herean attemptwas
to streamtogether.In the
fundamentals
madeto get "missing"
secondstimulusa puretone of frequencyf alternatedwitha
complexcontainingharmonics3 through10 of the same
fundamental
frequency;thiswasan attemptto streama pure
of the samefrequency.He
fundamental
tonewitha "missing"
foundthatit wasnot possiblewitheitherstimulusto obtain
a coherentstreampercept.However,it was necessarythat
if one
thesestimulihavevastlydifferentspectralstructures:
were to partiallyoverlapthe spectralstructuresof the
respectivetones, it would be impossibleto separatethe
basedon frequencycontiguityfromthat
perceptof streaming
basedon pitch contiguity.For example,if the firsttone
consistedof harmonics3f, 4f, and5f, andthe secondtone
consistedof harmonicsSf, 6f, and7f, howis one to knowif
lOf
-
f-••-
8ftC
c5fTu4f
-
5f
-
-
3fi-
Time -+
Time-
Figure20. Van Noorden (1975) tried alternatingtwo tone complexes of differingharmonicstructurebut identical"missing"
fundamentaland also tried alternatinga sine tone with an 8-tone complex whose "missing"fundamentalwas at the same pitch
as the puretone. He was not able to obtain streamingof the fundamentalsin the first example or of the sine tone and the
"missing"fundamentalin the second example.
andAlbertBregman:
Musical
Streams
StephenMcAdams
Hearing
Page37
the streamingeffects reportedare due to a repetition of 5f or
to the perceptionof a repeating"missing"fundamental,f. A
primaryresult of this arrangementof harmonicswas that
van Noorden used substantiallydifferent timbresin the two
tones, and this condition may have contributedsubstantially
to the preventionof streamformation.Equallylikely is the
possibilitythat streamformationmechanismsdo not use pitch
information but use frequency information. The following
findingsmay lend supportto the formernotion.
Our work on timbre effects in stream formation [25]
found an anomalousresult that may have been due to pitch
effects interactingwith timbreeffects. In Figure21, two of
the test cases used in that study are shown. In the first, the
higher tone pair of the repeating four-tone sequence is
enrichedwith the thirdharmonicand in the second case the
lower pairis similarlyenriched.The first case was found to
segregatesignificantlymore than the second. It is McAdams'
contention that this is the result of an interaction between
perceived pitch "height" and timbral "brightness."This
"brightness"seems to be relatedprimarilyto spectral"center
of gravity."In our examplewith the enrichedhigh pair, the
pitch "height"is alreadygreaterthan that of the low pairand
the addition of a highercomponent increasesthe "brightness"
of its timbre.The perceptualintegrationof these two factors
may facilitate the segregationof the tone pairsby resultingin
an overallincreasein perceived"height"of the tones in the
high pair. However,in the second condition where the low pair
was enriched, the increasein "brightness,"interactingwith
perceivedpitch "height," apparentlyserved to increasethe
overall perceived "height" of the lower pair. This would
bring it into closer perceptualproximity to the higher pair
and thus attenuate the effect of the difference in timbre
between the two pairs.But this condition still evoked more
segregationthan two isotimbralcontrol cases, where all of the
tones were composed of either one or two components. Of
interest here, again, is the fact that pitch and timbre movement may work againsteach other.
While these timbre changes have proven to be a
consistent irritant to the experimenterpursuingthe elusive
"missing"fundamental,the implied interrelationshipsbetween
timbral"brightness"and pitch "height"may proveinteresting
to investigatein terms of streamformationand to apply (both
experimentally and compositionally) to melodic sequence
construction. Grey [18] has delineated a three-dimensional
timbre space roughly defined by 1) spectral distribution,
2) spectralfluctuation and synchronicityof higherharmonic
transients, and 3) low-amplitude, high-frequency attack
transients.How does the "distance"between events in this
timbre space affect the temporal coherence and fission
boundaries?Do the effects obtained vary differently along
these different dimensionsassociated with timbre? Research
by Wessel [35] and Grey and Gordon [19, 20] has provided
informationwhich would allow the composer to deal with
timbre as a compositional parameter.This informationalso
allows a certaindegreeof predictionof the perceptualrelationships between sounds resultingfrom such timbralvariations.
Fusion,Timbre,andFrequencyStreaming
is an
Theideaof competingperceptualorganizations
and
excitingone formusiccompositionandtheory.Bregman
Pinker[10] investigated
the competitionbetweensequential
Low PairEnriched
WeakerSegregation
HighPairEnriched
StrongSegregation
C
0
u,
L
u.
t
Cr
ca
E
a)
LL
a)
L_
TmtTe
..-00:>
_____-
r
%No
00
0.0.
?
Time •
Time
-
Figure21. One resultof McAdams'study (1977) suggestedthat there is an interactionbetween pitch "height"and timbral"sharpness" (see text). In a repeating4-tone sequence,one of the pairsof tones was selectively enrichedby addingthe third harmonic.
A greaterdegreeof segregationof the high and low streamswas found for the formercase. The dashed lines indicatethe twostreamperceptsand the dotted lines indicatea potential one-streampercept.The verticalsolid lines representthe fusion and
timbreof the 2-tone complex.
Page 38
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
(or frequency) organizationsand simultaneous(or timbral)
organizationsin the formation of auditory streams. The
stimulusused by Bregmanand Pinkerwas a sine tone alternatingwith a two-tone complex. In Figure22, tones A and
B would representthe sequentialorganizationand tones B and
C would represent the simultaneous organization. The
harmonicity and synchronicity of tones B and C in the
complex were variedas was the frequencyseparationbetween
the sine tone A and the uppercomponent B of the complex.
The rationalefor varyingthese two parameterswas as follows.
Tones with frequency relationshipsderived from simple
ratios, i.e. those that exhibit "consonance,"should tend to
fuse more readily than combinationsconsideredto be dissonant. While the evidence for this was very weak in the
Bregmanand Pinker study, work currently in progressin
Bregman'slaboratorystrongly suggeststhat this is indeed
the case. The new evidence further suggeststhat the effect
of harmonicityitself is relativelyweak and may be overridden by strongerfactors such as frequency contiguity and
synchronicity of attack and decay. Tones with synchronous
and identically shaped attack and decay ramps are more
likely to fuse than those with asynchronousor dissimilar
attacks and decays [12, 15]. This may be a major cue in
being able to parse out the different instrumentsplaying
together in an orchestra since they all have substantially
different attack characteristics.In addition, there is a very
low probabilityof severalpeople preciselysynchronizingtheir
attacks.
In light of this work, one might make the following
predictionsfor the perceptionof the stimuli used by Bregman
and Pinker: 1) Sequential streamingis favored by the
frequencyproximity of tones A and B (as we have illustrated
in the earlierexamples). 2) The simultaneous(or timbral)
fusion of tones B and C is favoredby the synchronyof their
attacks. 3) These two effects "compete"for tone B's membership in their respectiveperceptualorganizations.4) Finally,
when tone B is "captured"by tone A, it is removedfrom the
timbral structureand tone C sounds less rich. Thus, it is
reasonedthat if the two simultaneoussine tones B and C are
perceivedas belonging to separatestreams,they should be
heardas sine tones. But if they are heardas one stream,they
should sound like one rich tone.
It would be appropriateto introduce the notion of
"belongingness"at this point, since we talk of tones belonging
to streams,and of frequency components and the timbre
resultingfrom their interactionbelongingto a perceivedtonal
event. "Belongingness"(a term used in the perceptualliterature of Gestalt psychology) may be consideredas a principle
of sensory organizationwhich serves to reconstructphysical
"units"into perceptualevents by groupingsensoryattributes
Stimulus
C
B
LL
t
4-
Time-
Percepts
:3
A
A
Cr
B
C
\
B
,
C
A & B Stream
C Pure
B
C
B
C
A & B Segregate
C Rich
Time-
Time -
Figure22. The competition between sequentialand simultaneousorganizationsin the formationof auditorystreamsis shown
here. Tone B can belong either to the sequential organizationwith tone A or to the simultaneousorganizationwith tone C
but not to both at the same time. Bregmanand Pinker (1978) variedthe frequency separationsbetween tones A and B and
between tones B and C and also variedthe relativesynchronyof onset of tones B and C. The dotted lines in the figure indicate
the streamperceptsand the verticalsolid lines representthe fusion of tones B and C (cf. Taped Illustration15).
StephenMcAdamsand AlbertBregman:HearingMusicalStreams
Page39
of thoseeventsinto unifiedpercepts.As Bregman[2] points
is a necessaryoutcomeof any process
out, "belongingness
whichdecomposes
mixtures,sinceanysensoryeffectmustbe
source."In this case,whenthe
assignedto someparticular
tones B and C are segregated,the timbre
simultaneous
resultingfromtheirinteractionstillexistsandcanbe heardif
onelistensforit, but it is not perceptually
assignedto either
of the tonesB or C, andthusdoesnot affectthe perception
of them.Thenatureof a streamis suchthatits qualitiesare
dueto the perceptual
featuresassignedto it.
In TapedIllustration15 one canheara casein whichA
is closeto B in frequencyandC is asynchronous
with B. Then
a caseis heardwhereA is furtherawayfromB in frequency
andC is synchronous
with B. TonesB andC havethe same
in bothcases.Listenforboththe A-B streamand
frequencies
the richnessof toneC. Thelistenersin Bregman
andPinker's
C asbeingricherwhenB andCwere
studyreportedperceiving
andthisjudgedrichnessdroppedoff with an
synchronous,
increasein asynchrony,
i.e. as C eitherprecededor followed
B by 29 or 58 msec.As the frequencyseparationbetween
A andB wasincreased,Cwasreportedlyperceivedas being
rich.
increasingly
The Role of Context in DeterminingTimbre
Thesefindingsindicatethat the perceivedcomplexity
of a momentof soundis context-dependent
(see [19] as
anotherexampleof the trendtowardviewingtimbreas
dependingon context).Contextmaybe suppliedby a number
of alternative
thatcompeteformembership
of
organizations
elementsnot yet assigned.Timbreis a perceivedpropertyof a
streamorganization
ratherthanthe directresultof a particular
In otherwords,
waveform,and is thuscontext-dependent.
two frequencycomponents
whosesynchronous
andharmonic
wouldcausethemto fuseunderisolatedcondirelationships
tions may be perceivedas separatesine tones if another
organization
presentsstrongerevidencethat they belongto
separatesequentialstreams.
A verycompellingdemonstration
of the decomposition
of a timbreorganization
by alternatefrequencystreaming
is illustratedin Figure23. Whentone A is
organizations
presentedby itselfit elicitsa timbre,denotedTA.If thistone
is precededby tone B elicitingtimbreTB,andsucceededby
tone C elicitingtimbreTC,one notices that timbreTA
andis replacedby timbresTBandTC.
completelydisappears
Here the highestand lowest componentsof tone A are
streamedwith thoseof tone B andsubsequentlyassumea
timbreidenticalto thatof tone B. Also,the innercomponents
of tone A streamwith thoseof tone C and assumea like
timbre(TapedIllustration16).
An importantquestionconcerningthe assignment
of
timbreand pitchto a tonaleventarises.Bothtimbreand
pitchhavebeen foundto be context-dependent[10, 21,
respectively].But each may be determinedby different
as such they may be
ongoingcontextual-organizations;
consideredto be associatedperceptualdimensions
of a sound
butmaynot be inextricably
boundto one another.Howrelevantto musictheory,then,arestudiesthatdealwiththe perceivedpitchandtimbreof tonesin isolation?Thisquestionis
not meantto insinuatethatsensoryandpsychophysical
experimentationareuseless.Farfromit. If we thinkin termsof
the experienceof musicat differentlevelsof
investigating
processing(Figure24), we see how importantall of these
areasof researcharein buildingthe wholepicture.Theraw
physicalinputis modifiedby the limitsof the senseorgan
whoseoutputis stillfurthermodifiedby cognitiveprocesses.
Butby studyingsteady-state,or at leastrelativelysimple
signals,we can find the limitsand interactionsof the sense
organsandperipheral
processes,suchas temporalandspectral
resolution,lateralinhibition,andmasking,whichlimitsaffect
the final percept.Beyondthese, the centralperceptual
processessuch as pitch extraction,timbrebuildup,and
coherenceand fissionmodify the initialneuralresultof
stimulationof the sensorysystem.Furtherinteractionsin
higherbrainprocessessuchas attentionalprocesses,memory
andcomparison
of pitch,timbreandloudness,contextextrac-
One Timbre
:TA
Two Timbres
:1
A
versus I:
i.
B
11
12.
A
C
Figure23. The first partof this figureshows a repeatingtone (A) consistingof 4 harmonics.This tone would elicit a certain
timbre percept,TA. In the second part this tone is precededby tone B, consistingof the top and bottom harmonics,and is succeeded by tone C consistingof the two innerharmonics.Tone B elicits timbreTB and tone C elicits timbre
TC. However,due to
the streamingof tone A's components with those of tones B and C, TA totally disappearsand is replacedby
TB and TC (cf.
Taped Illustration16).
Page40
ComputerMusicJournal,Box E, MenloPark,CA 94025 Volume3 Number4
PhysicalEnvironment
Levelsof
Processing
Temporaland Spectral Resolution
LateralInhibition
Masking
"Sensory"
etc.
Pitch Extraction
Timbre Buildup
Coherenceand Fission Limits
"Perceptual"
etc.
sequenceof tonesmay segregateinto two or moreseparate
coherent.In thiscase,we would
streamswhichareindividually
perceiveseveralsimultaneousmelodiesratherthan one.
definitionof melodyis tenable,
If suchan operational
we mustthenquestionhowit is thatsomeof the extensionsof
elementsotherthanpitch for "melodic"materialarevalid
For example,whena composerusestimbreas
perceptually.
thematicmaterial,do we stillperceivethe sequenceasmaintaininga temporalcontinuity?Sometimescoherenceis mainandsometimesit canbe very
tainedby sensitiveperformers
difficultto perceive.Of course,we havenot includedother
elementswhichaffectthe perceptionof melody,suchas
butwe areonly attemptingto
harmonicstructure,
underlying
convey the notion that temporalcoherenceshouldbe
consideredessentialto melodyformation.Conversely,one
mayuse the principlesof fissionto developrulesfor creating
in sequencesof acousticevents.
polyphonyandcounterpoint
AttentionandMusicalStructure
Attention
Memory
Context Extraction
Form and Texture Integration
etc.
*
Figure24. This block diagramsuggestsa possiblearrangement
of the processingof acoustic informationat different interconnected levels of the auditorysystem.
tion, andformandtextureintegration,
sculptthe transduced
into meaningful
information
percepts.It is felt thatall of these
levelsof processingfeed into eachotherin a sort of heterarchical(asopposedto a hierarchical)
system.Thepointbeing
madeis that in the frameworkof musicwhereall of these
complexinteractionsareof greatimportance,the context
thatis createdmaybe the essentialdeterminant
of themusical
resultof a givensound.Onesoundis potentiallyperceivable
in
a greatnumberof ways,dependingon its context.
Melody
Thisleadsus to believethatthe fundamental
perceptual
elementin musicmay be the "melody"ratherthan the
isolatedtone.Orin the terminologyof auditoryperception,
the fundamental
structureis the auditorystream.Thisis not,
of course,a newnotion;but anempiricalapproachmayallow
us to clarifyand delimitthe conceptto the extentthat we
maypredictthe perceptualresults.That,in the mindof the
firstauthor,is the primaryconcernof the composer.Letus,
then,examinemelodyandits relationto attention.
For our purposes,we can think of melodyas a
connectedandorderedsuccessionof tones [28]. It follows
thenthattemporalcoherenceis necessaryfor a sequenceof
tonesto be perceivedas a whole.On the otherhand,a
Stephen McAdams
nindAlh"rt Rrtnmn-
oRa,;
.
...A--
It hasbecomeapparentto the firstauthorthatmusical
bound
structure,as it is perceivedin real-time,is inextricably
willrevealthat
to attentionalprocesses.A bit of introspection
thereareat leasttwo kindsof attentionalprocesses,which
we mightcallactiveor willfulattention,andpassiveor automaticattention.Onemightwillfullydirectone'sattentionto
some objector sequenceof events,such as listeningto
eventswithina pieceof music.Or, someunusual
particular
suchas the
eventmightattractone'sattentionunexpectedly,
honkinghornof an oncomingcaryou hadnot noticedasyou
steppedinto the streetabsorbeddeepin thought.Thathorn
demandsyour attentionandin all probabilitygetsit in a
bothkindsof attentionmayparticipate
hurry.In particular,
in the processof listeningto auditorystreams.Forinstance,
in Figure5 whena sequenceof tonesliesabovethe temporal
coherenceboundary,no amountof activeattentioncan
extractthe perceptof one coherentstream.Hereperception
is limitedby passiveattentionalprocesses[28]. Thishas
forcomposerswhointendto usefast
importantconsequences
melodicsequences,sinceit suggeststhat therearetempiat
whichthe listenermaynot be ableto followas a melodythe
of the attentional
regardless
sequenceyou haveconstructed,
willpowerinvoked.An exampleof theseeffectsmaybe found
in the sequencespresentedat differentratesin CharlesDodge's
Earth's MagneticFields.Withoutpretendingto know the
composer'sintent,it canbe amusingto listento the same
sequencedecomposeand re-integrateitself duringvarious
sequenceone can relax
tempochanges.In a multi-streamed
attentional effort with the result that attention might
randomlyalternateamongthe availablestreams.Or one might
selectively focus attention on any one of them individually
and even play them againstone another.
Van Noorden reported that the temporal coherence
boundary (the boundarybelow which all tones may belong
to one stream)is not affected by previousknowledge of the
sequence, and consideredit to be a function of a passive
attentionalmechanism,given that the listener "wantsto hear
coherence."However,what happensbetween the boundaries
of temporalcoherenceand fission dependsupon a numberof
factors, such as context, and seems to be underthe influence
of attention. Dowling [17], for example, reported that if
listeners knew beforehandwhi:h melodies were being interleaved, they could, with a bit of practice,extract the approPage 41
priate melody even at very small separationsof the ranges
traversedby each melody. It is currentlyassumedthat this
ability would degenerate at faster tempi. In addition,
van Noorden found an effect at very fast tempi where it was
virtuallyimpossibleto hear sequenceswithin a rangeof two
or three semitonesas other than temporallycoherent. At very
fast tempi (about 12.5 tones/ sec.) the tones of such narrow
patternsare not heardas separatemembersof a sequence,but
actually mergeinto a continuouslyripplingtexture, as one can
hear in Taped Illustration 17. There are thus attentional
limits in the ability of the auditorysystem to tracka sequence
of events. Whenevents occur too quickly in succession, the
system uses the variousorganizationalrulesdiscussedin this
article to reorganizethe events into smallergroups. It may
then track events within a particulargroup if the listener is
payingattention to it, but this narrowingof focus necessarily
causesa loss of information.One resultis the inability to make
fine temporalorderjudgmentsbetween streams.These organizationalmechanismsreflect the tendency of the auditorysystem to simplify things in the face of excessivecomplexity. In
the example where the fast sequenceof tones mergesinto a
continuous "ripple,"the auditorysystem is unable to successfully integrateall of the incominginformationinto a temporal
structureand simplifies the situation by interpretingit as
texture (see also [31] ). Thus the auditory system, beyond
certaintempi, may interpretthe sequenceas a single event and
assignto it the texture or timbre createdby its spectraland
temporalcharacteristics.
An understandingof (or intuition about) these organizational processescan lead to new dimensionsof control over
musicalstructure.For example, one might construct contrapuntal sequences that play across variousstream boundaries
and through different borderlineregions between temporal
coherenceand fission. (It may be that composerssuch as Bach
were alreadyusingperceptualambiguityconsciously in their
work.) Any or all of the relevantmusicalparametersmight be
used to accomplishthis. Then, an appropriateuse of events
that vie for or demandthe listener'sattention can be used by
the composer to "sculpt" the attentional processes of the
listener. Since some events seem more strikingto some persons
than to others, this attentionalsculpturein time would lead
different listeners through different paths of auditory
experience. Further,perceptionis bound to vary from time to
time within a single person, so the experience would be
different with each listening. A composition of sufficient,
controlled complexity might thus be perceptuallyinfinite for
a givenlistener.
Conclusion
An attempt has been made here to point out that
composersand music theorists should thoroughlyexamine the
relationshipbetween the "musical"principlesthey use and
espouse, and the principles of sensory, perceptual, and
cognitive organizationthat operate in the human auditory
system. Manyof the principlesdiscussedin this articleextend
to higher-levelperceptualanalysis of musical context and
structureand may well representa scientific counterpartto
some extant music theoretical principles. In other cases,
though, this groupof phenomenasuggestsperceptualorganizations which have little relation to methods currently used
to construct or analyze musical structure. To ignore the
evidence from the real life system in developinga theory of
Page42
musicor a musicalcompositionis to take the chanceof
one'sworkto the realmof whatmightbe termed
relegating
"papermusic."
Acknowledgments
This paperis basedon a workshopentitled"The
PerceptualFactoringof AcousticSequencesinto Musical
Streams"deliveredat the 1978 International
Computer
MusicConference,Evanston,Illinois,andpublishedin the
of the Conference.Theauthorswouldlike to
Proceedings
thankDr.LeonvanNoordenforhismanyhelpfulsuggestions
andvaluablecriticismsof the manuscript;
Figures5, 8, 9,
10 and 20 were redrawn,with permission,from
Dr. vanNoorden'sthesis.Thepresentversionof this article
was preparedwhileMr.McAdamswasa GraduateFellow
of the NationalScienceFoundation.Dr. Bregman's
research
has been supportedby grantsfromthe NationalResearch
Councilof Canada,the QuebecMinistryof Education,and
the McGillUniversityFacultyof GraduateStudiesand
Research.Muchof the researchusedthe facilitiesof the
Laboratoryof the McGillUniversity
Computer-Based
of
Music
Department Psychology.The editorsof Computer
Journalwouldliketo thankMr.McAdams
for preparing
the
in thisarticle.
illustrations
Appendix1
of TapedIllustrations
Description
1. A repeatingsequenceof threehightones(1600, 2000,
with threelow tones(350, 430, 550
2500 Hz.)is interspersed
andCampbell(1971). a) At a tempoof
Hz.)usedby Bregman
five tones/sec.the sequenceis perceivedas one streamof
alternating
highandlow tones(cf. Figurela). b) At a tempo
of ten tones/sec.the sequencesegregates
into one
perceptually
streamof high tones and one streamof low tones (cf.
Figurelb).
2. Anotherrepeatingsix-tone sequenceis playedat a
tempoof ten tones/sec.,but the highertripletis closerin
frequencyto the lower.ToneF maybe perceivedasbelonging
to eitherthe highstreamor the low streamdependingon the
listener'sfocus.Note that tone F cannotbelongto both
streamsat once(cf. Figure2).
3. A repeatingsix-tonesequencestartsat a slow tempo.
As the tempois gradually
increased,the sequenceis progresinto
smaller
sivelydecomposed
perceptualstreamsuntilit is
no longerpossibleto followthe tonaleventswhichmergeinto
the perceptof timbreor texture(cf Figure3).
4. Usingthe sameinitialsequenceasin TapedIllustration
betweentemporallyadjacenttones
3, the frequencyseparation
is graduallyincreased.The samesort of decomposition
into
smaller streams occurs. The limits in this example are
determinednot by temporalresolutionbut by the audible
frequencyrange(cf Figure3).
5.
The tones of two familiarmelodies are interleavedin the
same frequencyrange.As the frequencyrangeof one melody
is shifted away from that of the other, the melodies become
recognizable.
6.
a) In the first part of this example (see Figure4), tones
in one frequencyrange,forma separate
A andB, alternating
in anotherfrequency
streamfromtonesX andY, alternating
Y aremovedinto the
tones
X
the
and
In
second
part,
b)
range,
proximityof the frequenciesof tones A andB with a sub-
ComputerMusicJournal,Box E, MenloPark,CA 94025 Volume3 Number4
sequent perceptualchange.Now tones A and X stream,as do
tones B and Y. Note also that the rhythmsof the respective
streamschangebetween the two examples.
7.
This example demonstratesthe streamingpotential of
a sine wave used for frequency modulation of a sine tone.
The progressionheardis from a very slow, trackablemodulation, through a point where the high and low peaks of the
modulated sound segregatedand then to the point where a
timbre is perceived. The example then returnsto the initial
modulation frequency.
8.
Two sine waves are modulated exponentially in
frequency. One ascends in frequency, the other descends in
frequencyand they cross at one point. Most listenersreport
hearinghigh and low V-shaped contours as opposed to hearing
the full glissandi.
9.
First, gated speech is heard. Note the difficulty in
discerningthe message. However,when the gaps are filled
with white noise, the spoken message sounds continuous
and a pulsingnoise is heardin the background.
10. The same demonstrationas in Taped Illustration9 is
used on musicalmaterial.It is difficult to induce the information lost in the gaps until those gaps are filled with white
noise.
11. The first part of this example is a repeatingfour-tone
pattern,all of the tones of which have the same timbre. This
representsa pattern usually perceivedas being temporally
coherent. However,when the thirdharmonicis addedto the
second and fourth tones of the pattern, thus changingtheir
timbre, two streamsof differing timbre may be heard. The
frequencies of the fundamentals are not changed (cf.
Figure 14).
12. Two glissandiwith identical spectralstructuresmoving
in opposite directionsareheard separatelysince none of their
spectralcomponents cross in frequency(cf. Figure 16).
13. Two glissandiwith differingspectralstructuresmoving
in opposite directions are made to cross each other in
frequency. Theirdifferingtimbresstill allow the listener to
segregatethem perceptually(cf Figure 17).
14. The same timbres used in Taped Illustration 13 are
arrangedsuch that their pitches cross but none of the
frequencycomponents cross. They are still heardas separate
glissandi(cf. Figure 18).
15. In the first part of this example (see Figure22) tones A
and B are close to each other in frequencywhile tones B and
C are asynchronous(tone C leads tone B by 29 msec.). Tones
A and B form a sequentialstreamand tone C is perceivedas
being fairly pure. In the second part, tone A's frequency is
separatedfrom that of tone B and tones B and C are synchronousin attack and decay. Tone A forms a streamby itself
and tones B and C fuse into a single,richerpercept.
16. First, a repeatingfour-component tone is heard. Note
the timbreof this tone (see Figure23). Then this tone A (at
the same repetition rate) is precededby a tone B consisting
of the top and bottom components of tone A and is followed
by a tone C consistingof the two inner componentsof tone A.
Two streamsare formed and the components of A are split
into separatestreamswith tones B and C. Accompanyingthis
streamorganizationis a decompositionof tone A's timbreinto
the timbresof tones B and C.
17. In this example a sequenceof tones very close to each
other in frequency is slowly acceleratedin tempo until the
percept of one continuous, ripplingtone is achieved. Then
the tempo is decelerateduntil the individualtones can be
heard once again.
StephenMcAdamsand AlbertBregman:HearingMusicalStreams
References
1. Bismarck,G. von. (1974). "Sharpnessas an attribute
of the timbre of steady sounds."Acustica, Vol. 30,
pp. 159-172.
2. Bregman,A. S. (1978a). "Askingthe 'what for' question
in auditory perception," in Kubovy, M. and J.
Pomerantz, eds., Perceptual Organization. Hillsdale,
N. J.: LawrenceEarlbaumAssociates.
3. Bregman,A. S. (1978b). "Auditorystreaming:Competition among alternativeorganizations."Perc. and Psychophys., Vol. 23, pp. 391 -398.
4. Bregman,A. S. (1978c). "Auditorystreamingis cumulative." J. Exp. Psych./HumanPerc. Perf., Vol. 4, pp.
380-387.
5. Bregman,A. S. (1978d). "The formation of auditory
streams."in J. Requin,ed., Attention and Performance
VII. Hillsdale,N. J.: LawrenceEarlbaumAssociates.
6. Bregman,A. S. and J. Campbell(1971). "Primaryauditory streamsegregationand the perceptionof orderin
rapid sequences of tones." J. Exp. Psych, Vol. 89,
pp. 244-249.
7. Bregman,A. S. and G. L. Dannenbring(1973). "Theeffect
of continuity on auditory streamsegregation."Perc.
andPsychophys., Vol. 13, pp. 308-312.
8. Bregman,A. S. and G. L. Dannenbring(1977). "Auditory
continuity and amplitudeedges." Can.J. Psych./Rev.
Can.Psych.,Vol. 31, pp. 151-159.
9. Bregman,A. S. and G. L. Dannenbring(1978). "Auditory
streaming."Sci. Am., in press.
10. Bregman,A. S. and S. Pinker(1978). "Auditorystreaming
and the building of timbre." Can.J. Psych., Vol. 32,
pp. 19-31.
11. Bregman,A. S. and S. I. Rudnicky (1975). "Auditory
segregation:streamor streams?"J. Exp. Psych./Human
Perc. Perf., Vol. 1, pp. 263-267.
12. Bregman,A. S. and H. Steiger(1978). "Auditorystreaming and verticallocalization:interdependenceof 'what'
and 'where'decisionsin audition." Unpub. manuscript,
McGillUniv. Dept. of Psych., Montreal,Quebec,Canada.
13. Dannenbring,G. L. and A. S. Bregman(1976). "Effect
of silence between tones on auditory stream segregation." J. Acous. Soc. Am., Vol. 59, pp. 987-989.
14. Dannenbring,G.L. and A.S. Bregman(1976)). "Stream
segregationand the illusion of overlap."J. Exp. Psych./
HumanPerc.Perf., Vol. 2, pp. 544-555.
15. Dannenbring,G. L. and A. S. Bregman(1978). "Streaming vs. fusion of sinusoidalcomponents of complex
tones." Perc. and Psychophys.,Vol. 24, pp. 369-376.
16. Deutsch, D. (1975) "Two-channellistening to musical
scales."J. Acous. Soc. Arrt, Vol. 57, pp. 1156-1160.
17. Dowling, W. J. (1973). "The perception of interleaved
melodies." Cog. Psych., Vol. 5, pp. 322-337.
18. Grey, J. M. (1977) "Multidimensionalperceptualscaling
of musicaltimbres."J. Acous. Soc. Am., Vol. 61, pp.
1270-1277.
19. Grey, J. M. (1978), "Timbrediscriminationin musical
pattern."J. Acous. Soc. Am, Vol. 64, pp. 467-472.
20. Grey, J. M. and J. W.Gordon(1978). "Perceptualeffects
of spectralmodificationson musicaltimbres."J. Acous.
Soc. Am., Vol. 63, pp. 1493-1500.
21. Guilford,J. M. and H. M. Nelson (1936). "Changesin the
pitch of tones when melodies are repeated."J. Exp.
Psych., Vol. 19, pp. 193-202.
(continued on p. 60)
Page43
(Buxton article continued from p. 25)
Mathews, M. (1969) The Technology of Computer Music,
Cambridge:MITPress.
Murray,D., Beauchamp,J., and Loitz, G. (1978) "Usingthe
PLATO/TI980A Music Synthesis System: The
PLACOMP Language," Proceedings of the 1978 International Computer Music Conference, Roads (Comp.),
Evanston: Northwestern University Press, Vol. 1:
151-166.
New EnglandDigitalCorporation(1978). SynclavierInstruction Manual,Norwich, Vermont: New EnglandDigital
Corp.,PO Box 305.
Newman, W. and Sproull, R. (1979) Principles of Interactive
Computer Graphics (Second Edition), New York:
McGraw-Hill.
Patel, S. (1979) "Score Editor Design: The Foundations of
Scriva," Toronto: unpublished manuscript, SSSP/
CSRG,Universityof Toronto.
Pulfer, J. (1972) MUSICOMP Manual, Ottawa: N.R.C. Radio
and ElectricalEngineeringDivision.
Reeves, W., Buxton, W., Pike, R., and Baecker,R. (1978)
"Ludwig:an Exampleof InteractiveComputerGraphics
(McAdamsand Bregmanarticle continuedfrom p. 43)
22. Halpern,L. D. (1977). "The effect of harmonic ratio
relationshipson auditory stream segregation."Unpub.
undergrad.researchreport, McGill University Dept. of
Psych., Montreal,Quebec, Canada.
23. Heise, G. A. and G. A. Miller(1951). "An experimental
study of auditory patterns."Am. J. Psych., Vol. 64,
pp. 68-77.
24. Houtgast,T. (1972). "Psychophysicalevidence for lateral
inhibition in hearing."J. Acous. Soc. Am., Vol. 51,
pp. 1885-1894.
25. McAdamsS. E. (1977). "The effect of tone quality on
auditory stream segregation."Unpub. honours thesis,
McGillUniversityDept. of Psych., Montreal,Quebec,
Canada.
26. Miller,G.A. and G.A. Heise (1950). "The thrill threshold." J. Acous. Soc. Am., Vol. 22, pp. 637-638.
27. Miller,G. A. and J. C. R. Licklider(1950). "Intelligibility
of interruptedspeech." J. Acous. Soc. Am., Vol. 22,
pp. 167-173.
28. van Noorden, L. P. A. S. (1975). Temporalcoherencein
the perception of tone sequences. Ph.D. Diss., Tech.
Hogeschool,Eindhoven,The Netherlands,Pub. by The
Institute of Perception Research, Eindhoven, The
Netherlands.
29. Ritsma, R. J. (1962). "Existence region of the tonal
residue.PartI."J. Acous. Soc. Am., Vol. 34, pp. 12241229.
30. Schouten, J. F. (1962). "On the perceptionof sound and
speech: Subjectivetime analysis." Fourth Int'l. Cong.
Acous., Copenhagen.Reports II, pp. 201-203.
31. Steck, L. and P. Machotka (1975). "Preferencefor
musicalcomplexity: Effects of context." J. Exp. Psych./
HumanPerc.Perf., Vol. 104, pp. 170-174.
32. Thurlow, W. R. (1957). "An auditory figure-ground
effect." Am. J. Psych., Vol. 70, pp. 653-654.
33. Verschuure, J. (1978). Auditory excitation patterns:
(continued on p. 63)
Page 60
in a Score Editor,"Proceedingsof the 1978 International ComputerMusic Conference, Roads (Comp.),
Evanston: Northwestern University Press, Vol. 2:
392-409.
Ritchie, D. M. and ThompsonK. (1974) "The UNIX TimeSharingSystem," Communicationsof the ACM, 17.7:
365-375.
Smith, L. (1978) MUSIO UserManual, Stanford: work in
progress,CCRMA,Dept. of Music,StanfordUniversity.
Smith, L. (1972) "SCORE-A Musician's Approach to
ComputerMusic,"Journal of the Audio Engineering
Society, 20.1: 7-14.
An ExperimentalAid for the
Tanner,P. (1972) "MUSICOMP,
Composition and Production of Music," ERB-869,
Ottawa: N.R.C. Radio and Electrical Engineering
Division.
Tucker, W. H., Bates, R. H. T., Frykberg,S. D., Howrath,
R. J., Kennedy,W. K., Lamb,M. R. and Vaughan,R. G.
(1977) "An InteractiveAid for Musicians,"International
Journalof Man-MachineStudies, 9: 635-651.
Vercoe, B. (1975) "Man-ComputerInteractionin Creative
Applications," Cambridge:unpublished manuscript,
Studio for ExperimentalMusic,M.I.T.
U.S. POSTALSERVICE
STATEMENT OF OWNERSHIP, MANAGEMENT AND CIRCULATION
(Required by 39 U.S.C 3685)
1.TITLE OF PUBLICATION
COMPUTER MUSIC JOURNAL
i
NO
A. PUBLICATION
9
51 6 .
3
2. DATE OF
0
28 Sep 19779
.
F.LI.
four
quarterly
_
4.LOCATION
OFKNOWN
OFFICe
OFPUBLICATION
State
andZIPCode)
(Notprinters)
(Street.
City,
County,
1263 El Camino Real,
Menlo Park,
CA 94025
.
.20.00
$.
(San Mateo County)
ORGeNeRAL
5.LOCATION
OFTHE
BUSINESS
OFTHE
HeAD0UARTeRS
PUBLISHERS
(.%of
OFFICES
printer.;
same
6.
NJAMESAND COMPLETEADDRESSES OF PUBLISHER,EDITOR AND
MAOAGING EDITOR
PUBLISHER (Name and Address)
Willard J. Holden, 1263 El Camino Real,
(N'ame
andAddress,
EDITOR
Curtis
Roads
CA 94025
Menlo Park,
(same)
andAddress)
EDITOR
MANAGING
(NOme
Curtis
Roads (same)
7. OWNER (if owned hy a corporation its name and address must be stated and also immediatel? thereunder the namesand addreses of tock
holders owning or holding i percent or more of total amount of stokh If not owned bN a corporation, the names and addressesof the individual
owners must be given If owned by a partnership or other unincorporated firm. its name and address, as well as that of each individual must
be given.)
NAME
s
People
.
ADDRESS
o.
Lomputer
-I263
Lamino
Ei
Reai,
Tenino
yark,
LAS94U025
KNOWNBONDHOLDERS,MORTGAGEES AND OTHERSECURITY HOLDERSOWNINGOR HOLDING PERCENTOR MOREOF
TOTAL AMOUNTOF BONDS, MORTGAGESOR OTHER SECURITIES(If there are none. so state)
NAME
S
ADDRESS
none
9. FOR COMPLETIONBY NONPROFITORGANIZATIONSAUTHORIZEDTO MAIL AT SPECIAL RATES (Section 132.122, PSM)
The purpos, function, and nonprofit natus of this organization and the exempt status for Federal income tax purpoeis (Check one)
must
of
EDDURING
(Ithanged,
AVE
0TCHAG2
[]A
CHANGEDhDURINt
submit
change
explanation
PRECEDING
MONTHS
12MONTHSwith
PRECEDING
thispublisher
NO.
COPIESOF SINGLEI
tAVERAGE
statement.)
EXTENT AND NATURE OF CIRCULATION
ISSUE DURINGCOPIES
PRECEDING
NEAREST
TO
ISSUE PUBLISHED
EACH ACTUAL
NO.
12 MONTHS
FILING DATE
,2
___
______________
_
10.
4,0'
840,
3,300
B. PAID CIRCULATION
NDOR CANDCOIERUNT
2.
853
625
1,984
1,800
2,837
2,425
RE
MAILSUBSCRIPTOONS
. TOTAL
PADCIRCULATION
(SuM O10oand102)
BYMAIL.
CARRIER
OROTHER
O.FREE
DISTRIBUTION
MEANS
SA
E-. COMPLIMEN._TARY,*
ANDOT4EFREE
neo
.n
n.TOTAL
Da
OI (SumtoC aond
1.
OFCEt
2. CRETI
N
SPOILED
LEFTOVER.
UNACCOUNTED,
USE.
M EI AGENTS
CFRO
3,634
i
2,525
1,051
775
0
155
G. TOTAL
(Sum
n
ofE. FI and2--oouldequalnetpressrunhow n,300
in A)ue
II.I certify that the statementsmadeby met
above arecorrectand complete.
RT
100
797
COPlE
,840
84o
3,300
N
P
12, FOR COMPLETIONBY PUBLISHERSMAILINGAT THE REGULAR RATES (Section l
;.Postal Service Manual)
39 U. S C. 3626 provides in pertinent part: "No person who wouldhive been enttledto mail matter under formersitlon 4359 of this title
%hall
such
matter
under
mall
atthe
rates
this
unless
hefies
provided
with
the
Service
awritten
for
subsectlon
annually
Postal
permlsion
requesR
to mail matter at such rates,"
In accordance with the provisions of this statute I hereby requst prm ilsion to mall the publlcatlon named in Item 1 at the phased posts"
rates prently authorlzed by 39 U. S. C. 3626.
/
M-94?yl V
1sY =Ir
Computer Music Journal, Box E, Menlo Park, CA 94025
Volume 3 Number 4
(McAdamsand Bregman continuedfrom p. 60)
The significanceof the pulsation thresholdmethod for
the measurementof auditorynonlinearity.Ph.D. Diss.,
Erasmus University, Rotterdam, The Netherlands.
34. Warren,R. M., C. J. Obusek,and J. M. Ackroff (1972).
"Auditory induction: Perceptual synthesis of absent
sounds."Science, Vol. 176, pp. 1149-1151.
35. Wessel,David L. (1979). "TimbreSpace as a Musical
Control Structure."ComputerMusic Journal, Vol. 3,
No. 2, pp. 45-52.
MUSIC-11
Sound Synthesis System
Runs Under Unix, RT-11 Or RSX-11M
On Any PDP-11 Or LSI-11 With EIS
We are licensed to distribute
this software system developedby the
M.I.T. ExperimentalMusic Studio
ComjmirMusic
Features:
is
available
in
microform
Please send me additional information.
University Microfilms International
300 North Zeeb Road
Dept. P.R.
Ann Arbor,MI48106
U.S.A.
18 Bedford Row
Dept. P.R.
London, WC1R4EJ
England
Name
Institution
Street
City
State
-Very High ExecutionSpeed
-Powerful And Intuitive OrchestraLanguage
-DynamicMemoryAllocation
-VarietyOf Synthesis Methods,IncludingLinear
Predictionand FastAdditiveSynthesis
For Detailed Information And Rental Rates
Call Or Write
Digital Music
Systems, Inc.
306 Court Street
Brooklyn, New York 11231
(212) 643-1077
Zip
DMX-1000
SIGNAL PROCESSING COMPUTER
For Real-Time All-Digital Sound Synthesis
16-bit minicomputer specially
designed for audio signal
processing
Produces audio output with a
self-contained D/A converter
be controlled by any
.................May
general-purpose computer
Completely programmable
i0,l:::iii
Very
fast
Interfaces available for many
popular computers
For More Information
Call or Write
Inc.
Music
Digital
Systems,
306 Court Street, Brooklyn, New York 11231
(212) 643-1077