interocular correlation, luminance contrast and cyclopean processing

0042-6989/91$3.00+ 0.00
Copyright 0 1991Pergalnon FVessplc
Vision Res. Vol. 31, No. 12, pp. 2195-2207, 1991
Printed in Great Britain. All rights reserved
INTEROCULAR CORRELATION, LUMINANCE
AND CYCLOPEAN PROCESSING
CONTRAST
LAWRENCE K. CORMACK, Sco3-r B. STEVENSONand CLIFTON M. SCHOR
School of Optometry,
University of California, Berkeley, CA 94720, U.S.A.
(Received 27 August 1990; in revised form 17 January 1991)
Abstract-We
have investigated the nature and viability of interocular correlation as a measure of signal
strength in the cyclopean domain. Thresholds for the detection of interocular correlation in dynamic
random element stereograms were measured as a function of luminance contrast, a more traditional
measure of stimulus strength. At high contrasts, correlation thresholds were independent of contrast. At
low contrasts, correlation thresholds were inversely proportional
to the square of contrast.
Stereothresholds were also measured as a function of both contrast and interocular correlation. At low
contrasts, stereoacuity was inversely proportional to both interocular correlation and the square of
contrast. These results are consistent with an inherently multiplicative mechanism of binocular combination, such as a cross-correlation of the two eye’s inputs.
Interocular correlation
Contrast
Stereopsis
INTRODUCTION
correlation
can be
In general, interocular
thought of as the degree to which the images in
the two eyes match one another. Intuitively,
interocular correlation, if reasonably defined,
should provide a measure of signal strength in
the cyclopean domain. That is, if the two eye’s
views are almost identical, as is the case with
binocular fixation of a flat surface, interocular
correlation is very close to the maximum possible. On the contrary, if the two eye’s views of
the world are predominantly non-overlapping
then nothing can be predicted about the right
eye’s image given only the left eye’s image and
vice versa. In this case there would be zero
interocular correlation and, hence, no cyclopean
information available.
Formally,
interocular
correlation
can be
defined as the cross-correlation
of the image
pair comprising the right and left eye’s views of
the world. For the simple one-dimensional case
the interocular correlation at some disparity d is
given by:
IOC(d) =
f(x)h(x + d) dx
(1)
s
where f(x) and h(x) represent the intensity
profiles (or some derivative of them) along the
horizontal meridian of the right and left eye’s
retinae.
This definition is simple, quantitative, and
works for any given image pair. For “natural”
images, with their generous variety of colors,
luminances, etc. interocular correlation is somewhat difficult to intuit, regardless of the definition employed. It would be unclear, for
example, how to change one member of an image
pair in order to reduce the interocular correlation by some desired amount. In the laboratory however, where one can restrict the visual
environment to one-bit random dot stereograms
of 50% density, the notion of interocular correlation is quite intuitive. Under these conditions,
the interocular correlation for a given disparity
is simply a linear function of the proportion of
dots which match at that disparity, i.e.
IOC(d) = 2P, - 1
(2)
where Pd is the proportion of matching dots at
disparity d.
Examples of different interocular correlations
are shown in Fig. 1. The top stereo pair
illustrates an interocular correlation of + 1 (all
dots match at 0 disparity) and, when fused,
the percept is that of a flat plane. In the middle
and bottom panels, the interocular correlation
has been reduced to +OS and 0 respectively,
accompanied by a degradation of the perceived
quality of the plane. It should be noted that the
phenomenology of these static examples differs
somewhat from that of dynamic displays such as
were employed in our experiments. Specifically,
the dynamic displays of interocular correlations
less than unity give rise to a percept much like
2195
2196
K.
C.
Fig.
Examples of random noise with various amounts
of interocular
correlation.
These examples can
either be free-fused or viewed through a stereoscope.
In (a). the interocular
correlation
is 100% at 0 dn,
parity, and the percept is one of a Aat plane. In (b) the interocular
correlation
has been reduced to SO” ,I
while a flat plane is still perceived, it appears less robust and accompanied
by dots both in front of :III~!
behind the plane of fixation. In (c), the interocular
correlation
is 0, and no percept of a coherent plani.
extant. In dynamic versions of these stimuli, such as were employed in the experiments,
the phenomtr;
ology is somewhat
different.
Lower interocular
correlations
in particular
appear as semi-transparct!t
volumes when dynamic, whereas. when static, they appear as opaque surfaces with chaotic topograph?
dirty window embedded in fog; as correlation
increases, the window grows dirtier and the fop
grows thinner. Absent in the dynamic displays
a
was the appearance
of’s pstchwork
01 cc)-plan;lr
dots imbedded
in a ri~Hlrousllustr_c,ab:~l\
cv-ound, such as is seen
t‘
lnterocular correlation,
luminance contrast and cyclopean processing
Yet Fig. 1 does illustrate a basic point, which
is that as one decreases the interocular correlation of the stimulus, the salience of the flat
plane also decreases. In this sense, interocular
correlation could represent a metric of signal
amplitude in the cyclopean domain analogous
to the manner in which luminance contrast
provides a metric of signal amplitude in the
spatial domain. * The generation of a cyclopean
signal does not occur in parallel with the generation of a spatial (contrast) signal however; the
presence of a contrast signal is a necessary
precursor to the generation of a cyclopean
signal. Given this ordinal relationship, and the
fact that contrast is already known to influence
such hypercyclopean functions as stereoacuity
(Halpern & Blake, 1988; Legge & Gu, 1989;
Heckman & Schor, 1989), it might be possible
to control the amplitude of a cyclopean signal
by manipulating
luminance
contrast.
For
example, if the luminance contrast of a cyclopean stimulus is reduced, it might be possible
to compensate for the resulting decrease in
signal strength by increasing the interocular
correlation, thereby maintaining a constant (e.g.
threshold) level of performance on some task.
Thus, given a threshold level of performance on
said task, a trading relation would be expected
between contrast and interocular correlation.
Moreover, the form of this trading relation
would reflect the manner in which cyclopean
signals are derived from monocular contrast
signals. Accordingly, we measured the effect
of contrast on the detection of interocular
correlation in the first experiment. Based on
the results of this experiment, a model was
developed to generate predictions concerning
hypercyclopean functions such as stereoacuity.
Predictions of this model were then tested
in Experiment 2 by measuring stereoacuity as
a function of both contrast and interocular
correlation.
GENERAL METHODS
The experiments were performed using dynamic random-element stereograms of 50% element density. A diagram of the basic apparatus
is shown in Fig. 2. A random noise signal was
hardware generated via shift registers running at
*In this paper, we will use the term “cyclopean” to refer to
the site and/or processes of binocular combination itself
and the term “hypercyclopean” to refer to processes
occurring after or beyond the cyclopean stage.
2197
Fig. 2. Schematic illustration of the experimental apparatus.
Random bit streams were hardware generated and sent to
a pair of video monitors, which were viewed through a
mirror haploscope. Disparities were created by delaying the
sync to one monitor. Interocular correlation was manipulated by driving the monitors with a single dot generator,
independent dot generators, or a combination thereof (see
text for details). The psychophysics (stimulus presentation,
data aquisition, etc.) were all under computer control.
7 MHz and was displayed on a pair of matched
TSD monitors
(p4 phosphor,
60 Hz noninterlaced) viewed through a mirror haploscope.
The viewing distance was 53.7 cm and the
haploscope mirrors were adjusted for each
subject to the corresponding
convergence
angle, thus obviating any mismatch between
convergence and accommodation
or “higher
level” distance cues. Mean luminance was
80cd/m2. The displays were viewed through
7 deg circular apertures in an otherwise black
surround.
Horizontal disparities were produced by delaying the horizontal video sync to one monitor.
This was accomplished via a programmable
delay chip (Digital Delay Devices model PDU13256-0.5), which allowed us to delay the noise
stimulus to one eye in 0.5 nsec (corresponding to
2 arc set) increments.
Interocular correlation was simply the proportion of the dots which were “forced” to
match in the two images; the remainder of the
dots then had a 50% chance of matching. Thus,
in a display which had an interocular correlation of 0, half of the dots in the right image
were matched by dots in the left image. In a
display which had an interocular correlation of
- 1 (“anticorrelation”),
the right image was
simply the opposite contrast version of the left
image and no matches existed. For an interocular correlation of + 1, of course, the two
images were identical.
2198
K.
Thus, rearranging equation (2), the proportion of matching dots is, on average, simply
(IOC + 1)/2; the two are linearly related, as is
somewhat demanded by intuition.
Interocular correlation was varied through
the use of two independent noise generators. To
produce an interocular correlation of + 1, the
output of a single noise generator was sent
to both monitors. To produce an interocular
correlation of 0, each monitor was driven by a
separate noise generator. Intermediate interocular correlations were produced by switching
between the above two conditions at a sufficiently high rate (1 kHz) to allow the spatiotemporal integration of the visual system to
render the stimulus identical with one in which
interocular
correlation
is statistically determined on a dot-by-dot basis. It should be noted
that this switching pulse was not synched to the
video signal, nor was the switching rate sufficiently close to an even multiple of the frame
rate. lest stationary or drifting bands of correlation be visible. Using this method. the interocular correlation is simply the duty cycle of the
(rectangular wave) switching pulse. and could
*Two control experiments were done to Insure that this
method was adequate. First, we had subjects attempt to
distinguish between an intermediate correlation stimulus
produced by a 1 kHz switching pulse (as used in our
experiments) and a 100 kHz switching pulse. All subjects
failed to make the distinction. On the assumption that
100 kHz (100 switching cycles every msec) is fast enough.
then this experiment shows that
kHz is also fast
enough.
Second, we had subjects attempt to dtscriminate between “true” zero correlation (right and left eyes’ images
produced by independent random bit streams) and an
intermediate correlation as produced by a 1 kHz mixture
of
and - 1 interocular correlation. The independent
variable was the duty cycle of the switching pulse. As the
duty cycle of the switching pulse went to SO%, the ability
of the subjects to make the discrimination fell to chance
levels.
tlndividual monocular contrast detection thresholds were
measured for our dynamic random element stimuli on
the same apparatus using the same 2 aft/constant stimuli
paradigm. Contrast detection thresholds were between 2
and 4% (Michelson contrast for a single stimulus frame).
These relatively high thresholds are to be expected
because of the effective contrast reduction which the
temporal integration of the visual system imparts on
dynamic stimuli. The lowest contrast at which a particular subject could perform either the correlation detection
task (Experiment 1) or the stereoacuity task (Experiment
2) corresponded to the calculated contrast at which both
eyes were reliably detecting the stimulus [i.e. P(RE
detection AND LE detection) = 0.751, as obtained by
multiplying the psychometric functions for the right and
1ett eve&,
be placed under software (Experiment 1) or
hardware (Experiment 2) control.*
Thus, to create an interocular correlation of
0.75, the duty cycle of the rectangular wave
switching pulse was such that the subject was
viewing a fully correlated display 75% of the
space/time, and viewing an uncorrelated display
for the remainder. Given the 1 kHz switching
pulse frequency and the 60 Hz non-interlaced
frame rate, however, the perception was one of
a continuously present intermediate correlation;
no perception of the correlation switching was
present during the experiments.
Luminance contrast was controlled by adjusting the peak-to-trough value of the video signals
using customized
hardware
and software.
The response functions of the two monitors
were matched and calibrated prior to both
experiments using a Photo Research Spectra
Spotmeter photometer.
Experiment 1: The Detection of Interocular Correlation as a Function of Luminance Contrast
Method
The 3 authors served as subjects. All had
normal or corrected to normal acuity, normal
contrast sensitivity and good stereopsis.
A temporal 2AFC-method of constant stimuli
paradigm was employed. The subjects fixated a
12 arc min wide “ +” which, to insure accurate
convergence, was flanked above and below by
48 x 5 arc min nonius lines. A block of trials
consisted of 1 trial at each of 4-6 correlation
levels, chosen to bracket the expected threshold
based on pilot data. The order of trials was
randomized within blocks. A run was composed
of 30 blocks of trials at a single stimulus contrast. A threshold correlation for a particular
contrast level was defined as 75% correct on a
psychometric function fit to the data from 3
(subject CMS) or 5 (subjects SBS and LKC)
such runs. Each subject ran at between 9 and
11 contrast levels, which were even multiples
of that subject’s contrast threshold for our
dynamic random-element stimuli.7
A trial was composed of two stimulus intervals, 1.2 set in duration, which were delimited
by audible tones. The dynamic noise was continuously present but, during one of the two
intervals, switched from 0 interocular correlation to some positive interocular correlation
for 200msec. The plane of positive correlation
was always presented at 0 disparity. i.e. the
plane of stimulus presentation was coincident
Interocular correlation,
luminance contrast and cyclopean processing
with the plane of fixation as defined by the
nonius horopter. The subject’s task was to
signal, by means of a key press, which interval
contained some non-zero interocular
correlation. The subject’s response was followed by
auditory feedback and initiated the next trial.
Results
The results for the 3 subjects are shown
superimposed in Fig. 3. The axis of ordinates
represents correlation threshold while the axis
of abscissas represents contrast expressed in
threshold multiples. Both axes are logarithmic.
As can be seen from the figure, correlation
threshold is independent of contrast at relatively
high contrasts. At relatively low contrasts, however, correlation
thresholds decrease rapidly
as stimulus contrast increases. Thus, in the
low contrast region, a decrease in stimulus
effectiveness due to a contrast reduction can be
compensated for by an increase in interocular
correlation in order to restore a criterion (e.g.
threshold) level of performance.
Generally, the data can be described as consisting of two regions, a high contrast region in
which the data asymptote to a line of slope 0,
and a low contrast region in which they asymptote to a line of slope - 2. The solid line in Fig. 3
is of slope -2, plotted for reference. The
short vertical lines at the top of the figure
show representative error bars for the low (lefthand line) and high (right-hand line) contrast
regions.
e
+
u
z!
sbs
ems
10
Fig. 3. Threshold for the detection of interocular correlation
as a function of luminance contrast, expressed in threshold
multiples, for the 3 subjects. Both axes are logarithmic. The
data asymptote to a log-log slope of 0 at high contrasts and
- 2 at low contrasts. A line of slope - 2, indicating a trading
relation between interocular correlation and the square of
contrast, is plotted for reference. The left- and right-hand
vertical lines at the top of the figure represent typical SD for
low and high contrast judgments respectively.
2199
Discussion
can be reasonably assumed that the detection of a stimulus in a psychophysical task
occurs when a signal-to-noise ratio reaches
some critical value at the relevant site (or in
the relevant functional unit) of the visual system
(cf. Green SC Swets, 1966). The data from
this experiment indicate that, at higher contrasts, the relevant signal-to-noise ratio remains
constant as contrast changes; this is reflected
by the fact that correlation thresholds are independent of contrast at high stimulus contrasts.
At lower contrasts, however, this is not the
case. As stimulus contrast is reduced, eventually
contrasts are reached which render a previously
threshold level correlation
indistinguishable
from uncorrelated noise. Threshold level performance can be restored, however, by increasing the interocular correlation of the stimulus
by some amount. This indicates that, at low
contrasts,
the relevant signal-to-noise
level
is changing with stimulus contrast and that
this change can be compensated
for by
appropriate adjustment of the interocular correlation. Further, the relative effectiveness of
contrast and interocular correlation on the
signal-to-noise ratio (i.e. the trading relation
between the two variables) is given by the
log-log slope to which the data asymptote at
low contrast levels. From Fig. 3, it can be seen
that this slope is roughly -2, which indicates
that the relevant signal-to-noise ratio is proportional to the square of stimulus contrast. A
possible means of binocular combination by
which this behavior could be realized is a simple
cross-correlation
of edge information in the
stimulus.
A cross-correlation
of two signals produces
a cross-correlation
function, the height of
which at some relative displacement reflects the
degree to which the two input signals match at
that displacement. When a horizontal crosscorrelation
is done on two retinal images
(assuming correct vertical alignment of the
images), the resulting function can be thought of
as representing the number of matching stimulus elements as a function of retinal disparity. In
order to determine the behavior of such crosscorrelation functions in response to various
stimulus parameters,
particularly
luminance
contrast and interocular correlation, a model of
binocular combination was developed which
incorporated the operation of cross-correlation
as the “engine” of the model.
2200
A truly global cross-correlation
would be
undesirable because such a model would have
difficulty distinguishing between transparent
stimuli and stimuli with local depth variations.
For example, a transparent stimulus consisting
of one surface in front of a second surface and
a “checkerboard”
stimulus in which alternate
squares lie at different depths would both lead
to a double peaked cross-correlation function.
This problem can be avoided by doing “local”
cross-correlations
on smaller patches of the
visual scene. The result is a set of crosscorrelations functions whose members correspond to different visual directions. Thus,
transparent stimuli would give rise to double
peaked cross-correlation
functions at each
visual direction
along which transparency
exists, whereas depth variations across a visual
scene would give rise to single peaked crosscorrelation functions, with the location of the
peak depending on the visual direction to which
the function corresponds.
A cartoon representation
of the model is
shown in Fig. 4, which illustrates the various
stages of the model along with the physiological
properties which they were intended to mimic.
The input to the model was a series of image
arrays, representing video frames of the stimulus, the contrast and interocular correlation of
which could be varied. Other image parameters,
such as the size and density of the elements
:omprising the stereograms, were set to match
the conditions of our experiments. The input
image arrays were “blurred” by the optics of the
:ye via convolution with a gaussian representation of the point spread function of the
:mmetropic eye (0.84 arc min space constant).
The blurred image frames were then sampled by
“retinae” with a 30 arc set inter-receptor distance, and averaged over space-time to simulate
reasonable values for spatial and temporal integration of the visual system. The amount of
integration was varied within sensible limits to
insure that the general conclusions from the
modeling did not depend on the particular
values chosen; the primary effect of increased
spatio-temporal integration is simply to reduce
the relative amplitude of noise from spurious
matches [i.e. the “ghost” matches of Cogan
1978) or the “phantom” matches of Julesz
(1971)] in the cross-correlation
function. The
images were then differentiated with respect to
space (edge extraction) to yield an image representation assumed to exist at an immediately
pre-binocular stage in the visual system.
These filtered left and right eye image pairs
were then cross-correlated to produce a crosscorrelation function such as is illustrated in
Image
Fig. 4. An illustration representing the flow of processing in the model. The input to the model is a series
of one bit arrays, analogous to our stimuli. The arrays are then blurred by an amount typical of an
emmetropic eye. Spatio-temporal averaging is then done to simulate the spatial and temporal integration
of the visual system. Edge information is extracted by means of differentiation. Finally, the left and right
eye “images” are cross-correlated to yield a function on which the binocular operations of correlation
detection and localization can be performed. Not illustrated is the addition of noise, assumed to be present
and intrinsic to the visual system. to the cross-correlation function
Interocular correlation,
luminance contrast and cyclopean processing
2203
i
h
f
Fig.
the interocular correlation of an image pair with
some fixed contrast is reduced, the signal amplitude is
decreased proportionally while the noise level remains constant. Shown in this figure is a family of cross-correlation
functions, the members of which differ in the interocular
correlation of the input image pair. The particular interocular correlation of each input image pair is displayed to
the right of the corresponding function. The contrast of the
input image pair was 20% in all cases, and the functions are
displaced vertically for clarity.
If one reduces contrast, however, one eventually reaches a point where the intrinsic noise
(which is assumed not to vary with contrast) is
of much greater amplitude than the noise produced by spurious matches. At this point, overall noise level is effectively constant since its
amplitude is determined almost exclusively by
the amplitude of the intrinsic noise. Signal-tonoise ratio, then, will be determined solely
by the peak height, which varies linearly with
correlation but as the square of contrast. It
follows that the interocular correlation required
to produce a given signal-to-noise ratio (e.g.
that corresponding
to threshold) would be
proportional to the square of the stimulus contrast. For example, if one starts with a crosscorrelation function with a signal-to-noise ratio
corresponding to threshold, and then reduces
the contrast by a factor of 2, the signal-to-noise
ratio will decrease by a factor of 4. To restore
the threshold level signal-to-noise ratio, the
interocular correlation would then have to be
increased by a factor of 4. Thus, in the low
contrast region, we would expect a log-log
plot of correlation threshold as a function of
contrast to show a slope of -2.
Combining the above reasoning from the low
and high contrast regions, we expect the data to
take the form shown in Fig. 8. At high contrasts,
Fig. 8. Schematic diagram showing the effect which intrinsic
noise of constant amplitude would have on the detection of
interocular correlation as a function of stimulus contrast.
When contrast is sufftciently high, the amplitude of the false
target noise is much greater than that of the internal noise.
Thus the effective noise amplitude is simply the amplitude
of the false target noise. Since the signal height and the false
target noise vary conjointly as contrast is changed, the
signal-to-noise ratio of the cross-correlation function remains constant and no change in correlation detection
threshold is anticipated. As contrast is reduced, however,
eventually a point is reached where the amplitude of the
intrinsic noise is much greater than that of the false target
noise. Thus the effective noise amplitude is simply the
amplitude of the intrinsic noise, which is constant. Under
these conditions, signal-to-noise ratio will vary directly with
signal amplitude which, in turn, varies with the square of
contrast (Fig. 7). But since signal-to-noise ratio is linearly
related to interocular correlation (Fig. 8), one anticipates a
square relation to determine the correlation/contrast combinations which will yield a given (e.g. threshold level) signalto-noise ratio. This is reflected by the log-log slope of -2
in the low contrast region of the figure.
where the signal-to-noise ratio is independent of
contrast, we expect the data to asymptote to a
log-log slope of 0. At lower contrasts, where the
signal-to-noise ratio varies as the square of
contrast, we expect the data to asymptote to a
log-log slope of -2.
Since the data (Fig. 3) conform to this expectation, it would seem that for dynamic random
element stereograms, a cross-correlation of the
two eye’s inputs after accounting for such factors as optical low pass filtering (“blurring”)
and spatial and temporal summation, provides
an adequate description of the stimulus at
cyclopean levels of the visual system, i.e. the
earliest level of the visual system where information from the two eyes is integrated. But how
far past the stage of binocular combination
can this analysis be continued? That is, can
downstream binocular processes be treated as
an analysis of, or operations performed on, a
cross-correlation
function? With this question
in mind, we attempted to extend our experimentation into the hypercyclopean domain.
2204
In Experiment
2, we measured
stereothresholds (stereoacuity) as a function of both
luminance contrast and interocular correlation.
The predictions of the cross-correlation model
are as follows.
Consider a cross-correlation function such as
illustrated in Fig. 5. The certainty with which
the “true” position of the function can be
determined depends on the signal-to-noise ratio
of the function. For a given signal-to-noise ratio
however, a sharp peak can, in principle, be more
precisely localized than a broad peak, so it is
important to know how the second derivative of
the cross-correlation function behaves. Further,
the zero-crossings or loci of steepest slope
could be used, along with assumptions of symmetry, to determine the position of the function (cf. Legge & Gu, 1989) so it is also of
import to determine the behavior of the first
derivative.
As it turns out, the height of both the first and
second derivatives of the cross-correlation function is simply a linear function of the height of
the cross-correlation
function (as is the case
with sinusoids). This is convenient, because it
means that in order to express the precision with
which the position of the cross-correlation function can be localized (without invoking disparity
domain interactions), it is necessary only to
specify the signal-to-noise ratio of the crosscorrelation function.
As we already know, both the height of the
cross-correlation
function and the signal-tonoise ratio grow linearly with increasing interocular correlation (Fig. 7). Thus, we predict that
stereoacuity should simply be a linear function
of interocular correlation.
The predicted slope for stereoacuity as a
function of contrast however, depends on the
absolute contrast level. At higher contrasts,
where false target noise is the dominant source
of noise, both the noise amplitude and the peak
signal height are proportional
to the square
of contrast such that the signal-to-noise ratio
remains constant (Fig. 6). At lower contrasts,
where intrinsic noise is the dominant source
of noise, both the peak signal height and
the signal-to-noise ratio vary as the square of
contrast, since noise amplitude is effectively
constant. Thus, at low contrasts, the crosscorrelation
model predicts the stereoacuity
will vary in proportion
to the square of con-
trast, whereas at higher contrasts the model
predicts that stereoacuity will be independent of
contrast.
The methods of this experiment are essentially the same as those of the first experiment.
The 3 authors served as subjects and a temporal
ZAFC-method of constant stimuli paradigm
was employed. The observers task, however,
was to indicate which of the two temporal
intervals contained a horizontal step change in
disparity approximately
halfway down the
stimulus field. For determining stereoacuity as a
function of contrast, interocular correlation
set at 80%, and contrast was varied over a
broad range. For determining stereoacuity as a
function of interocular correlation, contrast was
set at 12% (Michelson definition, as measured
on a static pattern) and correlation
was
varied from either 30% (LKC and SBS) or 60%
(CMS) up to 90%. Thresholds for each run
(where a “run” is the same as defined in Experiment 1) were defined as 75% correct on the
psychometric function, and the thresholds from
5 runs per subject at each contrast/correlation
combination were obtained.
Stereothresholds as a function of contrast are
shown for all 3 subjects in Fig. 9. The high
contrast portion of the data (above 5 contrast
threshold multiples, say) is essentially a replication of both Halpern and Blake (1988) and
contrast
(threshold
Fig. 9. Stereoacuity as a function of contrast for the 3
subjects. Both axes are logarithmic and contrast is expressed
in threshold multiples. Over almost a full log unit of
contrast, the data can be described by a cube root law
contrast dependence, roughly in accordance with the data of
previous investigators. At lower contrasts however, the data
more closely follow a square law dependence
channels centered at higher spatial frequencies,
stands only to increase stereoacuity via probability summation. This possibility need not be
seriously entertained, however, given the second
line of evidence against a significant contribution of high spatial frequency information.
This is simply that both Halpern and Blake
(1988) and Legge and Gu (1989) found a square
root dependence of stereoacuity on contrast
using narrow band targets (the specific targets
employed were tenth derivatives of gaussians,
mercifully known as DlOs, and truncated sinusoids, respectively).
A second mechanism which could subserve
the improvement of stereoacuity with increasing
contrast is a compressive contrast response
function inherent in the stereopsis “mechanism”. In its current form, our model predicts the
same shape for the data in both the correlation
detection and stereoacuity tasks, because performance on both tasks is assumed to reflect
the signal-to-noise ratio of the same crosscorrelation function. If, however, an appropriate compressive contrast response function was
inserted at some cyclopean stage prior to stereo
localization, the high contrast branch of the
model would asymptote more gradually, yielding a curve resembling the data of Fig. 9. This
would indicate that either the mechanism of
correlation detection and stereoacuity each have
a different contrast response and operate in
parallel or that the cross-correlation
function
is passed through a compressive transducer
function after the stage at which correlation
detection occurs but before the stage at which
stereo localization occurs.
The later alternative is perhaps the more
plausible one on the grounds that stereolocalization is probably the primary reason that a
cross-correlation
operation
would be performed. A second cross-correlation
operation
occurring in parallel would, therefore, be superfluous in all but artificial tasks of correlation
detection such as in the present experiments
(it is conceivable
that a parallel crosscorrelation operation could be occurring in
the eye movement pathway, but it is unlikely
that this would be a factor in a psychophysical
task).
A third mechanism which could bring about
an improvement of stereoacuity with increasing
contrast is inhibition across disparity tuned
“units”, be they the individual cells of Poggio
and Poggio (1984), the channels of Stevenson
i’t nl. (1990).
the detectors of Tyler (1983).
Consider the possibility that cross-correlation
functions are encoded as relative activity in an
array of such units located along the disparity
axis. Without disparity domain inhibition, a
graph of unit activity vs peak disparity tuning of
the unit for various contrasts would simply look
like Fig. 6. With the addition of inhibition
between the units, however, the signal-to-noise
ratio would continue to improve as contrast
increased, rather than remaining constant. The
nature of this inhibition could be “tweeked” in
one’s model to yield whatever contrast dependence was desired. The argument at low stimulus contrasts, however, need not be altered by
the addition of inhibition. At low stimulus
contrasts, where intrinsic noise seems to effectively determine the overall noise amplitude, the
activity of the disparity-tuned units outside of
the stimulus plane could be so low that their
influence through inhibitory pathways would be
negligible.
The presence of some sort of disparity domain inhibition is also supported by the phenomenology of random element stereograms.
As illustrated in Fig. 7, noise in the crosscorrelation function due to false dot matches is
unaffected by stimulus correlation. Thus, when
disparity domain inhibition is not considered,
the activity of a unit tuned to a disparity other
than that of, or immediately adjacent to, the
stimulus plane, would be unaffected by changmg the interocular correlation. Perceptually.
however, this is not the case. When viewing
;+ random element stimulus of 0 interocular
correlation, a voluminous “swarm” of dots is
perceived. But when interocular correlation is
raised to 100% at some disparity, the only thing
that is seen is a flat plane at that disparity,
no false matches are seen at all. Therefore, it’
it is assumed that some process like a crosscorrelation occurs at the site of binocular combination, then there must also be some process
to “clean up” the spurious disparities before a
site is reached where the neural activity IS
reflected perceptually.
Since cross-correlation is an inherently multiplicative mathematical operation, our model
is somewhat at odds with additive models of
binocular contrast combination (e.g. Leg@ &
1989). Dynamic random element stimuli,
however, by their very nature isolate a highly
specialized subset of the visual system. It would
be premature, therefore, to place too much
weight on apparent contradictions with results
&tained with other sorts of stimuli
Interocular correlation,
luminance contrast and cyclopean processing
In conclusion, our psychophysical results indicate that interocular correlation is an important controlling variable at both the cyclopean
and hypercyclopean stages of visual processing.
Further, a model which utilizes the operation of
cross-correlation
as the means of binocular
combination provides a good description of the
stimulus in both the cyclopean and hypercyclopean domains insofar as it accounts for
and/or predicts psychophysical data from both
of these domains.
Acknowledgements-This
work was supported in part by
grant # EYO 06045 to SBS and CMS and grant # ROlEY03532 to CMS. The authors with to thank Gordon Legge
for his invaluable comments and J. Malik and D. Jones for
helpful conversation.
Barlow, H. B. (1964) The physical limits of visual discrimination. In Giese, A. C. (Ed.), Photophysiology (Vol. 2,
pp. 163-201). New York: Academic Press.
Cogan, A. I. (1978) Fusion at the site of the “ghosts”. Vision
Research, 18, 657-664.
DeValois, R. L. & DeValois, K. K. (1988) Spatial vision.
New York: Oxford University Press.
Green D. M. & Swets, J. A. (1966). Signal detection theory
and psychophysics. New York: Wiley.
2201
Halpem, D. L. & Blake, R. (1988) How contrast affects
stereoacuity. Perception, 17, 483-495.
Heckman, T. & Schor, C. M. (1989) Is edge information for
stereoacuity spatially channeled? Vision Research, 29,
593-607.
Julesz, B. (1971) Foundations of cyclopean perception.
Chicago: University of Chicago Press.
Legge, G. E. & Gu, Y. (1989). Stereopsis and contrast.
Vision Research, 29, 989-1004.
Nishihara, H. K. (1987). Hidden information in transparent
stereograms. Proceedings of the Twenty-first Asilomar
Conference on Signals, Systems & Computers, 21,695-700.
Poggio, G. F. & Poggio, T. (1984). The analysis of stereopsis. Annual Review of Neuroscience, 7, 379-412.
Rogers, B. J. & Anstis, S. M. (1975). Reversed depth from
positive and negative stereograms. Perception, 4, 193-201.
Schor, C. M., Wood, I. C. & Ogawa, J. (1984). Spatial
tuning of static and dynamic local stereopsis. Vision
Research, 24, 573-578.
Stevenson, S. B., Cormack, L. K. & Schor, C. M. (1991).
Depth
attraction
and repulsion in random
dot
stereograms. Vision Research, 31, 805-813.
Stevenson, S. B., Cormack, L. K., Schor, C. M. & Tyler, C.
W. (1990). Disparity tuned channels in human stereopsis.
Investigative Ophthalmology and Visual Science (Suppl.),
31, 95.
Tyler, C. W. (1983) Sensory processing of binocular disparity. In Schor, C. M. & Ciuffreda, K. J. (Eds), Vergence
eye movements: Basic and clinical aspects (pp. 199-295).
London: Butterworth.
Westheimer, G. & Levi, D. M. (1987). Depth attraction and
repulsion of disparate fovea1 stimuli. Vision Research, 27,
1361-1368.