Vision Res. Vol. 32, No. 9, pp. 1685-1694, 1992 Printed in Great Britain. All rights reserved 0042-6989/92 $5.00 + 0.00 Copyright 0 1992 Pergamon Press Ltd Disparity Tuning in Mechanisms Stereopsis SCOTT B. STEVENSON,*1 LAWRENCE CHRISTOPHER W. TYLER? K. CORMACK,* CLIFTON of Human M. SCHOR,* Received 30 December 1991; in revised form 19 February 1992 The change in sensitivity across some stimulus dimension which follows adaptation to a particular stimulus can reveal a great deal about the tuning characteristics of underlying sensory/perceptual mechanisms. In this study, a psychophysical adaptation paradigm was employed to characterize the disparity tuning of perceptual mechanisms involved in stereopsis. The stimulus was a dynamic random-dot stereogram (DRDS) portraying a surface which varied in interocular correlation (IOC) and retinal disparity. Adaptation to a fully correlated DRDS surface produced an elevation in IOC threshold over a relatively narrow range of disparities, with maximum effect at the disparity of the adapting stimulus. The width of these disparity tuning functions varied from 5 arc min for adaptation at the horopter to 20 arc min for adaptation at 20 arc min disparity. Frequently, IOC sensitivity was enhanced for disparities on either side of the adapted disparity, suggesting that an opponent center-surround organization operates at an early level of disparity processing. A model of underlying channel structure consistent with these data is presented. Stereopsis Disparity tuning Stereo matching INTRODUCTION A common finding in visual psychophysics is that stimulus dimensions are processed through “channels” or “filters”: perceptual mechanisms whose selectivity on a visual dimension is narrow relative to the overall sensitivity function for that dimension, but whose selectivity is often broad relative to the discrimination function for that dimension. The trichromacy of color vision provides the classic example of channeling in human vision. The photopic spectral sensitivity function reflects a weighted sum of somewhat narrower cone sensitivity functions, or wavelength channels. Comparison of the activity in these cone types in turn produces very precise wavelength discrimination. Evidence for channels has been found for a number of visual dimensions including grating spatial frequency and orientation (Blakemore & Campbell, 1969), lateral motion (Levinson & Sekuler, 1980), motion in depth (Beverley & Regan, 1975) and static retinal disparity (Felton, Richards & Smith, 1972). In a review of such evidence, Regan (1982) offers a list of defining characteristics of a “visual channel” which restricts the term to mechanisms that are tuned for one, and only one, visual *School of Optometry, University of California at Berkeley, Berkeley, CA 94720, U.S.A. tSmith-Kettlewell Eye Research Institute, 2232 Webster Street, San Francisco, CA 94115, U.S.A. $To whom all correspondence should be addressed. Channels dimension. This paper concerns the tuning characteristics of mechanisms sensitive to static retinal disparity, and while we refer to “channels,” it should be clear that we have not determined whether the mechanisms are tuned on other visual dimensions. Early evidence for disparity tuned mechanisms in human vision was reported by Blakemore and Julesz (1971), who found that adaptation to a random-dot stereogram produced shifts in the apparent depth of subsequently viewed stereograms. They argued that these shifts indicated that relatively narrow disparity tuned mechanisms had been adapted. Further psychophysical evidence for disparity tuning was provided by Felton et al. (1972) and by Blakemore and Hague (1972) in experiments that revealed disparity-specific elevation of contrast threshold after adaptation to a sine-wave grating. Thresholds were typically elevated the most at the disparity of adaptation, supporting the theory that the visual system includes a number of disparity-tuned mechanisms with peak sensitivities covering a range of disparities around the horopter. The “many narrow channels” view of disparity processing can be contrasted with Richards (1971) threepool hypothesis. He found, and Jones (1977) confirmed that certain “stereoanomalous” individuals were unable to make accurate disparity discriminations over a relatively broad range of disparities. Consistent with the pattern of deficits found, Richards proposed the existence of three pools of disparity detectors: a fine pool for 1685 1686 SCOTT B. STEVENSON small disparities, and near and far pools for larger crossed and uncrossed disparities, respectively. Recent evidence from single cell recordings in monkey (Poggio, Gonzalez & Krause, 1988) provides some support for both of the above models of disparity channeling. Some cortical neurons (“tuned”) exhibit narrow disparity tuning, with peaks distributed across a range of disparities. Others exhibit broad tuning functions (“Near/Far”), being excited only by either crossed or uncrossed disparity but not both. The difficulty in linking human stereopsis with the single cell results is that these disparity tuning profiles are found for both horizontal and vertical disparity. While sensory and motor fusion operates on both axes of disparity, stereoscopic depth occurs only for horizontal disparity. Therefore, it is difficult to know if all, or only a subset of these cell types might represent the neural basis of stereopsis. Our purpose in these experiments was to examine the disparity tuning of human perceptual mechanisms processing interocular correlation (i.e. “solving the stereo-matching problem”). We used an adaptation paradigm to study the characteristics of visual mechanisms tuned to disparities in the range of 0.5 deg crossed to 0.5 deg uncrossed disparity. In order to target dis- e/ ul. parity processing mechanisms specifically, we have used dynamic random dot stereogram stimuli that varied in interocular correlation (IOC): that is, the proportion of left and right image dots which match each other at the target disparity (Julesz & Tyler, 1976). We began by measuring thresholds for detecting IOC over a range of disparities, and then determined the elevation in these thresholds which occurred after adaptation at a particular disparity. Finally, we generated a multichannel model to simulate a set of underlying mechanisms consistent with our results. GENERAL Stimuli The dynamic random dot patterns were generated by special-purpose electronics that allowed for precise control of disparity and interocular correlation. Figure 1 is a schematic diagram of the stimulus generation system. Pseudo-random binary noise was generated by a 32 bit shift register with XOR feedback from bits 23 and 30. A second, uncorrelated noise source was produced with an XOR combination of bits 25 and 26. (Extensive computer simulation of the shift register system confirmed that these noise streams are uncorrelated.) f- ‘_ Delay circuit METHODS Video __) Hor. Sync -* FIGURE 1. Schematic of display apparatus. Subjects viewed display monitors in a mirror haploscope setup with accommodation and convergence set to a distance of 57.3 cm. A hardware device based on shift registers generated two independent streams of one-bit noise (Dot streams 1 and 2) and an AT computer controlled the routing of the noise sources to the displays (represented by a switch in the diagram). When the same noise source went to both displays, IGC was + 1. When independent noise was sent to each, IGC was 0. When the AT switched rapidly between the states, IGC was at an intermediate level determined by the duty cycle of the mixture. Disparity was controlled by routing one horizontal sync signal (solid lines) through a programmable digital delay device under control of the AT. Subjects made responses by pressing keys on the AT keyboard. Inset: Schematic of left and right views with 11 deg aperture, dark surround, fixation dot and nonius lines. Free fusion of these images yields a percept similar to that experienced in our setup under conditions of 100% correlationPnonius lines superimposed on a random-dot surface in depth. DISPARITY TUNING IN HUMAN Zero interocular correlation was produced by directing one noise source to the left monitor and the other to the right monitor and a positive 1.0 interocular correlation was produced by directing the same noise source to both monitors. Various inte~ediate levels of interocular correlation were produced by switching rapidly between the two sources of noise and sending the result to the left monitor’s image, while always directing the first source to the right monitor.* For example, at the lowest IOC level used (2%) one fiftieth of the dots were forced to be perfectly correlated in the two views (same noise source) while the remainder were randomly correlated or anti-correlated (independent noise sources). Although the percent correlation is small in this example, the actual number of matching dots is quite high since each frame of the display had over 18,000 dots visible to the subject. Disparity was produced and controlled by directing the horizontal sync signal going to the left monitor through a programmable digital delay device. Delay of the sync resulted in a lateral shift of the noise pattern, producing a disparity of the noise with respect to the fixation dot. In our configuration, each unit of delay (nominally 5 nsecs) resulted in a disparity change of about 0.27 arc min. The full 255 steps of delay allowed for a disparity range of about 70 arc min. All stimuli in these experiments were viewed in a mirror haploscope at a distance of 57 cm. Convergence was adjusted for normal viewing at this distance. A chin rest and forehead rest stabilized the subject’s head. The display luminance (measured through the haploscope mirrors) was 40cd/m2 and the contrast (measured on a static pattern) was 80%. A pair of 9 deg circular apertures concealed the edges of the display, producing a dark surround. The display dots were 5 arc min wide by 2 arc min tall. A black circle subtending 9 arc min served as a fixation dot. Vertical nonius lines subtending 3 by 40 arc min provided a reference for maintaining constant convergence. We estimate that convergence accuracy is kept to within 1 arc min by this method.7 The appearance of the display varied with IOC from a flat, opaque plane of dynamic dots at positive 1.OIOC *Creating intermediate correlations with this method is analogous to creating intermediate gray-levels with only black and white pixels by half-toning or dithering. As long as the mixture is beyond the spatial (and/or temporal) resolution of the eye, the viewer cannot discriminate between halftone grays and true grays. In random dot stereograms, individual dot pairs are always either + 1 or - 1 correlated, so intermediate levels of interocular correlation necessarily reflect a mixture of +l and -1 across space and time. Our mixture rate was chosen so as to exceed both the spatial and temporal resolution limits of correlation processing. Further treatment of this topic is given in Cormack et al. (1991b). $Sensitivity to nonius offset was measured by having subjects fixate the random noise with all other fusible contours removed. Dichoptic misalignment thresholds were at or below 30 arc set for our three subjects in agreement with McKee and Levi (1987). Assuming that subjects maintained nonius alignment to within twice the misalignment detection threshold, vergence fixation errors in our experiments never exceeded 1 arc min. In debriefing, subjects reported that the nonius lines always appeared aligned, supporting that assumption. STEREOPSIS 1687 to a dynamic, space-filling swarm of dots at many depths at 0.0 IOC. Intermediate levels appeared as a surface embedded in the swarm, with the apparent density of the surface being monotonically, if not linearly, related to the level of IOC. Except for correlations right at threshold, subjects agreed that when the surface was visible its depth relative to the fixation mark was also evident. Procedure Thresholds for detection of interocular correlation were measured in three subjects using a method of constant stimuli and a temporal two alternative forced choice procedure. Subjects viewed a zero correlation pattern at all times except during the signal presentation. Each trial began with a tone to signal the first interval, a 500 msec pause, a 200 msec signal (or blank), and another 500 msec pause. The second interval followed the same sequence. Subjects then indicated which of the two intervals contained non-zero correlation by pressing a key on the keyboard. The next trial followed immediately after each key press. When fixation wandered or a blank or saccade interrupted a trial, it could be repeated, with a new random interval order, by pressing the space bar. A set of 4--7 IOC levels presented in random order composed a block of trials and 30 blocks composed a single threshold dete~ination. Threshold was defined as the level producing 75% correct performance, as interpolated from a linear regression between stimulus level and Z-transform of proportion correct. Subjects Subjects for these experiments were three of the authors of this paper. All had corrected to optimal visual resolution acuity and excellent stereoacuity for this stimulus, as reported previously (Stevenson, Cormack & Schor, 1989). EXPERIMENT I: BASELINE CORRELATION SENSITIVITY We began our experiments by establishing baseline performance in the correlation detection task, for purposes of comparison to adapted performance. Five threshold measures for each of 17 disparity levels (0, 2.5, 5, 10, 15, 20, 25, 30 and 35 arc min for both crossed and uncrossed directions, and additionally 40 and 45 arc min uncrossed for SBS) composed a complete baseline correlation sensitivity function. Prior to each run, subjects were shown a fully correlated surface at the disparity to be tested in order to minimize uncertainty about where test stimuli would appear. Results The results from the baseline measures of correlation sensitivity are plotted in Fig. 2. Disparity of the stimulus relative to fixation is plotted on the horizontal axis 1688 SCOTT B. STEVENSON 0.0 0 correlation. We measured the UDL for both crossed and uncrossed disparity in all three subjects. Methods -0.5 -1.0 8 A et trl 1*- I -1.d -60 I LKC 8 -60 f -40 -20 Test 0 Disparity r 20 r 40 (arc min) 8 60 1 60 FIGURE 2. Baseline correlation thresholds for three subjects. The log of interocular correlation at threshold is plotted against horizontal disparity of the test surface relative to fixation. Negative values of disparity indicate near (crossed) disparity. A value of 0.0 on the vertical axis represents a correlation of 1.0, the maximum possible. The intersection of each subject’s curve with the line at 0.0 indicates the upper disparity limits for correlation detection in our task. Correlation thresholds were measured for each subject out to +35 arc min disparity, with each point representing the mean of 5 runs. Upper disparity limits were measured in a separate experiment. The general trend can be characterized as being symmetric about 0 disparity, with an exponential relationship between correlation threshold and disparity. Each subject showed idiosyncratic departures from this exponential trend, such as the bumps at 0 disparity for SBS, at + 5 arc min for CMS and at + 10 for LKC. against interocular correlation at threshold (log scale). Correlation detection thresholds vary from about 0.10 at the horopter to 1.O at about 1 deg of crossed or uncrossed disparity, the upper disparity limit (see below). In general, all three subjects show a logarithmic increase in correlation threshold with linear increases in disparity away from zero. However, each subject’s function shows localized irregularities and extensive baseline measurements made in the context of a followup study (Cormack, Stevenson & Schor, l99la; in preparation) reveal that the coarser features are stable over time (e.g. the plateau from + 10 to + 20 arc min the data for LKC). Of course, some of these features are due to normal statistical variation and were not replicated exactly in the follow-up (e.g. the bump at 0 arc min in the data for SBS). Figure 3 shows the baseline data replotted to compare sensitivity for near and far disparities. Data for the three subjects have been displaced vertically for clarity. Error bars represent f 1 SE. Subject CMS shows a small but consistent difference in sensitivity between near and far disparity values, being more sensitive to near (crossed) disparity over the range tested. The other two subjects show no overall difference between near and far sensitivity, although LKC tends to be more sensitive to far for disparities greater than 15 arc min. Upper disparity limits for correlation detection were determined with a variant on the baseline procedure in which the correlation was kept at 100% and the disparity was varied according to the method of constant stimuli. Otherwise, the trial sequence was the same as described above for baseline measures. The resulting psychometric functions were used to determine a threshold value reflecting the largest disparity for which a 1.O correlated surface is discriminable from 0.0 correlation. Results The upper disparity limits are plotted in Fig. 2 as the intersection between the sensitivity curves and the horizontal line at 0.0 on the vertical axis. They are between 45 and 65 arc min for all three subjects and both directions of disparity, and are consistent with the exponential trend in the baseline data for each subject. EXPERIMENT III: The effect of adaptation on correlation sensitivity was determined for all three subjects and compared to the baseline measures in order to reveal the extent to which adaptation at a particular disparity affects sensitivity over the visible disparity range revealed in the data of Fig. 2. Methods Adaptation trials followed the same sequence as baseline trials except that a 1 min adaptation interval preceded each threshold determination and a 500 msec “refresher” adaptation interval preceded each interval of each trial. The adaptation stimulus was a 1.0 correlated surface at some fixed disparity. Three threshold Near/Far Baseline Disparity EXPERIMENT II: UPPER DISPARITY LIMIT The upper disparity limit (UDL) for detecting correlation under our conditions is the disparity at which correlation threshold reaches 100% and beyond which even fully correlated surfaces are indistinguishable from ADAPTATION Comparison (arc min) FIGURE 3. Baseline correlation thresholds replotted for purposes of near/far comparison. Absolute value of the disparity of the test stimulus is plotted horizontally against the log of the correlation at threshold, with data from each subject displaced vertically for clarity (0.5 log units for LKC. 0.8 for CMS). Open symbols plot data for far (uncrossed) disparity, solid symbols for near (crossed) disparity. Error bars indicate i 1 SE. DISPARITY TUNING IN HUMAN 1689 STEREOPSIS SAMPLE PSYCHOMETRIC FUNCTIONS test and adapt both at +20 aranin adaptation, and the horizontal line at 0 on each vertical axis indicates a ratio of one (no effect of adaptation). Values above the line reflect a loss of sensitivity due to adaptation, and values below it reflect an enhanced sensitivity due to adaptation. In the bottom panel, results are shown for two subjects adapting at 20 arc min far and for one subject adapting at 20 arc min near disparity. 0.34---6=~ a . * f ’ In most cases, the tuning functions obtained show the a5 0.4 0.3 0.2 0.1 0.0 greatest threshold elevation at the disparity of adapCorrelation tation. An exception to this general rule occurred for FIGURE 4. Sample psychometric functions showing proportion subject LKC (dashed line): the tuning function for correct responses in a two alternative forced choice paradigm as a adaptation to 10 arc min far disparity showed maximum function of the ~nter~ular correlation of a DRDS surfaced presented at 20 arc min relative to fixation. Open symbols show performance threshold elevation at 20 arc min. In the same plot, without adaptation, solid symbols show performance after adapting subject CMS (dotted line) shows a smaller shift, with to a fully correlated surface at the same (20arcmin) disparity. threshold elevation peaking at 15 arc min. In most cases, Thresholds were determined by fitting the psychometric functions the tuning functions show facilitation for disparities on with a cumulative normal and inter~lating to find the correlation yielding 75% correct performance. either side of the adaptation locus. In some cases, where the data set is sufficiently extensive, there is an indication that performance returns to baseline for more distant measures for disparities at and surrounding the site disparity values. The full width at half height of these disparity tuning of adaptation composed a single adapted sensitivity function. Functions were measured for adaptation at functions was measured directly from these plots and the four values of disparity for each subject (0, + 10, - 10, results are shown in Fig. 7. The horizontal axis plots +20 arc min for SBS and LKC, and 0, + 10, - 10, the absolute value of the disparity at which threshold -20 arc min for CMS). Disparity tuning functions were elevation is maximum (which was not always the locus calculated by taking the log of the ratio of adapted to of adaptation, as noted above). The vertical axis plots baseline threshold for each test disparity. the full width at half height of each tuning function obtained. The best fitting line, its equation and R2 value Results Sample psychometric functions for correlation detection at 20 arc min far are shown in Fig. 4 for one subject before (open symbols) and after (solid symbols) adaptation to a fully correlated surface at the same disparity. The effect of adaptation on correlation sensitivity across the range of disparities used is shown in Fig. 5 for two adaptation conditions for subject SBS. The baseline data (open symbols) are replotted from Fig. 2, the solid symbols show correlation thresholds measured after adaptation to a zero disparity (panel A) and a 10 arc min near (panel B) surface. Error bars represent f 1 SE. In both panels of Fig. 5, threshold is elevated at the disparity of adaptation by about a factor of two, and the range of disparities for which threshold is elevated is relatively narrow. The adaptation curve crosses the baseline curve 5-10arc min on either side of the adaptation locus and is below baseline for relative disparities greater than IO-20 arc min in both directions. The principal difference between the results presented in panels A and B of Fig. 5 is that adaptation at 10 arc min near produces a somewhat broader and asymmetric threshold elevation. Disparity tuning functions were obtained from these data by plotting the log of the ratio of adapted to baseline threshold, and the results are shown in Fig. 6 for all three subjects and all values of adapting disparity. The horizontal axis plots disparity in arc min, the vertical axis plots threshold ratio on a log scale. The arrow on the horizontal axis indicates the disparity of -40 -140 0 20 40 -20 0 20 40 Test Disparity (arc -20 min) FIGURE 5. Effect of adaptation for two adapting disparities for subject SBS. Axes are the same as for Fig. 2, but expanded to show more detail. Open symbols show baseline data replotted from Fig. 2. Solid symbols show correlation sensitivity after adaptation to a 100% correlated surface. Error bars indicate &I SE. Panel A: adaptation at 0 disparity. Panel B: adaptation at -10 arc min. Note that in both panels, thresholds are elevated at the disparity of adaptation, and lowered for most disparities outside a relatively narrow region surrounding it. A plot of the ratio of adapted to baseline threshold yields the disparity tuning functions shown in Fig. 6. I690 SCOTT B. STEVENSON 0.6 1 - SBS LKC as -----..,....”...... -40 -20 Test disparity 0 20 ( arc min 40 ) FIGURE 6. Disparity tuning functions for three subjects after adaptation at (A) 0, (B) -10, (C) + 10, (D) + 20 (two subjects) and -20 (one subject) arc min disparity. The log of threshold ratio (adapt/baseline) is plotted against the disparity of the test surface in each panel. Arrows on disparity axis indicate the disparity of the adapting surface. The solid curve in panel A plots the difference between the two functions shown in Fig. 5(a) for SBS, the solid curve in panel B plots the difference between the two functions shown in Fig. 5(b). In general, the disparity tuning functions show the greatest elevation at the locus of adaptation, with facilitation for more distal disparities. Note, however, the peak at +20 for LKC after adaptation to + 10 arc min (dashed curve in panel C). 6’1(I/ follow an exponential relationship to the magnitude of the pedestal (but see also Badcock & Schor. 1985; Regan & Beverley, 1973). Since the fixation reference was a pair of nonius lines, a disparity value of /ero on the horizontal axis of Fig. 2 indicates the nonius horopter and sensitivity to correlation was greatest at or near this point. One might alternatively measure a correlation horopter, i.e. the locus of disparities for which correlation threshold is at a minimum. For our subjects, this appears to be within 2- 3 arc min of the nonius horopter. The bumps and dips in the baseline data of Fig. 3 produce some local asymmetries between crossed and uncrossed disparities of the same value: for example, a comparison of 20 Near vs 20 Far for subject LKC in Fig. 3 shows about a 0.2 log unit difference in threshold. Subject CMS shows an asymmetry in the opposite direction over most of the disparity range tested, but the magnitude of the difference varies considerably. Considering all three subjects, these data indicate no overall difference in sensitivity between near and far mechanisms, particularly if the correlation horopter is used as the reference. This localized, mild form of “stereoanomaly” is more consistent with a model of many narrow band disparity channels, some of which may be relatively insensitive, than with the Richards (1971) pooling hypothesis. The effects are not robust enough in these subjects to be conclusive. but it suggests that further study may turn up individuals with pronounced deficits in correlation sensitivity at a particular disparity--a kind of correlation scotoma. For each subject, the triangular area in Fig. 2 bounded by the baseline sensitivity curve and the horizontal line at 0.0 on the vertical axis (top of Fig. 2) represents those combinations of disparity and correlation values which are distinguishable from 0.0 correlation. Everything outside that area has the appearance of 0.0 correlation: a dynamic swarm of dots filling a volume in depth. Values inside the triangle produce the percept of a stable surface in depth, floating amidst a less dense swarm. The baseline values of Fig. 2 are specific to the conditions we tested under and should not be generalized beyond them. In particular, correlation threshold at 0 E are shown in the figure. The tuning widths are well fit by a line with intercept at about 5 arc min (indicating the average tuning width at the horopter) and slope of about 0.8. DISCUSSION Disparity range of interocular correlation detection The results of the baseline and upper disparity limit experiments indicate that, in general, correlation sensitivity follows an exponential relationship to disparity. This is reminiscent of the finding by Blakemore (1970) that relative depth judgments on a disparity pedestal .P 20 I"= r 'E 'g G - 10 AZ 5 3 y I = I: 0 4.5584 0 Location + 0.80779X R”2 10 of Tuning = 0.846 20 Peak (arc mln) FIGURE 7. Width of disparity tuning functions plotted against locus of peak elevation. Each point represents the full width at half height of a tuning function, measured directly from the curves plotted in Fig. 6. Tuning width is approximately a linear function of peak disparity. with an intercept of about 5 arc min. DISPARITY TUNING IN HUMAN zero disparity varies with stimulus area, duration (Tyler & Julesz, 1978) and contrast (Cormack, Stevenson & Schor, 199lb). One would also expect the upper disparity limit to change with these variables, so that the figure of 1 deg might be exceeded under different conditions. In general, we chose values that optimize performance in the tasks so most alterations in the configuration of our stimuli would result in lowered correlation sensitivity. Disparity -speciJic adaptation The results of the adaptation experiments indicate that those visual mechanisms which process interocular correlation are sharply tuned for retinal disparity, both at and away from the horopter. While our data set is not sufficiently extensive to make a definitive statement, it appears that a relatively large number of “channels” exist based on the fact that threshold was usually most elevated at the locus of adaptation. Had there been only a few (e.g. 3 or 4) channels mediating correlation detection, adaptation would have produced the same elevation profile for a range of adapted disparities. Our data indicate that there are a large number, if not a continuum of channels, but a precise estimate of how many and where they peak would require an order of magnitude more data than we have presented here. The finding of facilitation for disparity values distant from the locus of adaptation suggests that adaptation weakened an inhibitory mechanism that affects sensitivity under baseline conditions. One explanation, which has been used to explain facilitory effects in spatial frequency channels (DeValois, 1977) is that channels are mutually inhibitory and that adaptation of a channel releases neighboring channels from inhibition. This assumes that surrounding channels which are not excited by test stimuli are significantly inhibiting those which are, thus implying a relatively high, constant level of activity in all channels. However, in the current context, the spurious matches which occur at random at all disparities in a DRDS stimulus might excite surrounding channels. A somewhat different explanation holds that each channel has an opponent, center-surround organization due to its inputs from earlier levels of processing rather than from inhibitory interactions at the same level. In this case, adaptation in the surround region produces enhanced sensitivity of the channel more directly, perhaps by adaptation of subunits in the inhibitory surround. This puts the inhibitory mechanism at an earlier stage in visual processing and requires no spontaneous activity in surround channels, but retains the intuitive notion that facilitation comes from a release of inhibition. Both models yield the same opponent center-surround characteristics and may be equivalent at the computational level despite differences in implementation. *Wiebull fits to our psychometric functions exactly 2.0, so a vector sum is equivalent across channels. show that Beta is almost to probability summation STEREOPSIS 1691 Disparity channels model: assumptions It is important to note that the tuning functions shown in Fig. 6 are not necessarily plots of disparity-tuned channels, but may be the result of adapting many overlapping channels, each with sensitivity to the disparity of the adapting surface. To understand better the implications of our results, we have developed a model of underlying channel structure which produces similar results and have explored some boundary conditions to limit the possibilities. Our search is by no means exhaustive and our model is by no means definitive, but we have been able to rule out some general schemes that have been proposed elsewhere. We constructed the model according to the general form of the results presented above, with a number of tuned mechnisms whose sensitivity decreases with disparity and whose tuning width increases with disparity such that the area under the channel profile is constant for all channels. We chose a difference of Gaussians as the response profile of each mechanism, with an excitatory central region and a broader, inibitory surround that exactly balances the overall function. The reponse of the visual system to a pulse of interocular correlation was modeled as the vector sum of the individual responses.* The effect of adaptation on an individual mechanism was assumed to be proportional to its sensitivity to the adapting stimulus. Figure 8 shows the channel profiles used in the modeling, after several channels have been desensitized by adaptation to a stimulus at f 10 arc min. One channel has been plotted more darkly to illustrate the form of the profile. The stimulus is represented as being at a single disparity value in the modeling. In actuality, the energy of the stimulus is distributed across disparity by some combination of optical, neural and oculomotor factors (Stevenson, Cormack & Schor, 1989; 1991). To simplify the model, we consider the channel profiles to be the response across disparity of each channel to the stimulus, without specifying the stimulus profile exactly. Quantitative agreement between the human data and the modeling results was achieved by adjusting the number of channels and eight constants in relatively simple formulas which determined the location, sensitivity, breadth and inhibitory sidelobe extent of each channel. In the configuration presented here, there are 21 channels, peak sensitivity falls off as the square root of disparity from the horopter, the space constant is inversely proportional to sensitivity, the inhibitory surround has six times the spatial extent of the excitatory center and the peak separation increases exponentially with disparity. The use of formulas to specify these variables served to reduce the number of free parameters considerably, given the large number of channels used. Disparity channels model: results The results of our simulations are shown in Figs 9, 10 and 11. Figure 9 shows simulated pre-adaptation performance for this set of mechanisms (solid curve), with 16Y2 SCOTT B. STEVENSON c/ al. Disparity Channel Profiles (after adaptation to 0.6 -20 0 Disparity 20 40 (arc min) FIGURE 8. Sensitivity profiles for hypothetical disparity tuned correlation detectors in visual system with difference of Gaussians profiles. Sensitivity to correlation is plotted vertically against retinal disparity. Plots reflect channel sensitivity after adaptation to a fully correlated surface at + 10 arc min (indicated by arrow), causing a depression of sensitivity for several channels peaking near that value. The effect of adaptation on each channel is assumed to be proportional to its unadapted sensitivity to the adapting stimulus. Before adaptation, the channel profiles are arranged symmetrically about zero disparity and have sensitivity which is inversely proportional to tuning width. log threshold plotted vertically against disparity. For comparison, the baseline data from two subjects have been replotted from Fig. 2. The model baseline captures the basic form of the human data, absent the idiosyncratic bumps and dips. These variations can be reproduced in the model by adjustment of individual channel sensitivities. Figure 10 shows the same model baseline data (solid curve) along with the postadaptation model baseline (dashed curve), obtained after adaptation to a stimulus at + 10 arc min. Note that the adapted baseline drops below the unadapted baseline for disparities that are distant from the locus of adaptation. Figure 11 shows the difference between these curves (dotted curve in Fig. 1I), plotted in the same way as the tuning functions of Fig. 6, with the log of Comparison of baseline data subiects with multlchannel -40 -20 0 20 from two model the threshold ratio plotted vertically against the disparity of the test stimulus. Also plotted in Fig. 11 are model threshold elevation curves for adaptation at 0 and 20 arc min disparity. While the profiles do not match exactly to those found experimentally, they capture four important characteristics: they peak at the disparity of adaptation, are narrowly tuned, show facilitation for surrounding disparities and the offhoropter functions are slightly asymmetric in form. Note that the asymmetry is not a property of the individual channels, but of the channel distribution overall. Note also that the facilitation observed at large disparities after adaptation to zero disparity (solid curve in Fig. 11) is produced by the sidelobes of channels peaking at large disparities not by the sidelobe of the channel peaking at zero disparity. Thus, the disparity tuning profile produced by adaptation to a particular disparity does not accurately reflect the channel profile of a single mechanism which peaks at that disparity. 40 DISPARITY FIGURE 9. Baseline data for two subjects and model. Model baseline reflects the vector-summed output of the channel array shown in Fig. 8 when presented with correlation stimuli. Log of correlation required to reach a criterion (“threshold”) response is plotted vertically against disparity of the test pulse. The pattern of detector sensitivity and tuning width was adjusted in order to produce results like those exhibited by the human subjects. 8 2 -1.4 -40 -20 0 Disparity 20 40 FIGURE 10. Model baseline replotted along with model correlation thresholds after adaptation to stimulus at + 10 arc min. Note that threshold is elevated most at the disparity of adaptation. returns to baseline on either side and drops below baseline for more distant disparities. Axes are the same as in Fig. 9 DISPARITY TUNING IN NUMAN FIGURE 11. Model tuning functions, obtained by taking the threshold ratio from the model baseline and post adaptation model sensitivity. Log of the threshold ratio is plotted vertically against the disparity of the test p&e. Functions are shown for adaptation at 0 (solid curve), + IO arc min (dotted curve) and +20 arc min (dashed curve). Note the similarity of these functions to the tuning functions plotted in Fig. 6(b) and 6(c). Rather, it is a reflection of the aggregate activity of many channels. This configuration of the model we present was chosen to so as to capture both the baseline and the tuning behavior of the human data and some parameters are more critical than others. For exampie, as few as 10 channels could be used to match the overall baseline and tuning characteristics, but then the baseline data showed considerable bumpiness due to gaps between channels. ftroadening these 10 channels removed the bumps but then made the tuning functions broader than the human data. Using more than 21 channels does not significantly alter the behavior of the model, so our configuration is close to the minimum needed to get the idealized baselines and tuning functions shown. The baseline irregularities and the peak shift from the human data (seen for LKC adapting at 10 arc min) suggest that the mechanisms tapped by our experiments depart from these ideals. These were successfuly simulated by simply adjusting the sensitivity of individual channels to make localized regions of higher or lower sensitivity, resulting in an idiosyncratic set of channels. Thus the model could be easily tuned to capture the in~vidual differences seen in the data. The results of our modeling serve to confirm what was implied by our data concerning the narrow tuning of multiple underlying mechanisms subserving correlation and disparity processing. Wowever, much is also revealed by the models we constructed that didn’t work, Unfortunately, there is always the possibility that a more creative design might have made these alternatives work. Our conclusions from these negative results are therefore tentative. That being said, we conclude that narrowly tuned threshold elevation profiles cannot arise from a scheme based solely on broadly tuned underlying mechanisms such as the Near and Far neural mechanisms of Foggio et aE. (1988) or the Near and Far pools of Richards (1971) The models which produce the kind of functions we obtained require narrowly tuned mechanisms off the horopter, more like the Tuned Near and Tuned Far neural mechanisms of Poggio et al. (1988), the STEREOPSIS 1693 continuum of disparity tuned mechanisms proposed by LeVay and Voigt (1988) or the disparity tuned mechanisms proposed by Lehky and Sejnowski (1990). In particular, one cannot account for the fact that tuning functions return to baseiine on both sides of their peak without invoking the existence of a tuned mechanism, off the horopter, with a relatively narrow profile. We found that models based on channels with broad sensitivity profiles extending out to large disparity values (Near/Far channels) always produced very broad threshold elevation functions. It is still possible that very broadly tuned channels are operative in processing correlation but their contribution is not apparent in our data due to the ove~helming influence of more narrowly tuned channels. The magnitude of side-lobe facilitation in Fig. 11 is somewhat less than in the data. It is likely that the data of Fig. 6 could be more faithfully reproduced by varying the profiles of the individual mechanisms. Additionally~ as discussed above, it is possible that the oppanent center/surround behavior could be captured by using mechanisms without side lobes and then invoking mutual inhibition at a later stage. However it is implemented, we conclude from our simulations that inhibitory interactions between test and surrounding disparities are required to account for the facilitation effects. Finally, while the model we present has accounted for the data by assuming many channels along a line of sight, it is possible that these data resulted fram the aggregate response of many channels distributed across the visual field, with only a small number representing any given visual direction. The data we present show that there are a number of channels available to process correlation over the 9 deg field size we used, but the distribution of these channels within that field is unknown. The luminance spatiaf frequency tuning of these channels is likewise open to speculation, since the stimulus we used was spatially broad band. Thresholds for detecting pulses of positive interocular correlation against a background of zero correlation were found to increase logarithmically with linear increases in both crossed and uncrossed retinal disparity. While signi~~nt irregularities occurred in the functions for our individual subjects, only one subject shawed an overall difference between the two directions of disparity. Adaptation to 100% correlation at several different disparity levels revealed threshold elevation tuning functions that were relatively narrow and usually peaked at the locus of adaptation. More distant disparities showed a facilitation effect after adaptation. Efforts to model these results suggest that they are most likely the result of a large number of disparity channels, with relatively narrow tuning, broad inhibitory sidelobes and peak sensitivities that are distributed about the horopter. 1694 SCOTT B. STEVENSON REFERENCES Badcock, D. R. & Schor, C. M. (1985). Depth increment detection function for individual spatial channels. Journal of The Optical Society of America, A, 2, 121 l-1215. Beverly, K. I. & Regan, D. (1975). The relation between discrimination and sensitivity in the perception of motion in depth. Journal of Physiology, 249, 387-398. Blakemore, C. (1970). The range and scope of binocular depth discrimination in man. Journal of Physiology, 21 I, 599622. Blakemore, C. & Campbell, F. W. (1969). On the existence of neurones in the human visual system selectivity sensitive to the orientation and size of retinal images. Journal of Physiology, 203, 237-260. Blakemore, C. & Hague, B. (1972). Evidence for disparity detecting neurons in the human visual system. Journnl of Physiology, 225, 437455. Blakemore, C. & Julesz, B. (1971). Stereoscopic depth aftereffect produced without monocular cues. Science, 271, 286-288. Cormack, L. K., Stevenson, S. B. & Schor, C. M. (1991a). An investigation of disparity tuned mechanisms using transparent dynamic random element stereograms. Investigative Ophthalmology and Visual Science (Suppl.) 32, 710. Cormack, L. K., Stevenson, S. B. & Schor, C. M. (1991b). Interocular correlation, luminance contrast and cyclopean processing. Vision Research, 31, 2195-2207. De Valois, K. K. (1977). Spatial frequency adaptation can enhance contrast sensitivity. Vision Research, 17, 1057-1065. Felton, T. P., Richards, W. & Smith, R. A. (1972) Disparity processing of spatial frequencies in man. Journal of Physiology, 225, 349-362. Jones, R. (1977). Anomalies of disparity detection in the human visual system. Journal of Physiology, 264, 621-640. Julesz, B. & Tyler, C. (1976). Neurontropy, an entropy-like measure et ~1. of neural correlation, in binocular fusion and rivalr!. B~dqqicd Cybernetics 23, 25 32. Lehky, S. R. & Sejnowski, T. J. (1990). Neural model of stereoacultl and depth interpolation based on a distributed representation of stereo disparity. Journal of Neuroscience, IO, 2281 2299. LeVay, S. & Voigt, T. (1988). Ocular dominance and disparity coding in cat visual cortex. Visual Neuroscience, 1, 395 414. Levinson, E. & Sekuler, R. (1980). A two-dimensional analysis of direction-specific adaptation. Vision Research, 20, 103 107. McKee, S. P. & Levi, D. M. (1987). Dichoptic hyperacuity: The precision of nonius alignment. Journal of rhe Opfical .Socrery of America, A, 4, 1104~1108. Poggio, G. F., Gonzalez, F. & Krause, F. (1988). Stereoscopic mechanisms in monkey visual cortex: Binocular correlation and disparity selectivity. Journal of Neuroscience, 8, 453 I 4550. Regan, D. (1982). Visual information channeling in normal and disordered vision. Psychological Review. 89, 407-444. Regan, D. & Beverly, K. I. (1973) Some dynamic features of depth perception. Vision Research, 13, 2369-2379. Richards, W. (1971). Anomalous stereoscopic depth perception. Journal of The Optical Society qf Americu. hi, 410 414. Stevenson, S. B., Cormack, L. K. & Schor, C. M. (1989). Hyperacuity, superresolution and gap resolution in human stereopsis. Vision Research, 29, 1597- 1605. Stevenson, S. B., Cormack, L. K. & Schor, C. M. (1991). Depth attraction and repulsion in random dot stereograms. Vision Research, 31, 805-813. Tyler, C. W. & Julesz, B. (1978). Binocular cross-correlation in time and space. Vision Research, 18, 101-105. Acknowledgement-Supported EY-08883 from the National by NRSA Eye Institute. grant EY-06045 and
© Copyright 2026 Paperzz