Professional Documents
Culture Documents
Abstract
A contrast-modulated (CM) pattern is formed when a modulating or envelope function imposes local contrast variations on a
higher-frequency carrier. Motion may be seen when the envelope drifts across a stationary carrier and this has been attributed to
a second-order pathway for motion. However, an early compressive response to luminance (e.g. in the photoreceptors) would
introduce a distortion product at the modulating frequency. We used a nulling method to measure the distortion product, and
then asked whether this early distortion could account for perception of second-order motion. The first stimulus sequence
consisted of alternate frames of CM (100% modulation) and luminance-modulated (LM) patterns. Carriers were either 2-D binary
noise (4 ×4 min arc dots) or a 4 c/deg grating, both modulated at 0.6 c/deg. The carrier was stationary while the phase of the
modulating signal (LM alternating with CM) stepped successively through 90° to the left or right. Motion was seen in a direction
opposite to the phase stepping, consistent with early compressive distortion that induces an out-of-phase LM component into the
CM stimulus. We measured distortion amplitude by adding LM to the CM frames to null the perceived motion. Distortion
increased as the square of carrier contrast, as predicted by the compressive transducer. It also increased with modulation drift rate,
implying that the transducer is time-dependent, not static. Thus early compressive non-linearity does induce first-order artefacts
into second-order stimuli. Nevertheless this does not account for second-order motion, since perceived motion of second-order
sequences (CM in every frame) could in general not be nulled by adding LM components. We conclude that two pathways for
motion do exist. © 1999 Elsevier Science Ltd. All rights reserved.
Keywords: Motion; Motion energy; Motion detection; Human vision; Gratings; Contrast; Direction discrimination; Contrast modulation;
Distortion; Second-order motion
1. Introduction the way that some other local property of the image—
such as local contrast or local element size—varies
Recent theoretical and experimental analyses have across space, or space-time (Chubb & Sperling, 1988).
led to two important ideas about spatial vision and This paper is concerned with the perception of second-
motion perception: that images can contain two differ- order motion, and specifically asks whether responses
ent kinds of spatial or spatio-temporal structure —first- to second-order motion arise from signal distortion
order and second-order — and that the visual system early in the visual system—in which case later process-
may contain separate, perhaps parallel, mechanisms for ing of first- and second-order motion information
analysing the two kinds of information (Chubb & would share a common mechanism—or whether in-
Sperling, 1988; Cavanagh & Mather, 1989; Lu & Sper- stead a distinct mechanism exists for the analysis of
second-order motion.
ling, 1995). First-order structure is captured by a de-
We investigated second-order structure defined by
scription of the way that image intensity varies across
contrast modulation of a carrier pattern. The contrast
space, or space-time. Second-order structure describes
of the carrier is modulated (multiplied) by a relatively
low spatial frequency component (the envelope). In the
* Corresponding author. Present address: Department of Psychol- corresponding first-order or luminance-modulated im-
ogy, Royal Holloway, University of London, Egham TW20 0EX,
UK.
age, the low frequency sinusoid is added to the carrier,
E-mail addresses: n.scott-samuel@rhbnc.ac.uk (N.E. Scott-Samuel), not multiplied with it. To create moving images the
m.a.georgeson@bham.ac.uk (M.A. Georgeson) sinusoid changed position with time, but the carrier was
0042-6989/99/$ - see front matter © 1999 Elsevier Science Ltd. All rights reserved.
PII: S 0 0 4 2 - 6 9 8 9 ( 9 8 ) 0 0 3 1 6 - 2
2854 N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865
stationary. Luminance modulation (LM) is shown in quency cannot detect the CM structure. This is con-
Fig. 1(a), contrast modulation (CM) in Fig. 1(b). In firmed by low-pass filtering the CM waveform (Fig. 1c)
these figures both the spatial image and its luminance to reveal no output at low frequencies.
profile are stationary snapshots of the drifting stimuli. Chubb and Sperling (1988) showed that there exists a
The space-time Fourier spectrum is that of the moving whole class of second-order stimuli for which no net
stimulus. directional Fourier energy signal exists, and which are
In terms of its Fourier components, the contrast- therefore invisible to models of motion detection based
modulated grating consists of three sinusoidal compo- on Reichardt-type detectors (e.g. Adelson & Bergen,
nents, one at the carrier spatial frequency, flanked by 1985; van Santen & Sperling, 1985; Watson & Ahu-
two sideband components (see Appendix A for deriva- mada, 1985). It has been argued (Cavanagh & Mather,
tion). The difference in spatial frequency between the 1989) that this distinction between first- and second-or-
carrier and either of the sideband spatial frequencies is der stimuli is a useful means of categorising input to the
equal to the modulating spatial frequency. visual system, and should supersede the long-range/
It is clear that in the spatial domain both the LM and short-range dichotomy introduced by Braddick (1974).
CM image contain visible structure at coarse and fine It has also been suggested that visual detection of
scales corresponding to the modulating and carrier second-order motion is handled by distinct second-or-
frequencies, respectively. However, in the Fourier do- der pathways in the brain (e.g. Wilson, Ferrera & Yo,
main, it is equally obvious that while the luminance- 1992; Ledgeway & Smith, 1994; Solomon & Sperling,
modulated carrier (Fig. 1a) contains energy at the 1994; Wilson, 1994a,b; Lu & Sperling, 1995). Some
modulating spatial frequency (filled circles), the con- neurons in cat visual vortex are certainly responsive to
trast-modulated carrier (Fig. 1b) has no such energy CM gratings (Zhou & Baker, 1993, 1994), but these
peaks. Thus linear filters tuned to the modulating fre- neurons also respond to luminance gratings. The idea
of a separate second-order motion system has generated
a large amount of empirical data dealing with differ-
ences in the visual processing of first- and second-order
stimuli, but it has proved difficult to exclude the possi-
bility that early, non-specific nonlinearity may enable
the subsequent encoding of first- and second-order
structure via the same mechanisms.
Holliday and Anderson (1994), for example, de-
scribed evidence from adaptation experiments that sec-
ond-order and first-order motions share a common
mechanism at high speeds. They measured sensitivity
for detecting either moving luminance gratings, or mov-
ing beats—which are a form of contrast modulation
where the sidebands are present (as in Fig. 1b), but the
carrier frequency is absent from the spectrum (Ap-
pendix A). Adaptation to drifting beats increased detec-
tion thresholds for luminance gratings, and vice versa,
but only at high drift rates (] 8 Hz). This mutual
transfer of adaptation suggests that the same Fourier-
type mechanism is used for the detection of both sec-
ond-order and first-order stimuli at high speeds. Such
Fig. 1. Luminance and contrast modulation. (a) Luminance modula-
tion (LM) and (b) contrast modulation (CM) of a stationary high-fre-
cross-adaptation did not occur at low temporal fre-
quency carrier grating. The spatial image and luminance profiles quencies (5 4 Hz).The detection of fast non-Fourier
represent stationary frames of the sequence, whilst the space-time stimuli by Fourier mechanisms is consistent with the
Fourier spectrum represents the moving image. Variable u is spatial hypothesis that a non-linearity in the visual pathway
frequency and w is temporal frequency. Both (a) and (b) have peaks gives rise to first-order distortion products at the modu-
of energy corresponding to the stationary carrier (open circles), but
only the LM carrier (a) has energy at its modulating frequency (filled
lation frequency of a second-order stimulus (Burton,
circles close to the origin). In (b) CM produces sideband frequencies 1973; Henning, Hertz & Broadbent, 1975). These first-
(filled circles) around the carrier, but no energy at the modulating order by-products of second-order input could behave
frequency. This is confirmed in (c) where low-pass filtering a CM like, and interact with, first-order input, thus enabling
waveform (upper trace) reveals no low frequency content (lower the cross-adaptation found by Holliday and Anderson
trace). Compressive distortion of the CM waveform (d, upper trace)
introduces a low frequency component that is 180° out-of-phase with
(1994). The lack of cross-adaptation between first- and
the modulating waveform, revealed by low-pass filtering (d, lower second-order stimuli at low temporal frequencies sug-
trace). gests that specific non-Fourier mechanisms may be
N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865 2855
should be visible, and the measured nulling contrast 2.2. Quantifying the display non-linearity
will be an estimate of the distortion magnitude. In
Experiment 1 we therefore added a luminance grating Clearly, when attempting to measure distortion in-
LM2 (Fig. 2d) at various contrasts to determine the duced by the visual system, it is important to know that
nulling contrast, at which reports of motion direction the experimental equipment is not responsible for any
were at chance. part of the non-linearity observed. It was therefore
necessary to ensure that the display monitor’s gamma
2.1. Method correction had successfully eliminated any screen non-
linearity.
Images were generated by a PC with a VSG 2/2 8-bit To carry out this control, the stimulus sequences
graphics card (Cambridge Research Systems Ltd.) and (Fig. 2d) were viewed through a diffuser (low-pass
displayed on an Eizo 6050S 15 inch RGB monitor in filter). This removed the higher spatial frequency (car-
white mode. The linearity of the display was carefully rier and sideband) components of the stimuli, which
corrected by calibration with a digital photometer fol- meant that there was no contrast modulation in the
lowed by construction of the appropriate lookup tables. viewed image and so no visual distortion product was
The two-palette system of the VSG allows the equiva- possible. Any low frequency distortion detected via the
lent of 12-bit greyscale resolution at low contrasts. nulling method (above) must then be due to the screen
Each frame was 512 ×512 pixels (4.5 ×4.5°) at a view- and not the visual system. To ensure that the diffuser
ing distance of 214 cm, with mean luminance 70.5 had effectively removed the high spatial frequency com-
cd/m2. The stationary carrier, present in each frame of ponents, observers attempted a 2AFC discrimination
the sequence, was either a 4.0 c/deg vertical grating, or task with a contrast-modulated carrier in one interval
2-D binary noise with 4× 4 min arc dots (8× 8 pixels). and an unmodulated carrier in the other, viewed
Stimulus sequences consisted of consecutive frames in through the diffuser. No reliable difference was de-
which the carrier remained stationary and the vertical tected between the two intervals for any experimental
contrast level or temporal frequency.
modulating grating stepped successively in phase
Measured psychophysically in this way, the residual
through 90° to the left or right. Alternate frames con-
screen non-linearity for grating carriers was slight, and
tained LM or CM at a spatial frequency of 0.6 c/deg.
expansive. Typical values of nulling contrast were
The modulation depth in the CM frames was 100%, so
around − 0.8, − 0.4, −0.1, and 0% for peak contrasts
the local peak contrast was twice the mean carrier
of 76.4, 54.0, 27.0 and 13.5%, respectively. The data for
contrast. These peak values of Michelson contrast for
the highest peak contrast (76.4%) are shown in Fig. 3
CM were 13.5, 27.0, 54.0 or 76.4%, and each of these
(squares). Although the screen distortion varied with
carrier contrast levels was tested at drift temporal fre-
stimulus contrast, it remained constant across temporal
quencies of 3.75, 7.5 and 15.0 Hz. The LM frames had frequency variations. This slight residual non-linearity
a modulation depth (LM1) of 3% for the 27.0% peak
contrast condition, and this ratio of 1:9 was maintained
for the other peak contrast levels used. As in Fig. 2(d),
the CM frames had a luminance grating (LM2) added
to them, and its contrast varied from 0 to 3% in the
27.0% peak contrast condition, and was scaled accord-
ingly at other peak contrast levels. LM2 was always
in-phase with the CM component. The stimulus se-
quence duration was 266 ms for all conditions, thus
displaying four cycles of drift at 15 Hz, two at 7.5 Hz
and one at 3.75 Hz. Within any one session, the con-
trast, drift rate and carrier type had fixed values, whilst
the luminance contrast (LM2) added to the CM frames
was varied from trial to trial. The procedure was a
two-interval forced choice, in which one observation
interval contained phases stepping leftward, the other
rightward. A mean luminance field was displayed for Fig. 3. Experiment 1 — sample psychometric functions. Filled circles
666 ms between the two intervals in each trial. Observ- show the percentage of 2AFC trials on which the rightward (or
ers viewed the screen binocularly with their usual spec- leftward) stepping sequence (Fig. 2d) was correctly identified as
rightward (or leftward) by the observer. 0% implies motion reversal,
tacle correction, and pressed one of two mouse buttons 50% is chance — corresponding to the nulling contrast that the exper-
to indicate in which interval the sequence was seen to iment aimed to measure. Squares represent a control condition that
move to the right. No feedback was given. checked for residual non-linearity in the display screen (see text).
N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865 2857
Fig. 4. Experiment 1 — distortion as a function of carrier contrast. Nulling contrasts (see Fig. 3) estimate the visual distortion introduced into CM
waveforms. Results shown for both observers at all temporal frequencies and contrasts, for grating carriers (left) and noise carriers (right). Power
functions are fitted for those sets of data with at least three points (solid symbols). An absence of data for any condition indicates that the nulling
contrast was unmeasurably small.
was confirmed by photometric measurements taken motion. As the contrast of LM2 increased, so the
with a Minolta LS-110 digital photometer, but the perceived direction of motion fell to chance, as shown
effects were so small that the psychophysical method at about 4% contrast in Fig. 3. At this null point, the
proved to be more sensitive and robust than the physi- amplitude of the added luminance (LM2) must equal
cal measurements. For the noise carrier conditions, that of the distortion product, and so this nulling
screen distortion was found to be insignificantly small. contrast measures the amount of distortion. With still
higher levels of added luminance contrast, standard
with-phase first-order motion (the sequence of LM1 and
2.3. Results: nulling the distortion
LM2) was observed.
With no added luminance (LM2 =0; Fig. 2d), motion Logistic functions were fitted to the experimental
was perceived in the opposite direction to the phase data, and the nulling contrasts, at 50% direction dis-
shift for higher contrasts and temporal frequencies. As crimination performance, were determined at four car-
an example of the psychometric functions obtained, rier contrasts and three drift speeds. For grating carrier
Fig. 3 shows data for one observer (NSS; 15 Hz, 76.4% conditions, a small correction was made to the data by
peak contrast). Filled circles show that for low values subtracting the values for screen distortion, whilst for
of LM2 all trials were seen in the reversed direction (0% the noise carrier no adjustment was needed.
responses with-phase). As described above, this motion The nulling contrasts for two observers are sum-
derives from the sequence of first-order signals and marised in Fig. 4 as a function of carrier contrast, for
first-order artefacts induced in second-order stimuli. We both grating and noise carriers. Four carrier contrasts
shall refer to motion involving the use of these first-or- were tested in all cases; the absence of a data point
der distortion products from second-order stimuli as indicates that the distortion product was unmeasurably
pseudo second-order motion. As predicted by this ap- small in that condition. Where three or more points
proach, the luminance contrast (LM2) had to be added were available, power functions were fitted to the data
in-phase with the CM in order to cancel the perceived as shown (linear on a log–log plot). The average gradi-
2858 N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865
Fig. 5. Modelling the non-linear response to luminance. (a) Empirical data (from Fig. 4) and fitted model for two observers at 15 Hz with a grating
carrier. The model assumed a transducer of the form shown in (b), took an LM +CM waveform as input, and adjusted the contrast of the LM
component until the transducer output showed zero amplitude at the modulating frequency, thus nulling the distortion. Higher distortion
amplitudes for one observer (NSS) imply a more compressive luminance response (lower value of S) than the other observer (MAG), as illustrated
in (b).
ent across observers and conditions was 1.7, implying convenience in Fig. 5 we let Rmax = 2+S, so that both
that distortion contrast rises rapidly, almost as the the input and output ranges are 0 to 2, irrespective of S.
square of carrier contrast. To simulate the experiment properly, the CM grating
The measured distortion increased with both peak and nulling LM grating must be passed through the
contrast level and temporal frequency for both observ- transducer together. The response R(x) was calculated,
ers. The distortion values for the binary noise carrier and the Fourier amplitude at the modulation spatial
were consistently lower than those for the grating car- frequency was computed. The nulling contrast was
rier (when compared at the same Michelson contrast) obtained by adjusting it iteratively until the Fourier
and those measured for observer MAG were lower than amplitude at the modulation frequency was zero. We
those of observer NSS. found that this compressive transducer model predicts
in general that the nulling contrast must be in-phase
2.4. Discussion: modelling the non-linearity with the contrast modulation, and that it will always
increase as the square of the stimulus contrast—a slope
We have suggested that the distortion of CM wave- of two on the log–log plot. Fig. 5(a) shows that this
forms arises from an early compressive response to gives quite a good fit to the data for a sinusoidal
luminance. To evaluate and quantify this proposal, the grating carrier with a CM drift frequency of 15 Hz. Fig.
empirical data were modelled using a form of the 5(b) shows that observer NSS was more compressive
Naka–Rushton equation (Naka & Rushton, 1966) that (S=1) than observer MAG (S= 2.8), and this individ-
has been widely used to model the intensity-response ual difference appears to be true more generally (Fig.
function of photoreceptors and early retinal processes 4).
(Geisler, 1981): Several studies on the perceived location of blurred
luminance edges have modelled their results using com-
I(x)n
R(x) =Rmax (1) pressive transducers similar to the one described here.
I(x)n +S n
Different authors have used different expressions for
where R(x) is the transduced spatial waveform, Rmax is the transducer, but converting their descriptions to ours
its maximum value at large I, I(x) is normalised lumi- we find values of S ranging from 0.5 to 3.5, a range
nance (I(x)= L(x)/L0, where L(x) =luminance at posi- which brackets the estimates (1.0, 2.8) obtained here at
tion x; L0 = mean luminance), exponent n is a constant, 15 Hz (Fig. 5). Table 1 lists the S-values from the
and S is the semi-saturation constant — the value of studies examined. For the one from Naiman and Mak-
I(x) at which R(x) reaches half its maximum value ous (1993) we fitted Eq. (1) (n= 1) to their second order
(Fig. 5b). We let n=1, so that the degree of compres- polynomial to derive an equivalent value for S.
sion is controlled entirely by the parameter, S; lower Fig. 4 also shows that distortion decreases with de-
values of S yield a more compressive function. For our creasing drift rate, and so the transducer must become
purpose the value of Rmax is unimportant, but for less compressive (S must increase) as drift rate de-
N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865 2859
Fig. 7. Experiment 2 — no cancellation of second-order motion at slower speeds. Results for observers NSS (left) and AR (right) for both grating
and noise carriers at a peak contrast of 76.4%. At 15 Hz, for NSS but not AR, the grating and noise carrier results show a motion reversal at
higher levels of added luminance contrast. At lower speeds CM motion was seen consistently, and could not be cancelled by added LM.
Thus if such cancellation and reversal is possible then both grating and noise carriers. Considering first the
the perception of CM motion must be mediated by the lower temporal frequencies (3.75 and 7.5 Hz), it is clear
distortion product. If no cancellation is obtained, and that motion was seen consistently in the direction of the
motion is always in the forward direction, then distor- phase shift, even with high levels of added luminance
tion cannot be the sole basis for second order motion. contrast, for both observers. This finding indicates that
at these lower drift rates it was impossible to cancel
3.1. Method CM motion by adding luminance contrast to alternate
frames of the sequence, and therefore that in these
The experimental conditions and procedure were conditions the perceived second-order motion is not
identical to those of Experiment 1, except that all mediated by a first-order mechanism.
frames of the apparent motion sequence were contrast- These conclusions were confirmed by further results
modulated grating carriers (CM; 100% contrast modu- obtained at a slightly lower contrast level (54.0% peak
lation depth as before), and alternate frames had a contrast) for three experienced observers (NSS, MAG
luminance grating (LM2) added to them, in-phase with and TCAF; Fig. 8). For observers AR (Fig. 7) and
the modulating envelope (Fig. 2e). The amplitude of TCAF (Fig. 8), there is evidence of a slight decline in
LM2 was varied over the same range as in Experiment perception of with-phase motion in both the 3.75 and
1, in order to cancel out the distortion product pro- 7.5 Hz conditions. However, in both cases it remained
duced in the CM stimulus. Direction discrimination true that the added luminance contrast did not cancel
performance was again measured as a function of the the second-order motion.
added grating’s contrast. At 15 Hz the results were much more variable across
observers, and appear to reflect a mixture of several
3.2. Results: no cancellation of second-order motion at effects. (i) Pseudo second-order motion: for observer
lower temporal frequencies NSS, Fig. 7 shows that at the highest temporal fre-
quency and contrast level used (15 Hz and 76.4%), the
Fig. 7 shows results for two observers (NSS, and added luminance grating first cancelled and then re-
naive observer AR) at a peak contrast of 76.4%, for versed the perceived motion as the added contrast
N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865 2861
Fig. 8. Experiment 2 — no cancellation of second-order motion at slower speeds. Results for observers NSS (grating and noise carrier), MAG
(grating carrier) and TCAF (grating carrier) at a peak contrast of 54.0%. At 15 Hz, there were large individual differences, but at lower speeds
CM motion was seen consistently, and could not be cancelled by added LM.
increased. This reversal at 15 Hz also occurred for the lower carrier contrast (13% peak contrast; not shown).
grating carrier at a peak contrast of 54.0% (NSS, Fig. (iv) Miscellany: at 15 Hz, observers AR and TCAF
8). The LM contrast at which nulling of CM motion were not far above chance, but in all three datasets
occurred was just slightly higher than the nulling con- (Figs. 7 and 8) performance declined from about 75 to
trast from Experiment 1. The implication is that in 50% with increasing LM contrast. Such performance
these particular cases perception of CM motion was could arise in many ways, but could reflect a mixture of
based almost entirely on the distortion product. (ii) effects (i)–(iii) above.
Mixture of genuine and pseudo second-order motion:
for the noise carrier, the nulling contrast for NSS was 3.3. A complication: component motion
much higher in Experiment 2 than Experiment 1, and
this suggests that genuine second-order motion played a The general reversal of CM motion seen in Experi-
larger role in this case, where the distortion products ment 2 with grating carriers at 15 Hz for observers NSS
were smaller. In the limit, if performance stayed at (13.0% peak contrast) and MAG (54.0% peak contrast)
100% for all levels of added LM we should conclude cannot be explained by either second-order or pseudo
that genuine second order motion was the dominant second-order processing. Derrington, Badcock and
factor, as it was at lower speeds. (iii) Component Henning (1993) have suggested an explanation for this
motion: for MAG at 15 Hz (Fig. 8, grating carrier) phenomenon. At high temporal frequency, high spatial
perceived motion was always reversed, even with no frequencies can be attenuated, or even eliminated, be-
LM added. This cannot be second-order motion (which cause they are near or beyond the resolution limit of
would be with-phase regardless of the added contrast the visual system. Fig. 9 shows the space-time spectrum
level), nor pseudo second-order (which would reverse of a stationary grating carrier modulated by a drifting
from with-phase to against-phase as the level of con- envelope. When the higher spatial frequency sideband
trast was increased). Instead it is more likely to reflect falls outside the ‘‘window of visibility’’ (Watson, Ahu-
perception of ‘‘component motion’’ — the motion of the mada & Farrell, 1986) there is an imbalance in the
lower sideband frequency component, discussed below. first-order motion energy in favour of the lower side-
Such a comprehensive reversal was also seen by NSS at band frequency.
2862 N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865
This first-order signal, termed component motion (be- dow of visibility; however, when the viewing distance
cause it is the result of the motion of only one of the was reduced by a factor of four (thus decreasing the
two components making up the sidebands in the con- spatial frequency by a corresponding ratio, and pre-
trast-modulated carrier) moves against the direction of sumably bringing the higher spatial frequency into the
the phase shift of the envelope, and could therefore be window of visibility), the reversal persisted. Further-
responsible for the reversals seen in conditions where more, observer NSS also reported consistent reversal
the higher sideband is likely to be around the resolution across all values of added luminance contrast at this
limit. One way to eliminate component motion would much lower spatial frequency.
be to increase the contrast of the higher-frequency Thus it appears that component motion is not always
sideband; this would have the effect of increasing the due to the effects of the window of visibility. An
size of the window of visibility (at higher contrast alternative explanation might be masking of the higher
spatial frequency sideband by the carrier; despite the
levels, higher spatial frequencies are more visible),
differences in temporal and spatial frequency between
which would then include the errant component.
the sideband and carrier, some masking may still occur
In an additional experiment this action did, indeed,
(Anderson & Burr, 1985). However, this idea was sub-
have the desired effect of eliminating the reversed mo-
sequently rejected when a replication of the near view-
tion; with-phase motion was restored around the point ing distance experiments with a beat stimulus (which
where the higher spatial frequency sideband had a has no carrier, and therefore no masking) yielded the
contrast level which was twice that of the lower spatial same reversed motion that was seen with a contrast-
frequency sideband. Unfortunately, it is difficult to say modulated grating.
whether the resulting with-phase motion was due to a A second possible explanation draws upon the fact
balancing of the visibility of the low and high spatial that the spatial frequency range of masking increases
frequency sidebands or whether it was simply due to with decreasing spatial frequency (Anderson & Burr,
dominance of the higher spatial frequency sideband 1985). in other words, a lower spatial frequency masks
(which moves with the phase shift instead of against it). over a greater range than a higher one. Given this, it is
This window-of-visibility explanation of component possible that the lower spatial frequency sideband
motion does not account for all the data collected. In masks the higher one, but not vice versa. Opposing this
Experiment 2, observer MAG consistently reported re- hypothesis is the fact that the sidebands move in oppo-
versed motion for all values of added luminance con- site directions and, although still present in such cases,
trast. The parameters in this experiment were 15 Hz, the masking effect is much reduced (Anderson & Burr,
76.4% peak contrast, and a grating carrier. This seemed 1985).
to imply that even at this high contrast level, the higher A third possibility is that the higher visibility of the
spatial frequency sideband was falling outside the win- low spatial frequency sideband might be due to its
greater velocity. Unfortunately, equating the velocities
of the sidebands would result in a drifting carrier,
which would be an obvious cue in any direction dis-
crimination task, and so this possibility is not readily
testable.
A final possible explanation for the effective attenua-
tion of the higher spatial frequency sideband might be
motion capture (Ramachandran & Cavanagh, 1987),
where motion of the higher spatial frequency is cap-
tured by the lower. Regardless of which explanation, if
any are correct, the reversal may be characterised as
component motion, since the only element moving
against the phase shift with no added luminance con-
trast was the lower spatial frequency sideband.
An obvious solution to this component motion prob-
lem is to change the carrier type from grating to noise.
As mentioned above, there was no reversal of perceived
Fig. 9. The origin of component motion. A moving contrast-modu- direction of motion at high temporal frequencies and
lated grating represented in the spatio-temporal (u– w) frequency low contrasts with a noise carrier. This is because the
domain. The window of visibility, that part of the spectrum that is
carrier, and thus the sidebands, are splattered across
visible to a human observer at a given contrast level, is shown by the
shaded region. Because the higher spatial frequency sideband falls the entire spectrum, which avoids the obvious imbal-
outside this area, only the stationary carrier and drifting lower ance due to attenuation of components near or beyond
sideband are visible; as a result, the only motion seen is first-order the edge of the window of visibility that was found with
and reversed in direction. grating carriers.
N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865 2863
3.4. Discussion
These findings suggest a modest alteration to current order stimuli and first-order distortion products in sec-
models of motion processing. Wilson et al.’s (1992) ond-order stimuli.
model (Fig. 10) is one of the better known, and is More positively, our demonstration that genuine sec-
representative of those which posit two pathways for ond-order motion exists (that which cannot be can-
motion processing, one for Fourier signals, one for celled by adding first-order components) lends
non-Fourier. The addition of an early non-linearity considerable weight to the idea that there are separate
means that first-order distortion products from second- mechanisms for the interpretation of first- and second-
order input can now leak into the first-order processing order input (as suggested by, e.g., Wilson et al., 1992;
pathway under certain conditions, yielding pseudo sec- Ledgeway & Smith, 1994; Lu & Sperling, 1995). The
ond-order motion. fact that under some conditions these two classes of
The existence of such a path for interaction between stimulus do not interact to produce a coherent impres-
first- and second-order signals has some serious impli- sion of motion (Ledgeway & Smith, 1994; Edwards &
cations. Most obviously, it is no longer clear that data Badcock, 1995) is a strong indication that there are at
which claim to deal with second-order processing can least two pathways for motion processing.
be taken at face value; those stimuli which move too Thus our experiments demonstrate that the visual
quickly, or are displayed at too high a contrast, run the processing of some second-order stimuli can induce
risk of containing first-order contamination. Many sim- first-order artefacts, and that these artefacts can, in
ilarities between first- and second-order processing have some circumstances, mediate the processing of second-
been reported, and it is possible that some of these order motion. The fact that this is not generally the
similarities could reflect the common processing of first- case offers strong evidence for the existence of a sec-
2864 N.E. Scott-Samuel, M.A. Georgeson / Vision Research 39 (1999) 2853–2865
Acknowledgements
C(x,t)=L0 1+ c sin ( fcx)
form (Scott-Samuel & Georgeson, 1995). It was sup- 2
ported by a BBSRC studentship to NSS and BBSRC
grants SO6946 and SO3969 to MAG. − cos ( fcx+ fmx+ v t)] (9)
Derrington, A. M., Badcock, D. R., & Henning, G. B. (1993). Mulligan, J. B., & MacLeod, D. I. A. (1991). Visual sensitivity to
Discriminating the direction of 2nd-order motion at short stimu- spatially sampled modulation in human observers. Vision Re-
lus durations. Vision Research, 33, 1785–1794. search, 31, 895 – 905.
Edwards, M., & Badcock, D. R. (1995). Global motion perception: Naiman, A. C., & Makous, W. (1993). Undetected grey strips dis-
no interaction between the first- and second-order motion path- place perceived edges nonlinearly. Journal of the Optical Society of
ways. Vision Research, 35, 2589–2602. America A, 10, 794 – 803.
Geisler, W. S. (1981). Effects of bleaching and backgrounds on the Naka, K. I., & Rushton, W. A. H. (1966). S-potentials from colour
flash response of the cone system. Journal of Physiology, 312, units in the retina of fish (Cyprinaidae). Journal of Physiology
413 – 434. (London), 185, 536 – 555.
Georgeson, M. A., & Freeman, T. C. A. (1997). Perceived location of Ramachandran, V. S., & Cavanagh, P. (1987). Motion capture an-
bars and edges in one-dimensional images: computational models isotropy. Vision Research, 27, 97 – 106.
and human vision. Vision Research, 37, 127–142. Scott-Samuel, N. E., & Georgeson, M. A. (1995). Does early nonlin-
Hayhoe, M. M. (1990). Spatial interactions and models of adapta- earity account for second-order motion? Perception (supplement),
tion. Vision Research, 30, 957–965. 24, 104.
Hayhoe, M. M., Benimoff, N. I., & Hood, D. C. (1987). The Solomon, J. A., & Sperling, G. (1994). Full-wave and half-wave
time-course of multiplicative and subtractive adaptation process. rectification in second-order motion perception. Vision Research,
Vision Research, 27, 1981–1986. 34, 2237 – 2257.
He, S., & MacLeod, D. I. A. (1998). Contrast-modulation flicker: van Santen, J. P. H., & Sperling, G. (1985). Elaborated Reichardt
dynamics and spatial resolution of the light adaptation process. detectors. Journal of the Optical Society of America A, 2, 300–321.
Vision Research, 38, 985–1000. Watson, A. B., & Ahumada, A. J. (1985). Model of human visual-
Henning, G. B., Hertz, B. G., & Broadbent, D. E. (1975). Some motion sensing. Journal of the Optical Society of America A, 2,
experiments bearing on the hypothesis that the visual system 322 – 341.
analyses spatial patterns in independent bands of spatial fre- Watson, A. B., Ahumada, A. J., & Farrell, J. (1986). Window of
quency. Vision Research, 15, 887–897. visibility: psychophysical theory of fidelity in time-sampled visual
Holliday, I. E., & Anderson, S. J. (1994). Different processes underlie motion displays. Journal of the Optical Society of America A, 3,
the detection of second-order motion at low and high temporal 300 – 307.
frequencies. Proceedings of the Royal Society London B, 257, Wilson, H.R. (1994a). Models of two-dimensional motion perception.
165 – 173. In A.T. Smith, R.J. Snowden, Visual detection of motion (pp.
Ledgeway, T., & Smith, A. T. (1994). Evidence for separate motion- 219 – 251). London: Academic Press.
detecting mechanisms for first- and second-order motion in hu- Wilson, H.R. (1994b). The role of second-order motion signals in
man vision. Vision Research, 34, 2727–2740. coherence and transparency. In Higher order processing in the
Lu, Z., & Sperling, G. (1995). The functional architecture of human visual system (CIBA Foundation Symposium 184) (pp. 227–244).
visual motion perception. Vision Research, 35, 2697–2722. Chichester: Wiley.
MacLeod, D. I. A., & He, S. (1993). Visible flicker from invisible Wilson, H. R., Ferrera, V. P., & Yo, C. (1992). A psychophysically
patterns. Nature, 361, 256–258. motivated model for two-dimensional motion perception. Visual
MacLeod, D. I. A., Williams, D. R., & Makous, W. (1992). A visual Neuroscience, 9, 79 – 97.
nonlinearity fed by single cones. Vision Research, 32, 347– 363. Zhou, Y.-X., & Baker, C. L. (1993). A processing stream in mam-
Mather, G., & Morgan, M. J. (1986). Irradiation: implications for malian visual cortex neurons for non-Fourier responses. Science,
theories of edge localization. Vision Research, 26, 1007–1015. 261, 98 – 101.
Morgan, M. J., Mather, G., Moulden, B., & Watt, R. J. (1984). Zhou, Y.-X., & Baker, C. L. (1994). Envelope-responsive neurons in
Intensity-response nonlinearities and the theory of edge localiza- areas 17 and 18 of cat. Journal of Neurophysiology, 72, 2134–
tion. Vision Research, 24, 713–719. 2150.