Professional Documents
Culture Documents
3D 𝐿"
display
𝐿 Human
observer
3D 3D
scene image
Why human vision?
• A brute-force way of imaging: capture & reconstruct everything but it
requires infinite resources
• The end-user of 2D/3D displays is a human
– For efficient display design, we should understand how visual content
affects the perception
– Perceptual artifacts in 3D images can cause general dizziness and
nausea = may impair visual experience and usability.
• Perceptual coding
– Perceptual data loss: not all sensory input is processed
– Cognitive data loss: we do not pay attention to all events registered by
the brain
– Then why to store/transmit/process such data?
– One can store/process only the perceptually “important” part
Vision and light
What is Light?
• Light: electromagnetic wave or photons or rays?
• Wave properties:
– Amplitude/intensity
– Frequency/wavelength
– (Propagation) direction
– Phase
– Polarization (vectoral)
Spectrum of light
• Intensity [W]
• Wavelength [nm]
• Visible light
– 380-750 nm
What is Light?
• Ray parametrization: intensity, wavelength, direction etc.
• Plenoptic function (light field):
𝑃(𝑥, 𝑦, 𝑧, 𝜃, 𝜙, 𝜆, 𝑡)
𝜃!
(𝑥!, 𝑦!, 𝑧!) 𝜙!
The physiology of the visual system
The visual system
• Layered (but bi-directional) neural system
– Receive and process light from the world
– Control visual motor functions
– Extract & reconstruct information about the world
• Anatomically consists of
– Eyes
• Refractive structure
• Retina
• Optic nerve
– LGN
– Visual cortex (= “brain”)
• V1, V2, …, MT, MST, …
Anatomy of the eye
• Cornea: responsible for ~70% of entire refractive power
• Iris: circular opening (= pupil) at the center, surrounded by muscles
so that the pupil size can be controlled around 2–8 mm
• Crystalline lens + zonules + ciliary muscles: focus-tunable lens,
main role in accommodation
• Aqueous humor: water-like liquid
that works like blood
to maintain lens and cornea
Anatomy of the eye
• Vitreous body: jelly-like filling between the lens and retina
• Retina: membrane with layers of neurons, which detects light and
processes the signals
Accommodation
• Accommodation: function to increase the refractive power of the eye
• Unaccommodated (= resting) for a distant object
– Ciliary muscle is relaxed & stretched, zonules flatten the lens
• Accommodative effort for a near object causes
– Contraction of ciliary muscle
– Release of zonular tension
– Lens tries to be “more spherical”
– Increase in the refractive power
Retina: structure
• Retina is layers of photoreceptors and other neurons.
• 4 types of photoreceptors: L-,M-,S-cones and rods = “duplex retinas”
• Various neurons relay signals from photoreceptors
Outside of eyeball
Photoreceptors
Neural signals
Inside eyeball
Retina: photoreceptors
• Rods
– Extremely sensitive to the light
– Functionally work only under very dim environment (scotopic vision)
• Cones
– Less sensitive to light; function under bright environment (photopic vision)
– Three types of cones (L-,M-,S-) with different spectral response curves
= supports color vision of humans
+-
Fovea (gazed)
+-
Small
reseptive field: Periphery
no exic./inhib. +-
+-
+-
+-
https://www.illusionsindex.org/i/hermann-grid
The Visual Cortex
• Primary visual cortex (V1) receives signals
from retina via the LGN
• There is a clear mapping of input from retina
to V1 neurons (retinotopy)
– Foveal region has very large area in V1
(cortical magnification)
• Visual input is further processed in V1
– Selectivity to a line stimulus’s orientation
– The differences between two retinas are
detected
• Visual information is then handed to the next
regions, where retinotopy is no longer kept,
having more complex selectivity and larger
receptive fields, etc.
http://what-when-how.com/neuroscience
Dynamic range of human vision
• Human vision works over a (very) wide range
From faint starlight (< 10-4 cd/m2) to intense sunlight (> 105 cd/m2)
• The mechanisms
– Physical regulation by the pupil
– Two photoreceptor classes, two modes of vision: photopia and scotopia
– Changes in cone sensitivity (biochemical adaptation)
– Retinal processing of signals
– Cortical gain control
Visual angle
• The angle subtended by an object
(at the retina)
• Directly dictates the apparent size
Spatial frequency
• Number of periodic spatial modulations
wikipedia (en) “visual angle”
per unit angle (or length)
• “cycles/degree” (cpd) is commonly used in studying vision
Visual acuity
Visual acuity
• The smallest visual feature size (in angle)
that a patient can reliably distinguish
– In Snellen letters, the width of the strokes
– Typical good vision: VA of ~1 arcmin (= 1/60 deg)
• Usually tested for foveal vision
= VA is related well to the foveal cone spacing
Snellen chart
Contrast sensitivity
Contrast sensitivity
• “Extension” of visual acuity
• Lowest contrast (Michelson contrast) that an observer can reliably
distinguish at various spatial frequencies of gratings
– Contrast thresholds as a function of spatial frequency
= Contrast sensitivity function
– Highly dependent on location in the visual field and other factors
Spatial frequency
Contrast Thresholds
Contrast sensitivity function
Color vision
“Colors you see are really in you”
Color is not a physical property
• Spectral radiance & reflectance are physical properties
• Color is what a human observer come up in mind seeing it under
each condition
• Yet color is related to a physical property
“The dress”
One image, two different percepts
Blue-black vs. white-gold
Subjectivity of color
Physically identical “color” (= a spectrum of light) may be perceived as
different colors
“Colors”
https://en.wikipedia.org/wiki/Color_vision
Radiance, Luminance and Brightness
Radiance
Radiant emitted, reflected, tranmitted or received by a surface,
measured by W·sr-1·m-2
https://www.cs.utah.edu/~gk/papers/vis02/talk/slide005.html
Radiance, Luminance and Brightness
Luminance
• Radiance “corrected” by how visible each wavelength is for the
“average human observer”
• cd/m2 = nit
https://www.cs.utah.edu/~gk/papers/vis02/talk/slide005.html
Radiance, Luminance and Brightness
Luminance
• Radiance “corrected” by how visible each wavelength is for the
“average human observer”
• cd/m2 = nit
Radiance, Luminance and Brightness
Brightness
• Perception elicited by the luminance of a visual target
• Brightness may vary even for patches with the same luminance
https://www.cs.utah.edu/~gk/papers/vis02/talk/slide005.html
Radiance, Luminance and Brightness
Brightness
• Perception elicited by the luminance of a visual target
• Brightness may vary even for patches with the same luminance
3 cone types, 3 primaries
For any (radiant) spectrum, we can calculate how much it stimulates
each cone type by computing its inner product with the cone sensitivity:
𝐿 = ∫ 𝑑𝜆 𝐴! 𝜆 𝐼 𝜆 , 𝑀 = ∫ 𝑑𝜆 𝐴" 𝜆 𝐼 𝜆 , 𝑆 = ∫ 𝑑𝜆 𝐴# 𝜆 𝐼 𝜆
Y=luminance
Z
wavelength in nm
X
Color Gamut
Color “blindness”
http://www.colourblindawareness.org/colour-blindness/
Color “blindness”
• Not that rare (especially for males)
• Difference in red-green axis may be difficult to distinguish for
those who don’t have “perfect” cones.
• Color universal design
Depth perception
Perception of 3D world
• Ability to perceive the structure of space is a fundamental goal of
the visual system
• Depth perception: the functions to reconstruct (or rather guess)
the 3D structure of the world from 2D retinal image(s)
• Depth cues: visual information sources about the 3D structure of
the world
– Many are from retinal image(s)
– Extra-retinal cues are also supplementarily used
https://giphy.com/mlb/
Depth cues: classification
• Monocular cues can be obtained even from one retinal image
– Pictorial cues
• Occlusion
• Size and position of objects
• Shading and shadows
• Linear perspective
• Texture gradient
• Aerial perspective
• Defocus blur
– Motion cues
• Head-motion parallax
• Kinetic depth effects / Structure from motion
• Binocular cues are from the difference between two retinal images
– Retinal binocular disparity
– Monocular occlusions (Da Vinci stereopsis)
Pictorial cue: linear perspective
Held, Robert T. and Cooper, Emily A. and Banks, Martin S., 2012
Neural link between accommodation and vergence
Fixated
point Divergence
Eyes rotating outward
Fixated
point
Convergence
Eyes rotating inward
Fixated point is imaged at fovea (the origin in retina) in both eyes, thus zero disparity
Geometry of binocular disparity
• Retinal position of image is measured by angle
• A feature that is imaged at the same position in retina (including
fixated point) has zero disparity
• Horopter: the set of points in the world that have zero disparity (in
each convergence state)
– Theoretically it is the circle that runs through the eyes and the fixated
point (the Vieth-Müller circle) Empirical
– Points on the VM circle should be perceived horopter
as single points (because of the same
retinal position)
– Horopter that is obtained in experiments
= empirical horopter
– Points on the empirical horopter are
actually perceived as single points VM circle