Analog VSI

CALIFORNIA INSTITUTE OF TECHNOLOGY
COMPUTATION AND NEURAL SYSTEMS PROGRAM
CNS Memo No. 30, April 2, 1996
ANALOG VLSI PHOTOTRANSDUCTION

by continuous-time, adaptive, logarithmic
photoreceptor circuits
Comparison
C1
Learning T. Delbrück & C.A. Mead
Qp Vb
Qfb
Model Adaptive
Vf element
Vo
I bg+i 3.4
Transient
C2 Qcas Vcas Responses
3.2
Peak
Vp Output
Qn (V) 3
Steady-state
Input response
2.8
2.6
2.4
-3 -2 -1 0 1 2 3 4 5
10 10 10 10 10 10 10 10 10
Irradiance (mW/m2)
Adaptive
element
Log Intensity
Signal Amplitude (dBV/√Hz)
-100 -2
Feedback
capacitor
(C )
2 -1
Amplifier 1/f2
(C ) 0
o -120
Storage Cascode
transistor 1/f Instrumentation
Capacitor
(C )
1
-140
6 µm
-1 0 1 2 3 4
10 10 10 10 10 10
Frequency (Hz)
Feedback transistor
Photodiode (C )
p
(also hole in overlying metal)
PASADENA, CALIFORNIA 91125

2 ABSTRACT 4/2/96
This paper describes an adaptive photorecep- which the bandwidth of the receptor is 60 Hz,
tor circuit that can be used in massively paral- is at approximately 1 lux, which is the border
lel analog VLSI silicon chips. The receptor between rod and cone vision and also the
provides a continuous-time output that has limit of current consumer video cameras.
low gain for static signals (including circuit We describe an adaptive element that is
mismatches), and high gain for transient sig- resistant to excess minority carrier diffusion.
nals that are centered around the adaptation We show measurements of the effectiveness
point. The response is logarithmic with illu- of guard structures, and of the spectral sensi-
mination, which makes the response to a tivities of devices that can be built in a BiC-
fixed image contrast invariant to absolute MOS process.
light intensity. The logarithmic transduction process
The 5-transistor receptor can be fabricated makes the time constant scale inversely with
in an area of about 70 by 70 µm2 in a 2-µm intensity. As a result, the total A.C. RMS
single-poly CMOS technology. It has a receptor noise is constant, independent of
dynamic range of 1–2 decades at a single intensity. The spectral density of the noise is
adaptation level, and a total dynamic range of within a factor of two of photon shot noise
more than 6 decades. Several technical and varies inversely with intensity. The con-
improvements in the circuit yield an addi- nection between shot and thermal noise is
tional 1–2 decades dynamic range over previ- beautifully illustrated.
ous designs without sacrificing signal quality.
The lower limit of the dynamic range,
defined arbitrarily as the illuminance at
CONTENTS
Continuous-Time vs. Sampled Receptors...............1 Total Noise in Adaptive Receptor.................. 12
Biological Motivation ...............................................1 Using Shot Noise Statistics to Compute the
Gain Control .....................................................2 Noise Power............................................... 12
Time-Constant Control .....................................2 Shot vs.Thermal Noise?................................. 13
The Goal of Phototransduction................................2 Effect of Temperature.................................... 14
Simple Logarithmic Receptors................................2 Noise Spectral Density .................................. 14
ADAPTIVE RECEPTOR CIRCUIT....................................3 The Essence.................................................. 14
The Feedback Loop ................................................3 Measurement and Theory..................................... 14
Cascode...........................................................4 Assumptions of Difficulty ............................... 14
Photoreceptor Gain .................................................4 Photodiode vs. Phototransistor:
Advantages of Active Feedback.............................5 Noise Behavior.................................................. 15
Speedup..................................................................5 Noise Advantage of Continuous-Time Receptors
Miller Capacitances ................................................5 over Sampled Detectors ................................... 15
Rise Time & Bandwidth ...........................................6 MINORITY CARRIER DIFFUSION & GUARD STRUC-
Gain–Bandwidth Product ........................................6 TURES ...................................................................... 15
photodiode area? ....................................................6 Summary ....................................................... 16
SMALL-SIGNAL ANALYSIS............................................6 SPECTRAL SENSITIVITY ............................................. 16
Second-Order Temporal Behavior ..........................7 The Spectral Responses ...................................... 18
ADAPTIVE ELEMENT .....................................................8 Absolute current level........................................... 19
Adaptation Rate ......................................................9 PREVIOUS WORK........................................................ 19
Other Adaptive Elements ........................................9 RELATION TO BIOLOGICAL PHOTOTRANSDUCTION..
PHOTODIODE VS. PHOTOTRANSISTOR ....................10 20
RECEPTOR LAYOUT....................................................10 SUMMARY ................................................................... 20
THE ILLUMINATION LIMIT (SPEED) ............................10 Technical Innovations........................................... 20
Illumination Limit: High End...................................11 CONCLUSION ............................................................. 21
THE DETECTION LIMIT (NOISE)..................................11 ACKNOWLEDGEMENTS ............................................. 21
Empirical Observations .........................................11 REFERENCES .............................................................. 21
Theory of Logarithmic Receptor Noise .................12 INDEX .......................................................................... 23
Using Equipartition to Compute the Noise
Power..........................................................12
Analog VLSI Phototransduction CNS Memo #30 T. Delbrück & C.A. Mead
Apr 2, 1996 1
ver the last few years, people have The transduction process seems mun- led to wide-spread use in machine vision
O built a number of neuromorphic

analog vision chips that do focal-
plane time-domain computation. These
dane, but it is important—GIGO comes to
mind. Subsequent computation relies on the
information. We don’t know of any contem-
applications.
However, their use has hindered investi-
gation of vision algorithms and architec-
chips do local, continuous-time, spatiotem- porary (VLSI-era) literature that compre- tures that use time in an natural and efficient
poral processing that takes place before hensively explore the subject. Previous manner, because is it difficult to couple
any sampling or long-range communica- results are lacking in some aspect, either in time-domain information from a serial
tion, for example, motion process- the circuit itself, or in the understanding of stream of sampled imager outputs to analog
i n g ,2 . 5 , 6 , 9 , 2 9 c h a n g e d e t e c t i o n , 7 the physics, or in the realistic measurement circuits. If we want to do time-domain ana-
neuromorphic retinal preprocess- of limitations on behavior.6,8,15,19,17 log visual processing, it makes sense to
ing,10,11,12,13,17,18 stereo image match- We’ll focus on one highly-evolved adap- build analog continuous-time photoreceptor
ing,10,11,14 and synthesis of auditory images tive receptor circuit to understand how it circuits and couple their outputs locally to
from visual scenes18,22. operates, what are the limitations on its dy- the circuits that do the computation.
namic range, and what is the physics of the Photoreceptors have not received much
This processing requires photoreceptor
noise behavior. The receptor has new and commercial attention because the system
circuits that transduce from light falling on
previously unpublished technical improve- requirements for analog visual computation
the chip to an electrical signal. If we want and imaging are so different. Sampled im-
to build analog vision chips that do high- ments, and we understand the noise proper-
ties and illumination limits much better agers are designed to go with serial televi-
quality focal plane processing, then we sion displays, and they must faithfully
need good photoreceptors. It’s not enough than we did before. We’ll also discuss the
practical aspects of the interaction of light reproduce the visual scene. No computation
to just demonstrate a concept; ultimate use- is necessary (except for gain control) nor
with silicon: What are the spectral respons-
fulness will be determined by market forc- particularly desirable, since the output is
es of various devices? How far do light-gen-
es, which, among other factors, depend a supposed to look like what we see. Also,
erated minority carriers diffuse and how do
lot on raw performance. The receptor cir- they affect circuit operation? How effective CCD cameras need 106 pixels, because hu-
cuits we discuss here have not been used in are guard bars to protect against them? Fi- mans who look at the TV picture want near-
any commercial product, so they have not nally, we’ll talk about biological receptors: foveal resolution everywhere. (There is no
yet passed that most crucial test, but by ev- How do their functional characteristics in- way the camera can know where in the
ery performance metric we can come up spire the electronic model? How are the scene people are going to look.) The fact
with, including successful fabrication and mechanisms of gain and adaptation related? that CCD devices naturally produce a serial
test of demonstration systems, they match stream of data is perfectly suited for display
performance criteria met by other pho- CONTINUOUS-TIME VS. SAMPLED and transmission to television monitors.
totransduction techniques that are used in RECEPTORS On the other hand, it is obvious that bio-
end-product consumer electronic devices. The photoreceptors we’ll discuss produce a logical visual systems are massively paral-
We hope that this article will serve sever- continuous analog output that can be direct- lel systems—at least in the preprocessing
al purposes: We want people to have a ref- ly coupled to adjacent analog circuits—for stages—and not TV cameras. Flying in-
erence where they can look to see the example, circuits that compute image mo- sects, for example, with less than 104 pixels,
functioning and practical problems of pho- tion. This characteristic contrasts with the are existence proofs that interesting vision
totransducers built in a typical CMOS or vast majority of imaging devices used com- problems well beyond any current technol-
BiCMOS process. We want to inspire peo- mercially. ogy are doable with 100 times fewer pixels
ple to build low-power, integrated commer- than the cheapest Sony Handycam.
CCD imagers, for example, have become
cial vision devices for practical purposes. dominant in commercial cameras for many BIOLOGICAL MOTIVATION
We want to provide a photoreceptor that reasons—high density, low noise, minimal How do biological photoreceptors deal with
can be used as a front end transducer in nonuniformity, high sensitivity, and rela- two basic requirements: the simultaneous
more advanced research on neuromorphic tively simple manufacturing process. Their need for high sensitivity and large dynamic
systems. easy availability and reliable operation have range, and the requirement of rapid re-
FIGURE 1 Gain adjustment in turtle cones (recorded intracellularly) caused by

background illumination. The stimuli are 0.5 s increments or decrements on a steady
-15
background (except for the curve for the dark adapted cone which only is for
Peak Potential (mV)
increments). The stimulus spot is 3.2 mm in diameter on the retina. Peak responses
measured from the dark-adapted resting potential (dotted line) are plotted as a -10 Steady-State
Dark
function of test illumination. The thin curves connect the measured points. The thick -3.2 Response
-4.4
curve is the steady membrane potential measured at least two minutes after -5 -2.1 -1.0
background onset. The average slope of the transient responses is 9.5 mV/decade,
Transient
and the slope of the steady-state, adapted response is about 1.8 mV/decade. The 0 Responses
ratio of transient gain to the steady-state gain is about 5. The total dynamic range is
about 15 mV. The illuminations are given as log attenuation from a baseline value. The -6 -5 -4 -3 -2 -1 0
unattenuated test stimulus (0 log) is 6.4 1015 quanta(640 nm)(cm2s)-1 on the retina, Log intensity
equivalent to an irradiance of about 20 W/m2. (Direct office fluorescent lighting is
about 1 W/m2.) The unattenuated background illumination is 9.1 1015
quanta(640 nm) (cm2s)-1. Adapted from Normann and Perlman (1979).23
Authors may be contacted by email at tobi@pcmp.caltech.edu. These circuits have been patented as T. Delbruck, C.A. Mead, "Adaptive photoreceptor including
adaptive element for long-time-constant continuous adaptation with low offset and inensitivity to light," U.S. Patent 5,376,813.This document (and other related
ones) exist in electronic form available via the World Wide Web; the URL for the Physics of Computation Group is http://www.pcmp.caltech.edu/
2 Apr 2, 1996
10 Vb
FIGURE 2 Time to
peak response in
Vo
response to a flash of
light, vs. gain, in toad
response (s)
rod receptors.
Time to peak
n++ p- n++
Replotted from
1 Baylor et al., (1980).1
Vb
60 mV/decade
Vo
0.1
0.01 0.1 1 10
Gain (arbitrary units) log Intensity
sponse time, invariant to lighting condi- FIGURE 4 The source-follower

tions? The approaches to phototransduction Vo logarithmic photoreceptor. Vo decreases
and amplification we use in our silicon pho- logarithmically with light intensity,
starting with zero intensity at
toreceptor are inspired by biology, although approximately Vb.
the detailed implementation is quite differ- n++
p-
ent. We shall only discuss functional char-
acteristics here; in the discussion we will because the change in the log intensity is
discuss biological mechanisms in relation log Intensity given by
to our silicon receptor. dI i
GAIN CONTROL. Figure 1 shows the re- d log I = ----- = ------- (1)
Vo 60 mV/decade I I bg
lationship between input and output in typi-
cal cone photoreceptors. The curves show below The simplest logarithmic receptor circuit is
the membrane voltage of the cone in re- ground a single junction formed between the lightly
sponse to a given illumination. The shallow doped substrate and a piece of heavily
curve shows the voltage in response to doped source–drain diffusion (Figure 3).
steady intensity. The steep curves show the FIGURE 3 A single junction—the When light shines on the silicon, it makes
response to small variations in illumination simplest logarithmic photoreceptor. Vo electron–hole pairs. When electrons freed
sits below the substrate voltage,
around a steady value. Two characteristics in the p– substrate diffuse to the junction,
decreasing logarithmically with intensity.
stand out: The receptor has a larger re- they are swept home by the junction’s elec-
sponse to changes in illumination than to tion, as least as rapid as possible. (We can tric field into the n++ region. The same for
steady illumination—it adapts , and the always lowpass-filter the response after- holes created in the n++ region—these are
slope of the responses is constant on a log- wards if we don’t care about high frequen- swept home to the p– region. The result of
illumination scale, meaning it has a con- cies, but there is no way to recover the this photocurrent flowing from n++ to p– is
stant response to image contrast, indepen- information if the photoreceptor filters it that the n++ region becomes negatively
dent of illumination level. The adaptation away first.) charged with respect to the substrate. This
means that the receptor can respond with negative voltage sets up a forward current in
high gain over a wide operating range with- THE GOAL OF the junction to compensate for the photo-
out saturating, and perhaps more important, PHOTOTRANSDUCTION current. Since the forward current is expo-
the receptor output reflects actual changes Inspired by biological transduction and nential in the junction voltage, the voltage
in the illumination and not offsets that can common sense, we shall assume that the on the n++ region is logarithmic in the in-
build up due to various biochemical imbal- primary goal of phototransduction is to tensity. However, this signal is not very use-
ances in a living system. (We don’t know if compute image contrast invariant to abso- ful, because it is below the substrate
the last supposition is correct, since no one lute illumination. We can think of the total voltage.
has shown that biological system compo- intensity as a sum Ibg + i of a steady state The simplest logarithmic receptor that
nents have offsets, but it seems likely.) background component Ibg, and a varying produces a useful output voltage range is
TIME-CONSTANT CONTROL. Biolog- small signal component i. The contrast of shown in Figure 4. It consists of a single
ical photoreceptors have a bandwidth that is the signal is defined as the ratio i/Ibg, and MOS transistor, where the source of the
practically invariant to the light intensity, the receptor response should be proportion- transistor forms the photodiode shown in
over a wide range of intensities. In toad al to this ratio independent of Ibg, at least for Figure 3. The MOS transistor channel
rods, for example, the bandwidth goes as small ratios. The rationale for this assump- forms the barrier that results in a logarith-
the fourth root of intensity, as shown in tion is that objects reflect a fixed fraction of mic response to intensity. The voltage at the
Figure 2. This behavior means that over a the light that hits them. A receptor respond-
factor of 200 in background intensity, the ing to a dynamically varying scene, such as
response latency to a flash of light varies would result from ego motion or from a † This goal of contrast-invariant response makes
over only a factor of about 4. It seems rea- moving object, will produce an output in- sense for all but the lowest intensities, where con-
trast becomes an impractical quantification. When
sonable to speculate that this invariance is variant to absolute illumination.† only a few photons hit the detector during an inte-
useful for dynamic visual processing of gration time, the fractional variation in the number
moving images, and it suggests that we SIMPLE LOGARITHMIC RECEPTORS
becomes so large that it makes more sense simply
should at least build a receptor circuit with A receptor with logarithmic response to il- to try to count every photon,24 a problem that is
response speed, if not invariant to illumina- lumination has the right kind of response, beyond the scope of this paper.
ADAPTIVE RECEPTOR CIRCUIT Apr 2, 1996 3
source decreases as the light intensity in-

creases. The decrease is a thermal voltage, Comparison
VT, for each e -fold intensity change, or
about 60 mV per decade.† The DC operat- Learning
ing point is determined by the bias voltage C1
Qp Vb
Vb. Qfb
This circuit would be a fine logarithmic Model Adaptive
Vf element
photoreceptor in an ideal world. The gain
of about 60 mV/decade results in a typical Vo
range of output voltages of perhaps 20 mV I bg+i
from natural scenes, which would in turn C2 Qcas Vcas
affect later circuits with current variations
of factors of two, which is sufficient for
computation. Vp
Qn
There are two important problems: mis-
matches and slow response. The differenc-
Input
es between supposedly identical receptor
outputs are as large as the typical signals
variations produced by real scenes. In a
system context, we know from practical
experience that this circuit is unusable ex- FIGURE 5 Adaptive receptor circuit.
cept under demonstration conditions.
The other problem is that under low illu- The comparison between input and mod- goes up Aamp times as much. The output
mination, the response is too slow. The rea- el is performed by the inverting amplifier change is coupled back to the gate of Qfb
son is that the junction capacitance, per consisting of Qn and Qp. An additional cas- through the capacitive divider, with a gain
unit area, in typical fabrication processes is code transistor, Qcas, is discussed on page 4. of perhaps 0.1. Pulling up the gate of Qfb
too large. Making the photodiode larger The input voltage Vp controls the current pulls up on the source, which is where we
doesn’t help, because the capacitance sunk by Qn. The current sourced by Qp is started. So, instead of pulling down on the
scales up linearly with photodiode area. fixed by the bias voltage Vb. The voltage source of Qfb, we end up raising the gate.
This problem will only grow worse as fea- gain –Aamp of this amplifier is determined The feedback amplifier and the input fight
ture size decreases, because substrate dop- by the ratio of the transconductance of Qn to to control the source voltage of Qfb, but the
ing density and junction capacitance will the output conductance of the amplifier, feedback amplifier wins because it has
be increased. which in turn is determined by the length of much higher gain. The input voltage v p
Hence the necessity for adaptation, to the transistors in the amplifier. For typical moves enough so that the output voltage vo
deal with the circuit mismatch problem, layout, the gain is several hundred to several moves enough so that vf moves enough so
and active feedback, to deal with the prob- thousand. Bias voltage Vb determines the that vp is held nearly clamped.
lem of slow response. cutoff frequency for the receptor, by setting On long time scales, the gain of the re-
the bias current in the inverting amplifier. ceptor is low, because the feedback is a
We often use this control to filter out flicker short circuit across the adaptive element,
from artificial lighting. and vo does not need to move much to hold
The feedback loop is completed when the vp clamped. On short time scales, no charge
ADAPTIVE RECEPTOR output Vo is fed back to V f through the flows through the adaptive element, but
CIRCUIT adaptive element and through the capaci- changes in vo are coupled to vf through the
tive divider formed from C1 and C2. The capacitive divider. The transient gain of the
The adaptive receptor circuit is formed by adaptation state is stored on C1, which is receptor is set by the capacitive-divider ra-
delayed feedback to the gate of the feed- shown hooked up to Vdd. (It makes no dif- tio. The larger C1 is relative to C2, the larger
back transistor in the source follower reference if Vdd or ground is used—all that is the gain of the circuit.
ceptor, as shown in Figure 5. Conceptually, required is a fixed potential.††)
the circuit uses an internal model to make a Figure 6 illustrates the receptor’s adap-
prediction about the input signal. The out- THE FEEDBACK LOOP tive behavior and the invariance of the re-
put comes from a comparison of the input A small increase i of the photocurrent tries sponse to absolute intensity. The traces
and the prediction. The loop is completed to pull down Vp by (i/Ibg)VT. In response, Vo compare the response of the nonadaptive
by using learning to refine the model so source-follower receptor with the response
that predictions are more accurate.16 The of the adaptive receptor. The incident signal
adaptive receptor, with its level adaptation, † V T = kT ⁄ q = 25 mV at room temperature. is a small intensity variation sitting on a
uses perhaps the simplest type of learning. An e-fold is a factor of e=2.72...The subthreshold steady background. The small variations
transistor drain–source current is represent the type of signal arising from ob-
The input stage of the adaptive receptor jects in a real scene, while the steady back-
consists of the source-follower receptor κV g –V s
I ds = I 0 e (e – e –V d ) +( I ds ⁄ V e )V ds ground represents the ambient lighting
shown in Figure 4. The feedback transis- level. The contrast of the signal is a fixed
tor Qfb, for typical intensities, operates in where g means gate, s means source, d means
drain, and all voltages are in units of VT measured percentage, independent of the absolute in-
subthreshold, so the source voltage Vp is tensity, as would be produced by reflective
relative to the bulk. I0 is the preexponential con-
logarithmic in the photocurrent. Vp sits be-
stant and Ve is the Early voltage characterizing
low Vf at whatever voltage it takes to turn
on Qfb to supply the photocurrent. Concep- drain conductance . κ ≈ 0.7 – 0.9 is the back- †† Mahowald’s adaptive retina hooks up C 1 to a
tually, the voltage stored on C1 acts as a gate coefficient describing the effectiveness of gate reference voltage that is computed by a resistive
model of the input intensity. voltage changes on channel potential.18 network, leading to interesting behavior.
4 Apr 2, 1996 ADAPTIVE RECEPTOR CIRCUIT
FIGURE 6 Responses of the source- addition of this single cascode transistor in-
Voltage follower and adaptive receptor over 7
-6 creases the dynamic range of the receptor
decades of background. Stimulus is a by about a decade.
200 mV
square-wave variation in the intensity,
centered around a mean value. The
Figure 7 shows the effect of the cascode
-5 on the small-signal time response of the re-
numbers by each section are the log
intensity of the mean value; 0 log is ceptor. Under these operating conditions
2.9 W/m2, about the level of direct office the cascode speeds up the response by a fac-
-4 5s Time fluorescent light. The source-follower tor of about 6.
receptor begins to smooth out the 1 Hz
PHOTORECEPTOR GAIN
square wave input at the lowest
intensities, while the adaptive receptor still It’s simple to compute the steady state and
-3 responds. The 1.8 Hz square wave transient gain of the receptor if we assume
stimulus is from a red LED (635 nm). The that Aamp is large. When the input current
irradiance varies by a factor of about 2, or changes an e-fold, the gate of the feedback
-2
about 0.33 decades or 0.8 e-folds. capacitor must change by VT/κ to hold vp
clamped. That means in steady state the
Log small-signal gain is given by
-1 clearly useful in systems that care about the
Intensities vo ⁄ V T 1
contrast changes in the image, and not the ------------------- = ---
absolute intensities. i ⁄ I bg κ
0
The receptor adapts very rapidly in re- (linearized steady-state closed-loop gain) (2)
sponse to the decade changes of intensity. and for transient signals, where the output
Source- Adaptive This rapid adaptation is due to the use of an must go through the capacitive divider, the
receptor
follower adaptive element with an expansive nonlin- gain is
receptor earity. Large changes in the output adapt
rapidly, while small signals around an adap- vo ⁄ V T 1 C1 + C2
A cl ≡ ------------------ = --- --------------------
0 tation point only adapt slowly. i ⁄ I bg κ C2
CASCODE. Qcas has two effects:
(linearized transient closed-loop gain) (3)
1. It shields the drain of Qn from the large
where κ ≈ 0.7 is the back-gate coefficient
voltage swings of Vo. Because the
-1 describing the effectiveness of gate voltage
source conductance of Qcas is larger
changes on channel surface potential. We
than the drain conductance by a factor
shall call the gain for transient signals the
of approximately Aamp, the drain of Qn
-2 closed loop gain Acl from now on.
moves only about as much as vp. With-
out Qcas, the large voltage swings across Writing the gain in dimensionless form
the gate–drain capacitance of Qn load displays the logarithmic, contrast-sensitive
-3 response properties. The response to a con-
down the input node. They make fF
gate-drain capacitance appear to the stant i/Ibg is independent of the background
-4 input node to be on the pF scale, a phe- current. The ratio between transient and
nomenon called the Miller effect. steady-state gain is the capacitive-divider
ratio (C1+C2)/C2. We generally use a capac-
2. Qcas also multiplies the drain resistance itive divider ratio of about 10, which is the
-5 of Qn by a factor of approximately
ratio of poly-poly to metal-poly capaci-
Aamp, through a cascode action. This
tance. Figure 8 shows measured transfer
increase in drain resistance increases the
-6 curves. The plots illustrate the shifting of
gain of the amplifier by a factor of
the adaptation point to the ambient intensity
about 2.
and that the transient gain is much larger
Both the reduction in effective input capaci- than the steady-state gain.
objects. We varied the overall intensity level tance and the increased gain translate into
by interposing neutral density filters (i.e., speedup. The additional speed of the recep-
sunglasses) with various attenuation factors tor makes it usable at lower intensities. The
between the light source and the receptor,
while recording the receptor outputs. These (a) Input to LED
changes in the overall intensity level repre-
sent changes in the ambient lighting, as
would be caused by passing from shadow FIGURE 7 Effect of cascode on time- (b) Dim light
into sunlight or vice versa. response and noise. (a) shows the without cascode
The amplitude of the response to the small-signal input signal. (b) shows the
response of the adaptive receptor when
small contrast variation is almost invariant the cascode is shorted. (c) shows the
to the absolute intensity, owing to the loga- (c) Dim light with cascode
response with the cascode activated,
rithmic response property. The adaptation (along with averaged curve)
along with a time-averaged response.
makes the receptor have high gain for rapid- (d) shows the response of the receptor
ly varying intensities and low gain for slow- when the light is 10 times brighter. The
ly varying intensities. Hence, the response noise properties are discussed in the (d) Bright light (10x)
50 mV
to an intensity change of a decade, after ad- text (see page 11). without cascode
aptation, is almost the same as the response
to the 15 % variation. A receptor like this is 10 ms
ADAPTIVE RECEPTOR CIRCUIT Apr 2, 1996 5
3.4
FIGURE 8 Step response operating curves. Each s- Transient
shaped curve shows the peak value of the response to a Responses
step change of irradiance, starting at the intensity marked 3.2
with a circle. The ordinate shows the peak value of the Peak
response to the step, and the abscissa shows the Output
absolute incident irradiance. The total range we tested (V) 3
Steady-state
spans 6.5 decades. The receptor was allowed to adapt response
back to its steady-state value for 5 s before each step
2.8
stimulus. The steady-state gain is between 30 and
40 mV/decade; the transient gain is approximately
1.5 V/decade, decreasing at the lowest intensities due to 2.6
interaction between rise time and adaptation time
constants.
2.4
-3 -2 -1 0 1 2 3 4 5
10 10 10 10 10 10 10 10 10
Irradiance (mW/m2 )
Equation 2 is the small-signal equivalent line speed of the source-follower receptor. gate to the drain of Qn and Cfb from the gate
of the large signal expression for the Its time constant is to the source of Qfb have a substantial effect
steady-state output voltage, CV T on the time-response of the receptor, even
C
Vo I bg + i τ in = --- = ------------- (baseline speed) (5) though the capacitance is only 0.1 fF/µm of
1 g I bg channel width. We can compute the time
-------- = V p + --- log ---------------
VT κ I0 The adaptive receptor is shown again in constant of the receptor response including
(large-signal steady-state response) (4) Figure 9, this time with relevant capacitors, these parasitics by using the fact that the
both explicit and parasitic. In the absence of Miller effect increases the relevant
where Vp is the clamped voltage at the gate
Cfb and Cn, the speedup obtained by using gate–drain and gate–source capacitances by
of Qn and I0 is the preexponential in the
the active feedback to clamp the input node a factor of A, and the feedback increases the
subthreshold transistor law.
is given by effective conductance at the source of Qfb
ADVANTAGES OF ACTIVE by a factor of the total loop gain. (In this
A amp
FEEDBACK A loop = ------------- (speedup) (6) analysis we shall imagine that we have left
An active feedback circuit has three advan- A cl out Qcas.) The time constant including these
tages over a passive feedback circuit like The speedup is equal to the total loop gain, effects is given by
the one used in many early Mead lab obtained by following the gain all the way C
projects: The bias current in the output leg τ = ---
around the loop. However, this result is na- g
of the receptor is capable of driving arbi- ive, because it ignores the important para- A amp
trary capacitive loads. The bias control al- sitics Cfb and Cn. C p + A amp C n + ------------- C fb
lows us to low-pass filter at a chosen κA cl
frequency, which lets us filter out flicker MILLER CAPACITANCES = -----------------------------------------------------------------
The Miller effect occurs when a capacitor A amp I bg
from artificial lighting. Most important, the ------------- -------- (7)
feedback, by clamping the vp node, extends feeds back the output of an inverting, high- A cl V T with cascode
the usable dynamic range of the receptor gain amplifier back to the input. If the input
by speeding it up. The small photocurrents needs to move a certain amount, it must V T  A cl 1 
charge not only its own side of the capaci- --------  ------------- C + A cl C n + --- C fb
need only charge and discharge the small κ
= I bg  amp p 
A
changes in vp, rather than the large swings tor, but also but charge the other side of the













of vo. capacitor which moves A times as much in
the opposite direction. Hence a small capac- C eff
SPEEDUP itance C looks like a capacitance (A+1)C to We can think of this result as implying an
When we computed the closed loop gain, the input. Since A >> 1, we usually ignore effective capacitance Ceff, as shown above,
we assumed that the gain of the feedback the 1. The Miller capacitors Cn from the at the input node, which can be combined
amplifier formed from Qn and Qp is infi-
nite, which of course is only true in the
sense that it is much larger than the closed
loop gain. Our assumption meant that the
input node is perfectly clamped, which is Qfb Qp Vb
Vf C2
also not true—the input has to move a little
bit to change the output voltage by the re-
quired amount. The larger the gain Aamp of C1
the feedback amplifier, the less the input Vo
node needs to move, and the more speedup I bg+i Cfb
we obtain. On the other hand, for a given Qcas Vcas
feedback amplifier, the more closed-loop FIGURE 9 Adaptive receptor Cn
gain Acl we design in the receptor (by ad- circuit with important parasitic Vp
justing the capacitive divider ratio), the capacitors. (Adaptive element is left Qn
more the input node must move, and the out.) Cp
slower the response. We’ll compute the
speedup of the receptor relative to the base-
6 Apr 2, 1996 SMALL-SIGNAL ANALYSIS
with an input conductance of Ibg/VT to com- RISE TIME & BANDWIDTH

pute the time constant. This way of thinking Equation 7 shows that the response time of b(s)
of the capacitance has the virtue that it the receptor is inversely proportional to in-
clearly shows how the effective Cfb is unaf- tensity. Figure 10 shows measurements of –
fected by either the closed-loop gain or the the small-signal rise time plotted against the
amplifier gain. When a cascode is used in absolute intensity for receptors built with x(s) + a(s) y(s)
the receptor, the Cn term essentially disap- different types of photodiodes. The inverse
+
pears. When the total loop gain is large, the relationship between rise time and intensity
Cp term also essentially disappears, leaving FIGURE 11 Generic feedback model.
is only violated at intensities above 1 W/m2
only Cfb to ultimately limit the response with light from a red LED, where the recep-
time. Clearly, the receptor should be de- tor starts to be limited by minority carrier
signed to minimize Cfb by using a narrow PHOTODIODE AREA?
diffusion lifetime to a minimum response How big to make the diode? Often the
Qfb. time of about 1 µs. This limit is unimportant
We tested this theory on a fully-instru- choice is dictated by design requirements.
for most vision applications, but specialized We’ve built receptors with areas ranging
mented receptor by turning the cascode off applications requiring very rapid response
and on and measuring the resulting speed- from 100 µm2 (10x10) up to 4000 µm 2
can use a junction with limited collection (65x65). The big wins from using a bigger
up. Capacitance and gain values are shown volume to speed up the response, at the cost
in Table 1. photodiode are that it is faster (as long as
o f l ow e r q u a n t u m e ffi c i e n c y. you some loop gain to knock out the photo-
Effective Effective GAIN–BANDWIDTH PRODUCT diode capacitance), and it is electrically qui-
value, value, eter. The penalty is lower resolution.
A feedback amplifier whose closed-loop
without with
gain is determined by the feedback element
True value cascode cascode
generally has a fixed gain–bandwidth prod-
Aamp 1000 3700 uct. The GB product is an invariant for a
Acl 11 11 particular design that can be used to com-
Aloop 91 336 pare different designs. We can compute the SMALL-SIGNAL ANALYSIS
GB product from Equation 7; the result is
Cp 100 fF 1.1 fF 0.3 fF To understand the second-order behavior of
I bg A amp the circuit, we’ll compute the general trans-
Cn 0.54 fF 5.9 fF 0.01 fF GBproduct = -------- ------------------------------------- (8)
VT A amp fer function including the time constant of
Cfb 2.7 fF 3.0 fF 3.0 fF the feedback amplifier. We’ll ignore Miller
C p + ------------- C fb
Ceff 10.0 fF 3.3 fF A cl effects in this analysis.
TABLE 1 Capacitance and gain values for In the limit where we can ignore Cfb (in re- The generic feedback model shown in
adaptive receptor. Measured κ is 0.89. ality never), the GB product is higher by a Figure 11 has the transfer function
Typical areal capacitance values are factor of Aamp in the adaptive receptor than y(s) a(s)
0.5 fF/µm2 for photodiode, 0.8 fF/µm2 for H ( s ) = ---------- = ------------------------------ (9)
in the source-follower receptor. In practice, x(s) 1 + a ( s )b ( s )
above-threshold gate, 0.1 fF/µm gate-drain or
we have measured increases of the GB where a(s) and b(s) are the feedforward and
gate-source width.
product of 500 to 2000. feedback transfer functions in the s-plane.
The ratio of Ceff without and with the cas-
code is 3.0. This value is in excellent agree-
ment with the measured speedup value 3.1, 10 -1
suggesting that we have correctly account-
ed for all the important parasitic capacitors.
slo
Table 1 is worth examining to see how Cfb 15 ms

pe
becomes totally dominant once the cascode 10 -2

=
and high loop gain take the other capacitors

1
n++/p-
out of the picture. With proper layout, Cfb
should be reducible to 0.6 fF, a factor of 5 p++/n
10–90 % Rise Time (s)
smaller than in our example. Hence, we 10 -3

n++/p- w/cas
should be able to achieve a speedup of 10
with the cascode. p++/n w/cas
FIGURE 10 Response-time measurements 10 -4 n++/p+

for photodiode receptors. Each curve shows n++/p+ w/cas
the 10–90 % rise time for a small step-intensity
change, versus irradiance by a red LED. The
different curves are for different photoreceptors 10 -5 1 lux
circuits, differing in the phototransducer and @ 555 nm
the use of the cascode. Keys: x/y means
junction between x and y, where x and y are as
follows: p- is bulk substrate, n is well, p+ is p-
type base layer, n++ is n-type source-drain 10 -4 10 -3 10 -2 10 -1 10 0 10 1
diffusion, and p++ is p-type source-drain
Irradiance (W/m2)
diffusion. w/cas means cascode is activated.
The effect of minority-carrier diffusion lifetime Limit
of direct border of direct direct
can be seen at the solid arrow ( ). This biological flourescent sunlight
moonlight rod and cone
effect is discussed on page 6. vision light
vision / consumer
CCD camera
SMALL-SIGNAL ANALYSIS Apr 2, 1996 7
In the photoreceptor circuit, the input and

output variables x and y are given by 1
x ( s ) = i ⁄ I bg and y ( s ) = v o ⁄ V T . 0
The feedforward gain element a(s) con- Gain -1.1
sists of the photodiode, the source of Qfb, (normalized) -2.2
and the amplifier consisting of Qn, Qcas,
and Qp. The transfer function is given by 10 -1 -3.26
A amp
a ( s ) = ---------------------------------------------------- (10)
( τ ins + 1 ) ( τ out s + 1 ) 10 -1 10 0 10 1 10 2 10 3 10 4 10 5
where τout is the time constant of the output Frequency (Hz)
node, set by the capacitance and output FIGURE 12 Measured amplitude transfer functions for the adaptive receptor using a
conductance in the amplifier, and τin is the photodiode constructed from native diffusion. The number by each curve is the log
time constant of the input node, set by the background irradiance, in decades. The highest irradiance is 19 W/m2, about 10 times
photocurrent and input capacitance, and direct office-fluorescent lighting. The intensity affects both the high and low frequency
cutoffs. The receptors have a constant gain over a range of 4–5 decades of frequency. At
given by Equation 5.
the highest background intensity, the gain is larger than at the other intensities, because
The feedback element b(s) consists of the feedback transistor comes out of subthreshold, reducing its transconductance. We
the capacitive divider and the gate of the normalized the curves to the mean gain for the median intensities, about 1.4 V/decade for
feedback transistor. In this analysis, we this receptor. This receptor has no Qcas to nullify Miller capacitance. The adaptation rate,
shall assume that no charge transfers given by the low-frequency cutoff, appears to scale with intensity.
through the adaptive element and hence
that we are operating the circuit in the high-
gain, transient-response mode. b(s) is given value of τout that makes the imaginary part From these expressions we can easily solve
by of the poles equal to zero. for the Q = 1/2 condition:
1 It is easiest to approach this problem
b ( s ) = -------- (11) 1 1
Q = --- when τ out = ----------------- τ in (15)
A cl from a canonical point of view for second- 2 4 A loop
order systems.18 A canonical form for the
Using Equation 9, we obtain the transfer transfer function of a second order system is For a nonringing, critically-damped re-
function: sponse, the amplifier must be faster than the
1
H ( s ) = ---------------------------------- (13) input node by about the total loop gain. This
2 2 τ
H ( s)= (12) τ s + ---- s + 1 restriction is severe, because the amplifier
Q
output is already a factor of Aamp slower
A cl where, in the case of an underdamped sys-
than it would be if the amplifier had unity
------------------------------------------------------------------------------------------------ tem, 1/τ is the radius of the circle on which
( A cl ⁄ A amp ) ( τ ins + 1 ) ( τ out s + 1 ) + 1 the poles sit, and Q, stated loosely, is the
gain. In other words, the amplifier generates
high gain by using a small output conduc-
The shape of this transfer function when number of cycles of ringing in response to a
tance, and this small output conductance
τout is small is shown in the measured step input. Q = 1/2 means a critically
makes the amplifier slow. Hence, for a criti-
c u r ve s i n F i g u r e 1 2 ( e x c e p t t h a t damped system. We can identify τ and Q in
cally-damped response, the transconduc-
Equation 12 doesn’t include the adapta- the transfer function for the photoreceptor,
tance of the input to the feedback amplifier,
tion). As s goes to zero, H(s) approaches Equation 12, as follows:
and the bias current, scales as the square of
Acl when Aamp >> Acl. As s goes to infinity, τ out τ in the desired speedup.
H(s) approaches zero. Assuming τ out is τ = -----------------
A loop We can also use Equation 12 to find the
zero results in first order system with condition for maximum Q. The result is
(14)
equivalent time constant τ in ( A cl ⁄ A amp) . τ out τ in 1
SECOND-ORDER TEMPORAL Q = A loop ----------------------- Q = --- A loop when τ out = τ in (16)
τ out + τ in 2
BEHAVIOR
If we bias the amplifier so that the amplifier
In our computation of the expected speed- output has a time constant equal to the time
up due to the active feedback clamping of constant of the input node, then we obtain
the input node, we assumed that the feed-
back amplifier is infinitely fast. The sec-
Decreasing τout
ond-order behavior when τout is not zero Im(s)
can be visualized in the root-locus plot
shown in Figure 13, which shows the loca- 10
tions of the poles of Equation 12 as τout is
decreased. In the infinitely-fast limit, the FIGURE 13 Root-locus
two poles of the second order system sepa- τ out = 0 5
plot for adaptive τ out = τin Acl /A amp
rate along the negative real axis. One pole receptor, showing the τ out = τin
shoots off to -∞, and the other ends up at poles of the transfer
the value derived earlier, corresponding to function in Equation 12, Re(s)
a speedup of Acl/Aamp over the open-loop parameterized by the -25 -20 -15 -10 -5
value. To achieve this speedup, the feed- output time constant τout
back must be very fast. If it is not, the poles of the feedback amplifier.
Parameters: -5
will have a nonzero imaginary part, and the
Aamp = 100, Acl = 10,
output will ring in response to a step input.
τin = 1.
We can derive the condition for a damped,
nonringing step response by finding the -10
8 Apr 2, 1996 ADAPTIVE ELEMENT
Vf x 10 -8
Vo Vf Vo 10
(a)
5
Vf Vo
Bipolar
p+ p+ n+ Current 0
(b) n-well Vo > Vf (A)
p-substrate MOS
-5
Vf Vo
p+ p+ n+
(c) Vo < Vf
-10
-0.5 0 0.5
Voltage (V)
FIGURE 14 Expansive adaptive element (a), shown in two
schematic forms, along with the capacitor that stores the FIGURE 15 Measured current–voltage relationship for the new
adaptation state. expansive adaptive element shown in Figure 14. The bipolar
(b) The mode of conduction when the output voltage is higher mode conduction e-folds every 28 mV, compared with 48 mV
than the capacitor voltage: The structure acts as a diode- for the MOS mode, leading to a quantitative difference in the
connected MOS transistor. voltage at which the current rapidly increases. At any scale of
(c) The opposite case: The p+/n junction is forward-biased, current, the curves have the same appearance; the voltage
and the device as a whole acts as a bipolar transistor with two scale changes logarithmically with the current scale. This data
collectors. was taken from a p-well chip.
the maximum possible amount of ringing. element acts like a pair of diodes, in paral- back gate (the well) is driven at the same
This ringing is not very severe, because the lel, with opposite polarity. The current in- time, the current e-folds every VT/κ.
maximum Q of the circuit is generally less creases exponentially with voltage for For opposite polarity, the bipolar is
than 5. We have labeled these conditions on either sign of voltage, and there is an ex- turned on. The driven side forward-biases
the root-locus plot in Figure 13. tremely high-resistance region around the the p++/n emitter–base junction. The bipo-
Usually we turn the bias current up origin, as shown in Figure 15. lar transistor has two collectors: the driven
enough to give a response that is fast The I–V relationship of the expansive el- side and the substrate. The current e-folds
enough for the situation at hand, but slow ement means that the effective resistance of every VT volts. (The back gate of the MOS
enough to filter out flicker from artificial the element is huge for small signals and transistor is also turned on, leading to a cur-
lighting. A nice feature of this mode of op- rent that e-folds every VT /(1- κ ), but this
small for large signals. Hence, the adapta-
eration is that the speedup is only effective small component is invisible relative to the
tion is slow for small signals and fast for
at low intensities, while at higher intensi- large bipolar current.)
large signals. This behavior is useful, be-
ties, the low pass filtering reduces the noise. These characteristics may be seen in the
cause it means that the receptor can quickly
adapt to a large change in conditions—say, data shown in Figure 16. This data was tak-
en with and without light shining on a near-
moving from shadow into sunlight—while
by hole in the metal covering of the adaptive
maintaining high sensitivity to small and
element, to illustrate that the currents in the
slowly varying signals. The adaptation time capacitor node (Vf) are unaffected by mi-
ADAPTIVE ELEMENT rate is proportional (in some sense) to the nority carriers generated in the substrate.
Adaptation occurs when charge is trans- signal amplitude.
The I–V relationship for the element is
ferred onto or off the storage capacitor. This For voltage polarity Vo > Vf across the el- given by
charge transfer happens through the adap- ement, the MOS transistor is turned on and
tive element. The adaptive-element is a re- the bipolar transistor is turned off. The driv- I = I m – I b + I par
sistor-like device that has a monotonic I-V en side (Vo ) in the MOS case acts as the
relationship. For analog VLSI circuits, source of the transistor, but because the ∆V ≡ V o – V f
however, true ohmic resistors available in a κ∆V ( 1 – κ ) ( – ∆V ) (17)
plain CMOS process are much too small for I m = I 0,m[ e –e ]
adaptation on the time scale of seconds.† In- † Assume we need an RC time constant of a sec-
–∆ V
stead, we use transistors in our adaptive ele- ond, and that C = 1 pF (a 50 µm by 50 µm poly-to- I b = I 0,b[ e – 1]
ment—a sacrifice with unanticipated poly capacitor). Then we need R = 1012 Ω. Polysil-
icon has a resistance of 20 Ω/square, so we would
benefits. We have developed two novel need 5x1010 squares—a 2 µm-wide poly resistor
where I is the current flowing onto the ca-
adaptive elements with dual nonlineari- with area 0.6 m2! Some DRAM processes have an pacitor. I consists of three components, the
ties—expansive and compressive.6 Here, extremely high-resistance undoped polysilicon MOS transistor current Im, the bipolar tran-
we shall discuss only the expansive ele- with ohmic properties, but it is unstable, with large sistor current Ib, and the parasitic photocur-
ment, shown in Figure 14. The expansive variations, and very temperature dependent. rent in the emitter–base junction Ipar. The
ADAPTIVE ELEMENT Apr 2, 1996 9
Gate node Well node

10 -3 10 -3
parasitic photocurrent flows out of the element. Voltages
are in units of VT. The preexponential constants I0,m and Bipolar
I0,b are for the MOS and bipolar transistors. The current- 10 -4 10 -4
gain factor β has been included in I0,b. Figure 17 shows Light and Dark
Curves
Equation 17 plotted near zero differential voltage. 10 -5 Superimposed 10 -5
ADAPTATION RATE Bipolar
The conductance of the adaptive element at the adapted 10 -6 10 -6
condition (I = 0) determines the time constant of adapta-
tion for small signals at the output. When Ipar = 0 we can MOS MOS
see by inspection of Equation 17 that the conductance is 10 -7 10 -7
e-fold
g = I 0,m + I 0,b (18) 28 mV
Current (A)
10 -8 10 -8
The detailed measurements in Figure 16 indicate that I0,m Light
and I0,b are both approximately equal to 0.1–1 fA. The
parasitic photocurrent Ipar in the emitter–base junction is 10 -9 10 -9
e-fold
small but nonetheless important, because it determines 48 mV
the imbalance across the element that must be maintained 10 -10 10 -10
in an adapted condition of zero net current. Of course, Ipar
is never zero, due to scattered light and dark light (thermal
photons). In any case, the conductance near the adapted 10 -11 10 -11
state is determined by the condition I = 0. Ipar is a positive Dark

current flowing onto the capacitor; hence it shifts the I-V 10 -12 I 0=10-16 – 10-15 A 10 -12
curve upwards, resulting in a conductance at I = 0 that is

approximately proportional to Ipar, when Ipar is somewhat
10 -13 10 -13
larger than I0,m + I0,b. 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
The rate of adaptation is proportional to intensity, judg-
|∆V| (V) |∆V| (V)
ing from the transfer curves shown in Figure 12. This be-
havior is to be expected if scattered light creates a FIGURE 16 Detailed I–V characteristics of the bipolar–MOS
parasitic photocurrent Ipar. However, the more careful adaptive element, taken in the dark and under illumination. At
measurement shown in Figure 18 indicates that only un- the gate node (left plot), the I-V characteristics are unaffected
der intensities above approximately 500 mW/m2 is the by light because the emitter/drain diffusion is shielded from
adaptation rate affected by light intensity; at low intensi- minority carriers by the well. At the well node (right plot), there
ties, the adaptation rate is constant, suggesting a conduc- is a large parasitic photocurrent from the well to ground.
tance of about 4 fA/VT in the adaptive element. This Current gain for the bipolar mode is visible as a gate node
conductance is consistent with the I0,m and I0,b from in- emitter current that is approximately 100 times larger than the
well node collector current.
spection of Figure 16.
OTHER ADAPTIVE ELEMENTS
The MOS–bipolar adaptive element is inherently resistant
to the deleterious effect of diffusing minority carriers at 10 1
the capacitor node. Earlier attempts to construct adaptive
elements purely from MOS transistors, such as the one Time
shown in Figure 19, suffer from collection of minority Constant Darkness
carriers by the parasitic photodiodes formed by (s)
source–drain diffusions. There are offsets in these other 10 0
elements of about a volt in an adapted condition (where
no net current flows onto the capacitor). The large offset
voltages arise from the huge back-gate voltage from the
bulk that tries to turn off the transistor. The new adaptive
10 -1
-4 -3 -2 -1 0
1
Log Intensity
Im FIGURE 18 Time constant of adaptation (5 W/m2)
node vs. intensity, using red LED.
-2
2 ∆V
Ib -1
I FIGURE 19 A bad choice
for an adaptive element.
Parasitic photodiode pulls
adaptation node towards
substrate voltage.
FIGURE 17 Theoretical plots of currents in Equation 17,
assuming κ=0.7, I0,m and I0,b both equal 1, and Ipar is zero.
10 Apr 2, 1996 PHOTODIODE VS. PHOTOTRANSISTOR
element has offsets of less than 100 mV in A nearby substrate contact sinks the photo-
an adapted state, owing mostly to the zero current to ground. The transistors in the p++ emitter
back-gate voltage. feedback amplifier are long, to maximize
the gain and hence the speedup. The adap- n well base
tive element is made from an isolated well
p– substrate collector
with a single MOS transistor. The capaci-
tive divider is formed from the two levels of FIGURE 20 Parasitic vertical bipolar
PHOTODIODE VS. polysilicon plus a metal plate, but a MOS transistor in an n-well process.
PHOTOTRANSISTOR capacitor may be used instead. The total
area in this conservative layout is about
Previous logarithmic photoreceptor designs 80x80 µm2 in a technology with a 2-µm the photocurrent, but the larger the capaci-
from the Mead lab all used the parasitic ver- feature size. tance. Since we are pretty much stuck with
tical bipolar transistor, shown in Figure 20, The capacitive-divider ratio determines a fixed technology, we regard this speed as a
instead of a photodiode. We used the bipo- the gain of the receptor for transient signals. constant of the problem. †
lar transistor because bipolars have much The capacitance of the adaptive element it- In the devices available in an ordinary
less 1/f noise than MOS surface channel de- self is usually not negligible compared with CMOS process, the maximum receptor
vices, and because the larger output current the explicit feedback capacitor C2 and must speed is obtained from one of the substrate-
is more capable of driving capacitive loads. be taken into account. It consists mostly of junction photodiodes, because these have
Later on, when we started to use active the active-well junction. A typical value is the largest quantum efficiency and the low-
feedback circuits, we continued using bipo- 15 fF, equivalent to a square metal–poly ca- est capacitance. We have chosen to use the
lars for these same reasons. pacitor with an edge length of about 17 µm. active–substrate diode rather than the
Once we started to characterize the noise, well–substrate diode in all of our designs
we immediately discovered that 1/f noise is because the layout is much more compact.
negligible compared with shot and thermal We expect that the well–substrate diode will
noise. In fact, there is a big disadvantage to
using bipolar phototransistors in the adap- THE ILLUMINATION LIMIT have a similar capacitance and quantum ef-
ficiency.
tive receptor. The active feedback speeds up
the response by clamping the input node.
(SPEED)
When we use a phototransistor instead of a From an evolutionary perspective, it is clear † The speed could be increased if we had access to
photodiode, the feedback clamps the emit- that animals that can see in the dark occupy PIN diodes 28, where the p and n regions are sepa-
ter node, but the base is left floating. Indeed, an important niche. The same is true for our rated by an intrinsic region that increases the vol-
the base must not be clamped if the bipolar silicon receptors. If they can only be used in ume, and hence the quantum efficiency, and
transistor is to function with its normal cur- bright sunlight, then they are not very use- simultaneously decreases the capacitance of the
ful. device—but we know of no CMOS process with
rent gain mechanism. The current available these characteristics.
to charge and discharge the base of the tran- The source follower receptor has a re- In a typical 2-µm n-well process, the active-sub-
sistor is approximately the same photocur- sponse speed that is determined primarily strate capacitance is around
rent available from the photodiode. As a by the ratio of quantum efficiency to capaci- 120 aF/µm2+200 aF/µm. In the more heavily
result, we obtain no speedup. The dynamic tance per unit area in the photodiode. The doped 1.2-µm process, these values jump to
range is at least 1–2 decades smaller with a larger we make the photodiode, the larger 500 aF/µm2 + 400 aF/µm.
phototransistor than with a photodiode. In
the discussion of receptor noise, we shall
see that the noise properties of photodiode
receptors and phototransistor receptors are
indistinguishable (page 15). In the context Adaptive
element
of an active feedback circuit, it makes no
sense to use bipolar phototransistors.
Feedback
capacitor
(C2)
Amplifier
RECEPTOR LAYOUT (Co)
Figure 21 shows the layout corresponding
to the schematic in Figure 5. The photo-
diode can be constructed from any of the pn Storage Cascode
Capacitor transistor
junctions that are part of a CMOS process,
(C1)
but the one that we use in practice is the
junction between native source–drain diffu- 6 µm
sion and substrate. In an n-well process, the
junction is between the p– substrate and the
n++ diffusion. We use this junction because
the quantum efficiency is high and the ca- Feedback transistor
pacitance per unit area is relatively small, Photodiode (Cp)
resulting in a fast response. This junction
(also hole in overlying metal)
also has the advantage that it may be con-
structed simply as an extension of the FIGURE 21 Photoreceptor layout. This layout is nonoptimal because the feedback
source of Qfb. All parts of the circuit except transistor is wider than it needs to be, leading to excessive Cfb (see page 5).
for the photodiode are covered with metal.
THE DETECTION LIMIT (NOISE) Apr 2, 1996 11
A reasonable definition of the lower lim- illumination the current to ground that they The important observation is that the to-
iting intensity for operation of the receptor produce may exceed the bias current sup- tal noise power, integrated over the entire
is the intensity at which the photoreceptor plied by Qp , pulling the output node to passband of the receptor, is a constant inde-
has a rise time of 15 ms—corresponding to ground. This situation may be ameliorated pendent of intensity. The lower the intensity,
the time for a single field of a video camera by shielding the native-type transistors in the smaller the bandwidth of the receptor,
that scans at 60 Hz, and approximately the the amplifier from light using a metal wir- but the larger the noise level within the
same as the cutoff frequency for human vi- ing layer, and by surrounding them with a passband. The responses of the logarithmic
sion under photopic† conditions. We can guard bar made from native diffusion that is receptor to white noise stimulation show
see from Figure 10 that the fastest photore- preferably tied to V dd . (It doesn’t make that the shape of the noise spectrum is the
ceptor circuit that we have tested has a rise much difference if you tie the guard bar to same as the shape of the receptor transfer
time of 15 ms at about 1 mW/m2 irradi- ground or Vdd.) Guard structures are dis- function, at each intensity level. But while
ance—equivalent to an illuminance of cussed further starting on page 15. the transfer function measurement shows
about 1 lux††, or approximately the light- that the receptor contrast gain is constant,
ing of the full moon. This receptor is built independent of intensity, the underlying
with a feedback amplifier with a gain of noise behaves differently, becoming larger,
several hundred, a closed loop gain of the lower the intensity.
about 10, and a photodiode with an area of THE DETECTION LIMIT We know that the noise arises from with-
20 µm by 20 µm. (NOISE) in the receptor, and is not an artifact of the
CCD detectors are integrating devices; measurement (for instance, from noise in
every frame, they dump out all the charge What is the smallest signal that can be de- the LED light source). The reason is that the
they collected since the last frame. The tected? How is this value affected by light amplitude of the noise depends on the level
sensitivity limit is determined by the elec- intensity, and by detector area? How does of intensity. If the noise arose from the
tron counting noise in the charge-sensing the performance compare with commercial steady light source, then reducing the inten-
amplifier. Current commercial amplifiers, CCD cameras and biological rods and sity by interposing neutral density filters
in consumer end-product devices, function cones? What is the physical basis for the re- would result in a set of curves like the top
at a noise level of about 100 electrons, ceptor noise? set of curves in Figure 23 showing the re-
meaning that the RMS noise in the output We’ll investigate the noise properties of sponse to a white noise source. In other
is equivalent to 100 electrons in the charge the receptors and their detection ability, em- words, if the noise arose in the supposedly
bucket. If we assume that the cameras must pirically and theoretically. We’ll start em- steady source, then the bottom curves
have at least 4 bits resolution to be accept- pirically, with measurements of noise would duplicate the top set of curves but be
able, then the number of electrons collect- properties. These measurements show that shifted down by a constant amount. Since
ed must be 16 times the noise level, or the underlying behavior is very simple. The the behavior is clearly quite different, we
1600, each 1/60 s. A typical CCD pixel simplicity of this behavior motivates an are certain that the noise arises in the detec-
area is 100 µm2. The quantum efficiency is equally simple theory that intuitively and tor itself.
about 30 %. From these numbers, we com- Another important observation is that
quantitatively describes how noise works in
pute that the irradiance is 1.4 mW/m2, or flicker noise (1/f) in the receptor is negligi-
logarithmic photoreceptors.
1 lux at 555 nm, consistent with the adver-
tised ratings††† of a few lux. EMPIRICAL OBSERVATIONS Bias Light
The borderline between rod and cone vi- The observations shown in Figure 23 were
sion occurs at an illuminance of about 1 measured from the simple source-follower
lux, which is approximately the level of (a) Output
detector shown in Figure 22. We captured voltage
bright moonlight. In Figure 10, we have la- the noise power spectra using two types of
beled the rod-cone border and the moon- Photodiode
stimuli, a steady illumination from an LED
light irradiance. and a white-noise source. We used the white
In summary, the current photoreceptor noise source as a direct measurement of the
circuit functions down to about the same receptor transfer function. We did each
intensities as consumer CCD cameras and measurement at several levels of intensity,
human cone receptors. separated by decades. It is clear that all
ILLUMINATION LIMIT: HIGH END characteristics are well above the instru- (b)
MOS transistors Qn and Qcas that form the mentation noise.
bottom of the amplifier circuit in the adap-
tive receptor contain parasitic photodiodes e–
from their drains and sources to the sub- FIGURE 22 The simple logarithmic
strate. Ordinarily, these parasitic photo- photoreceptor used in the study of
diodes are irrelevant, but under intense receptor noise. This circuit forms the
input stage to the adaptive receptor.
(a) shows the schematic form. The bias
† Photopic means cones vision, mesopic means voltage sets a reference for the source
cones and rods are both used, and scotopic means
(c)
voltage, which is the output. (b) shows
rod vision. that the compact receptor consists of a
†† At 555 nm wavelength (the peak of human sen- single MOS transistor whose source Floats to
sitivity under photopic conditions), forms the photodiode. (c) shows the necessary
1 lux = 1.4 mW/m2. For white light (uniformly electron energy diagram. The source level
distributed over visible), voltage floats to whatever level is
1 lux=4 mW/m2=104 photons/µm2s. 21 required to spill the photocurrent over
††† Whatever these ratings mean—supposedly
Electron
the channel and into the drain. energy
each manufacturer makes up their own definition,
but none of them tell you what it is!
12 Apr 2, 1996 THE DETECTION LIMIT (NOISE)
FIGURE 23 Noise spectra for the source-follower logarithmic -60

receptor shown in Figure 22, at different intensities. The curve
labeled (a) is from an isolated follower pad, and shows that the
measured 1/f instrumentation noise in the follower pad and (c) 1/f2
spectrum analyzer is smaller than measured spectral noise from -3 -2 -1 0
the detector. The curves labeled (b) show the intrinsic noise -80
Signal Amplitude (dBV/√Hz)

spectra from the receptor at a given level of steady background
intensity. The number by each curve is the log background
irradiance. 0 log irradiance is 1.7 W/m2. The lower of each
smooth curve is the theoretical fit based on the theory given in -2
the text; the upper curve is twice the theoretical value. The -100
curves labeled (c) show the response of the receptor to small-
(b) -1
signal white-noise stimulation from an LED. The stimulus for
each curve has the same contrast, i.e., it is formed by
interposing neutral density filters between a white noise source 0
and the receptor. The straight lines have a slope of 1/f 2 , the -120
same as from a first-order low-pass filter. The number by each
curve is the log background irradiance. Definition of dBV units is 1/ f
the signal power, in dB, relative to a 1 V signal. Parameters used (a)
in the fits: node capacitance C=341.2 fF, temperature T=300°K, -140
and time constant τ chosen to make cutoff frequency correct at
-1 0 1 2 3 4
the brightest intensity. τ is scaled inversely with intensity for the 10 10 10 10 10 10
other curves. The capacitance consists of a 20x20 µm2
photodiode with areal capacitance of 0.122 fF/µm2 and edge Frequency (Hz)
capacitance 0.451 fF/µm (total 85 fF), a 6x6 µm2 gate with areal
capacitance 0.828 fF/µm2 (oxide thickness 417 Å, 29 fF), and a or
metal wire with total capacitance 92 fF. 2
∆Q = kTC (21)
Substituting this result in Equation 19, we
obtain the total dimensionless noise power
ble, although it is clearly dominant in the in- lower receptor, the gain of the receptor is for the source-follower receptor:
strumentation. One often gets the simply V T = kT ⁄ q per e-fold change in 2
impression from the literature that flicker the intensity, and hence the total dimension- kTC ⁄ C
2 qV T C ⁄ C q
noise dominates MOS transistor operation, less noise power P is given by P = --------------------- = --------------------------- = ------------- (22)
2 2 CV T
but here it clearly does not. VT VT
2 2 2 2
T h e s e c o n d s e t o f o b s e r va t i o n s ∆v ∆i ∆Q ⁄ C
P ≡ --------- = -------- = ---------------------- (19) The simplicity of this answer is quite re-
(Figure 24) compare the noise spectra of the VT 2 2 V T2 markable. It says that the dimensionless
I bg
simple source-follower receptor and the noise power—namely, the noise expressed
adaptive receptor. We injected a small test where ∆x2 means the mean-square variation in input units—is the ratio of the unit charge
signal to examine the SNR degradation by of x. The reason we write the noise in the to the “thermal charge,” CV T . In hind-
the adaptive feedback circuit. We can make form of Equation 19 is that we shall derive sight, what else could it have been?
the following observations: an expression for the mean square variation For a typical input capacitance of 100 fF,
1. The adaptive feedback circuit amplifies in the charge sitting on the output node of Equation 22 says that the total noise power
both signal and noise, but degrades the the source-follower receptor. We will use at room temperature is about 10-4, equiva-
SNR by less than 3 dB. that expression in Equation 19 to obtain the lent to an RMS variation of about 1 %.
2. The feedback circuit and the cascode receptor noise. TOTAL NOISE IN ADAPTIVE
widen the bandwidth; the extent of the There are two equivalent methods to RECEPTOR. The feedback circuit in the
widening, a factor of 1.5 to 2 decades, is compute the charge fluctuation. The first adaptive receptor adds minimal noise, but it
in agreement with earlier predictions way uses the principle of equipartition from does extend the bandwidth. The total recep-
given in the discussion of the adaptive statistical mechanics. The second way ana- tor noise is hence increased to
receptor, that were based on the para- lyzes the statistics of the individual charges.
sitic capacitances and gain measure- q
USING EQUIPARTITION TO P = ------------------- (23)
ments. C eff V T
COMPUTE THE NOISE POWER. T h e
The main conclusion of this measurement is principle of equipartition says that the aver- where Ceff, given in Equation 7 (page 5), is
that the degradation of the SNR by the age energy stored in each independent de- the effective input capacitance assuming a
adaptive feedback circuit is small enough gree of freedom of a system in thermal source conductance Ibg/VT at the source of
that our analysis can treat the feedback and equilibrium is kT ⁄ 2 . A degree of freedom Qfb. For a typical speedup of about 30
adaptation as a noiseless amplifier. If we is a parameter that appears quadratically in (Table 1, page 6) the total noise is increased
can understand what determines the noise the energy—for instance, each component by the same factor, leading to an RMS vari-
in the input stage of the adaptive receptor of the velocity of a free particle. Similarly, ation of about 5 %.
circuit, then understanding the noise behav- the charge on a capacitor is a degree of free- USING SHOT NOISE STATISTICS
ior of the complete adaptive receptor circuit dom, because the energy stored on the ca- TO COMPUTE THE NOISE POWER.
is t r iv i a l . 2
p a c i t o r i s g iv e n b y Q ⁄ 2C . U s i n g The equipartition computation may appear
THEORY OF LOGARITHMIC equipartition, we can write the following re- to be magic. Relying solely on this principle
RECEPTOR NOISE lation between the fluctuations in Q and the may give one an uncomfortable feeling that
In a logarithmic detector, the natural input temperature: something has been left out. That some-
units are fractions of the baseline signal, 2 thing is the intuition and understanding of
and the natural output units are fractions of ∆Q kT the origin of the noise, and why it takes the
----------- = ------ (20)
the e-folding parameter. In the source-fol- 2C 2 interesting form given by Equation 22.
THE DETECTION LIMIT (NOISE) Apr 2, 1996 13
FIGURE 24 Comparison of noise spectra for -40

simple source follower detector of Figure 22 and the
adaptive receptor shown in Figure 5, with the layout with
shown in Figure 21. The gain of the source follower -60 Cascode
Amplitude (dBV/√Hz)
receptor is about 60 mV per decade. The gain of
the adaptive receptor is about 1.3 V/decade, or Adaptive receptor
27 dB more than the gain of the source follower -80
receptor. The plots show the measured power without
spectra for each detector, along with the spectrum
of the follower pad instrumentation. The DC Source follower Cascode
-100
irradiance for this measurement is 0.2 W/m2, and
the injected signal is a combination of a 150 Hz 1/f 2
sinusoidal signal and a 1.5 kHz signal each with
-120 1/f
Rayleigh contrast of about 1 %. We can see from Instrumentation
the height of the signal spike relative to the
surrounding noise that the SNR for each detector
are nearly indistinguishable. For definition of dBV -140
units, see Figure 23. 10 0 10 1 10 2 10 3 10 4
Frequency (Hz)
2
The macroscopic flow of current through energy is given by q τ , where τ is the response of the node to a disturbance. If the
the receptor circuit consists of the micro- mean length of time that the charge is disturbance consists of a sudden excess
scopic movement of discrete charges. As present. Hence, the noise energy in a single amount of charge on the node, the system
shown in Figure 25(a), single charges event is given by returns to its equilibrium level after some
cause step changes in the voltage on the ca- 2 time—the time being the average lifetime
pacitor. The charges are collected by the E≡q τ (24) of the individual charges that make up the
photodiode and leave via the channel of the The current consists of a random stream of disturbance. The decay from the excess is
feedback transistor. The time at which a single events. All the events are indepen- exponential, and the e-folding time is once
charge appears, and the time that it stays on dent, so we can compute the mean square again τ . Hence, in Equation 25 we can re-
2
the capacitor, are both random. A given charge fluctuation ∆Q (the noise power) place τ with the time constant of the sub-
charge spends a random amount of time sit- in the signal by taking the noise energy in a threshold circuit, CV T ⁄ I bg , to obtain
ting on the output node. These times are typical event given by Equation 22 and CV T
multiplying by the average number of 2
distributed according to a Poisson distribu- ∆Q = qI bg ----------- = (26)
tion. events per unit time, given by I bg ⁄ q : I bg
Each charge is independent of all the 2 I bg 2 I bg kT
∆Q = E ------- = q τ ------- = qI bg τ (25) qCV T = qC ------ = kTC
others, which means that if we can com- q q q
pute the statistics of the average charge, We can think of the preceding computation which is the same result as Equation 21, ob-
then we can easily obtain the statistics of in two ways: Either as an average of a single tained using equipartition. Notice that here
the current as a whole. We shall first com- particle over many lifetimes, or as an aver- we have suddenly slipped thermal behavior
pute the average noise energy in the step age of many particles over a single lifetime. into a discussion that was strictly statistical.
change of charge caused by a single The two viewpoints lead to identical results; Boltzman statistics determines the time
charge. Using the independence of the the former is perhaps more rigorous in scale of the integration, which in turn deter-
charge events, we shall then compute the terms of obtaining the correct multiplier, mines the total noise.
noise power in a current that consists of a while the latter offers a better intuition of SHOT VS.THERMAL NOISE? W h e n
flow of single charges. what process physically governs the noise we initially attempted to compute the noise
The amplitude of the step change is the behavior—an average over a time window. in the receptor, our theory consistently dif-
unit charge q of a single electron. The noise Formally, the equivalence of the two ap- fered from the measured noise power by a
energy contained in the step is the integral proaches arises from the ergodicity of the factor of about two—our theory always pre-
over time of the squared deviation of the system. dicted a noise power twice what we mea-
charge from the mean value. The mean val- A crucial fact is that the average lifetime sured. This distressing situation was not
ue is zero, since this single charge has finite τ of a charge is the same as the time con- resolved until Rahul Sarpeshkar pointed out
duration. During the presence of the stant of the node—a fact that is obvious but that shot and thermal noise are alternative
charge, the value is simply q, the size of the hard to prove. The reason it is true is that the aspects of a single underlying phenomenon.
charge. Hence, the mean value of the noise time constant describes the time scale of the We had been under the impression that they
FIGURE 25 Shot noise computations. (a) The flow

of current in the source-follower receptor consists of
unit charges that appear and disappear, acting
independently. (b) A single charge appears at a Average
random moment, lasts a random amount of time, and
Charge
τ lifetime
disappears. The distribution of times that the impulse
lasts is described by a Poisson process. The mean q
charge level is zero. The average energy in the
impulse is q2τ. Time
(a) (b) Mean level
14 Apr 2, 1996 THE DETECTION LIMIT (NOISE)
-80
FIGURE 26 Photodiode and
Signal -80
phototransistor noise properties are
Signal + Noise (dBV/√Hz)

indistinguishable, and bandwidths are
identical with passive feedback. The 1 /f 2
stimulus is a steady background with a -100 -90
small AC component. The test signal has
the same SNR for each device.
-100
-120
were separate phenomena, with identical -110
1/f
magnitudes. We computed the shot noise,
and then insisted that the thermal noise was Instrumentation
something additional—which is wrong. -140 -120
1 2 3 500 600 700
Perhaps the easiest way to see how shot 10 10 10
and thermal noise are related is to consider Frequency (Hz) Frequency (Hz)
the flow of individual charges that make up
a current. Looking at the current, we ob-
serve an average arrival rate I b g ⁄ q of ele- noise—is interesting, because it says how to the transit of a single charge, then it
m e n t a r y c h a rg e s . T h e c h a r g e s a r e detectable a signal is if we know that it oc- makes sense that the noise is approximately
uncorrelated. Given an average arrival rate, curs within some particular frequency band. unity. At a current of 1 nA, the spectral
and the fact that all the charges are indepen- The noise fluctuations are spread over the noise density from Equation 31 is approxi-
dent, in some sense the statistics of the flow spectrum of a first-order lowpass filter. The mately 1 ns. The noise power is unity if we
are as random as they can possibly be. How power spectrum (the power per unit fre- observe the signal over a time scale of 1 ns,
could more noise be added? Only by intro- quency) is given by and it decreases linearly with the time over
ducing additional correlations. An example dP 4τP which the signal is averaged.
S ( f ) = ------- = ---------------------------- (28)
is 1/f noise, where modulatory fluctuations df 2
in the level of current contribute additional 1 + ( 2πfτ ) MEASUREMENT AND THEORY
noise power. Temperature has no effect on The funny normalization constant 4τP ap- We can compare the measured spectra of
these statistical fluctuations; it serves only pears so that the integral over all frequen- the source-follower receptor with the theo-
to set the bandwidth—the averaging cies is the total power P: retical spectrum. The three fit parameters
time—at which the system operates. ∞ are the temperature T, the capacitance C,
and the integration time constant τ—which
EFFECT OF TEMPERATURE. I t m a y
seem odd that the total noise power given by ∫ S( f )df = P (29) scales inversely with intensity. After ac-
counting for all node capacitance using pa-
Equation 22 is inversely proportional to the 0
temperature. Usually, we think that reduc- Within the passband of the lowpass filter, rameters supplied by MOSIS, assuming
ing the temperature reduces the noise. This the spectral density of the shot noise is room temperature operation, and choosing
odd behavior is due to a bandwidth effect. the time constant to match the cutoff fre-
 CV T  q
S = 4τP = 4  -------------   ------------- 
When the temperature is reduced, the ther- (30)
quency of the measured spectrum for the
mal voltage scale is also reduced. At a given  I bg   CV T  brightest intensity, we can see that the fit is
level of current, determined by the light in- very good. The lower of the two theoretical
or curves is the predicted result; the upper
tensity, the source conductance of the feed-
back transistor is increased, and the time
4q curve is twice the prediction. Another mea-
S = ------- (31)
constant is reduced. Hence, reducing the I bg sured receptor with a different capacitance
temperature increases the bandwidth. The shows comparable results, although we
We can attribute half of this noise power to don’t show the data here because the results
circuit integrates over a smaller time scale, each terminal connecting to the capacitor,
the current is observed with finer granulari- are so similar. Direct measurements of the
to obtain a well-known result given in all current noise spectrum in a single transistor
ty, the individual charges are more closely the standard texts:
resolved, and hence, the total dimensionless confirm these results with a single parame-
noise power is increased. We must recall d∆I
2 ter fit.26
------------ = 2qI (32) ASSUMPTIONS OF DIFFICULTY. A
that we are talking about the dimensionless df
noise, given by Equation 19, that measures Equation 32 gives the shot noise power on a common phenomenon for people who
noise in input-referred units, and not the node arising from a single source of current study noise is to start with a pessimistic atti-
noise voltage. We can see from Equation 19 onto that node. However, there are subtle- tude that assumes that noise, inherently a
and Equation 22 that the mean-square out- ties involved in the indiscriminate use of random phenomenon, is not quantifiable.
put noise voltage is given by this expression. Sarpeshkar analyzed the The statistics of noise are just as quantifi-
noise properties of MOS transistors for both able as the statistics of the steady-state flow
2 qV T
∆v = ------------ (27) saturated and nonsaturated operating condi- of current in a MOS transistor. In the case of
C tions in subthreshold operation, and showed steady-state current, we study statistics of
which is proportional to temperature, but that Equation 32 applies to a MOS transis- diffusion—first-order statistics, while in the
the output noise voltage is not a relevant tor only when the transistor is saturated.26 case of noise we study statistical
quantity by itself because it must be com- THE ESSENCE. From Equation 31, we fluctuations—second-order statistics. It is
pared with the natural voltage scale given see that S has units of time: The spectral satisfying to be able to quantify so precisely
by VT. density of noise is proportional to the time a phenomenon that initially seems intrinsi-
NOISE SPECTRAL DENSITY. T h e per unit charge. The less time between cally random and unpredictable.
amount of noise power per unit band- charges, the smaller the noise. If we observe For more insight and detailed discussion
width—the spectral density of the the signal over the time scale corresponding about electronic noise and photon counting,
MINORITY CARRIER DIFFUSION & GUARD STRUCTURES Apr 2, 1996 15
S a r p e s h k a r e t a l . 2 6 a n d R o s e ’s t wo The total noise is equivalent to a fraction is not good for short distances. The space
books24,25 are helpful. tional variation of 100% of the signal! constant is shorter, the closer the spot to the
PHOTODIODE VS. Under dim lighting conditions, shot sensor, probably because of geometrical ef-
PHOTOTRANSISTOR: noise dominates any detector system. fects due to the finite size of the test spot or
NOISE BEHAVIOR The limited integration time of a CCD three-dimensional effects of the interaction of
Earlier we saw that phototransistors have the detector means that shot noise is a real surface recombination with the pure expo-
disadvantage that the base cannot be limitation to the usable limit of opera- nential decay. With no guard bar, the mea-
voltage–clamped to speed up the response. tion for video imagers, where the inte- sured current is reduced by a factor of 10 in a
Figure 1 shows that the noise properties of gration time is at most 1/30 s. At distance of about 40 µm.
photodiodes and phototransistors are indis- moonlight illumination levels, the num- To test for the possibility that this relatively
tinguishable. The reason is obvious, given ber of charges collected per integration long diffusion length of 30 µm is due to scat-
that the analysis just given may be directly time is less than 1000. The shot fluctua- tering of light, and not minority carrier diffu-
applied to the base-emitter junction of the tion, 30, means that at best—with a sion, we compared the diffusion length
phototransistor. noiseless amplifier—the noise level is measured with a red and green LED. The ra-
3%, limiting the SNR to 30 dB. tio of absorption length for the red compared
NOISE ADVANTAGE OF CONTINUOUS- with the green LED is a factor of 2–3, so if
TIME RECEPTORS OVER SAMPLED light scattering is a significant effect in the
DETECTORS
measurement of diffusion length, we expect a
Continuous-time logarithmic photorecep- difference between the diffusion lengths mea-
tors have a natural advantage in noise per- MINORITY CARRIER sured with the two LEDs. There is no differ-
formance over sampled detector systems.
The advantage comes from the scaling of
DIFFUSION & GUARD ence. The decay with distance is nearly
bandwidth with intensity, so that circuit, in a STRUCTURES
sense, adapts its integration time to how
much signal is available. The less light there Turning now to an different topic, we’ll
is, the longer the integration time. The total discuss the diffusion of minority carri-
noise is held constant. ers and guard structures. We built the
simple structure shown in Figure 27 Probing light spot
In a CCD detector, the bandwidth is limit- 3 um
ed by the sampling rate, because the pixels specifically to measure lateral diffusion
cannot store state across samples. This limit- length of minority carriers, and to test 17 um
ed integration time means that the number of for the effectiveness of various guard Sensing
integrated photons goes as the intensity. The structures in blocking their diffusion. Phototransistor
total noise power goes inversely with the in- The structure consists of a central para-
tensity—it gets worse as the intensity de- sitic bipolar phototransistor surrounded
creases instead of staying constant. There is on three sides by guard bars with differ- 7 um
nothing magical about it: the CCD detector ent widths. The fourth side is left bare.
holds its bandwidth fixed while the log re- The bipolar transistor acts as a probe for
ceptor holds the number of integrated pho- minority carriers by collecting and am-
tons fixed. plifying the local concentration of ex-
cess minority carriers in the bulk. This Guard bars
If we look at the spectral density of the
chip was built in a 2 µm p-well technol- Path of light spot
noise power, we see that there is no differ-
ogy. We measured the emitter current
ence: in both types of detectors the spectrum 1Distance
9/16"
using a Keithley 617 picoammeter, with
of the noise comes from the quantal nature Phototransistor
a 2 V collector-emitter bias.
of light and charge; in both, the noise spec-
tral density goes inversely with the intensity. The guard bars consist of two rectan-
One additional complication is that a gles of ordinary p-type source–drain
CCD detector invariably uses a charge sens- diffusion and a rectangle of p-well dif- Light
ing amplifier to convert the photocharge to a fusion. All guard bars were grounded to spot Source-drain
diffusion
voltage—the signal that is actually used. the bulk potential. Well diffusion
The noise in the charge-sensing amplifier is We imaged a 12 µm spot of light onto

generally equivalent to a fixed number—on the test structure at various locations FIGURE 27 Layout used to test for
the order of a hundred—of electrons at the away from the sensing transistor using a light-generated minority carrier
input. As the intensity decreases, the number small pinhole and the 100x objective diffusion and for the efficacy of
of integrated photocharges becomes compa- lens on the microscope. We moved the various guard bars in blocking
rable and then smaller than this fixed ampli- chip under the spot using a two-dimen- minority carriers. The large square in
fier noise. For very low intensities, the sional motorized positioning system. the middle of the figure is a parasitic
The results of the measurements, bipolar transistor that probes for
mean-square charge fluctuation in the output minority carriers.We shined a probing
signal is fixed instead of decreasing linearly Figure 28, show the measured current as
light spot onto the structure at various
with intensity. Here is a specific example a function of the distance of the test spot distances and directions away from
that will make this idea clearer: Suppose the from the center of the sensing transistor. the phototransistor and measured the
amplifier noise—the total RMS fluctua- Each curve corresponds to moving current generated in the
tion—is equivalent to 100 photoelectrons. across a different guard structure. phototransistor. The guard structures
When the integrated number of photoelec- For distances greater than approxi- consist of p-type diffusion in the n-
trons is 100, the fluctuation in the signal it- mately 70 µm, the decay of carrier con- type substrate. The 3 and 7 micron-
self is 10—a 10% variation. In reality, centration is exponential and the wide structures are source-drain
however, this small fluctuation in the signal measured e-fold distance is about 30 diffusion, and the 17 micron-wide
structure is a p-well.
is totally swamped by the amplifier noise. µm. An exponential-decay approxima-
16 Apr 2, 1996 SPECTRAL SENSITIVITY
indistinguishable, as shown in Figure 28c. (a) Effect of guard bars

This result is not very surprising because 80
the absorption length for red and green light if no minority carriers
centers around 1 µm. 60 no guard
The guard bars were only moderately ef- 3 µm diff
Current (nA)
fective in reducing the minority carrier den- 7 µm diff
sity. The widest guard bar, a 17 µm well, 40 17 µm well
reduces the minority carrier density by up to
a factor of 10, particularly when the test 20
spot shines directly onto the guard bar. The
other two guard bars, which consist of
0
source-drain diffusion, are less effective, re- 0 100 200 300
ducing the carrier density by a factor of 2–3. Distance from center (um)
Interestingly, the 3 µm guard bar seems to
(c) Diffusion expt comparision
be more effective than the 7 µm guard bar. of Red and Green LED
We don’t know the reason for this result, but (b) Effect of guard bars 1
2
we can repeat it. no guard
SUMMARY. We measured a diffusion 3 µm diff
length of about 30 µm at distances of great- 7 µm diff Red LED
er than 70 µm from the source. A mini- 17 µm well response with
mum-size silicon retina pixel has about the superimposed
same dimension, which means that minori- 0
green
log Current (log nA)
log Current (log nA)

ty carriers can have a large effect on circuit- response
ry surrounding an opening in the overlying
metal. The use of guard bars can reduce the
number of minority carriers, but not by very
much. Even a 17 µm-wide well—the deep-
est available diffusion—can only reduce the
minority carrier concentration by a maxi- 0 -1
mum factor of about 10. Circuits must be
inherently resistant to the effect of excess
minority carriers—like the adaptive ele-
ment discussed earlier—if they are to func-
tion correctly with light shining on the chip. 0 100 200 300 0 100 200
Distance from center (um) Distance from center (um)
FIGURE 28 Results of measurements on diffusion test structure shown in
Figure 27. (a) shows the measured photocurrent due to minority carrier diffusion as
a function of the distance of the test spot from the sensing phototransistor. The
SPECTRAL SENSITIVITY different curves show the current for movement of the spot in different directions out
The absorption of light by silicon is wave- from the middle of the phototransistor. (b) shows the same results on a log(current)
length-dependent: Short wavelength pho- scale. (c) shows a comparison between illumination with red and green light. The
absorption for red light is much less than for green light, so if the results in (a) and
tons travel a shorter distance, on the (b) were due to scattered light and not minority carrier diffusion, we expect a
average, before being absorbed.† The ab- difference in the measured diffusion length that we do not observe.
sorption length, L(λ), as a function of pho-
ton wavelength λ, for bulk silicon, is shown
in Figure 29. For blue light (wavelength
475 nm) L is 0.3 µm, while for red photons CMOS process, there are a number of dif- Figure 30 shows a set of typical carrier
(650 nm) L is 3 µm—a ratio of approxi- ferent junctions with different doping and concentrations in the parasitic vertical bipo-
mately 10 in absorption length over the visi- depth. An ordinary CMOS process has two lar for different photon absorption lengths.
ble spectrum. This behavior is primarily complementary source–drain active diffu- We computed the curves from the diffusion
due to the available density of states, be- sions and a well diffusion. For concrete- equation. The carrier concentration goes to
cause there are many more available states ness, we consider an n-well process; the zero at the edge of a junction. At the sur-
at higher energies. well and the active diffusion for the native face, the concentration is reduced by sur-
The wavelength-dependent absorption transistors are n-type, and the substrate and face recombination. The junction current is
means that photodetectors formed from active diffusions for the well transistors are proportional to the slope of the carrier con-
junctions with different junction depths p-type.†† We can form four photodetectors centration at the edge of the junction. Long-
have different spectral responses. In a in a plain CMOS process, the three photo- wavelength light creates most carriers deep
diodes plus a single parasitic PNP bipolar in the substrate, so the current is mostly
transistor, whose emitter is active, base is found in the deepest junction. In contrast,
† This effect is known to degrade resolution in well, and collector is substrate. short-wavelength light causes a current
CCD imagers at longer wavelengths. The longer- mostly in the shallowest junction.
wavelength photons get absorbed deeper in the A BiCMOS process adds a medium-
substrate, diffuse less precisely to the correct
charge bucket, and take enough time to diffuse that
doped p-type diffusion intermediate in
they get collected into the wrong charge bucket. †† The complementary process, p-well, results in a depth between the active diffusion and the
Many CCD imagers are built in a shallow p-well to set of devices that are exactly complementary to well diffusion. This new implant is used to
reduce these effects and to better tailor the spectral the ones we describe here, and we expect that these form the p-type base for vertical NPN bipo-
response to match with human spectral efficiency. devices have similar characteristics. lar transistors.††† The emitter of the vertical
SPECTRAL SENSITIVITY Apr 2, 1996 17
10 5 FIGURE 29 The photon absorption length as a function of photon

wavelength at 300˚K. The absorption length is the distance over which 1/e
4 of the incident photons are absorbed. Approximate wavelength of primary
10
colors (B = blue, G = green, and R = red) from CIE color wheel, along with
10
3 associated absorption length are shown as dashed lines. Dashed line at
about 0.95 µm shows absorption length at peak of spectral response of
Absorption 10
2 deep, diffusion-limited junction (see page 18). (Adapted from Dash and
length L (µm)
Newman.3)
10 1
0
10
p++ n p–
-1
10
B G R IR
10 -2
0.4 0.6 0.8 1 1.2
Wavelength l (µm)
Light
FIGURE 30 Theoretical carrier concentration profiles for the illuminated slab of

silicon shown at the top of the figure. The plots show the excess minority carrier
concentration as a function of distance into the silicon for three different photon L = 50 µm (IR)
absorption lengths. The junction current is proportional to the slop of the carrier
Carrier
Concentration
concentration at the edge of the junction.
L = 5 µm (Red)
bipolar is the heavily-doped n-type L = 0.5 µm (Blue)
source–drain diffusion, and the collector is

the lightly-doped n-type well. In a BiC- 20 µm
MOS process, we can form an additional
four photodetectors: two photodiodes from
the base to the emitter and collector, plus
two vertical bipolars, one being the floating
NPN, the other the parasitic PNP that uses
the base diffusion as an emitter in a manner
similar to the parasitic PNP discussed earli- 1V 1V
er. well-substrate active-p base
Of these eight devices, we measured the
six shown in Figure 31. We did not test the p++ active n++ emitter
simplest photodiode from native diffusion
to substrate, because we neglected to fabri-
cate it on the same chip in the same config- p+ base
n well 3 µm
uration, making reliable results difficult to n well
obtain. We expect it to behave similarly to
the well–substrate photodiode. Also, we p– subs
p– subs
1V
††† Only n-well BiCMOS processes exist; for

technical reasons it is difficult to fabricate good active-well p base-well
floating vertical PNP bipolar transistors. Also, we
note that a BiCMOS process differs from a true
1V
bipolar process in that it lacks an additional low-
resistance implant in the collector (the well) to
reduce collector resistance. The missing collector
contact implant is not a concern for phototransis-
tors, at least in the range in which we are inter-
ested, because the generated photocurrents are
much too small to generate appreciable ohmic 1V
voltage drop.
1V
parasitic PNP vertical NPN
FIGURE 31 The structures and biasing setups
used to measure the quantum efficiencies of the
devices. The upper four devices are
photodiodes and the lower pair are
phototransistors. In the text, we refer to the
elements by the underlined names in the figure.
The active junctions are shaded for each
device. A scale bar is shows approximate
dimensions (horizontal dimension not to scale).
18 Apr 2, 1996 SPECTRAL SENSITIVITY
FIGURE 32 Measured spectral quantum 100

efficiencies Q(λ) versus photon
P
wavelength and energy. The quantum parasitic PN
efficiency is the number of collected
charges per incident photon; it is larger
10 Band
than one for the phototransistors because ver
they have built-in current gain. Each curve tica edge
Quantum efficiency Q(λ)

l NP
is labeled with the name of the device as N (1.12 eV)
shown in Figure 31. The small circles are
the absolute calibration points measured
1
(charges/photon)
with discrete LEDs. The photopic visibility well-substrate
curve shows the relative visibility of
photons under photopic conditions; this
curve is normalized to 1 at its maximum.
pb
The primary colors, according to the CIE 0.1 ase
color chart, are labeled on the wavelength -we
axis. A shelf, marked with a hollow arrow, acti ll
Photoptic ve-
appears around the band edge for both wel
the vertical NPN phototransistor and for l see text
Visibility
the p base-well photodiode that forms the 0.01 act
ive
base to collector junction (page 19). The -p b
monochromator calibration curves show ase
the spectral line width of the
monochromator and the calibration
wavelengths. 0.001 Monochromator
calibration
B G R
0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2
did not test the parasitic PNP that uses Wavelength λ (µm)
p–base as emitter because we forgot about
it, although almost certainly its behavior is
similar to the usual parasitic PNP that uses 3.75 2.81 2.25 1.88 1.61 1.41 1.25 1.13 1.02 .94
active as emitter. Photon energy (eV)
A scale bar in Figure 31 shows the ap-
proximate vertical dimensions of the junc- es including the gain, so Q(λ) can be larger • The shallow junctions formed inside the
tions of this 2-µm feature size BiCMOS than 1. well are most sensitive to a wavelength
process, according to the MOSIS fabrica- We used a prism monochromator, in con- around 500 nm, while the deep well-
tion service. The n++ emitter is arsenic- junction with a tungsten incandescent substrate junction sensitivity peaks at
doped, with a junction depth of about source, to produce a continuously variable, about 900 nm, well outside the visible.
0.3 µm, and a surface concentration of nearly monochromatic source of light. We • The peak absolute quantum efficiency
1020 cm –3 . The p+ base is boron-doped, had to carefully calibrate several parame- for the photodiodes varies from a high
with a junction depth of 0.45–0.5 µm, and a ters to obtain a reliable measurement 6. of about 0.8 at near-infrared photon
surface concentration of 1–2·1017 cm –3 . energy, to a low of about 0.3 for the
The n-well is phosphorous-doped with a THE SPECTRAL RESPONSES shallow, volume-limited junctions.
junction depth of approximately 3 µm, and Figure 32 shows the absolute quantum effi-
a surface concentration of 3-4·1014 -cm–3. • All the quantum efficiencies are approx-
c i e n cy v e r s u s p h o t o n w a v e l e n g t h . imately 0.3 at the blue end of the spec-
The p-type substrate has a doping of Figure 33 shows the same data on a linear
3–4·1014 cm–3.† trum.
scale. Figure 32 also shows the photopic
We distinguish between the deep, diffu- (daylight) visibility curve, the band edge for • The phototransistor spectral responses
sion-limited detectors where the carrier silicon, the monochromator bandwidth, and are very close to constant multiples of
collection volume is defined mostly by the the wavelengths of the primary colors. We the deeper-junction responses contribut-
minority carrier diffusion length, and the can make several phenomenological obser- ing to the base current for the pho-
shallow, volume-limited detectors where va t i o n s . totransistor. For example, the parasitic
the carrier collection volume is mostly de- PNP response is very similar in shape to
fined by junction edges. • There is a clear distinction between the response of the well-substrate junc-
The aim of the measurement is to obtain photodetectors that collect light-gener- tion, but is completely unlike the
the absolute quantum efficiency Q(λ) for ated carriers from a junction-limited response of the shallow active-well
each of the devices, as a function of photon volume and the photodetectors that col- junction. A similar observation can be
wavelength λ, where Q(λ) is defined as: lect light-generated carriers from a dif- made for the vertical NPN. This obser-
fusion-limited volume defined by the vation simply means that most of the
Q(λ) = (33) diffusion length of minority carriers. base current in the phototransistors
The diffusion-limited detectors are more comes across the deep junction.
# collected charges s e n s i t ive t o l o n g e r wave l e n g t h s .
----------------------------------------------------------------------------------- • The current gain is about 100 in the par-
# incident photons at wavelength λ
The phototransistors have built-in current • The response spectra are broad band. asitic bipolar phototransistor, and is
gain; for them we count the collected charg- All the detectors cover more than the about 30 in the vertical bipolar pho-
visible spectrum. Also, all of the detec- totransistor. This gain is a soft function
tors have responses that are flat within a of the current level, and decreases at
† These parameters come from the specifications factor of 2 or 3 within the visible spec- both high and low intensities. Figure 34
for the MOSIS 2–µm n-well BiCMOS process. trum. shows the current gain for the bipolar
PREVIOUS WORK Apr 2, 1996 19
1
FIGURE 33 Absolute quantum efficiencies plotted on a linear
Photoptic scale. Data are a subset of Figure 32.
0.8 Visibility well-substrate
Quantum Efficiency
0.6 200
0.4
active-well 150 Parasitic PNP vertical bipolar
0.2 p base-well
ß-1 100
active- p base (IE/I B)
0
0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 Vertical NPN bipolar
50
Wavelength (µm)
transistors as a function of emitter cur- 0 -10 -9 -8 -7 -6 -5 -4 -3 -2

10 10 10 10 10 10 10 10 10
rent, measured by base-current injec-
IE (A)
tion. These current gains are larger by a
factor of about 2 than the spectral mea- FIGURE 34 Bipolar transistor current gain as a function of emitter current. Ordinate is
surements. ratio of emitter current to base current. Data is from n-well, double-poly, 2 µm feature size,
BiCMOS MOSIS technology. Collector to emitter voltage was held at 1 V. Transistor
• All of the spectral responses show an dimensions in µm are given in the table below.
absolute cutoff around the band edge.
The cutoff is not perfectly sharp, and Base Collector Emitter
extends past the actual band edge. The Parasitic PNP 20x20 diffusion limited 10x10
quantum efficiency drops off at a rate of Vertical NPN 10x10 20x20 8x8
about one e-fold per 25 meV around
the theoretical band edge. We are confi-
dent of this result, because we used a
two point absolute calibration of wave- 25 fA (i.e. 25x10-15 A), or approximately photoreceptor that consists of a parasitic ver-
length, and we measured the mono- 2.5x105 charges/second, or 4000 in 1/60 s. tical bipolar phototransistor, with a series
chromator bandwidth to be much In sunlight, the irradiance can be as much as pair of diode-connected MOS transistors as a
smaller than the measured cutoff 1 kW/m2, corresponding to a photocurrent load 13,20. This simple design has the virtue
behavior. of 25 nA. Optics and scene reflectance that the layout is trivial and the receptor re-
• There is an interesting shelf in the spec- properties reduce these numbers by about a quires no bias controls. The signal produced
tral response of the well to p–base junc- factor of 10 under typical conditions. b y t h e r e c e p t o r h a s a ga i n o f
tion right around the band edge that can 200–300 mV/decade of intensity. The two
also be seen in the NPN vertical bipolar main problems with this receptor which led
response. It could be due to a shallow to the development of the adaptive devices
are
recombination-generation center PREVIOUS WORK 1. A poor matching between different
unique to the p-base implant that
stretches out the spectral response an The precursor for the work described here is receptors. Signals output from neigh-
additional fraction of an eV. an optical mouse system built by Dick Lyon boring receptors differ as much from
at Caltech. He used precharged photodiodes offsets as from true signals caused by
ABSOLUTE CURRENT LEVEL to produce a digital signal at a time inverse- the scene.
People frequently ask how much current to ly proportional to intensity. The system in- 2. A slow time-response, rendering the
expect from a given size of photodiode or corporated gain control by precharging of receptor useful only under bright light-
phototransistor. Since light intensity varies the photonode after it had discharged to a ing conditions and making all time-
more than 6 decades under photopic and given level, and not at after a fixed responses strongly intensity-dependent.
mesopic conditions, the answer obviously i n t eg r a t i o n t i m e . The SeeHear chip 22, designed by Neilson,
depends on the operating conditions. For Mead’s original logarithmic Mahowald, and Mead, was the first analog
reference we will compute a typical situa- photoreceptor 19 used a single parasitic verti- VLSI vision circuit that used the idea of ad-
tion. Office fluorescent lighting conditions cal phototransistor that feeds into a series of aptation. In that chip, the simple nonadapt-
are an irradiance of an exposed surface of Darlington-connected lateral bipolar tran- ing logarithmic receptors from the early
about 1 W/m 2 . Under these conditions, sistors. A feedback arrangement converts Mahowald-Mead silicon retinas were used
each 10 µ m by 10 µ m photodiode area, the final current into a voltage that is loga- as input to Mead’s hysteretic differentiator
with quantum efficiency 0.5, generates a rithmic in the intensity. This receptor was circuit 18. The problem with this arrange-
current used in the Tanner and Mead optical flow ment is that the uncoupling of the receptor
J⁄s eV quantum chip. There was no good reason for using and adaptation is inefficient in terms of
1 --------- × ---------------------------- × --------------------- (34)
2 – 19 2.5eV the lateral bipolar transistors. transistor count, and forgoes the speedup
m 1.6×10 J
2 8 quanta The large area required by the lateral bipo- advantages of the active feedback arrange-
× ( 10µm ) × 0.5 = 10 ---------------- = 25 pA lar transistors quickly led to more compact ment.
s
designs. The early Mahowald and Mead sili- Delbrück and Mead 8 built a preliminary
Typical moonlight is about 3 decades less con retinas, and subsequently many other version of an integrated adaptive receptor
light, and hence a photocurrent of only Mead lab projects, used a simple logarithmic that uses two stages of high-gain amplifica-
20 Apr 2, 1996 RELATION TO BIOLOGICAL PHOTOTRANSDUCTION
tion, one at the phototransducer, and the tricks like PIN photodiodes and avalanche stage, by the exponential I-V relationship at
other in the feedback loop. This design had multiplication. the source of the feedback transistor. The
a large transistor count, lacked any speedup logarithm gotten for free, courtesy of Boltz-
advantage of the active feedback, and used man, is apparently not used for this same
an inferior adaptive element. purpose in biological receptors. The obvi-
Delbrück and Mead 7 built an adaptive ous first guess, a membrane channel with
silicon retina for sensing time derivatives of RELATION TO diode rectifying characteristics, is not what
the image contrast, using the same feedback BIOLOGICAL the cones or rods use.
arrangement as described here, but lacking
the speedup advantage of using a photo- PHOTOTRANSDUCTION
diode and a cascode, and using an inferior In the introduction we discussed functional
adaptive element. This silicon-retina chip characteristics of biological photoreceptors SUMMARY
uses the scanning mechanism to reset the and how they inspired the silicon version.
pixel output every time the pixel output is We have discussed a photoreceptor circuit
Three characteristics stood out: The adap- that is useful for continuous-time pho-
sampled. tive properties, the gain control resulting in
Mead17 built an adaptive silicon retina totransduction. It consists of a logarithmic
illumination independent contrast response, input stage coupled to an adaptive feedback
that uses ultraviolet light to move charge and the invariance of the response time to il-
onto and off floating gates for storage of an- circuit. The feedback circuit produces an
lumination. output with high gain for transient—and
alog mismatch compensation. A pho-
totransistor with a drain-type load produces In the silicon receptor, the adaptation presumably interesting—signals, and has
high voltage gain at the input, and this high state is stored as a charge on a capacitor. In low gain for static signals, including circuit
gain point is used in the UV adaptation. Un- rods and cones, the adaptation state is (at offsets. The feedback also speeds up the re-
fortunately, making high gain at the input is least partially) stored as the internal calci- sponse by clamping the input node voltage.
incompatible with high speed operation, um concentration. The full story of photore- The receptor can be fabricated in an area
and inherently prevents the type of voltage- ceptor adaptation mechanisms is too of about 70 by 70 µm2 in a 2–µm single
clamping speedup we have discussed. Sub- complex for discussion here, but the idea poly CMOS process. The useful dynamic
sequent work has tended away from UV ad- that there is an independent state variable range, which we arbitrarily define as the
aptation, because of technical problems for adaptation state is similar in the silicon range over which the bandwidth is at least
with shining high-energy photons onto the and biological receptors. 60 Hz, extends down to moonlight illumi-
chip and making the circuit function cor- More germane is the invariance of re- nation levels, or at least 1.5 decades more
rectly at the same time. sponse time to illumination, because here than a plain logarithmic detector without
Mahowald12 incorporated the feedback we see some functional differences. How active feedback.
arrangement described here into a silicon does the response-time invariance come The receptor noise in a plain logarithmic
retina with interesting network feedback. about? The enormous gain generated in detector is almost exactly what is predicted
Her receptor does not have the speedup ad- rods comes from 3–4 amplification stages, by a simple noise theory that assumes that
vantage of the active feedback, and uses an each with modest gain. The gain of each the noise is purely shot noise, with a band-
inferior compressive adaptive element, stage is at most a few hundred, but the com- width set by the conductance of the feed-
leading to very unsymmetrical adaptation bination of all stages results in a maximum back and the capacitance of the input node.
rates for bright- and dark-going illumina- gain that closes 10 6 ion channels in re- The noise in the adaptive receptor is within
tion changes. Mahowald’s chip with the response to a single photon. a factor of two of this value.
ceptor improvements we have discussed The gain of the rod is inversely propor- TECHNICAL INNOVATIONS
here is in fabrication. tional to intensity. The mechanism for this The three technical improvements in the re-
Mann15 developed several adaptive pho- gain control probably lies in a modest gain ceptor are as follows:
toreceptor circuits that are more flexible reduction of each stage of the amplification. 1. Using a photodiode, instead of the pho-
than the ones described here, in that they A small change in the gain of each stage re- totransistor used in previous designs,
have an adjustable temporal passband. sults in a large change in the total gain. Each increases the dynamic range by at least
These receptors also use U.V. mismatch stage has a fixed gain–bandwidth product: a decade, without degrading signal qual-
compensation. They use a larger number of When the gain is reduced, the response time ity.
components than the receptor described decreases proportionally. The trick is that 2. Using a cascode transistor in the feed-
here. They also lack the advantage of active the total gain goes as the power of the num- back amplifier yields another 0.5–1
speedup and use inferior adaptive elements. ber of gain stages, and the total time re- decade dynamic range.
None of the previous photoreceptors sponse goes linearly with the number of 3. Using a new adaptive element increases
were satisfactorily characterized, in the gain stages. the adaptation time constant and the
sense that no one bothered to measure sim- This multistage gain mechanism con- symmetry of the receptor response to
ple engineering metrics like usable dynamic trasts with the silicon receptor, which has light- and dark-going transients, com-
range and sensitivity. only a two-stage amplifier, one of which has pared to previous designs, which used
Up until lately, continuous-time analog fixed gain. As shown earlier, this gain elements that were very susceptible to
photoreceptors have not received a great mechanism results in a fast response time, the effects of parasitic minority carriers.
deal of commercial attention. Nearly all ef- but the response time is also inversely pro-
fort is concentrated on sampled imagers portional to the intensity. We have not elect-
like CCD video cameras. Photoreceptors ed to try to build a silicon receptor with
are used in very specialized applications multistage gain, in analogy with the biolog-
like optical repeaters, where high integra- ical mechanism, because the response time
tion density and wide dynamic range are not is adequately fast already, and the log gen-
so important compared with response erating mechanism in subthreshold opera-
speed, and where special fabrication tech- tion is so convenient. All the gain control in
nologies are available that allow use of the silicon receptor is done right at the input
CONCLUSION Apr 2, 1996 21
We have also discussed two theoretical top- manium and silicon at 77 ˚K and 300 ˚K,” 16. C.A. Mead, “Neuromorphic Electronic
ics in which we have developed a much Physical Review, vol. 99, pp. 1151–1155, Systems,” Proceedings of the IEEE, vol.
clearer understanding: 1955. 78, pp. 1629–1636, 1990.
1. We quantitatively understand the physi- 4. D. Cohen and G. Lewicki, “MOSIS—the 17. C.A. Mead, “Adaptive Retina,” in Ana-
cal origin and dominant sources of ARPA silicon broker,” in Proceedings from log VLSI Implementation of Neural Sys-
the Second Caltech Conference on VLSI, tems, C. Mead and M. Ismail, Eds.,
electronic noise in continuous-time log- Boston: Kluwer Academic Pub., pp.
California Inst. of Tech., Pasadena CA,
arithmic receptors. 239–246, 1989.
pp. 29-44, 1981.
2. We quantitatively understand the limits 5. T. Delbrück, “Silicon retina with Correla- 18. C.A. Mead, Analog VLSI and Neural
on response speed that effectively limit tion-Based, Velocity-Tuned Pixels,” IEEE Systems. Reading, MA: Addison--Wesley,
the lower end of the receptor dynamic Transactions on Neural Networks, vol. 4, 1989.
range. no. 3, pp. 529–541, May 1993. 19. C.A. Mead, “A Sensitive Electronic
6. T. Delbrück, Investigations of Analog Photoreceptor,” In 1985 Chapel Hill Con-
VLSI Phototransduction and Visual ference on VLSI, H. Fuchs, Ed., Rockville:
Motion Processing, Ph.D. Thesis, Dept. of Computer Science Press, pp. 463–471,
Computation and Neural Systems, Cali- 1985.
CONCLUSION fornia Institute of Technology, Pasadena, 20. C.A. Mead and M.A. Mahowald, “A Sili-
Operations that require spatiotemporal CA, 91125, 1993. con Model of Early Visual Processing,”
computation on noisy input data in real 7. T. Delbrück and C.A. Mead, (1991), Sili- Neural Networks, vol. 1, pp. 91–97, 1988.
time are a natural fit to analog VLSI. We con adaptive photoreceptor array that 21. J.R. Meyer-Arendt, “Radiometry and
computes temporal intensity derivatives, Photometry: Units and conversion fac-
have shown how to build a practical photo-
in T.S. Jay Jayadev (ed.) Proc. SPIE, tors,” Applied Optics, vol. 7, pp.
receptor circuit that is suitable as a front- 2081–2084, 1968.
end transducer for analog VLSI vision sys- Infrared Sensors: Detectors, Electronics,
and Signal Processing., vol. 1541, pp 22. L. Nielson, M. Mahowald, and C.
tems. The receptor is usable under photo- Mead, “SeeHear,” in Analog VLSI and
pic vision conditions. The continuous 92–99.
Neural Systems, by C. Mead, Reading:
transduction process leads to intensity-in- 8. T. Delbrück and C.A. Mead, “An elec-
Addison-Wesley, chapter 13, pp.
tronic photoreceptor sensitive to small
variant total noise that is within a factor of 207–227, 1989. (adapted there from 1987
changes in intensity,” in Advances in
two of photon counting noise. Photorecep- Neural Information Processing Systems
International Association for Pattern Rec-
tors of this general type will undoubtedly ognition, 5th Scandinavian Conference
1, D.S. Touretzky, Ed., San Mateo: Mor-
be used in commercial vision systems, on Image Analysis.)
gan Kaufman, pp. 720–727, 1988.
where designers will not be able to resist 23. R.A. Normann and I. Perlman, “The
9. T. Horiuchi, J. Lazzaro, A. Moore and C. effects of background illumination on the
the capability to link low-power, continu- Koch, “A delay line based motion detec-
ous-time computation together with trans- photoresponses of red and green cones,”
tion chip,” in Advances in Neural Informa- J. Physiol., vol. 286, pp. 509–524, 1979.
duction. tion Processing Systems 3, R. Lippman, 24. A. Rose, Concepts in Photoconductiv-
J. Moody, D. Touretzky, Eds., San Mateo, ity and Allied Problems, InterScience
CA: Morgan Kaufmann, pp. 406–412, Publishers (div. of John Wiley), p. 104,
1991. 1963.
10. M.A. Mahowald, An Analog VLSI Sys-
ACKNOWLEDGEMENTS tem for Stereoscopic Vision, Kluwer Aca-
25. A. Rose, Vision: Human and Electronic,
Plenum Press: New York, 1973.
We acknowledge support from the ONR demic Publishers: Norwell, 26. R. Sarpeshkar, T. Delbrück, and C.A.
and by MOSIS, the ARPA silicon foundry.4 Massachusetts, ISBN 0-7923-9444-5, Mead, “White noise in MOS transistors
David Van Essen inspired this detailed 1994. and resistors,” IEEE Circuits and Devices,
study (especially of the noise properties). 11. M.A. Mahowald, VLSI Analogs of Neu- vol. 9, no. 6, pp. 23–29, 1993.
Rahul Sarpeshkar made a crucial observa- ronal Visual Processing: A Synthesis of 27. R. M. Shapley and C. Enroth-Cugell,
tion about the relationship between shot Form and Function, Ph.D. Thesis, Califor- “Visual adaptation and retinal gain con-
nia Inst. of Tech., Dept. of Computation trols,” in Progress in Retinal Research, N.
and thermal noise. David Standley and
and Neural Systems, Pasadena CA, Osborne and G. Chader, Eds., New York:
Dick Lyon provided helpful comments 1992.
about the manuscript. Micah Siegel and Pergamon Press, vol. 3, pp. 263–346,
12. M.A. Mahowald, “Silicon Retina with 1984.
Sanjoy Mahajan pointed out that κ varia- Adaptive Photoreceptor,” Proc. 28. S.M. Sze, Physics of Semiconductor
tions affect the gain vs. intensity character- SPIE/SPSE Symposium on Electronic Sci- Devices, Second Edition, New York: John
istics and their measurement discrepancies ence and Technology: from Neurons to Wiley & Sons, chapter 13 and appendix I,
inspired the discovery of the importance of Chips, vol. 1473, pp. 52–58, April 1991. 1981.
the Miller effect in Qfb. 13. M.A. Mahowald and C.A. Mead, “Sili- 29. J.E. Tanner and C. Mead, “An Inte-
con Retina,” in Analog VLSI and Neural grated Analog Optical Motion Sensor,” in
Systems, by C. Mead, Reading: Addison- VLSI Signal Processing, II, S.Y. Kung,
Wesley, pp. 257–278, 1989. Ed., New York: IEEE Press, pp. 59–76,
14. M.A. Mahowald and T. Delbrück, 1986.
REFERENCES “Cooperative stereo matching using
1. D.A. Baylor, G. Matthews, K.-W. Yau, static and dynamic image features,” in
“Two components of electrical dark Analog VLSI Implementation of Neural
noise in toad rod outer segments,” J. Systems, C. Mead and M. Ismail, Eds.,
Physiol. vol. 309, pp. 591–621, 1980. Boston: Kluwer Academic Pub., pp.
2. R. G. Benson and T. Delbrück, (1991). 213–238, 1989.
Direction-selective silicon retina that 15. J. Mann, “Implementing Early Visual
uses null inhibition, in D.S. Touretzky, Ed. Processing in Analog VLSI: Light Adapta-
Advances in Neural Information Process- tion,” Proc. SPIE/SPSE Symposium on
ing Systems 4. pp. 756–763. Electronic Science and Technology: from
3. W.C. Dash and R. Newman, “Intrinsic Neurons to Chips, vol. 1473, pp.
optical absorption in single-crystal ger- 128–136, April 1991.
22 Apr 2, 1996 REFERENCES
INDEX Apr 2, 1996 23
INDEX J S
junction capacitance, typical 10 Sarpeshkar, Rahul 13, 15
Numerics junction-limited volume 18 shelf in spectral response of phototransistor
1/f noise 11 K 19
60 Hz 11 signal to noise ratio in CCD video imager 15
κ(kappa) 3 signal to noise ratio of photodiode and pho-
A
L totransitor compared 14
absolute current level in photodiode 19 silicon retina 19
layout
absorption length of light 16, 17 single photon counting, relation to contrast 2
area of adaptive receptor 10
adaptive element 8–10 SNR, see receptor, signal to noise ratio
adaptation rate 9 minimizing Miller effect 6
of adaptive receptor 10 spectral density of noise 14
capacitance of 10 spectral sensitivity 16–19
conductance of 9 used to test minority carrier diffusion 15
lighting of the full moon 11 speedup
area of adaptive receptor 10 definition 5
artificial lighting 3, 8 lux, relation to W/m2 11
subthreshold transistor law 3
B M sunglasses 4
BiCMOS process Mahowald adaptive silicon retina 3
Mahowald, Misha 19
T
junction depths 18 thermal voltage 3
photodetectors 16 Mann, Jim 20
biological photoreceptors Miller effect 4, 5 V
adaptation state 20 minimizing 6 VT 3
gain and adaptation characteristics 2 minority carrier diffusion 15–16
gain mechanism 20 effect of diffusion on high illumination limit
invariance of response time 2, 20 11
bipolar transistor effect of lifetime on receptor response
current gain 18 speed 6
noise behavior 15 moonlight illumination level 11
structure 10 MOSIS 14, 18, 21
using in adaptive receptor 10 N
C neutral density filters 4
capacitance noise in receptor, see receptor
of adaptive element 10 noise, see receptor
of typical junction 10 P
typical values in receptor 6
capacitance to quantum efficiency ratio 10 photodiode, absolute current level 19
capacitive divider 3, 4 photometry 11
cascode photopic visibility 18
effect on time response and noise 4 PIN photodiode 10
functions in adaptive receptor 4 primary colors 17
CCD detector 11 Q
degradation by carrier diffusion 16 quantum efficiency
CMOS process, photodetectors can con- definition 18
struct in 16 discussed 18–19
D measured, plot of 18
Dash and Newman 17 peak 18
dBV units 12 R
density of states 16 ratio of photodiode quantum efficiency to
diffusion-limited volume 18 capacitance 10
doping of typical BiCMOS process 18 receptor
E effect of finite minority carrier lifetime 6
effective input capacitance 5 gain-bandwidth product 20
effective input conductance 5 logarithmic, technological context 19
e-fold 3 noise 11–15
ergodicity 13 noise in adaptive 12
F response time 6
feedback circuit second-order behavior 7–8
closed-loop gain 4 signal to noise ratio 13
total loop gain 5 signal to noise ratio degradation by feed-
voltage gain 3 back 12
feedback circuit, noise in 12 simplest logarithmic 2
flicker noise 11 source-follower 2
fluorescent lighting, typical level of 19 spectral density of noise 14
flying insects 1 speedup 7
foveal resolution, need for in imagers 1 speedup obtained by active feedback 5
full moon lighting 11 steady-state gain 4
transient gain 4
G
resistor, impracticality of making in CMOS
guard structures 15–16 process 8
I rod-cone border 11
image contrast 2 Rose, Al 15
24 Apr 2, 1996 INDEX

Analog VSI

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Analog VSI

Uploaded by

Copyright:

Available Formats

CALIFORNIA INSTITUTE OF TECHNOLOGY

COMPUTATION AND NEURAL SYSTEMS PROGRAM

CNS Memo No. 30, April 2, 1996

ANALOG VLSI PHOTOTRANSDUCTION

PASADENA, CALIFORNIA 91125

O built a number of neuromorphic

FIGURE 1 Gain adjustment in turtle cones (recorded intracellularly) caused by

sponse time, invariant to lighting condi- FIGURE 4 The source-follower

source decreases as the light intensity in-

with an input conductance of Ibg/VT to com- RISE TIME & BANDWIDTH

Table 1 is worth examining to see how Cfb 15 ms

becomes totally dominant once the cascode 10 -2

and high loop gain take the other capacitors

smaller than in our example. Hence, we 10 -3

FIGURE 10 Response-time measurements 10 -4 n++/p+

In the photoreceptor circuit, the input and

Gate node Well node

state is determined by the condition I = 0. Ipar is a positive Dark

curve upwards, resulting in a conductance at I = 0 that is

FIGURE 23 Noise spectra for the source-follower logarithmic -60

Signal Amplitude (dBV/√Hz)

FIGURE 24 Comparison of noise spectra for -40

FIGURE 25 Shot noise computations. (a) The flow

Signal + Noise (dBV/√Hz)

The noise in the charge-sensing amplifier is We imaged a 12 µm spot of light onto

indistinguishable, as shown in Figure 28c. (a) Effect of guard bars

log Current (log nA)

10 5 FIGURE 29 The photon absorption length as a function of photon

FIGURE 30 Theoretical carrier concentration profiles for the illuminated slab of

bipolar is the heavily-doped n-type L = 0.5 µm (Blue)

source–drain diffusion, and the collector is

††† Only n-well BiCMOS processes exist; for

FIGURE 32 Measured spectral quantum 100

Quantum efficiency Q(λ)

transistors as a function of emitter cur- 0 -10 -9 -8 -7 -6 -5 -4 -3 -2

You might also like