You are on page 1of 7

Available online at www.sciencedirect.

com

Computational models of decision making: integration, stability,


and noise
Nicholas Cain and Eric Shea-Brown

Decision making demands the accumulation of sensory Integration, optimality, and functionality
evidence over time. Questions remain about how this occurs, What is the optimal way to accumulate evidence over
but recent years have seen progress on several fronts. The first time? Many different definitions of ‘optimal’ — from
concerns when optimal accumulation of evidence coincides maximizing the rate at which correct responses are made
with the simplest method of accumulating neural activity: [8,1] to maximizing guaranteed payoff [9] to lossless
summation over time. The second involves what computations Bayesian decoding [10] — depend on the same key
the brain might perform when summation is difficult due to computation. This is to update the statistical likelihood
imprecision in neural circuits or is suboptimal due to uncertainty of each decision alternative being correct, relative to the
or variability in how evidence arrives. Finally, the third concerns other alternatives, when each increment of evidence
sources of noise in decision circuits. Empirical studies have arrives [8,11,12,13]. In general this is a tall order, requir-
better constrained the extent of this noise, and modeling work ing computation of an arbitrary function of the momen-
is helping to clarify its possible origins. tary neural activity that encodes incoming evidence, and
then combining this with the evidence already accumu-
lated. Making matters worse, even knowing what func-
Address
tion to compute depends on full knowledge of the
Department of Applied Mathematics, Program in Neurobiology and statistics of that activity [11].
Behavior, University of Washington, Seattle, WA, USA
Fortunately, in a number of circumstances, the likelihood
Corresponding author: Shea-Brown, Eric (etsb@amath.washington.edu) calculation is equivalent to directly summing, or integrat-
ing, the evidence stream over time (see Figure 1). In the
Current Opinion in Neurobiology 2012, 22:1047–1053 simplest case there are two task alternatives, and two
This review comes from a themed issue on Decision making independent streams of incoming evidence I1(t) and
Edited by Kenji Doya and Michael N Shadlen
I2(t) — for example, representing the output of neurons
selective for upward vs. downward motion. Perhaps the
For a complete overview see the Issue and the Editorial
best known result says that, if I1(t) and I2(t) are uncorre-
Available online 15th May 2012 lated Gaussian signals with identical variance, then the
0959-4388/$ – see front matter, # 2012 Elsevier Ltd. All rights relative likelihood of each alternative can be computed by
reserved. integrating the signals over time — [8,11,12,13], via the
http://dx.doi.org/10.1016/j.conb.2012.04.013 accumulated inputs

Z t
A1 ðtÞ ¼ I 1 ðt 0 Þdt 0 (1)
Introduction 0
For many sensory discrimination tasks, instantaneous Z t
snapshots of sensory input represent single, noisy samples A2 ðtÞ ¼ I 2 ðt 0 Þdt 0 (2)
that alone carry limited evidence. A prototypical example 0

is the random dots task, in which the subject must deter-


mine the average direction of motion in a field of ran- If I1(t) and I2(t) are Poisson processes — that is, spike
domly moving dots (Figure 1). Here, as in many more trains [10,14,15,16] — the optimal computation is
complex decisions, a sound judgment requires accumu- again integration over time. In fact, this setting is much
lating this sensory evidence over time [1–7]. more flexible, as there is no longer a requirement that all
inputs have the same variance ([15,16] identify limits in
We review recent progress on how this type of accumu- which the Poisson and Gaussian cases agree). Moreover,
lation could occur in the brain. We begin with how the for decisions involving multiple alternatives, where
neural encoding of this evidence determines optimal motion could follow any of the directions of the compass
mechanisms for integrating it over time. Next, we discuss rose, the same result extends. In this case, there will be N
the stability and variability of neural circuits that may different input streams Ii(t), whose integrals produce N
perform this computation, the function and adaptability accumulated variables Ai(t). Additionally, for the two
of these circuits in uncertain decision making environ- alternative cases, [17] extends these results to compute
ments, and how they might operate with irregular and (at estimates of the quality of evidence streams for each task
least apparently) noisy components. alternative.

www.sciencedirect.com Current Opinion in Neurobiology 2012, 22:1047–1053


1048 Decision making

Figure 1

I1(t)

time time

A1(t) ≈ I1(t) dt

A1(t)

N
tim
e

time time
Current Opinion in Neurobiology

Schematic of sensory evidence accumulation over time. In the random dots task, a field of irregularly moving dots is presented to the subject, with
motion biased toward one favored direction. While some neurons represent ‘instantaneous’ values of this motion evidence (top rastergram), other
neurons appear to represent its accumulated value over time (bottom rastergram). According to the equation shown, the relationship between the
average activity of these neural pools approximates temporal integration.

When the statistics of incoming evidence vary over the computed in ‘downstream’ areas, recurrent inhibition
course of a trial, in some cases the relative likelihood of within the same networks that integrate the evidence
task alternatives can still be computed based on the streams Ij(t) can often implement or closely approximate
integral of the inputs. For two Gaussian input streams, the required differencing operations [7,18,25,26,1].
this holds when the ratio of the ‘signal’ hI1(t)  I2(t)i in
the inputs to the variance of these inputs is constant over Beyond the integral sign: precision,
time [18]. For N Poisson input streams, there are stronger thresholding, and the neural ratchet
results: if the time dependence is via a gain term that While ‘pure’ integration of incoming evidence is optimal
uniformly scales up or scales down the firing rate of each in many settings, it poses a challenge for neural circuitry.
input stream, then integration once again enables a direct Evidence is integrated over hundreds of milliseconds or
computation of the relative likelihoods; similar consider- longer as decisions develop, but the activity of individual
ations hold for variation across trials [10]. neurons and synapses tend to decay with timescales that
are several orders of magnitude faster. How can this rapid
Can we measure such integration occurring in neural decay be countered? A classical solution is via feedback
circuits? Compelling evidence is presented in [19], connections tuned to balance — and hence cancel —
where the authors devise a novel method to measure passive voltage leak and synaptic decay [27,30]. This
the time-evolution of the variance of spike rates believed process is illustrated in Figure 2a via motion of a ball on an
to represent the integrals Aj(t). The key finding is that this energy surface. The position A(t) represents the total
variance grows linearly in time, consistent with what is activity of a circuit (say, average firing rate relative to a
expected from an integrating process. baseline marked 0). If decay dominates (upper-right),
then A(t) always has a tendency to ‘roll back’ to baseline
Nevertheless, computing an integral is not the end of the values; we say the integrator circuit is stable. Conversely, if
story: integrated evidence must trigger an explicit feedback connections are in excess, then activity will
decision. For the two alternative case, the highest grow away from the baseline value and thus the circuit
accuracy at any given speed is achieved by making a is unstable. Pure integration is achieved (upper-left) with
decision when the difference A1(t)  A2(t) crosses a preset feedback that is precisely tuned to match decay, so that
bound [13,8,11,20]. (We note that this still leaves open changes in A(t) represent integrated evidence alone
precisely what tradeoff between speed and accuracy the [27,31].
decision maker will select, which is a distinct and inter-
esting question [21–23,9,1,8].) For multiple alternatives, Biologically, this feedback could occur in many forms.
Ai(t) can be used to implement a ‘MSPRT’ rule which can Beyond excitatory connections, positive feedback can also
also optimize speed and accuracy [24,16], or to find occur through opponent inhibition [32,1,33,27,28,7,34];
the maximum likelihood alternatives at any given time recently, a generically valid ‘normal form’ equation that
[10]. While in principle these decision rules could be describes the resulting dynamics was presented and its

Current Opinion in Neurobiology 2012, 22:1047–1053 www.sciencedirect.com


Computational models of decision making Cain and Shea-Brown 1049

Figure 2

(a) Integration Instability Stability (b)


I(t) ?
Photon absorption: accumulate over space

A(t) A(t) A(t)


0 0 0

A(t) A(t) A(t) Motion direction: accumulate over time


0 0 0
Current Opinion in Neurobiology

(a) Visualizing integration via an energy surface [27,28,29], where the location of the ball indicates integrated evidence A(t). A robust integrator can
‘fixate’ at a range of discrete values, indicated by a sequence of potential wells, despite imprecise tuning of circuit feedback. Without these wells (the
non-robust case), activity in a mistuned integrator would either exponentially grow or decay, as in the top panels. Perturbing the robust integrator from
one well to the next, however, requires sufficiently strong momentary input I(t). (b) Relationship of robust integration to the detection of sparse visual
signals. Top: activity of five photoreceptors is shown as a dot, and the dotted line represents photon absorption. Light that arrives at only a tiny fraction
of photoreceptors (here receptor # 3) must be detected. Bottom: motion evidence fluctuates over time, and the direction of the time-average must be
discriminated among two possibilities — that is, up (dotted line) vs. down (solid line).

predictions favorably compared with behavioral data [35]. of reaction times and correct vs. incorrect responses better
Interestingly, unstructured networks also appear able to than an allied model without a threshold. Moreover, [46]
accumulate inputs over time: a single ‘integration mode’ develops a model with a threshold-like ‘gating’ effect,
may emerge naturally as the connection statistics are varied finding improved fits to single unit recordings while
in large, randomly coupled networks [36]. Recent studies simultaneously matching behavioral data (see also
have also raised the possibility of achieving integration [47]). Several key statistical and dynamical properties
without depending on feedback per se, rather by passing of robust integrators that contribute to the quality of this
inputs through extended sequences of ‘functionally feed- match are studied in [48]. Ongoing work asks how robust
forward’ states within a large network [37–39]. This mech- integration impacts the optimality of decision perform-
anism is one of several that produce high-dimensional ance. For example, consider the case introduced above
dynamics — that is, with time constants that differ from where instantaneous evidence is carried by Gaussian
cell to cell — by contrast to classical line attractor models. signals I1(t) and I2(t). Rather than accumulate the entire
Intriguingly, such cell-to-cell heterogeneity has recently signal, a robust integrator would ignore all samples below
been found in the zebrafish oculomotor integrator circuit a preset threshold. Intuitively, one might expect this to
[40]. However they are implemented, must feedback diminish the speed and accuracy with which decisions can
effects always be precisely tuned to produce integration be made. However, results to date [49] suggest that the
over time? The robust integrator mechanism of [41] presents loss is minimal, even when more than half of the signal is
an alternative. Here, the circuit as a whole ‘ratchets’ among thresholded away. This suggests that most of the evi-
many stable states, as illustrated in the bottom row of dence can be gleaned from the ‘tails’ of the signals.
Figure 2a by introducing a series of wells into the energy
surface [27,28,29]. Importantly, even with imprecisely Moving beyond the case of Gaussian inputs, analogy with
tuned feedback, the network now avoids the tendency other fields suggests some settings in which thresholding
to intrinsically decay or grow without incoming evidence. actually improves decision performance. For example,
Thus, the activity A(t) approximates the integral of I(t), consider the case of detecting photons that arrive rarely
without the effects of decay or runaway excitation. at noisy photoreceptors (see Figure 2b). If the output of
all photoreceptors is simply summed — that is, inte-
Nevertheless, such robustness impacts how the evidence grated — this sparse signal could easily be lost in noise.
streams Ij(t) are processed. As the energy wells illustrate, The solution demands summing receptor responses only
instantaneous inputs Ij(t) below a threshold value will fail after passing through a thresholding function [43], analo-
to perturb the state from one well to another [41,42], and gous to integrating only those inputs strong enough to
are therefore ignored by the integrator. The idea of move the robust integrator state from well to well. For
thresholding inputs has a long history [43,44], and several decisions based on accumulating temporally sparse evi-
studies explore its implications for decision making over dence over a noisy background — such as the auditory
time. In [45], the authors show that a closely related and visual impulses recently studied in rats and humans
‘interval of uncertainty’ model can match distributions [50] — thresholding integrators would then be expected

www.sciencedirect.com Current Opinion in Neurobiology 2012, 22:1047–1053


1050 Decision making

to improve decision performance. Taken together, these how the local composition of NE receptors determines
observations point to a potentially favorable role for which regime occurs in a given circuit. This opens the
robust integrators in decision making. Ignoring the door to circuit-specific effects of diffuse NE modulation
weaker parts of the input signal is sometimes the best [55] — a specificity that might be of use when distinct
strategy, and when it is not, the cost of doing so can be evidence streams must be simultaneously integrated over
surprisingly low. time, as has been recently demonstrated to occur [50].

Richer computations over time Related studies have shown how circuit dynamics should
Beyond sparseness, there are other conditions that also differ from pure integration when other aspects of the
demand circuit dynamics departing from pure integration. decision making environment are variable and uncertain.
For example, consider a version of the moving dots task in If the difficulty of a task varies from trial to trial — as in
which the stimulus (i.e. mostly upward or downward many experimental settings [12,52] — then an ‘urgency
moving dots) is prefaced by a period of purely random signal’ may be added to integrated evidence [51]. These
motion. Then, a simple integral of the inputs would authors note that such a signal is equivalent to a gradual
produce a mix of both noise and signal-carrying inputs. collapse of the threshold for making a decision. This
A better strategy is to put more emphasis on stimulus results in truncation of excessively long decisions,
periods that contain the signal. thereby concentrating net effort toward easier trials—a
strategy which can provide optimal decision performance
If the stimulus onset is explicitly cued, or accompanied by [57–60]. Models with time-dependent gain mechanisms
a large sensory change, then there are several straightfor- [61], or with instabilities in the dynamics that integrate
ward ways to accomplish this. For example, the integrator evidence [62], instantiate a similar effect. These results
circuit could be ‘reset’ when the stimulus occurs [51,52], may find synergy with the concept that instability and
erasing integrated noise samples. A recent physiology gain are modulated to adjust to a related type of uncer-
study provides support for this notion [19]: spike rate tainty, in which the relationship among stimuli, decisions,
variance in an area believed to represent integrated and rewards is not fully known. Here, different levels of
evidence drops strongly after stimulus onset. Beyond stability — possibly mediated by NE — could differently
the reset, other studies have asked what types of stable emphasize conservative, gradual accumulation of inputs
and unstable circuit dynamics could optimally process vs. rapid, runaway commitment to ‘exploratory’ decisions
such time-dependent stimuli (see Figure 2a). Not surpris- based on chance fluctuations in evidence streams
ingly, optimal solutions begin with stability in periods [54,55,56], cf. [63].
before the stimulus arrives, so that the effect of noisy
inputs remains contained. Once the stimulus arrives, the
best strategy is to transition to either pure integration or Integration, background, and circuit-driven
instability, thus letting evidence begin to build up over noise
time [7,18,53,39]. Above, we have treated the transformation of the incom-
ing evidence I(t) into its integral A(t) as a deterministic
What mechanisms could drive changes in integrator mathematical operation. However, A(t) is presumably
stability over time? One possibility is transient release computed by irregularly spiking neurons (Figure 1, bot-
of a neuromodulator such as norepinephrine (NE). One tom raster), perhaps distributed throughout the brain —
model of the impact of NE is that it controls the strength, and possibly incorporating ‘background’ neurons that are
or gain, of connections within an integrator circuit [54]. If untuned or poorly tuned to a specific decision. Do inte-
NE were to transiently increase the gain of feedback, for grator circuits with these noisy components contribute
example, it could switch a circuit from being stable to additional variability to decisions?
unstable. Network models have shown how this could
lead to improved decision performance [18,55,53,56]. Empirical work suggests two constraints. First, well-
Determining the timing of NE release, however, established patterns of accuracy and reaction time distri-
represents a decision in itself — and this ‘first’ decision butions [12,52,11] must be compatible with whatever
about the presence of a stimulus can only be made in time type of circuit-based noise arises. Second, for the moving
to improve the ‘main’ decision about the stimulus content dots task, a number of studies have precisely measured
under specific assumptions about the decision circuit [53]. the signal and noise properties of motion evidence at a
stage (area MT) that appears to precede its accumulation
More recently, NE transients have been analyzed within over time [64,65,66,67]. Recently, building on prior
biologically detailed models of decision making [55]. analyses, [66,68,69] showed that a noiseless accumu-
The authors show how NE can steer a network between lation, or sum, of MT spikes over an empirically esti-
stable, unstable, and integrating regimes, and develop mated distribution of reaction times predicts levels of
predictions for the density and location of NE receptors decision performance that closely match experiment.
that optimize decision performance. Moreover, they show This only leaves room for a relatively small amount of

Current Opinion in Neurobiology 2012, 22:1047–1053 www.sciencedirect.com


Computational models of decision making Cain and Shea-Brown 1051

additional variability to be contributed by the integrator ‘feature’ in unpredictable decision making environ-
circuit itself. ments. Adding to the possibilities, different neural inte-
grator circuits vary not only in how they process
Spiking neuron models of integrator circuits have met fluctuating evidence, but also in whether and how they
some of these constraints. A number of studies show that may contribute additional noise to the decision compu-
such circuits can reproduce behavioral statistics as well as tation. The future is bright for careful combinations of
key features of neural activity [10,70,68,71,34]. In experiment, modeling, and theory that will link increas-
addition, [10] shows that additional properties of noise- ingly realistic circuit models with optimal algorithms for
less integration — including how confidence in task altern- decision making in uncertain environments.
atives grows over time — are preserved in a simple
integrator circuit with Poisson-like firing (see also Acknowledgements
[68,72]). For biologically detailed models — especially We thank Rafal Bogacz, Anne Churchland, Philip Holmes, and the editors
those in which the long time constants of integration arise for their valuable comments on the manuscript, and these and many other
colleagues for their contributions to our understanding of this material over
through recurrent network interactions [34,73,70,74] — the years. We gratefully acknowledge the support of a Career Award at the
future work will be needed to understand how much Scientific Interface from the Burroughs Welcome Fund (E. S. B.).
variability arises from the fluctuating inputs I(t), from
background neurons, and from the irregular dynamics of
the circuits themselves. References
1. Bogacz R, Brown E, Holmes P, Cohen JD: The physics of optimal
Three recent advances in theoretical neuroscience lay decision making: a formal analysis of models of performance
important groundwork. The first is the model of [71], in in two-alternative forced-choice tasks. Psychol Rev 2006,
113:700-765.
which past evidence is represented by spikes that recur at
2. Laming DRJ: Information Theory of Choice-Reaction Times. New
randomly spaced times, as in a Poisson process — allow- York: Academic Press; 1968.
ing them to continue to contribute variability to the
3. Ratcliff R: A theory of memory retrieval. Psych Rev 1978,
decision process. While this is an abstract model, it does 85:59-108.
separate the variability driven by the evidence stream and
4. Ratcliff R, Van Zandt T, McKoon G: Connectionist and diffusion
that generated by mechanisms within the integrator cir- models of reaction time. Psych Rev 1999, 106:261-300.
cuit itself, a perspective that will remain important in 5. Shadlen MN, Newsome WT: Motion perception: seeing and
future studies. deciding. PNAS 1996, 93:628-633.
6. Stone M: Models for choice-reaction time. Psychometrika 1960,
The second, related development concerns how integra- 25:251-260.
tor circuits process both noise in incoming signals and 7. Usher M, McClelland JL: The time course of perceptual choice:
noise local to each ‘cell’ in the circuit [39,37,75]. In the leaky, competing accumulator model. Psychol Rev 2001,
108:550-592.
particular, functionally feedforward architectures amplify
incoming signals faster than internally occurring noise. 8. Gold J, Shadlen M: Banburismus and the brain: decoding the
relationship between sensory stimuli, decisions, and reward.
This could clearly improve decision making performance, Neuron 2002, 36:299-308.
although constraints such as saturation of firing rates may 9. Zacksenhouse M, Bogacz R, Holmes P: Robust versus optimal
limit when this strategy can be applied [39,37]. Finally, strategies for two-alternative forced choice tasks. J Math
[76] shows how lossless statistical inference over time can Psychol 2010, 54:230-246.
be performed by the (integrate-and-fire) dynamics of 10. Beck JM, Ma WJ, Kiani R, Hanks T, Churchland AK, Roitman J,
 Shadlen MN, Latham PE, Pouget A: Probabilistic population
spiking neural networks. Importantly, this study shows codes for Bayesian decision making. Neuron 2008,
how irregular, variable spiking and optimal neural integ- 60:1142-1152.
The authors show how Bayesian inference — that is, computing the
ration can coexist, and suggests that — at least in prin- relative likelihood that alternative decisions will be correct — can be
ciple — all of the truly detrimental noise in integrator performed by integrating spikes over time. In particular they show that
circuits could be sourced to variability upstream. this holds for broad classes of firing statistics within the ‘exponential
family,’ which includes Poisson statistics, and for many assumptions on
how the stimulus is encoded within a neural population
Conclusion 11. Smith PL, Ratcliff R: Psychology and neurobiology of simple
Models of decision making have an enduring theoretical decisions. Trends Neurosci 2004, 27:161-168.
foundation in the accumulation of evidence streams over 12. Gold JI, Shadlen MN: The neural basis of decision making. Annu
time. In more and more settings, we have learned that  Rev Neurosci 2007, 30:535-574.
This review covers important results that relate classical sequential
this computation can be implemented by simply inte- analysis theory to both behavioral electrophysiological observations
grating neural activity over time. We have also learned
13. Wald A, Wolfowitz J: Optimum character of the sequential
about when and why we need to reach beyond this probability ratio test. Annals Math Statistics 1948, 19:326-339.
concept. For example, exact integration over long time- 14. Link SW: The Wave Theory of Difference and Similarity. Lawrence
scales may be difficult to achieve in realistic circuits. Erlbaum; 1992.
However, this apparent ‘bug’ — and the richly varied 15. Smith PL: From Poisson shot noise to the integrated Ornstein-
circuit dynamics that could result — may become a Uhlenbeck process: neurally principled models of information

www.sciencedirect.com Current Opinion in Neurobiology 2012, 22:1047–1053


1052 Decision making

accumulation in decision-making and response time. J Math 34. Wang X-J: Probabilistic decision making by slow reverberation
Psychol 2010, 54:266-283. in cortical circuits. Neuron 2002, 36:955-968.
16. Zhang J, Bogacz R: Optimal decision making on the basis of 35. Roxin A, Ledberg A: Neurobiological models of two-choice
 evidence represented in spike trains. Neural Comput 2010, decision making can be reduced to a one-dimensional nonlinear
22:1113-1148. diffusion equation. PLoS Comput Biol 2008, 4:e1000046.
This paper applies classical sequential sampling procedures to the
setting of neural population coding. This yields concrete and useful 36. Ganguli S, Bisley J, Roitman J, Shadlen M, Goldberg M, Miller K:
results on when optimal algorithms for evidence accumulation can be One-dimensional dynamics of attention and decision making
accomplished by integrating inputs over time — and when this will fail. in lip. Neuron 2008, 58:15-28.
Results are given for decisions among two and multiple alternatives
37. Ganguli S, Huh D, Sompolinsky H: Memory traces in dynamical
17. Moreno-Bote R: Decision confidence and uncertainty in systems. PNAS 2008, 105:18970-18975.
diffusion models with partially correlated neuronal
integrators. Neural Comput 2010, 22:1786-1811. 38. Goldman MS: Memory without feedback in a neural network.
Neuron 2009, 61:621-634.
18. Brown E, Gao J, Holmes P, Bogacz R, Gilzenrat M, Cohen J:
39. Lim S, Goldman MS: Noise tolerance of attractor and
Simple networks that optimize decisions. Int J Bifurcation
feedforward memory models. Neural Comput 2012, 24:332-390.
Chaos 2005, 15:803-826.
40. Miri A, Daie K, Arrenberg AB, Baier H, Aksay E, Tank DW: Spatial
19. Churchland AK, Kiani R, Chaudhuri R, Wang X-J, Pouget A,
gradients and multidimensional dynamics in a neural
 Shadlen MN: Variance as a signature of neural computations
integrator circuit. Nat Neurosci 2011, 14:1150-1159.
during decision making. Neuron 2011, 69:818-831.
The authors develop a method to assign noise in single-cell responses to 41. Koulakov A, Raghavachari S, Kepecs A, Lisman J: Model for a
one of two sources: (1) variability in an underlying stochastic process that robust neural integrator. Nat Neurosci 2002, 5:775-782.
could represent the time integral of decision evidence, and (2) additional
‘point-process’ noise, such as that emitted from a neuron firing as a 42. Goldman MS, Levine JH, Major G, Tank DW, Seung HS: Robust
Poisson process. This shows a ‘reset’ of the underlying variability to lower persistent neural activity in a model integrator with multiple
values when task stimuli arrive, followed by a gradual increase of this hysteretic dendrites per neuron. Cerebral Cortex 2003,
variability while these stimuli are integrated over time. Both of these 1:1185-1195.
properties are consistent with optimal accumulation of evidence over
time 43. Field GD, Sampath AP, Rieke F: Retinal processing near
absolute threshold: from behavior to mechanism. Annu Rev
20. Kiani R, Hanks TD, Shadlen MN: Bounded integration in parietal Physiol 2005, 67:491-514.
cortex underlies decisions even when viewing duration is
dictated by the environment. J Neurosci 2008, 28:3017-3029. 44. Green D, Swets J: Signal Detection Theory and Psychophysics.
Peninsula Pub; 1966.
21. Bogacz R, Hu P, Holmes P, Cohen JD: Do humans produce the
speed-accuracy tradeoff that maximizes reward rate? Quart J 45. Smith PL, Vickers D: Modeling evidence accumulation with
Exp Psych 2010, 63:863-891. partial loss in expanded judgment. J Exp Psychol Human
Perception Performance 1989, 15:797.
22. Edwards W: Optimal strategies for seeking information:
models for statistics, choice reaction times, and human 46. Purcell BA, Heitz RP, Cohen JY, Schall JD, Logan GD, Palmeri TJ:
information processing. J Math Psychol 1965, 2:312-329. Neurally constrained modeling of perceptual decision making.
Psychol Rev 2010, 117:1113-1143.
23. Maddox WT, Bohil CJ: Base-rate and payoff effects in
multidimensional perceptual categorization. J Exp Psychol 47. Schall JD, Purcell BA, Heitz RP, Logan GD, Palmeri TJ: Neural
1998, 24:1459-1482. mechanisms of saccade target selection: gated accumulator
model of the visual-motor cascade. Eur J Neurosci 2011,
24. Bogacz R, Gurney K: The basal ganglia and cortex implement 33:1991-2002.
optimal decision making between alternative actions. Neural
Comput 2007, 19:442-477. 48. Miller P: Power–law neuronal fluctuations in a recurrent
network model of parametric working memory. J Neurophysiol
25. McMillen T, Holmes P: The dynamics of choice among multiple 2006, 95:1099-1114.
alternatives. J Math Psychol 2006, 50:30-57.
49. Cain N, Barreiro A, Shadlen M, Shea-Brown E: A favorable
26. Wong KF, Wang XJ: A recurrent network mechanism of tradeoff between robustness and performance in sequential
time integration in perceptual decisions. J Neurosci 2006, decision tasks. Cosyne Abstracts 2011.
26:1314-1328.
50. Raposo D, Sheppard JP, Schrater PR, Churchland AK:
27. Goldman MS, Compte A, Wang X-J: Neural integrator models. Multisensory decision-making in rats and humans. J Neurosci
 In: Squire LR, editor. Encyclopedia of Neuroscience, 6:165–178, 2012, 32:3726–3735.
2009.
An excellent review of neural mechanisms for signal integration that 51. Churchland AK, Kiani R, Shadlen MN: Decision-making with
bridges fields of neuroscience multiple alternatives. Nat Neurosci 2008, 11:693-702.

28. Machens CK, Romo R, Brody CD: Flexible dynamics of mutual 52. Roitman JD, Shadlen MN: Response of neurons in the lateral
inhibition: a neural model of two-interval discrimination. intraparietal area during a combined visual discrimination
Science 2005, 307:1121-1124. reaction time task. J Neurosci 2002, 22:9475-9489.

29. Pouget A, Latham P: Digitized neural networks: long-term 53. Shea-Brown E, Gilzenrat M, Cohen JD: Optimization of decision
stability from forgetful neurons. Nat Neurosci 2002, 5:709-710. making in multilayer networks: The role of locus coeruleus.
Neural Comput 2008, 20:2863-2894.
30. Robinson DA: Integrating with neurons. Ann Rev Neurosci 1989,
12:33-45. 54. Aston-Jones G, Cohen JD: An integrative theory of locus
coeruleus-norepinephrine function: adaptive gain and optimal
31. Seung HS: How the brain keeps the eyes still. Proc Natl Acad Sci performance. Annu Rev Neurosci 2005, 28:403-450.
1996, 93:13339-13344.
55. Eckhoff P, Wong-Lin KF, Holmes P: Optimality and robustness of
32. Bogacz R: Optimal decision-making theories: linking  a biophysical decision-making model under norepinephrine
neurobiology with behavior. Trends Cogn Sci 2007, modulation. J Neurosci 2009, 29:4301-4311.
11:118-125. The authors include norepinephrine (NE) receptors in a spiking neuron
model of decision making. They show how decision speed and accuracy
33. Brown E, Holmes P: Modeling a simple choice task: stochastic depends on receptor densities and NE levels in different cell populations,
dynamics of mutually inhibitory neural groups. Stochastics identifying ways in which NE can optimize performance for a given
Dynam 2001, 1:159-191. decision making environment and adapt to changes in that environment.

Current Opinion in Neurobiology 2012, 22:1047–1053 www.sciencedirect.com


Computational models of decision making Cain and Shea-Brown 1053

The authors also demonstrate how a given NE level can have disparate 66. Cohen M, Newsome W: Estimates of the contribution of single
effects of circuits with different receptor densities and configurations  neurons to perception depend on timescale and noise
correlation. J Neurosci 2009, 29:6635-6648.
56. Usher M, Cohen JD, Servan-Schreiber D, Rajkowsky J, Aston- This paper contributes new evidence for an important experimental
Jones G: The role of locus coeruleus in the regulation of finding, that the time-integral of outputs from MT neurons in the moving
cognitive performance. Science 1999, 283:549-554. dots task generally approximates the speed and accuracy of decision
making behavior
57. Drugowitsch J, Moreno-Bote R, Churchland AK, Shadlen MN,
Pouget A: The cost of accumulating evidence in perceptual 67. Zohary E, Shadlen MN, Newsome WT: Correlated neuronal
decision making. J Neurosci 2012, 32:3612-3628. discharge rate and its implication for psychophysical
performance. Nature 1994, 370:140.
58. Frazier P, Yu A: Sequential hypothesis testing under stochastic
deadlines. In Advances in Neural Information Processing 68. Mazurek ME, Roitman JD, Ditterich J, Shadlen MN: A role for
Systems, vol. 20. Edited by Platt JC, Koller D, Singer Y, Roweis S. neural integrators in perceptual decision making. Cerebral
Cambridge, MA: MIT Press; 2008:465-472. Cortex 2003, 13:1257-1269.
59. Huang Y, Friesen A, Rao R: How prior probability influences 69. Shadlen M, Britten K, Newsome W, Movshon J: A computational
decision making: a unifying probabilistic model. Cosyne analysis of the relationship between neuronal and behavioral
Abstracts 2012. responses to visual motion. J Neurosci 1996, 16:1486-1510.
60. Rao RPN: Decision making under uncertainty: a neural model 70. Deco G, Rolls ET, Romo R: Stochastic dynamics as a principle
based on partially observable markov decision processes. of brain function. Progr Neurobiol 2009, 88:1-16.
Front Comput Neurosci 2010, 4.
71. Smith PL, McKenzie CRL: Diffusive information accumulation
61. Ditterich J: Evidence for time-variant decision making. Eur J by minimal recurrent neural models of decision making. Neural
Neurosci 2006, 24:3628-3641. Comput 2011, 23.
62. Wong KF, Huk AC, Shadlen MN, Wang XJ: Neural circuit 72. Simen P, Balci F, deSouza L, Cohen JD, Holmes P: A model
dynamics underlying accumulation of time-varying evidence of interval timing by neural integration. J Neurosci 2011,
during perceptual decision making. Front Comput Neurosci 31:9238-9253.
2007, 1:1-11.
73. Wang X-J: Decision making in recurrent neuronal circuits.
63. Dayan P, Yu AJ: Phasic norepinephrine: A neural interrupt Neuron 2008, 60:215-234.
signal for unexpected events. Network: Comput Neural Syst
2006, 17:335-350. 74. Webb TJ, Rolls ET, Deco G, Feng J: Noise in attractor networks
in the brain produced by graded firing rate representations.
64. Britten KH, Shadlen MN, Newsome WT, Movshon JA: The PLoS One 2011, 6:e23630.
analysis of visual motion: a comparison of neuronal and
psychophysical performance. J Neurosci 1992, 75. Doiron B, Litwin-Kumar A, Polk A: Correlated variability
12:4745-4765. stabilizes persistent activity in networks of spiking neurons.
Abstract 451.23. Soc Neurosci 2011.
65. Britten KH, Shadlen MN, Newsome WT, Movshon JA: Responses
of neurons in macaque MT to stochastic motion signals. Visual 76. Boerlin M, Denève S: Spike-based population coding and
Neurosci 1993, 10:1157-1169. working memory.. PLoS Comput Biol 2011, 7:e1001080.

www.sciencedirect.com Current Opinion in Neurobiology 2012, 22:1047–1053

You might also like