Identification of Pattern Completion Neurons

The Journal of Neuroscience, October 13, 2021 • 41(41):8577–8588 • 8577
Systems/Circuits
Identification of Pattern Completion Neurons in Neuronal

Ensembles Using Probabilistic Graphical Models
Luis Carrillo-Reid,1p Shuting Han,1p Darik O’Neil,1p Ekaterina Taralova,1,2 Tony Jebara,2 and Rafael Yuste1
1 2
Departments of Biological Sciences and, and Computer Science, Columbia University, New York, New York 10027
Neuronal ensembles are groups of neurons with coordinated activity that could represent sensory, motor, or cognitive states.
The study of how neuronal ensembles are built, recalled, and involved in the guiding of complex behaviors has been limited
by the lack of experimental and analytical tools to reliably identify and manipulate neurons that have the ability to activate
entire ensembles. Such pattern completion neurons have also been proposed as key elements of artificial and biological neural
networks. Indeed, the relevance of pattern completion neurons is highlighted by growing evidence that targeting them can
activate neuronal ensembles and trigger behavior. As a method to reliably detect pattern completion neurons, we use condi-
tional random fields (CRFs), a type of probabilistic graphical model. We apply CRFs to identify pattern completion neurons
in ensembles in experiments using in vivo two-photon calcium imaging from primary visual cortex of male mice and confirm
the CRFs predictions with two-photon optogenetics. To test the broader applicability of CRFs we also analyze publicly avail-
able calcium imaging data (Allen Institute Brain Observatory dataset) and demonstrate that CRFs can reliably identify neu-
rons that predict specific features of visual stimuli. Finally, to explore the scalability of CRFs we apply them to in silico
network simulations and show that CRFs-identified pattern completion neurons have increased functional connectivity. These
results demonstrate the potential of CRFs to characterize and selectively manipulate neural circuits.
Key words: Conditional random fields; graph theory; neuronal ensembles; pattern completion; probabilistic graphical
models; two-photon optogenetics
Significance Statement
We describe a graph theory method to identify and optically manipulate neurons with pattern completion capability in mouse
cortical circuits. Using calcium imaging and two-photon optogenetics in vivo we confirm that key neurons identified by this
method can recall entire neuronal ensembles. This method could be broadly applied to manipulate neuronal ensemble activity
to trigger behavior or for therapeutic applications in brain prostheses.
Introduction
Following Lorente de Nó’s (1938) proposal that cortical circuits
are designed to sustain recurrent reverberating chains of activity,
Received Jan. 9, 2021; revised July 6, 2021; accepted July 11, 2021.
Author contributions: L.C.-R., S.H., E.T., and R.Y. designed research; L.C.-R. and S.H. performed research;
Hebb (1949) postulated that recurrent neuronal activity could
L.C.-R., S.H., D.O., E.T., and T.J. contributed unpublished reagents/analytic tools; R.Y. assembled and directed generate cell assemblies, functional multineuronal units, as
the team and secured funding and resources; L.C.-R., S.H., and D.O. analyzed data; L.C.-R. and R.Y. wrote the building blocks within neuronal microcircuits. In addition, Hebb
paper. speculated that strong recurrent connections between a given set of
This work was supported by grants from the National Eye Institute (R01EY011787), National Institute of
neurons would increase the probability to reactivate a whole group
Neurological Disorders and Stroke (R01NS110422 and R34NS116740), National Institute of Mental Health
(R01MH115900), National Science Foundation (CRCNS 1822550), Vannevar Bush Faculty Fellowship (ONR
if only one of the elements was activated. This property, known as
N000142012828) to R.Y., Consejo Nacional de Ciencia y Tecnología (CF6653 and CF154039) and Programa de pattern completion, represents an efficient mechanism to retrieve
Apoyo a Proyectos de Investigación e Innovación Tecnológica (IA201421) to L.C-R, Howard Hughes Medical information allowing the reactivation of previously stored informa-
Institute International Student Research Fellowship (S.H.), and National Science Foundation Graduate Research tion triggered by a few neurons (Marr, 1971; Hopfield, 1982; Rolls
Fellowship (D.O.). We thank Kui Tang for code; laboratory members for comments and virus injections;
and Treves, 1994). Acknowledging this tradition, we use the term
Columbia University Yeti and Habanero shared High Performance Computing Cluster for computation
resources; and Stanford University Neuroscience Gene Vector and Virus Core for the AVVdj virus. “pattern completion” to define neurons whose activation can recall
*L.C.-R., S.H., and D.O. contributed equally to this work. an entire ensemble (Carrillo-Reid et al., 2016, 2019).
L. Carrillo-Reid’s present address: Neurobiology Institute, National Autonomous University of Mexico, A neuronal ensemble is a group of neurons with coordinated
Juriquilla, 76230 Queretaro, Mexico activity that represents an experimental or behavioral condition
The authors declare no competing financial interests.
Correspondence should be addressed to Luis Carrillo-Reid at carrillo.reid@comunidad.unam.mx.
(Carrillo-Reid and Yuste, 2020a,b). Advances in two-photon cal-
https://doi.org/10.1523/JNEUROSCI.0051-21.2021 cium imaging and two-photon optogenetics have made possible
Copyright © 2021 the authors the simultaneous reading and writing of cortical ensemble
8578 • J. Neurosci., October 13, 2021 • 41(41):8577–8588 Carrillo-Reid et al. · Identification of Pattern Completion Neurons
activity with single-cell resolution in awake animals (Rickgauer SV40 and AVVdj-CaMKIIa-C1V1(E162T)-TS-P2A-mCherry-WPRE were
et al., 2014; Packer et al., 2015). Coactivating sets of neurons can injected simultaneously into layer 2/3 of left primary visual cortex (2.5 mm
artificially imprint neuronal ensembles in vivo (Carrillo-Reid lateral and 0.3 mm anterior from the l , 200 mm from pia). After 3 weeks
et al., 2016). Neuronal ensembles can also be recalled by tar- mice were anesthetized with isoflurane (1–2%) and a titanium head plate
was attached to the skull using dental cement. Dexamethasone sodium
geting individual neurons with two-photon holographic
phosphate (2 mg/kg) and enrofloxacin (4.47 mg/kg) were administered sub-
optogenetics, demonstrating pattern completion and causal- cutaneously. Carprofen (5 mg/kg) was administered intraperitoneally. After
ity between neuronal ensemble activity and learned behav- surgery animals received carprofen injections for 2 d as postoperative pain
iors (Carrillo-Reid et al., 2019; Marshel et al., 2019; Robinson medication. A reinforced thinned skull window for chronic imaging (2 mm
et al., 2020). The identification of pattern completion neu- in diameter) was made above the injection site using a dental drill. A 3 mm
rons (p.c. neurons) could enable the systematic manipulation circular glass coverslip was placed and sealed using a cyanoacrylate adhesive
of neuronal ensembles. (Drew et al., 2010). Imaging experiments were performed 7–28 d after head
Graph theory has been used in neuroscience to describe the plate fixation. During recording sessions mice were awake (head fixed) and
structural and functional organization of entire brains (Bullmore could move freely on a circular treadmill.
and Sporns, 2009). Graphs are usually constructed with nodes Visual stimulation. Visual stimuli were generated using MATLAB
Psychophysics Toolbox and displayed on an LCD monitor positioned
representing brain regions (Bettencourt et al., 2007) and edges
15 cm from the right eye at 45° to the long axis of the animal. Population
representing information flow (Iturria-Medina et al., 2008). activity corresponding to two-photon stimulation of targeted neurons in
Many studies have constructed graphs for functional analysis of layer 2/3 of visual cortex was recorded with the monitor displaying a
brain data from fMRI, EEG, and electrode arrays, taking brain gray screen with mean luminescence similar to drifting gratings. The
regions (Achard and Bullmore, 2007; Fair et al., 2008; Hagmann imaging setup and the objective were completely enclosed with blackout
et al., 2008), voxels (Eguíluz et al., 2005; van den Heuvel et al., fabric and a black electrical tape. Visual stimuli consisted of full-field
2008; Zuo et al., 2012) or electrode position (Downes et al., 2012) sine wave drifting gratings (100% contrast, 0.035 cycles/°, 2 cycles/s)
as nodes, and cross-correlation, mutual information, or Granger drifting in two orthogonal directions presented for 4 s, followed by 6 s of
causality as edges (Fair et al., 2008; Bullmore and Sporns, 2009; mean luminescence (Carrillo-Reid et al., 2016, 2019).
Micheloyannis et al., 2009; Wang et al., 2010; Khazaee et al., Simultaneous two-photon calcium imaging and photostimulation.
Two-photon imaging and optogenetic photostimulation were per-
2015). Graphs are usually associated with a restricted set of pa-
formed with two different femtosecond-pulsed lasers attached to a
rameters that describe the weight and direction of edges obtained
commercial microscope. An imaging laser (l = 940 nm) was used to
by pairwise metrics, underscoring network interactions. To excite a genetically encoded calcium indicator (GCaMP6s), and a
understand the role of pattern completion neurons in awake ani- photostimulation laser (l = 1064 nm) was used to excite a red
mals it is necessary to generate graphical models that capture the shifted opsin (C1V1), which preferentially responds to longer wave-
interaction between neurons and subsequently identify and lengths (Packer et al., 2012).
manipulate individually neurons with pattern completion capa- The two laser beams on the sample were individually controlled by
bility that could have a potential role orchestrating network ac- two independent sets of galvanometric scanning mirrors. The imaged
tivity (Carrillo-Reid and Yuste, 2020a). field of view was ;240 240 mm (25 numerical aperture 1.05, XLPlan
Neuronal ensembles form a network structure that can be N objective), comprising 60–100 neurons.
naturally characterized with probabilistic graphical models, We adjusted the power and duration of photostimulation so that
the amplitude of calcium transients evoked by C1V1 activation
where nodes and edges are biologically meaningful, representing
mimic the amplitude of calcium transients evoked by visual stimula-
neurons and their functional connections, respectively. Here, we tion with drifting gratings. Single-cell photostimulation was per-
used conditional random fields (Koller and Friedman, 2009), a formed with a spiral pattern delivered from the center of the cell to
combination of graph theory and probabilistic modeling that the boundaries of the soma as has been previously published
uses graphs to simplify and express the conditional interaction (Carrillo-Reid et al., 2016).
among a collection of variables. We applied CRFs to existing Image processing. Image processing was performed with ImageJ (ver-
data from a recent work that used two-photon optogenetics to sion 1.42q, National Institutes of Health) and custom-made programs
investigate pattern completion properties of neurons from written in MATLAB as previously described (Carrillo-Reid et al., 2016,
mouse primary visual cortex in vivo (Carrillo-Reid et al., 2016), 2019). Acquired images were processed to correct motion artifacts using
and we demonstrate that the parameters obtained from CRFs TurboReg. Neuronal contours were automatically identified using inde-
pendent component analysis and image segmentation (Mukamel et al.,
can predict neurons with the ability to recall artificially imprinted
2009). Calcium transients were computed as changes in fluorescence:
ensembles. To test the broad applicability of CRFs, we also ana- (Fi–Fo)/Fo, where Fi denotes the fluorescence intensity at any frame, and
lyzed publicly open calcium imaging data (Allen Institute Brain Fo denotes the basal fluorescence of each neuron. Spikes were inferred
Observatory dataset) and demonstrated that our approach can from the first-time derivative of calcium signals with a threshold of 3
identify neurons that reliably predict different features of visual SDs above noise level. We constructed an N F binary matrix, where N
stimuli and could potentially be targeted to recall percepts. denotes the number of active neurons, and F represents the total number
Finally, to test the scalability of CRFs we used in silico network of frames for each movie.
simulations and demonstrated that few neurons from Hopfield- Population vectors. We constructed multidimensional population
style networks have pattern completion capability and increased vectors that represent the simultaneous activation of different neurons.
functional connectivity. Peaks of synchronous activity describe population vectors (Carrillo-Reid
et al., 2008). Only high-activity frames were used. An activity threshold
was determined by generating 1000 shuffled raster plots and comparing
Materials and Methods the distribution of the random peaks against the peaks of synchrony
Animals and surgery. All experimental procedures were conducted observed in the real data (Shmiel et al., 2006). We tested the significance
in accordance with the National Institutes of Health and Columbia of population vectors against the null hypothesis that the synchronous
University Institutional Animal Care and Use Committee and have been firing of neuronal pools is given by a random process (Shmiel et al.,
described previously (Carrillo-Reid et al., 2016, 2019). Briefly, simultane- 2006; Carrillo-Reid et al., 2015a). Such population vectors can be used to
ous two-photon imaging and two-photon optogenetic experiments were describe the network activity as a function of time (Schreiber et al., 2003;
performed on C57BL/6 male mice. Virus AAV1-syn-GCaMP6s-WPRE- Stopfer et al., 2003; Brown et al., 2005; Sasaki et al., 2007; Carrillo-Reid
Carrillo-Reid et al. · Identification of Pattern Completion Neurons J. Neurosci., October 13, 2021 • 41(41):8577–8588 • 8579
et al., 2008). The number of dimensions for each experiment is given by imaging data, we constructed CRF models in the following two steps: (1)
the total number of active cells (N). Neuronal ensembles are defined by structure learning and (2) parameter learning.
the concomitant firing of neuronal groups at different times. The simi- For structure learning, we learned a sparse graph structure
larity between two population vectors was defined by their normalized G ¼ ðV; AÞ using ‘1 -regularized neighborhood-based logistic regression
inner product, which represents the cosine of the angle between such for each node r (Ravikumar et al., 2010) as follows:
vectors. A similarity index close to 1 indicates that both vectors point to
a close direction in an N-dimensional space, therefore the neurons that n o
define each vector are almost the same. A similarity index of 0 denotes min ‘ðu s ; xÞ1l s ku snr k1 ;
u nr
that the pair of vectors are orthogonal, so the neurons that define each
vector are different (Carrillo-Reid et al., 2015a,b; Carrillo-Reid and
where
Yuste, 2020b).
Allen Institute Brain Observatory dataset. To demonstrate the gen- X
eral applicability of our approach we analyzed a publicly available dataset
1
X
n exp 2xr u srt xt
X
t2Vnr
from the Allen Brain Observatory (http://observatory.brain-map.org/ ‘ ð u s ; xÞ ¼ log
visualcoding) along with the software development kit (SDK) for ex- n
i¼1 exp 2xr u srt xt 11
tracting fluorescence and (Fi–Fo)/Fo (http://alleninstitute.github.io/ t2Vnr
AllenSDK/) by the Allen Institute of Brain Science (de Vries et al., 2020).
Spikes were detected by first low-pass filtering the (Fi–Fo)/Fo traces, and
then a threshold of 5 SDs above noise level on the first derivative of
filtered (Fi–Fo)/Fo. The experiments identifiers s used are 511507650, u snr ¼ u sru ; u 2 Vnr ;
511509529, 511510650, 511510670, 511510718, and 511510855.
Conditional random fields. We constructed CRFs as previously where nr denotes except r, and u s is a vector of regression parameters
published
(Tang et al., 2016), using indicator feature vectors for structure learning. Here, l s is a regularization parameter that con-
x ¼ x1 ; x2 ; :::; xM ; where xm 2 X N , for each edge and
node, and target trols the sparsity (or conversely, the density) of the constructed graph
binary population activity vectors y ¼ y1 ; y2 ; :::; yM ; where ym 2 Y N structure. This is essentially a logistic regression of variable Xr on the
for N neurons and M samples (time points). The X N and Y N describe other variables Xnr , with ‘1 -regularization. The regression coefficients
an N-dimensional space where the dimensionality describes the total thus represent the neighborhood structure and the sign pattern. We
number of active neurons. For each sample, the conditional probability implemented ‘1 regularization using a highly efficient elastic-net algo-
can be expressed as follows: rithm solving using penalized maximum likelihood as follows:

exp h f ðxm ; ym Þ; u i X
N

ð1 a Þ

p ym jxm ; u ¼ ; 1
wi l yi ; b 0 1 b xi 1l s k b k2 1 a k b k1 ;
2
Z ð xm ; u Þ min T
b 0 ;b N 2
i¼1
where f is a vector of the distribution expressed in log-linear form, u is

a vector of parameters with parameters for the log-linear distribution, where a = 1, and thus is equivalent to a standard LASSO (least absolute
and Z is the partition function as follows: shrinkage and selection operator). We constructed a large number of
structures (1001) across a broad range of l s , from which we stochasti-
X cally selected a number of structures for subsequent parameter learning.
Zðxm ; u Þ ¼ exp h f ðxm ; yÞ; u i : We ensured proper bracketing by including the largest and smallest l s
y2Y structure in this priming set. In future iterations of parameter learning,
the selected sets of structures were selected using the log likelihood of
The conditional probability can be factored over a graph structure learned models as feedback. The final graph structure was obtained by
G ¼ ðV; AÞ, where V is the collection of nodes representing observation thresholding the edge potentials with a given density preference d. Edges
variables and target variables, and A is the collection of subsets of V. with potential values within the top d quantile were kept as the final
Based on the graph structure, the model parameters can be written sepa- structure. It is worth noting that although d could bias the result, varying
rately for nodes and edges as f ¼ f f V ; f A g and u ¼ fu V ; u A g 2 <N . d does not lead to density values that differ much. This is because of the
Given binary x and y, node parameters f V and u V include two sets of sparsity induced by the ‘1 regularizer.
distribution and the parameters corresponding to the node state 0 and 1, For parameter learning, we aim to learn the edge and node potential
whereas edge parameters f A and u A include four sets of distribution parameters ¼ fu V ; u A g. Based on the learned structure, we used the
and the parameters, corresponding to the edge states 00, 01, 10, and 11 Bethe approximation to approximate the partition function and iterative
(see Fig. 2). The conditional dependencies can be then written as Frank–Wolfe methods to perform parameter estimation by maximizing
follows: the log likelihood of the observations with a quadratic regularizer (Tang
et al., 2016) as follows:
X X
u i f i ðX; Yi Þ 1 u a f aðX; YaÞ
pðYjX; u Þ ¼
exp
i2V a2A
:
X
M
lp
ZðX; u Þ ‘ðu ; X; Y Þ ¼ ‘ð u ; X m ; Y m Þ ku k :
2
2
m¼1
This model is a generalized version of Ising models, which have been

Here, l p is a regularization that controls the learned parameters and
previously applied to model neuronal networks (Schaub and Schultz,
helps prevent overfitting. Cross-validation was done to find the best l s ,
2012). The log likelihood of each observation can be then written as
d, and l p via held-out model likelihood. We varied l s with six values
follows:
between 0.002 and 0.5, d with six values between 0.25 and 0.3, and l p
with five values between 10 and 10 000, all sampled uniformly. To obtain
‘ðu ; Xm ; Y m Þ ¼ h f ðXm ; Y m Þ; u i logZðXm Þ: the best model parameters, 90% data were used for training, and 10%
data were withheld. Cross-validation was done using all possible combi-
To reduce the complexity of the model, we first learn a sparse graph nations of the above parameters and calculating the likelihood of the
structure that represents variable dependencies, then we construct a CRF withheld data. Then, the best model parameters were determined by
on the learned structure. Given the inferred binary spikes from raw selecting the parameter set with a locally maximum likelihood in the
A population vectors (binary representation)

B population vectors (multidimensional representation)
1 0 1 0 0 1 0 0 0 1
1 1 1 1 1 1 1 1 1 1 d2
0 0 0 0 1 0 1 0 1 0
vectors dimensionality
0 1 0 1 0 0 0 1 0 0
1 0 1 0 0 1 0 0 0 1 population vector
0 0 0 0 0 0 0 0 0 0
0 0 0 0 1 0 1 0 1 0
neurons
0
1
0
0
1
1
0
1
0
0
1
1
0
1
0
0
1
0
... 0
1
0
0
1
1
0
1
0
0
1
0
v1
v2 neuronal ensemble
1 0 1 0 0 1 0 0 0 1
0 0 0 0 0 0 0 0 0 0 cluster of vectors
0 0 0 0 1 0 1 0 1 0
0 0 0 0 0 0 0 0 0 0
0 1 0 1 0 0 0 1 0 0
1 0 1 0 0 1 0 0 0 1 d1
0 0 0 0 1 0 1 0 1 0
0 1 0 1 0 0 0 1 0 0
v1 v2 ... recurrent population vector ... vt
dn N dimensions (neurons)
time
Figure 1. Characterization of neuronal ensembles as multidimensional vectors. A, Population activity represented as population vectors. The activity from the entire popu-
lation at each time point (t) is represented by a binary vector (v1 to vt). B, Schematic representation of population vectors in an N-dimensional space, where n denotes the
total number of observed neurons. Each point in a multidimensional space represents a population vector defined by a coactive group of neurons (left). Dashed arrows indi-
cate the dimensionality of the data (d1 to dn). For clarity only population vectors v1 and v2 are depicted by solid arrows. A neuronal ensemble could be defined by a cluster
of population vectors (right).
parameter space. This empirically results in

models with connectivity density of ;10 to
40%, corresponding to the previously reported
connectivity probability in layer 2/3 of V1 A in vivo two-photon B conditional random fields
(Yoshimura et al., 2005; Ko et al., 2011).
Node strength. As 0 and 1 on graph nodes x2 x3
photostim
imaging
represent the nonactive and active state of xsn

visual stim
x1
neurons, the edge potential term 11 represents x4
xs1 x6 x5
the coactive state of connected neurons,
whereas 01 and 10 represent antiactivate, and y2 y3
00 represent the state that both nodes (neu- ysn
y1
rons) are nonactive. Because the nodes are y4 0 ф
0
mostly inactive during recording, the 00 term head fixed
y5 0 1 1 ф1
is the strongest term, whereas only the 11 free movement ys1 0 ф00 ф01
y6
term captures neuronal ensemble coactiva- 0 ф0 1 ф ф
10 11
tion. Therefore, we defined the node strength 1 ф1
as the sum of the 11 term of edge potentials
from all connecting edges as follows:
C model 1 D stimuli classification from CRFs E
y2
X
log(ps1)-log(ps2)
N ðiÞ
E ****
y1 y3 100
sðiÞ ¼ f 11 ði; jÞ:
ys1 ysn 80
accuracy (%)
j¼1
y4
y6 60
Here, NE ðiÞ denotes the number of con- y5
necting edges for node i. The defined node 40
strength reflects the importance of a given cell model n
log(ps2)-log(ps1)
y2 20
in coactivating other cells. y3
y1
Shuffling method. To generate shuffled 0
el
models, we first randomized the spike raster ys1 ysn

od
el
od
matrices while preserving the activity per cell

m
y4
fle
al
y6
uf
re
and per frame. Then, we trained CRF models

sh
using the shuffled spike matrices, with the y5
cross-validated l s , d, and l p from the real

model. This procedure was repeated 100 Figure 2. Classification of visual stimuli from calcium imaging data using CRFs. A, In the experimental setup, simultaneous
times. The random level of node strengths two-photon imaging and two-photon optogenetics were performed in layer 2/3 of primary visual cortex in head-fixed freely
was determined by averaging node strengths moving mice. B, Graphical representation of CRFs. Circles represent neurons, squares represent added nodes depicting visual
from shuffled models. stimuli, shaded nodes (x) represent observed data, and white nodes (y) represent true states of the neurons and are connected
Identifying putative pattern completion by edges that indicate their mutual dependencies. Node potentials are defined over the two possible states of each node, and
neurons. To find the most influential neurons edge potentials are defined over the four possible states of each existing edge, depending on the state of adjacent nodes. C,
for each condition, we iterated through all the Representation of graphical structure with added stimulus nodes. Each stimulus introduces an additional node to the graph. D,
neurons and identified their contribution in Ratio of log likelihood predicting horizontal (top, red) and vertical (bottom, blue) drifting gratings calculated by CRF models.
predicting the stimulus conditions in the pop- Colored stripes indicate visual stimuli. Scale bar, 10 s. E, The accuracy of real models is higher compared with shuffled models
ulation. To this end, for the ith neuron in the (n = 8 mice, 16 models; p , 0.0001 (****); Mann–Whitney test).
A B
log(pi,1)-log(pi,0)
first node ith node last node
4 4
4
... ...
log(pi,1)-log(pi,0)
C D E
1
1 1
p.c. neurons
low
0.8
0.8 0.8
****
0.6 0.6 0.6
AUC
AUC
AUC
0.4 0.4
non-p.c. neurons
high 0.4
0.2 0.2
0.2
0 0
-1 -0.5 0 -1 -0.5 0
node strength node strength 0
Figure 3. Identification of putative pattern completion neurons using CRFs. A, Schematic representation of putative pattern completion neurons extraction from CRF models with added
nodes for two different visual stimuli. The activity of the ith neuron is set to 1 or 0 at each frame, and the log likelihood pmi;1 and pmi;0 of modified population vectors is calculated. B, Log-likeli-
hood inference and prediction for cortical putative pattern completion neurons representing horizontal (top, red) or vertical (bottom, blue) drifting gratings. C, Graphical representation of node
strength. D, Putative pattern completion neurons are identified by high AUC and high node strength (top right quadrant). Confidence levels were defined from CRF models of shuffled data
(gray bars). E, AUC of putative pattern completion neurons with high node strength is higher than nonpattern completion neurons (n = 14 mice, 28 models; p , 0.0001 (****); Mann–
Whitney test).
1
X
population, we compared its activity or inactivity in all M frames for wij ¼ ensui ensuj ;
each model. With the two resulting population vectors in the mth frame N
u
among all samples, we calculated the log probability of them coming
from the trained CRF model as follows:
where ensiu is the state of neuron i in the uth ensemble. Only one ensem-
ble was active at any given time. External stimuli were produced by initi-
pmi;1 ¼ p ym jxmni ; xmi ¼ 1; u alizing the network to a noisy ensemble state (20% noise), and thereafter
the network was permitted to iterate for 5 time steps to simulate the pat-
tern completion of the ensemble. In total, the matrix contained 810 neu-
pmi;0 ¼ p ym jxmni ; xmi ¼ 0; u :
rons and 100 ensembles. The probability of any given neuron being
active during an ensemble external to its respective Hopfield net was
Then, we computed the log likelihood ratio as follows: fixed at 2.5%.
Data analysis. CRF models were trained using the Columbia
‘i;10 ¼ log pmi;1 log pmi;0 ; m ¼ 1; :::; M University Yeti and Habanero shared High Performance Computing
Cluster (see Figs. 2–6). Benchmarking and simulation modeling were
and calculated the standard receiver operating characteristic (ROC) performed on a custom-built computer using an Intel Core i9-9900K
curve with the ground truth as the timing of each presented visual stim- processor and NVIDIA Titan RTX (see Figs. 7, 8). MATLAB R2016a,
uli. The prediction ability of all nodes for all presented stimuli is then R2019b, and R2020a (MathWorks) were used for data analysis.
represented by an area under curve (AUC) matrix A, where Ai;d repre- We didn’t use any statistical method to predetermine sample size.
sents the AUC value of node i predicting stimulus d. Additionally, we Sample sizes are similar to those reported previously (Carrillo-Reid et
calculated the node strength S ¼ fsi g of each neuron in the CRF model. al., 2016, 2019). Statistical tests were done in GraphPad Prism 5.
We then computed prediction AUC Ar and node strength Sr of each Statistical details of each specific test can be found in the article and the
node r from 100 CRF models trained on shuffled data. The final ensemble figure legends.
for stimulus d is defined as follows: fijAi;s . meanðArs Þ 1 stdðArs Þ; Data availability. Step-by-step implementation of the code and associ-
Si . meanðSr Þ 1 stdðSr Þg. ated notations can be found at the following: https://github.com/hanshuting/
Neural network simulation to study pattern completion. The simu- graph_ensemble for the code used to implement CRF models on a cluster
lated neural network was generated using 10 Hopfield nets embedded in and https://github.com/darikoneil/Identification-of-Pattern-Completion-
a larger matrix. Each Hopfield network was generated synchronously Neurons-in-Neuronal-Ensembles-using-Probabilistic-Graphical-Mod
and deterministically (Gerstner et al., 2014). The binary state S of neuron
for the code used to run simulated data and CRF models on a single
i in the following time step (t 1 1) given j inputs and weight wij was
computer.
described by the following:
X
Si ðt11Þ ¼ wij Sj ðtÞ: Results
j Neuronal ensembles as multidimensional population vectors
As neuroscience moves from the study of individual neurons to
Every neuron was fully connected according to the weight matrix w, neural ensembles as the substrate of brain functions, it is neces-
defined by the following equation: sary to develop methods to identify neurons that are able to recall
entire ensembles (Carrillo-Reid et al., 2017). The development of before imprinting after imprinting
optical techniques has allowed the simultaneous recording and
A
manipulation of neurons with single-cell precision (Rickgauer et
al., 2014; Packer et al., 2015). However, studies performing si-
multaneous recordings usually perform pair correlations or the
average of all recorded neurons as a measurement of network ac-
tivity. As neuronal ensembles are related to specific functions
that can be repeated at different times, they can be optimally
defined by binary multidimensional population vectors (see
above, Materials and Methods) where each vector represents the
activity of all observed neurons at a given time window, and the B
dimensionality of the vectors is defined by the number of
observed neurons (Fig. 1A). In this context, neuronal ensembles
could be understood as clusters of population vectors that point
in a similar direction in a high-dimensional space (Fig. 1B). The
advantage of conceptualizing neuronal ensembles as population
vectors, instead of pairwise correlations between neurons or
averaged activity, is that large datasets could be visualized as
changes in population vectors in real time, whereas pairwise cor-
relations of neurons are dependent on the recording time length
(Carrillo-Reid and Yuste, 2020b). Thus, we built CRFs using
population vectors that take into account all the recorded neu- C 1 1
rons simultaneously, allowing the comparison of network activ-
ity under different experimental conditions to highlight neurons 0.8 0.8
with pattern completion capability that could be used to orches-
0.6 0.6
AUC
AUC
trate entire functional networks.
0.4 0.4
Building CRF models from population responses to oriented
0.2 0.2
stimuli
Cortical neurons from layer 2/3 in primary visual cortex are 0 0
organized as neuronal ensembles, where each ensemble depicts a -1.2 -0.8 -0.4 -1.2 -0.8 -0.4
node strength node strength
given orientation of visual stimuli (Miller et al., 2014; Carrillo-
Reid et al., 2015b). To investigate the ability of CRF models to Figure 4. Confirming pattern completion with two-photon optogenetics. A, Graphical
identify groups of neurons that could recall the representation of models obtained using CRFs from simultaneous two-photon imaging and two-photon opto-
a specific orientation of visual stimuli, we analyzed population genetic stimulation of a neuron with pattern completion capability before (left) and after
responses to drifting gratings from layer 2/3 neurons of primary (right) two-photon optogenetic ensemble imprinting. Ensemble connections are highlighted
visual cortex in awake head-fixed mice from previously pub- with blue lines. Scale bar, 50 mm. B, Circular graphs of CRF models before (pre-) and after
lished data (Carrillo-Reid et al., 2016; Fig. 2A). (post-) ensemble imprinting. Connections between ensemble neurons are shown in blue.
CRFs model the conditional distribution p(y|x) of a network, Red represents photo-stimulated neuron (arrow). C, Node strength and AUC values showed
network changes of neurons with pattern completion capability. The photo-stimulated neu-
where x represents observations and y represents labels associ-
ron is represented in red before (left) and after (right) ensemble imprinting. Confidence lev-
ated with a graphical structure (Sutton and McCallum, 2012). As els calculated from random data are depicted by gray bars.
no assumptions are made on x, CRFs can describe the condi-
tional distribution with complex dependencies in observation
variables associated with a graphical structure that is used to and the four values associated with edges are characterized by a set
constrain the interdependencies between labels (Fig. 2B). of parameters called node potentials ð f 0 ; f 1 Þ and edge potentials
CRFs have been successfully applied in diverse areas of ð f 00 ; f 01 ; f 10 ; f 11 Þ respectively (Fig. 2B). These parameters, also
machine learning that involve multidimensional data such as known as potential functions, reflect the scores of individual values
analysis of texts (Peng et al., 2011), bioinformatics (Sato and on each node and edge. Using part of the observation data (e.g.,
Sakakibara, 2005; Liu et al., 2006), computer vision (Brecht et training data), we estimated the model’s structure (e.g., graph con-
al., 2004; Sminchisescu et al., 2006), and natural language nectivity) and parameters (e.g., the potentials) via maximum likeli-
processing (Lafferty et al., 2001; Choi et al., 2005), but their hood (see above, Materials and Methods). We then performed
application to identify neurons with pattern completion capa- cross-validation on held-out data to identify the optimal level of
bility that could allow the study of the properties and mecha- sparsity and regularization (see above, Materials and Methods).
nisms of neuronal ensemble formation is lacking. Once the model was learned from the data, the normalized product
Population vectors representing the coordinated activity of of the corresponding nodes and edge potentials describe a probabil-
neuronal groups were inferred from calcium imaging recordings ity distribution that gives a scalar score when a given neuronal pop-
and used as training data. We defined activity events from each ulation becomes active. This is a normalized probability over all
neuron as nodes in an undirected graph, where each node can possible network states (e.g., all binary configurations of the nodes)
have two values, 0 corresponding to nonactivity and 1 corre- and is estimated by maximizing likelihood on past observed popula-
sponding to activity. In this way, nodes interact with each other tion vectors (see above, Materials and Methods).
through connecting edges, which have four possible configura- To integrate information about the external stimulus along
tions, 00, 01, 10, and 11, depending on the values of the two with the observed neuronal data, we added an additional node
nodes adjacent to the edge. The two values associated with nodes for each type of stimulus that was presented to the animal. This

A p xm jym ; ys1 ¼ 0; ys2 ¼ 1 . Thus, assum-
ing that the a priori probability of visual
stimuli is uniform, the higher probability
ensemble 1
ensemble 2
ensemble 3
ensemble 4
5s value through comparing the log-likeli-
hood ratio logðps1 Þ logðps2 Þ can be used
to classify each stimuli (Fig. 2D). To quan-
tify the classification performance of
CRFs, we calculated the accuracy percent-
B age in experimental data and compared it
1 with shuffled models in which node poten-
tials and edge potentials from the real
ensemble
2 model were randomly shuffled (Fig. 2E).

The accuracy of real models was signifi-
3
cantly higher than shuffled models (n = 8
mice; real model accuracy = 79.82 6
4
8.34%; shuffled model accuracy = 61.21 6
22.55%; mean 6 SD; p , 0.0001; Mann–
C D Whitney test) demonstrating that CRFs
1 1
1 * can model different orientations of drifting
**
*** gratings from calcium imaging population
0.8
recordings.
AUC
TF=1 0.6
TPR
TPR
0.5 TF=2 0.5

0.4 Identification of neurons with pattern
TF=4
TF=8
completion properties
0.2
TF=15 After the successful generation of CRF
0 0 0 models that describe different experimen-
0 0.5 1 0 0.5 1 1 2 4 8 15
TF (Hz)
tal conditions (vertical or horizontal drift-
FPR FPR ing-gratings; Fig. 2D) we investigated
E whether the parameters obtained from
1 1
such models could be used to identify
neurons with putative pattern comple-
tion properties. To identify pattern com-
TPR
TPR
0.5 0.5 pletion neurons we set the activity of each

neuron to 1 or 0 and compared the output
logarithm probability difference (pro- m
bability ratio) logpm logpm i;0 ¼ logp y j
0 0 i;1
0 0.5 1 0 0.5 1 x=i ; xi ¼ 1Þ logp ym jxm
m m
;
=i ix m
¼ 0 using
FPR FPR
the inferred CRF models (Fig. 3A). In other
Figure 5. Applicability of CRFs to public calcium imaging data. A, Calcium transients of representative pattern completion neurons words, we tested how good each neuron is to
identified from CRF models for different orientations of drifting gratings. Gray stripes indicate visual stimuli for the corresponding orien- predict each model. We calculated single neu-
tation of drifting gratings for each neuronal ensemble. B, Temporal course of neuronal ensemble activation. Each neuronal ensemble ron performance by binarizing the logarithm
was defined by all putative pattern completion neurons with high AUC and high node strength for each orientation of drifting gratings. probability difference (Fig. 3B) and calculated
Each row shows the activation of a specific neuronal ensemble (black lines). Scale bar, 200 frames. C, ROC curves of neuronal ensembles the area under the ROC curve (AUC).
predicting different temporal frequencies (TF: 1, 2, 4, 8, and 15 Hz). Dashed line represents random classification performance. D, AUC Because pattern completion neurons are
for different TFs (classification AUC: TF 1 = 0.7906 6 0.0301, TF 2 = 0.7158 6 0.0407, TF 4 = 0.7009 6 0.0323, TF 8 = 0.6212 6
likely to have strong functional connectivity,
0.0335, TF 15 = 0.5642 6 0.0288; p1,2 = 0.2277 ns; p1,4 = 0.0423*; p1,8 = 0.001**; p1,15 = 4.2583e-05***). Note that prediction per-
formance of CRF models with TF = 1 decreases with data with different TFs. E, Preferred orientation selectivity of neuronal ensembles we used the node strength obtained from
for TF = 1 Hz. The radius of each circle depicts AUC values from 0 (center) to 1 (border). Dotted inner circles represent random perform- CRF models that represents the summation
ance (AUC = 0.5; n = 6 mice, 24 ensembles; Wilcoxon rank sum test). of edge potentials (f 11 terms) from all con-
necting edges for each node in the graph. In
this way, strongly connected neurons have
node was set to 1 when the corresponding stimulus was on and 0 high node strength, whereas weakly con-
when the stimulus was off (Fig. 2B). In this way, CRFs modeled nected neurons have low node strength (Fig. 3C). Finally, we defined
the conditional probability of network states given the observa- putative pattern completion neurons as the ones that can be used to
tions for different visual stimuli. Therefore, by treating visual predict each experimental condition with higher performance (high
stimuli as added nodes and comparing the output likelihood of AUC) and have high node strength (Fig. 3D). It is important to high-
observing each stimulus, CRFs were able to model different light that some neurons could take part in several ensembles simulta-
visual stimuli from observed data. The nodes directly connected neously (Carrillo-Reid et al., 2015b). Such neurons that respond to
to the added nodes generate a specific model for each experimen- different experimental conditions have high functional connectivity
tal condition (Fig. 2C). Given two different visual stimuli (hori- but are not able to recall a specific ensemble (Carrillo-Reid et al.,
zontal or vertical drifting gratings; ys1 and ys2, respectively), the 2019), therefore the use of AUC is fundamental to distinguish neu-
logarithm of probability corresponding
to observing each stimu- rons with high functional connectivity that can recall a specific en-
lus is defined by ps1 ¼ p xm jym ; ys1 ¼ 1; ys2 ¼ 0 and ps2 ¼ semble. Our analysis shows that neurons with putative pattern
completion capability have significantly A

higher AUC compared with nonpattern 0.6 0.6
completion (non-p.c.) neurons that have ******* ***
similarity
similarity
high node strength [Fig. 3E; n = 14 mice; 0.4 0.4
AUC p.c. neurons = 0.8063 6 0.1277;
AUC non-p.c. neurons = 0.2520 6 0.2 0.2
0.1649; mean 6 SD; p , 0.0001; Mann– 0 0

Whitney test]. Noticeably, not all the 20 60 100 140 180 20 60 100 140 180
neurons with high node strength are neurons from CRFs (%) neurons from CRFs (%)
able to predict the given stimuli.
B
1
Optogenetic confirmation of pattern ******* 1
completion prediction using CRF 0.8 0.8
AUC
AUC
models
We then tested the CRF prediction of pu- 0.6 0.6
tative pattern completion neurons using
previously published data, where artifi- 0.4
20 60 100 140 180 0.4
20 60 100 140 180
cially imprinted cortical ensembles were
created by the repetitive activation of a neurons from CRFs (%) neurons from CRFs (%)
group of neurons with two-photon opto-
genetics and recalled by activating specific C 1 optimal ensemble 1
members of the ensemble, a property
known as pattern completion (Carrillo-
0.8 0.8
Reid et al., 2016). To do so, we used 180 180
neurons from CRFs (%)

the structural and classification parame- 160
random neurons (%)

160
0.6 140 0.6
ters of CRFs learned from simultaneous
TPR
TPR
140
120
two-photon imaging and two-photon 120
0.4 100 0.4 100
optogenetic experiments with single-cell 80 80
resolution before and after the imprinting 60 60
protocol and demonstrated that pattern 0.2 40 0.2
40
completion neurons increased their con- 20 20
nectivity after the imprinting protocol 0
0 0.5 1
0
0 0.5 1
(Fig. 4A). To clearly visualize the change
FPR FPR
in graph connectivity induced by the
imprinting protocol, we constructed iso- Figure 6. CRF efficacy to model pattern completion neurons from cortical ensembles. A, Cosine similarity from putative pat-
morphic graphs from the CRF models tern completion neurons extracted from CRFs that have been down sampled or up sampled (left) or randomly sampled (right).
and arranged them in a circular visualiza- Black triangle indicates the original ensemble size. Note that removing or adding neurons decreases the ability to predict visual
tion (Fig. 4B). After the artificial ensemble stimuli. B, AUC values of down-sampled or up-sampled putative pattern completion neurons (left) or randomly sampled neu-
was imprinted, neurons with pattern rons (right). C, ROC curves of down-sampled or up-sampled neurons (left) or randomly sampled neurons (right; n = 8 mice, 28
completion capability that were able to ensembles; Wilcoxon rank sum test).
recall an entire ensemble showed signifi-
cantly increased AUC (Fig. 4C; n = 6 concomitant responses to the corresponding orientation (Fig.
mice; AUC before = 0.4570 6 0.1256; AUC after = 0.8568 6 5A). Putative pattern completion neurons were also able to
0.0952; mean 6 SD; p = 0.0313; Wilcoxon matched-pairs signed- define specific neuronal ensembles for each orientation (Fig. 5B).
rank test). These analyses demonstrate that structural and classifi- The ROC curves comparing the performance of the model with
cation parameters inferred with CRFs can be used not only to TF = 1 against data obtained with the same orientation but with
identify pattern completion neurons but also to study the mecha- different TFs showed that our approach could also be used to
nisms involved in the reconfiguration and recalling of neuronal detect changes in specific features of visual stimuli in layer 2/3 of
ensembles from optogenetically perturbed microcircuits. primary visual cortex as different TFs showed a degraded classifi-
cation performance (Fig. 5C,D; classification AUC: AUCTF=1 =
Applicability of CRF models to publicly available datasets 0.79 6 0.15, AUCTF=2 = 0.71 6 0.21, AUCTF=4 = 0.69 6 0.17,
One of the challenges for the identification of putative pattern AUCTF=8 = 0.62 6 0.17, AUCTF=15 = 0.56 6 0.14; p1,4 , 0.05,
completion neurons consists in the applicability of the methods p1,8 , 0.01, p1,15 , 0.001; n = 5 animals, 20 ensembles; mean 6
to data obtained in different laboratories under diverse experi- SD; Wilcoxon rank sum test). Therefore, our approach was able
mental conditions. To explore the general applicability of our to find neurons with putative pattern completion properties,
method, we analyzed publicly open datasets (Allen Brain from a publicly open dataset, that are tuned to a particular orien-
Observatory), which contain data from layer 2/3 of primary vis- tation of drifting gratings for a given TF (Fig. 5E), suggesting
ual cortex consisting in several visual stimuli types with different that CRFs could be used to recall percepts with varied features.
experimental settings. We generated CRF models for each of
four orientations of drifting gratings at a temporal frequency Quantifying CRF prediction performance for ensembles
(TF) of 1 and compared the performance of the model for the We next investigated whether the neurons highlighted by CRFs
same orientation but with different TFs. For each model, neu- were optimal for predicting visual stimuli. To do so, we ran-
rons with high AUCs and high node strength showed domly shuffled population vectors containing all putative pattern
A B lower than those of CRF models (best sim-

sorted nets (simulation) 0.6 ilarity 0.1000 6 0.0101, AUC 0.5045 6
0.0071; mean 6 SEM), indicating that
probability
neurons identified with CRF parameters
0.4
achieved a classification performance sig-
neurons
in vivo
nificantly better than random sets of neu-
0.2 rons. These results suggest that targeting
all the neurons that were active at different
trials of a given experimental condition
0
4 5 6 7 8 9 10 could reduce the accuracy of recalled
time active neurons (%) ensembles.
C D
1 Modeling CRFs with different
cumulative probability
1
parameters using simulated neural
score (simulation)
networks
To test the performance of CRFs in model-
shuffled
ing the pattern completion properties of

simulated neural networks, we embedded
10 Hopfield networks within a larger net-
work of weaker connectivity (Fig. 7A; see
0 0
0.5 0.6 0.7 0.8 0.9 1 0.2 0.4 0.6 0.8 1 above, Materials and Methods). To achieve
similarity score (in vivo) biologically plausible population vectors,
we constructed our network so that the
E F mean of neuronal activity following stim-
1 1 uli presentation matched the mean of neu-
p.c. neurons
score (simulation)
ronal activity of high-activity frames in our

0.8
correlation
in vivo datasets (Fig. 7B). To replicate a plau-

non-p.c. neurons
0.6
**** sible level of underdetermination, we mod-
eled this simulated network at eight values
0.4
of the structure-learning regularization term,
0.2 which is a regularization parameter that con-
0 trols the sparsity of the graphical structure
10 20 30 40
neurons
0 l s (7.9767e-05-0.1793), and two values
of the parameter-learning regularization
Figure 7. Pattern completion neurons in simulated networks. A, Raster-plot of simulated data sorted by Hopfield networks term, which controls the learned parameters
(100 ensembles from 10 Hopfield nets). B, Probability distribution of simulated network. The network’s response to trained and helps in preventing overfitting, l p (10,
stimuli was comparable to the level of activity observed in high-activity frames in our in vivo datasets (dashed line). 10,000). First, we assessed how accurately
Population activity was defined as the percentage of neurons active during any given frame. C, The distribution of cosine sim- our model could describe the functional
ilarities comparing the weights of the simulated network and CRF model for each neuron. Dash line denotes shuffled data. connectivity of our simulated network. To
D, Top-ranked neurons according to their score in both simulated network and CRFs. Correlation coefficient and linear regres-
accomplish this, we used cosine similarity to
sion of scores calculated from the simulated network and CRF model ( r = 0.8592; r2 = 0.738). E, Comparison between pu-
tative pattern completion neurons identified in simulated networks and CRFs for different set sizes. F, Pattern completion
measure the synaptic weight matrix of the
neurons from each ensemble have higher scores than nonpattern completion neurons (n = 100 ensembles from 10 Hopfield simulated network with the edge potentials
networks; p.c. neurons score = 0.7891 6 0.0452; non-p.c. neurons score = 0.4794 6 0.0399; p , 0.0001 (****); mean 6 of our model. We found a striking similarity
SD; Mann–Whitney test). between the synaptic weights of each neuron
and respective edge potentials in the CRFs
(cosine similarity = 0.9615 6 0.0017; mean
completion neurons from cortical ensembles by adding or 6 SEM), which was not observed in a shuf-
removing elements from the group and examined the stimulus fled model (cosine similarity = 0.0052; Fig. 7C). Thereafter, we
prediction performance. The similarity function (Fig. 6A, left), investigated whether using the real connectivity of the simulated
prediction performance (Fig. 6B, left) and ROC curves (Fig. 6C, network or the learned connectivity of the model would identify the
left) of population vectors formed with detected cortical ensem- same pattern completion neurons. To compare performance using
bles had a maximum value when the neurons depicted by CRFs one metric, we averaged the scores for prediction performance
were unchanged (original neurons similarity 0.2147 6 0.0206; (AUC) and normalized node strength. We found the correlation
prediction AUC 0.8109 6 0.0242; mean 6 SEM). These results between real and modeled ensembles to be very strong (r2 = 0.7380,
showed that population vectors formed with neurons identified r = 0.8592; Fig. 7D). To determine whether both methods would
by CRFs represent an efficient population to predict external vis- identify the same neurons for targeted stimulation, we compared
ual stimuli. This raises the question of whether such population sets of neurons selected by each method using the Jaccard index.
vectors are a specific nonrandom subgroup. To answer this, we We found that both methods targeted similar groups of pattern
randomly sampled a subset from all the population of neurons completion neurons, and this relationship was maintained across
ranging from 10 to 190% of the total size of neuronal ensembles. variations in set size (Fig. 7E). Our analysis of these targeted groups
Indeed, we observed that the similarity function (Fig. 6A, right), confirmed that a handful of neurons in each ensemble possessed
prediction performance (Fig. 6B, right) and ROC curves (Fig. pattern completion capability demonstrated by significantly higher
6C, right) from random groups of neurons was significantly scores between pattern completion neurons and nonpattern
completion neurons (Fig. 7F; n = 100 A B

ensembles; p.c. neurons score = 0.7891 6 16k
600
small
0.0452; non-p.c. neurons score = 0.4794 6 small
CPU time (seconds)
CPU time (seconds)

medium medium
0.0399; p , 0.0001; mean 6 SD; Mann– 12k large
450
large
Whitney test), supporting the hypothesis

that neurons with pattern completion capa- 8k 300
bility have increased functional connectivity
(Carrillo-Reid et al., 2016, 2019). 4k 150
0 0
Discussion 0 2500 5000 7500 10000 0 2500 5000 7500 10000
Graph theory analysis of functional population vectors (frames) population vectors (frames)
connectivity in cortical microcircuits Figure 8. Modeling CRFs with different parameters. A, Run time of structural learning across small (n = 64), medium
In this study, we use a computational (n = 500), and large (n = 1000) datasets of increasing size. Each run generated 100 structures using l values between 1e-
s
method, CRFs, to identify pattern comple- 5 and 0.5. With datasets of 1000 neurons and 10 000 population vectors (prescreened for the inclusion of only high-activity
tion neurons in calcium imaging recordings frames), run times did not surpass 5 h. B, Mean run time of parameter learning for individual models using small (n = 64),
of mouse visual cortex in vivo. As opposed to medium (n = 500), and large (n = 1000) datasets of increasing size. With datasets of 1000 neurons and 10,000 population
traditional descriptive approaches for neuro- vectors (prescreened for the inclusion of only high-activity frames), run times did not surpass 10 min per model.
nal ensemble identification and network Approximately one-half of this run time comprised block-coordinate Frank–Wolfe iterations. The remainder of this run time
analyses, based on correlations between pairs can be attributed to the initiation of each model and frequent reporting of progress.
of neurons, machine learning methods repre-
sent an empirically grounded new approach The computational difficulty in constructing CRFs lies in
to create models that aim to capture the functional structure of neu- recovering the global normalizer (the partition function) and the
ronal circuits while providing structural information about the net- corresponding gradients. With an arbitrary graph structure, this
work properties of individual neurons within a population. problem is often intractable. But recent advances that combine
In the past decades, graph theory has been applied to charac- Bethe free energy approximation and Frank–Wolfe methods for
terize the structure and function of neuronal networks (Achard inference and learning model parameters allow fast and a rela-
and Bullmore, 2007; Bettencourt et al., 2007; Fair et al., 2008; tively accurate construction of cyclic CRFs (Tang et al., 2016).
Hagmann et al., 2008; Iturria-Medina et al., 2008; Supekar et al., Thus, CRFs could be applied to datasets with thousands of inter-
2008; Yu et al., 2008; Downes et al., 2012; Zuo et al., 2012; connected neurons. However, for datasets with more neurons
Cunningham and Yu, 2014; Chiang et al., 2016). Most of these (and therefore more random variables and larger networks),
studies used recordings across multiple brain regions (Achard CRFs (like most machine learning approaches) would require an
and Bullmore, 2007; Fair et al., 2008; Zuo et al., 2012; Hinne et increasingly large number of samples (population vectors) in the
al., 2013; Chiang et al., 2016), and only a few focused on the gen- training dataset.
eral network properties of cortical circuits with recordings from
single neurons (Bonifazi et al., 2009; Stetter et al., 2012; Advantages and limitations of CRFs compared with other
Gururangan et al., 2014; Yatsenko et al., 2015). The majority of classification algorithms
methods applied to infer network properties in brain slices (Mao Compared with fully generative models such as Markov random
et al., 2001; Cossart et al., 2003; Ikegaya et al., 2004; Stetter et al., fields and Bayesian networks that make assumptions on the
2012; Gururangan et al., 2014) or in vivo (Yatsenko et al., 2015) dependencies among all the observed variables from the model,
operate on the correlation matrix and aim to recover the CRFs only model the hidden system states dependent on
functional dependencies between observed pairs of neurons. observed features. Because no independence assumptions are
However, these methods are model-free and therefore incapable made among observed variables, CRFs avoid potential errors
of describing the overall network dynamics based on the proba- introduced by unobserved common inputs. Additionally, given
bility distribution of neuronal ensembles. Our method provides the finite number of network states described by population ac-
an alternative by directly modeling the statistical dependencies tivity, the conditional distribution is enough for making predic-
among all the neurons. The generation of a graphical model of tions and for identifying neurons in each state. Compared with
the circuit provides a direct link to the functional connectivity other discriminative finite-state models such as the maximum
structure of the circuit, enabling targeted manipulations of com- entropy Markov model (MEMM), CRFs use global normalizers
ponents, as opposed to dimensionality reduction or encoding to overcome the local bias in MEMM, induced by local normal-
methods. izers, and achieve higher accuracy in diverse applications
(Lafferty et al., 2001). Also, compared with encoders that adjust a
CRFs graphical models identify pattern completion neurons series of constants associated with each neuron, CRFs can be
in neuronal ensembles used to explore changes in functional connectivity patterns (as
Cortical ensembles represent coactive neuronal populations that we demonstrate here) and their relation to learning or behavioral
could represent the computational unit of brain functions (Mao states. Therefore, CRFs appear promising for modeling neuronal
et al., 2001; Bonifazi et al., 2009; Miller et al., 2014; Carrillo-Reid ensemble functional connectivity and for identifying members
et al., 2015a, 2016, 2019). The structural and functional organiza- that could be easily manipulated by two-photon optogenetics.
tion of brain microcircuits could be characterized by these mod- The overall activity of multiple cells in a given time window
ular properties. For this reason, it is necessary to identify pattern can be understood as a multidimensional array of population
completion neurons in these cortical modules that can efficiently vectors where vectors pointing to a similar location in space can
recall different brain states (Gururangan et al., 2014; Carrillo- be considered as a group. We previously showed that population
Reid et al., 2015a). vectors defining a group (i.e., a neuronal ensemble) can be
extracted from multidimensional arrays by performing singular Finally, it has been shown that the connectivity of diverse sys-
value decomposition (SVD; Carrillo-Reid et al., 2015a). Although tems described by graphs with complex topologies follow a scale-
SVD can identify cortical ensembles reliably, it lacks a structured free power law distribution (Barabasi and Albert, 1999).
graphical model that allows the systematic study of changes in net- Similarly, cortical ensembles described by CRFs could be charac-
work properties and the identification of neurons with pattern terized by a subset of neurons with strong synaptic connections.
completion capability. One limitation for the current CRF learning The existence of neurons with pattern completion capability has
algorithm is the computation of a sparse graph structure, which is been demonstrated in previous studies where perturbing the ac-
less prone to overfitting of the data. Certainly, because of some tivity of single neurons was able to change the overall network
modeling assumptions, computational considerations and the dynamics (Hagmann et al., 2008; Bonifazi et al., 2009; Carrillo-
finiteness of training data, the learned graphical structure and pa- Reid et al., 2016).
rameters may not be the globally best probabilistic model of the In closing, our results suggest that the structural and predic-
data. However, it is recovered efficiently rather than requiring an tive parameters defined by CRF models could be used in the
exhaustive and unrealistic exploration of all possible structures design of closed-loop experiments with single-cell resolution to
and parameter combinations. Additionally, approximations dur- investigate the role of a specific subpopulation of neurons in a
ing the parameter learning step can sometimes compromise the given cortical microcircuit during different behavioral events
global optimality guarantees. (Carrillo-Reid et al., 2019).
Applicability of CRFs to broader datasets References

To assess the scalability of CRFs, we also performed benchmark- Achard S, Bullmore E (2007) Efficiency and cost of economical brain func-
ing tests using sample artificial networks of 64, 500, and 1000 tional networks. PLoS Comput Biol 3:e17.
neurons with up to 10,000 frames of activity. We found that 100 Barabasi AL, Albert R (1999) Emergence of scaling in random networks.
Science 286:509–512.
l s runs completed in ,270 min of CPU time when using the Bettencourt LM, Stephens GJ, Ham MI, Gross GW (2007) Functional struc-
largest dataset (Fig. 8A). Parameter estimation was trivial in ture of cortical neuronal networks grown in vitro. Phys Rev E Stat Nonlin
comparison, requiring just 600 s of CPU time per model at the Soft Matter Phys 75:021915.
largest dataset (Fig. 8B). Both structural learning and parameter Bonifazi P, Goldin M, Picardo MA, Jorquera I, Cattani A, Bianconi G, Represa
estimation can be easily parallelized by neighborhood (structural A, Ben-Ari Y, Cossart R (2009) GABAergic hub neurons orchestrate syn-
learning) and by model (parameter estimation), leading to highly chrony in developing hippocampal networks. Science 326:1419–1424.
Brecht M, Schneider M, Sakmann B, Margrie TW (2004) Whisker move-
efficient wall clock run times. Furthermore, given the sequential
ments evoked by stimulation of single pyramidal cells in rat motor cortex.
execution of structural learning and parameter estimation, it is Nature 427:704–710.
also possible to substitute more efficient methods of structural Brindley GS, Lewin WS (1986) The sensations produced by electrical stimula-
learning, such as Pearson’s correlation, to further prune wall tion of the visual cortex. J Physiol 196:479–493.
clock run times. Given the efficient wall clock run times, it is fea- Brown SL, Joseph J, Stopfer M (2005) Encoding a temporally structured stim-
sible for future research to consider expanding our approach to ulus with a temporally structured neural representation. Nat Neurosci
8:1568–1576.
incorporate multiframe ensembles and other temporal interac-
Bullmore E, Sporns O (2009) Complex brain networks: graph theoretical analysis
tions or use continuous (instead of binarized) neural data by of structural and functional systems. Nat Rev Neurosci 10:186–198.
replacing the linear programming solver used during parameter Carrillo-Reid L, Tecuapetla F, Tapia D, Hernández-Cruz A, Galarraga E,
estimation. Finally, given the efficiency of parameter estimation, Drucker-Colin R, Bargas J (2008) Encoding network states by striatal cell
the development of equally efficient structural learning methods assemblies. J Neurophysiol 99:1435–1450.
will allow for the analysis of streaming network data. Carrillo-Reid L, Lopez-Huerta VG, Garcia-Munoz M, Theiss S, Arbuthnott
GW (2015a) Cell assembly signatures defined by short-term synaptic
plasticity in cortical networks. Int J Neural Syst 25:1550026.
Identification of pattern completion neurons for optogenetic Carrillo-Reid L, Miller JE, Hamm JP, Jackson J, Yuste R (2015b) Endogenous
targeting sequential cortical activity evoked by visual stimuli. J Neurosci 35:8813–
One key advantage of CRFs is the ability to model the circuit ex- 8828.
plicitly in a manner than can be used for targeted manipulation. Carrillo-Reid L, Yang W, Bando Y, Peterka DS, Yuste R (2016) Imprinting
Indeed, the results from the present work (Fig. 4) and those from and recalling cortical ensembles. Science 353:691–694.
Carrillo-Reid L, Yang W, Kang Miller JE, Peterka DS, Yuste R (2017)
our previous publication (Carrillo-Reid et al., 2019), demonstrate
Imaging and optically manipulating neuronal ensembles. Annu Rev
that neurons with high node strength and AUC values are partic- Biophys 46:271–293.
ularly prone to be pattern completion neurons. As one practical Carrillo-Reid L, Han S, Yang W, Akrouh A, Yuste R (2019) Controlling visu-
example of an application of this method, nontargeted electrical ally guided behavior by holographic recalling of cortical ensembles. Cell
stimulation of visual cortex has been used for decades as an 178:447–457.e5.
attempt to provide useful visual sensations to patients who have Carrillo-Reid L, Yuste R (2020a) Playing the piano with the cortex: role of
neuronal ensembles and pattern completion in perception and behavior.
lost the functionality of their eyes (Brindley and Lewin, 1986).
Curr Opin Neurobiol 64:89–95.
To improve prostheses, one could in principle train patients Carrillo-Reid L, Yuste R (2020b) What is a neuronal ensemble? In Oxford
using devices with a large number of electrodes (Shepherd et al., Research Encyclopedia of Neuroscience (Sherman SM ed), pp 1–23.
2013). Our results suggest that after a given network has been Oxford: Oxford UP.
imprinted (Carrillo-Reid et al., 2016), the identification of neu- Chiang S, Cassese A, Guindani M, Vannucci M, Yeh HJ, Haneef Z, Stern JM
rons with pattern completion capability could be used to reduce (2016) Time-dependence of graph theory metrics in functional connec-
the number of active points that require stimulation and pin- tivity analysis. Neuroimage 125:601–615.
Choi Y, Cardie C, Riloff E, Patwardhan S (2005) Identifying sources of opin-
point them with accuracy. The further development of network ions with conditional random fields and extraction patterns. Paper pre-
models based on population activity that can predict a given set sented at Conference on Human Language Technology and Empirical
of features embedded in visual stimuli will be crucial for the effi- Methods in Natural Language Processing, Vancouver, British Columbia,
cient manipulation of cortical ensembles. Canada, October.
Cossart R, Aronov D, Yuste R (2003) Attractor dynamics of network UP Miller JE, Ayzenshtat I, Carrillo-Reid L, Yuste R (2014) Visual stimuli recruit
states in the neocortex. Nature 423:283–288. intrinsically generated cortical ensembles. Proc Natl Acad Sci U S A 111:
Cunningham JP, Yu BM (2014) Dimensionality reduction for large-scale E4053–4061.
neural recordings. Nat Neurosci 17:1500–1509. Mukamel EA, Nimmerjahn A, Schnitzer MJ (2009) Automated analysis of cellu-
de Vries SEJ, Lecoq JA, Buice MA, Groblewski PA, Ocker GK, Oliver M, lar signals from large-scale calcium imaging data. Neuron 63:747–760.
Feng D, Cain N, Ledochowitsch P, Millman D, Roll K, Garrett M, Packer AM, Peterka DS, Hirtz JJ, Prakash R, Deisseroth K, Yuste R (2012)
Keenan T, Kuan L, Mihalas S, Olsen S, Thompson C, Wakeman W, Two-photon optogenetics of dendritic spines and neural circuits. Nat
Waters J, Williams D, et al. (2020) A large-scale standardized physiologi- Methods 9:1202–1205.
cal survey reveals functional organization of the mouse visual cortex. Nat Packer AM, Russell LE, Dalgleish HW, Hausser M (2015) Simultaneous all-
Neurosci 23:138–151. optical manipulation and recording of neural circuit activity with cellular
Downes JH, Hammond MW, Xydas D, Spencer MC, Becerra VM, Warwick resolution in vivo. Nat Methods 12:140–146.
K, Whalley BJ, Nasuto SJ (2012) Emergence of a small-world functional Peng H-K, Zhu J, Piao D, Yan R, Zhang Y (2011) Retweet modeling using
network in cultured neurons. PLoS Comput Biol 8:e1002522. conditional random fields. Paper presented at the IEEE 11th
Drew PJ, Shih AY, Driscoll JD, Knutsen PM, Blinder P, Davalos D, International Conference on Data Mining Workshops, Vancouver,
Akassoglou K, Tsai PS, Kleinfeld D (2010) Chronic optical access through British Columbia, Canada, December.
a polished and reinforced thinned skull. Nat Methods 7:981–984. Ravikumar P, Wainwright MJ, Lafferty JD (2010) High-dimensional Ising model
Eguíluz VM, Chialvo DR, Cecchi GA, Baliki M, Apkarian AV (2005) Scale- selection using ‘1-regularized logistic regression. Ann Statist 38:1287–1319.
free brain functional networks. Phys Rev Lett 94:018102. Rickgauer JP, Deisseroth K, Tank DW (2014) Simultaneous cellular-resolu-
Fair DA, Cohen AL, Dosenbach NU, Church JA, Miezin FM, Barch DM, tion optical perturbation and imaging of place cell firing fields. Nat
Raichle ME, Petersen SE, Schlaggar BL (2008) The maturing architecture Neurosci 17:1816–1824.
of the brain’s default network. Proc Natl Acad Sci U S A 105:4028–4032. Robinson NTM, Descamps LAL, Russell LE, Buchholz MO, Bicknell BA,
Gerstner W, Kistler WM, Naud R, Paninski L (2014) Memory and attractor Antonov GK, Lau JYN, Nutbrown R, Schmidt-Hieber C, Häusser M
networks. In Network Dynamics: From singles neurons to networks and (2020) Targeted activation of hippocampal place cells drives memory-
models of cognition and beyond, pp 443–457. Cambridge: Cambridge guided spatial behavior. Cell 183:1586–1599.e10.
UP. Rolls ET, Treves A (1994) Neural networks in the brain involved in memory
Gururangan SS, Sadovsky AJ, MacLean JN (2014) Analysis of graph invari- and recall. Prog Brain Res 102:335–341.
ants in functional neocortical circuitry reveals generalized features com- Sasaki T, Matsuki N, Ikegaya Y (2007) Metastability of active CA3 networks.
mon to three areas of sensory cortex. PLoS Comput Biol 10:e1003710. J Neurosci 27:517–528.
Hagmann P, Cammoun L, Gigandet X, Meuli R, Honey CJ, Wedeen VJ, Sato K, Sakakibara Y (2005) RNA secondary structural alignment with condi-
Sporns O (2008) Mapping the structural core of human cerebral cortex. tional random fields. Bioinformatics 21 Suppl 2:ii237–242.
PLoS Biol 6:e159. Schaub MT, Schultz SR (2012) The Ising decoder: reading out the activity of
Hebb DO (1949) The organization of behavior; a neuropsychological theory. large neural ensembles. J Comput Neurosci 32:101–118.
New York: Wiley. Schreiber S, Fellous JM, Whitmer D, Tiesinga P, Sejnowski TJ (2003) A new
Hinne M, Heskes T, Beckmann CF, van Gerven MAJ (2013) Bayesian infer- correlation-based measure of spike timing reliability. Neurocomputing
ence of structural brain networks. NeuroImage 66:543–552. 52-54:925–931.
Hopfield JJ (1982) Neural networks and physical systems with emergent col- Shepherd RK, Shivdasani MF, Nayagam DF, Williams CF, Blamey PJ (2013)
lective computational abilities. Proc Natl Acad Sci U S A 79:2554–2558. Visual prostheses for the blind. Trends Biotechnol 31:562–571.
Ikegaya Y, Aaron G, Cossart R, Aronov D, Lampl I, Ferster D, Yuste R (2004) Shmiel T, Drori R, Shmiel O, Ben-Shaul Y, Nadasdy Z, Shemesh M, Teicher
Synfire chains and cortical songs: temporal modules of cortical activity. M, Abeles M (2006) Temporally precise cortical firing patterns are associ-
Science 304:559–564. ated with distinct action segments. J Neurophysiol 96:2645–2652.
Iturria-Medina Y, Sotero RC, Canales-Rodríguez EJ, Alemán-Gómez Y, Sminchisescu C, Kanaujia A, Metaxas D (2006) Conditional models for contex-
Melie-García L (2008) Studying the human brain anatomical network via tual human motion recognition. Comput Vis Image Underst 104:210–220.
diffusion-weighted MRI and graph theory. Neuroimage 40:1064–1076. Stetter O, Battaglia D, Soriano J, Geisel T (2012) Model-free reconstruction
Khazaee A, Ebrahimzadeh A, Babajani-Feremi A (2015) Identifying patients of excitatory neuronal connectivity from calcium imaging signals. PLoS
with Alzheimer’s disease using resting-state fMRI and graph theory. Clin Comput Biol 8:e1002653.
Neurophysiol 126:2132–2141. Stopfer M, Jayaraman V, Laurent G (2003) Intensity versus identity coding in
Ko H, Hofer SB, Pichler B, Buchanan KA, Sjöström PJ, Mrsic-Flogel TD an olfactory system. Neuron 39:991–1004.
(2011) Functional specificity of local synaptic connections in neocortical Supekar K, Menon V, Rubin D, Musen M, Greicius MD (2008) Network
networks. Nature 473:87–91. analysis of intrinsic functional brain connectivity in Alzheimer’s disease.
Koller D, Friedman N (2009) Probabilistic graphical models: principles and PLoS Comput Biol 4:e1000100.
techniques. Cambridge, MA: MIT press. Sutton C, McCallum A (2012) An introduction to conditional random fields.
Lafferty J, McCallum A, Pereira F (2001) 18th International Conference on FNT in Machine Learning 4:267–373.
Machine Learning, pp 282–289. San Francisco (ICML), CA: Morgan Tang K, Ruozzi N, Belanger D, Jebara T (2016) Bethe learning of graphical
Kaufmann. models via MAP decoding. InArtificial Intelligence and Statistics, pp
Liu Y, Carbonell J, Weigele P, Gopalakrishnan V (2006) Protein fold recogni- 1096–1104. PMLR.
tion using segmentation conditional random fields (SCRFs). J Comput van den Heuvel MP, Stam CJ, Boersma M, Hulshoff Pol HE (2008) Small-
Biol 13:394–406. world and scale-free organization of voxel-based resting-state functional
Lorente de Nó R (1938) Analysis of the activity of the chains of internuncial connectivity in the human brain. NeuroImage 43:528–539.
neurons. J Neurophysiol 1:207–244. Wang J, Zuo X, He Y (2010) Graph-based network analysis of resting-state
Mao BQ, H-S F, Aronov D, Froemke RC, Yuste R (2001) Dynamics of spon- functional MRI. Front Syst Neurosci 4:16.
taneous activity in neocortical slices. Neuron 32:883–898. Yatsenko D, Josic K, Ecker AS, Froudarakis E, Cotton RJ, Tolias AS (2015)
Marr D (1971) Simple memory: a theory for archicortex. Philos Trans R Soc Improved estimation and interpretation of correlations in neural circuits.
Lond B Biol Sci 262:23–81. PLoS Comput Biol 11:e1004083.
Marshel JH, Kim YS, Machado TA, Quirin S, Benson B, Kadmon J, Raja C, Yoshimura Y, Dantzker JL, Callaway EM (2005) Excitatory cortical neurons
Chibukhchyan A, Ramakrishnan C, Inoue M, Shane JC, McKnight DJ, form fine-scale functional networks. Nature 433:868–873.
Yoshizawa S, Kato HE, Ganguli S, Deisseroth K (2019) Cortical layer-spe- Yu S, Huang D, Singer W, Nikolic D (2008) A small world of neuronal syn-
cific critical dynamics triggering perception. Science 365:eaaw5202. chrony. Cereb Cortex 18:2891–2901.
Micheloyannis S, Vourkas M, Tsirka V, Karakonstantaki E, Kanatsouli K, Zuo XN, Ehmke R, Mennes M, Imperati D, Castellanos FX, Sporns O,
Stam CJ (2009) The influence of ageing on complex brain networks: a Milham MP (2012) Network centrality in the human functional connec-
graph theoretical analysis. Hum Brain Mapp 30:200–208. tome. Cereb Cortex 22:1862–1875.

Identification of Pattern Completion Neurons

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Identification of Pattern Completion Neurons

Uploaded by

Copyright:

Available Formats

The Journal of Neuroscience, October 13, 2021 • 41(41):8577–8588 • 8577

Identification of Pattern Completion Neurons in Neuronal

where f is a vector of the distribution expressed in log-linear form, u is

This model is a generalized version of Ising models, which have been

A population vectors (binary representation)

parameter space. This empirically results in

represent the nonactive and active state of xsn

models, we first randomized the spike raster ys1 ysn

matrices while preserving the activity per cell

and per frame. Then, we trained CRF models

using the shuffled spike matrices, with the y5

cross-validated l s , d, and l p from the real

2 model were randomly shuffled (Fig. 2E).

0.5 TF=2 0.5

0.5 0.5 pletion neurons we set the activity of each

completion capability have significantly A

0.1649; mean 6 SD; p , 0.0001; Mann– 0 0

completion prediction using CRF 0.8 0.8

neurons from CRFs (%)

random neurons (%)

A B lower than those of CRF models (best sim-

ing the pattern completion properties of

ronal activity of high-activity frames in our

in vivo datasets (Fig. 7B). To replicate a plau-

completion neurons (Fig. 7F; n = 100 A B

CPU time (seconds)

CPU time (seconds)

Whitney test), supporting the hypothesis

Applicability of CRFs to broader datasets References

You might also like