Professional Documents
Culture Documents
Data Mining in Brain Imaging
Data Mining in Brain Imaging
Data mining in brain imaging is proving to be an effective methodology for disease prognosis and
prevention. This, together with the rapid accumulation of massive heterogeneous data sets, motivates the
need for efficient methods that filter, clarify, assess, correlate and cluster brain-related information. Here,
we present data mining methods that have been or could be employed in the analysis of brain images.
These methods address two types of brain imaging data: structural and functional. We introduce
statistical methods that aid the discovery of interesting associations and patterns between brain images
and other clinical data. We consider several applications of these methods, such as the analysis of task-
activation, lesion-deficit, and structure morphological variability; the development of probabilistic atlases;
and tumour analysis. We include examples of applications to real brain data. Several data mining issues,
such as that of method validation or verification, are also discussed.
1 Introduction
Data mining in brain imaging is an emerging field of high importance for providing
prognosis, treatment, and a deeper understanding of how the brain functions. The
field of data mining addresses the question of how best to use this data to discover new
knowledge and improve the process of decision making. The discovery of associations
between human brain structures and functions (i.e. human brain mapping) has been
recognized as the main goal of the Human Brain Project,1 which is a high-priority
project funded by several government initiatives. Mining problems can be grouped in
three categories:2 identifying classifications, finding sequential patterns, and
discovering associations. Although data mining is a powerful knowledge discovery
technique, there are constraints in the way it can be applied: it is application-
dependent, different applications usually require different mining techniques, and
data must be of a certain size and format.3 In this paper we survey current mining
methods, give a critical review of the main computational obstacles that lie behind our
ability to perform automatic data mining on brain imaging and propose some
solutions.
There are various problems in mining of brain images that need to be addressed.
The first problem is that most fundamental mining algorithms (rule-based learning
systems, neural networks, decision trees, Bayesian networks, logistic regressions, and
so on), which have been used with great success in medicine, assume that data sets
contain only simple numeric and symbolic entries. It is important, therefore, to
Address for correspondence: V Megalooikonomou, Department of Computer and Information Sciences, Temple
University, 314 Wachman Hall, Philadelphia, PA 19122, USA. E-mail: vasilis@cis.temple.edu
types of data. We exploit this fact and consider methods that combine the information
from image and behavioural data about the brain and present methods for developing
probabilistic brain atlases. The results of these methods are new representations of the
information content of brain images and statistical maps.
The rest of this paper is organized as follows: in Section 2 we present the
preprocessing phase of data mining in brain imaging, including the segmentation and
spatial normalization of images. Although smoothing and reduction of noise can be
considered to be part of preprocessing, here we include them as part of the mining
methods in Section 3. In Section 3 we present mining methods that have been used in
structural and functional imaging. These methods are useful for: (a) the efficient
discovery of associations between structures and functions; (b) the classification of
structural information, including both normal structures and abnormalities such as
tumours; and (c) recently, the discovery of associations between gene expressions,
morphology, and function. In Section 4 we present important issues in mining of brain
images that are common to structural and functional brain data. We also consider the
problem of verification of mining methods. This review concludes with a discussion in
Section 5.
2 Data preprocessing
After image data is collected for each subject and before mining is performed, the data
has to pass through a preprocessing phase. This phase identifies and normalizes the
brain objects to be stored in a database and mined later. In anatomically related
studies, after an anatomical (i.e. structural) image is collected for each subject, each
lesion, area, or structure of interest is delineated (segmented) as a region of interest
(ROI) on each slice automatically, semi-automatically, or manually. Extensive work
has been done on automating image segmentation.18–21 The first segmentation
methods were solely intensity based.22 However, since many structures were not
distinguishable on the basis of signal intensity alone, prior spatial information was
incorporated either in the form of intensity gradients,23 prior spatial probability
distribution over signal intensity,18,23,24 or by registering the image to a segmented
atlas25–27 using a spatial probability map for each voxel.28
Using the slice data, each ROI is reconstructed in three dimensions. Then, in both
functional and anatomical images, normalization or image registration has to be
performed to make image data comparable across subjects without morphological and
acquisition variability. This process maps homologous anatomical regions to the same
location in a stereotaxic space, such as the Talairach anatomical atlas.29 Several linear
and nonlinear spatial transformations have been developed to bring the 3-D atlas and
the subject’s 3-D image into register, i.e. spatial coincidence.27,30–32 As an example, the
effect of registration of an MR image to the Talairach atlas using a nonlinear method
based on a 3-D elastically deformable model32 is presented in Figure 1.
In addition to 3-D image data, modern brain image databases usually contain a set
of generic anatomic atlases of the human brain that model the exact shapes and
positions of anatomical structures. A raw MR or fMR image does not identify the
structure to which each voxel (3-D volume element) belongs, but an anatomical atlas
362 V Megalooikonomou et al.
can supply this information (with the accuracy of the registration methods) when
overlaid on the image (see Figure 1). A variety of brain structure maps have been
derived, at several spatial scales, from 3-D tomographic images,33 anatomic speci-
mens,29,34,35 and a variety of histologic preparations that reveal regional cyto-
architecture36 and molecular content. Other brain maps have concentrated on
function,37,38 or neuronal connectivity and circuitry.39,40 Here, for clarity of
presentation, we concentrate on the widely used Talairach atlas.
3 Data mining
We first present methods for discovering associations between brain functions and
structures. Traditionally, two approaches have been employed for functional brain
mapping. The first approach seeks associations between lesioned structures and con-
comitant neurological or neuropsychological deficits – for example, between trauma
lesions and deficits like left visual field deficit. The second approach measures brain
activation in subjects as they are asked to perform certain tasks. We present methods
for both approaches. The problem of efficiently finding similar ROIs in brain image
databases is also addressed. We present methods for extracting knowledge about the
morphological variability of brain structures and about abnormalities such as
tumours. Methods that can be potentially applied to both structural and functional
imaging are also presented.
Figure 2 Statistical parametric map. These displays of a statistical parametric map (SPM), produced using the
SPM99 software package, show two views (one overlaid on a surface map, the other an axial slice) of 3D brain
activations for a simple motor task (courtesy Brain Imaging Lab, Dartmouth-Hitchcock Medical Center).
Data mining in brain imaging 365
subjects.55,67 One problem here is that when using voxel data, comparison of different
scans may not align activations that are slightly misregistered. Recently, the spatial
extent, i.e. the area of activation, has been used to detect significant regions of
activation. If a spatial extent of activation is reported, however, it is sometimes only
given as a voxel count above a threshold, which has been found to be a very unstable
indicator of activation across trials.68
Another issue of concern in functional imaging is the consistency of observations
across subjects. It is common for some activations during a study to be similar across
subjects (and thus significantly correlated with the task under study), and for others to
be idiosyncratic. There has been some report in the literature about the difficulty of
reproducing voxel level significance maps in fMRI.68,69 In the case of Tegeler,69 even
the 2% most significant voxels were found to vary considerably across runs, subjects,
and analysis techniques. Variability in activation maps is a concern; however,
reproducibility of fMRI activations at a regional level has been found to be good in
general across sites, subjects, and techniques,70 and comparisons of signal changes
rather than significance values may address problems in voxel-level comparisons.68
A third issue in functional imaging is the choice of analysis methods for generating
activation data. Several reports have substantiated the difference in activations
observed using different analysis techniques.69,71,72 These may be a concern in a large
database system. However, if results are tagged according to the analysis technique
and parameters used to generate them, multiple analysis methods may actually be
beneficial in supporting a ‘pluralistic’ strategy for analysis.72
A fourth issue in functional imaging is the necessity of limiting assumptions in
analysis. For example, the SPM model assumes a uniform distribution of noise with
covariance between voxels estimated by a Gaussian distribution that decays with
increasing distance, and unfortunately not all data conforms well to this noise model
(noise from unmodelled biological variability such as venous activity may not, for
example). It is difficult to assess the extent that conditions deviate from the model in
practice,52 but tests with synthetic data have demonstrated large biases in the false
positive rate73 as a result of low frequency physiological fluctuations and a variation of
signal-to-noise ratio with imaging rate. Techniques for correcting long-term changes
in mean signal intensity can improve the sensitivity of statistical analysis in this
particular case.74
ADHD+
ADHD-
two-way contingency tables are constructed and for each the Fisher exact test98 is
computed. The associations between structures and deficits are sorted in order of the
p-values returned from the exact tests, and the ones with the lowest p-values are
reported. For the same type of analysis with continuous structural variables one can
use the Mann–Whitney test and logistic regression analysis (the Mann–Whitney stati-
stic is appropriate because the distributions of the fractions of lesioned volumes are
not Gaussian). In exploratory analysis of either categorical or continuous structural
variables, computing a statistic for many pairwise tests leads to the multiple com-
parison problem, i.e. the situation where a certain undesirably high number of the
tests are expected to be positive by chance (see Section 4.1 for a more complete
treatment of the multiple comparison problem).
3.2.1.2 Voxel-based analysis. Atlas-based analysis results are only as good as the atlas
that is being used. Instead of imposing any high level structure on the image data, one
can analyse them on a voxel-by-voxel basis. Voxels are typically labelled as either
normal or abnormal, and thus structural variables are in this case categorical. Given
that the number of voxels that are considered is typically on the order of 107 , a like
number (i.e. 107 ) of Fisher exact tests have to be performed for each of the functional
variables that are examined. This procedure can be seen as clustering the voxels by
functional association.97 The calculation of the contingency table and the Fisher exact
test is computationally intensive, and the multiple-comparison problem is also severe
due to the large number of tests that must be performed. However, in this case it can
be attacked with clustering analysis since false positives will not tend to cluster.
Voxel-based regression analysis can also be used to determine whether voxels in a
certain region are associated with a functional variable. One can construct a regression
equation that relates lesions in a sphere of a given radius and centre to a deficit, and
the ‘causal brain region’ in which lesions are most strongly associated with that deficit
can then be identified.13,97 Let l be a lesion, o a sphere, vðrÞ the volume of a region r,
and iðr1 , r2 Þ the intersection of two regions r1 and r2 . Then the identification of the
causal region is done by calculating the optimal centre and radius for the logistic
regression equation:
log itðdÞ ¼ log ðoddsd Þ ¼ afs þ b
Data mining in brain imaging 369
where oddsd ¼ pd ðÞ=ð1 ÿ pd ðÞÞ, pd ðÞ is the probability of having a certain deficit d,
fs ¼ vðiðl; oÞÞ=vðoÞ is the fraction of the sphere that is lesioned, a=(log odds of d)/
(lesioned fraction of sphere volume), and b is the prior log odds of deficit d.
Given the centre ðx; y; zÞ and the radius r of the sphere, one can find values for the
parameters a and b such that the sum of squares of residuals is minimized. The goal is
to optimize the sphere parameters ðx; y; z; rÞ to obtain the best fit of the data to a
regression line. The solution is the sphere that best discriminates between lesions that
are and are not associated with deficit d. This nonlinear optimization procedure is
computationally intensive, and cannot describe multifocal functional associations.
3.2.1.3 Results from mining lesion-deficit associations. In this section we present results
from the mining process in BRAID.13,14 The Brain Image Database includes images
and clinical information from over 700 subjects from two different studies: the
Cardiovascular Health Study (CHS)99 and the Frontal Lobe Injury in Children
(FLIC) study.100 Visualization applied prior to the analysis procedure can help direct
the analysis by choosing certain structures of the anatomical atlas to examine further
using the statistical tests. Figure 3 shows the sum of lesions over all subjects that did
and did not develop ADHD (Attention-Deficit Hyperactivity Disorder), i.e. ADHD+
and ADHDÿ, respectively. Based on these images and on previous research
implicating a frontal lobe-basal ganglia-thalamic pathway, the right putamen and
the left thalamus (highlighted in Figure 3)y were chosen for further analysis using the
Fisher exact test for categorical and the Mann–Whitney test for continuous structural
variables. The p-values in Table 1 confirm a strong association between lesions in the
two structures and development of ADHD.
Running an exploratory analysis on the CHS data set (300 subjects) using the chi-
square test to evaluate two-way contingency tables for all pairwise combinations of
atlas structures (90) and functional variables (14) returns a list (sorted by p-value) of
structure–function associations.13,97 The five most significant associations are
presented in Table 2. Highly significant lesion-deficit associations detected by BRAID,
such as visual field deficit and lesions in contralateral orbital or cuneate gyrus, are also
consistent with current clinical knowledge.101 The incorrect association between the
left hippocampus and a right visual field deficit is due to registration error, since the
hippocampus is next to the optic radiations that are very well known to be correlated
with a visual field deficit.
Preliminary stepwise logistic regression analysis using continuous structural
variables from the FLIC data set show similar results for the development of ADHD.
This method identifies the left SupCerebellarA (which is lateral to the left putamen
area) as a strong predictor. Results from a preliminary voxel-based analysis for the
ADHD variable of the FLIC data set are presented in Figure 4(a). Each voxel
represents the p-value for the association between the voxel being lesioned and the
development of ADHD. A 3-D reconstruction is shown in Figure 4(b). These results
yDue to the compromised connections between the frontal lobe and these two structures, it is believed that the frontal
lobe is not able to exert its normal oversight function to suppress impulsive urges and behaviours. A common
behavioural pattern in patients with ADHD is impulsivity and lack of self-control.
370 V Megalooikonomou et al.
p-value
(a)
(b)
Figure 4 Voxel-based analysis for development of ADHD. Six slices (a) of the Talairach atlas and a colour bar
that shows the correspondence between p-values and colour values are shown. (b) Visualization of the voxel-
based analysis p-value volume for development of ADHD. The higher the intensity the lower the p-value. Picture
from Megalooikonomou et al.97
are consistent with those of the atlas-based analysis. Figure 5 shows one representative
slice (119) of the Talairach atlas for the voxel-based regression analysis for ADHD.
These results are consistent with all the previous ones for ADHD.
Data mining in brain imaging 371
Table 1 Visualization directed mining. Statistical analysis of selected Talairach atlas structures for association
with ADHD (FLIC data set)97
Table 2 Explorative analysis The five most significant structure-function associations given by the chi-square
analysis on the CHS data set97
(see Figure 7). It is then compared to an average parametric mesh that is formed by
calculating the mean and variation between corresponding points on the mesh.
Finally, displacement vectors are generated for each individual structure. A local
profile of change in structures in certain conditions can be provided through colour-
coded topographic maps (see Figure 8). This method first aligns each brain volume
using distance scaling to control for head size differences, allowing for inter-individual
and group comparisons. A strategy for creating a population-based brain atlas using
(a)
(b) (c)
Figure 6 Morphological variability of the corpus callosum between women and men for a group of elderly
subjects. The posterior part (in white) was found to be significantly larger in women than in men (a). The
average shape of the corpus callosum for (b) men and (c) women in a study by Davatzikos et al.106 Pictures
from Davatzikos et al.106
Data mining in brain imaging 373
Figure 7 Extracting meshes (a) to create a cortical surface database, to search for differences where the
deformation is regarded as an observation from a random vector field. Variability is calculated based on 3D
displacement maps, which locally encode the amount of deformation required (b) to drive each subject’s gyral
pattern into exact correspondence with the average cortex for the group. Pictures from Thompson et al.184
3.2.2.1 Morphological analysis of tree-like structures. Another tool for the analysis of
brain structure and function is through the morphological characterization of
neurological brain structures. Tree-like structures, such as nerve-fibre tracing in
374 V Megalooikonomou et al.
(a) (b)
Figure 9 Creating a population-based brain atlas to quantify local structural variations. A family of high-
dimensional volumetric warps relating a new 3D MRI scan to each normal scan in a brain image database is
calculated (I–II, above). The resulting warps encode the distribution in stereotaxic space of anatomic points that
correspond across a normal population (III), and their dispersion is used to determine the likelihood (IV) of local
regions of the new subject’s anatomy being in their actual configuration. Colour-coded topographic maps
highlight regional patterns of deformity in the anatomy of the new subject. Abnormal structural patterns are
quantified locally, and mapped in three dimensions. Pictures from Thompson et al.185
Data mining in brain imaging 375
Figure 10 Query tumour images (left column) and their nearest neighbours, with respect to morphological
distance. Picture from Korn et al.130
processing different kinds of stimuli. However, considering that multiple genes can be
expressed in the same brain location, and that the time sequence of gene expression
may also be important, makes the problem even more challenging.
where Bj is the index of behavioural functioning for a given region, W ði; jÞ is the
weight assigned to the jth brain region for the ith behavioural score, and Si is the test
score. The method was demonstrated on a sample of hemi-Parkinson patients151 and
later used to examine the sensitivity of cognitive test scores to lesions in specific ROIs,
inter-expert agreement, and intra-expert reliability.152 The method can be used to
relate cognitive test scores to the results of structural and functional imaging, and has
great potential for integrative data mining. Turkheimer et al.187 also quantitatively
examined the relationship between neuropsychological test scores and lesion locations
on structural neuroimaging.
discounted. Taking advantage of this kind of approach increases sensitivity, but it also
adds risk of error at the cluster level (rather than only at the voxel level).84 The overall
error rate can still be controlled, so the net effect is to reduce errors.
1 Strong 0/1
2 Moderate 0.25/0.75
3 Weak 0.49/0.51
380 V Megalooikonomou et al.
conditional probability for each function variable was determined by table lookup.
This probability was then used to generate the binary vector for the function variables,
and Fisher’s exact test of independence was applied to each structure-function pair.
(a)
(b)
Figure 11 Evaluating a mining method. Performance of the Fisher exact test (p 0.001) for (a) uniform (0.5)
prior probabilities and (b) data-derived prior probabilities of structure abnormality, for the three strengths of
lesion-deficit associations from Table 3 that correspond to strong (case 1), moderate (case 2) and weak (case 3)
associations. The difference between the total number of associations detected and the number of true
associations detected is the number of false-positive associations detected for each case. The horizontal line in
(a) represents the total number of simulated edges (69) and in (b) represents the total number of simulated
edges that can be detected (55). Graphs from Megalooikonomou et al.177
382 V Megalooikonomou et al.
(a)
(b)
Figure 12 (a) Evaluating a mining method. Performance of the Fisher exact test (p 0.001) for BNs with degree
4 with 20, 40 and 80 edges. (b) Performance of the Fisher exact test (p 0.001) for BNs with 48 edges, and
with degree 4, 6, and 8. Graphs from Megalooikonomou et al.177
Data mining in brain imaging 383
behavioural variables. On average the specific registration method used reduces the
number of associations discovered by 13% for the same number of subjects when
compared with perfect registration.
5 Concluding remarks
In this review we have presented data mining methods that have been or could be used
for knowledge discovery from brain images of different modalities along with other
clinical data. We have focused on the problems of: (1) finding associations between
structures and functions through task-activation and lesion-deficit studies, (2)
studying the morphological variability of brain structures and finding associations
with certain conditions, (3) classifying shapes of brain structures, including tree-like
structures such as nerve fibres and abnormalities such as tumours, and searching for
similarity, and (4) finding associations between gene expressions, morphology, and
function. We have presented results of applying mining methods to epidemiological
data that demonstrate detection of several clinically meaningful associations in
different studies. These methods can lead to interesting conclusions about the funct-
ional mapping of the human brain, the effect of lesions or other abnormalities in the
development of neurological and neuropsychological deficits, and the effect of certain
diseases and gene expressions on structural morphology and function.
Visualization can help reduce the inherently enormous search space in statistical
analysis. Exploratory analysis through the use of a statistic for many tests produces
reasonable results, although one has to deal with the multiple-comparison problem.
Voxel-based approaches show encouraging results, but are computationally intensive
and even more severely impacted by the multiple comparison (although the latter can
be addressed with clustering analysis). Statistical simulations show that more
advanced mining methods and large sample sizes are required to determine lesion-
deficit associations accurately, with reduced number of false positive associations.
Simulators can be used for verifying and comparing mining methods in brain
imaging. Their use is very important especially in determining the number of subjects
needed to detect all associations while reducing false positives. In particular, in lesion-
deficit analysis, simulators have shown that the number of subjects required to detect
all and only those associations in the underlying model (i.e. the ground truth) may be
in the thousands, even for strong associations, particularly if the spatial distribution of
lesions does not extend to all structures. The more one descends from the 0.5 level for
prior probabilities, the more difficult it becomes to discover associations. These results
underline the necessity of developing large image databases for the purpose of meta-
analysis of data pooled from multiple studies, so that more meaningful results can be
obtained. The testing procedure framework is very important, since it can be used to
characterize the power of methods for detecting multivariate associations while taking
into account the effects of registration and noise. Simulators can also be used in the
evaluation of new analysis methods, as well as in the study of the effect of different
registration and segmentation algorithms.
Existing mining algorithms are limited in that they typically assume data will
consist of individual numeric and symbolic features. We still lack effective algorithms
384 V Megalooikonomou et al.
for learning from data that is represented as a combination of various types (i.e.
multimedia data). Predictions based on the full medical record could potentially
achieve much greater accuracy than those that are limited to one data type. In
addition, prediction accuracy can be improved by inventing more appropriate features
to describe the brain data. We need new methods that actively generate optimal
experiments to collect the most informative data. Another obstacle is integrating data
from different investigators and analysing them jointly. Brain imaging data are
usually collected in a single database for a specific study and with a specific data
mining task in mind, so an additional important issue is interoperability and the
ability to learn from multiple databases.178 Also, the mining algorithms developed so
far tend to be fully automated and therefore do not allow active experimentation, i.e.
guidance from experts at key stages in the search for brain data regularities. Ideally,
human experts should be able to collaborate closely with a mining algorithm to form
hypotheses and test them against the data. In addition, mining methods need to be
able to scale to extremely large data sets. Research during the past few years has
already produced more efficient algorithms for such problems as learning association
rules2 and efficient visualization of large data sets.179 A closer integration of machine
learning algorithms into database management systems is also needed.
Acknowledgements
The authors wish to thank Christos Davatzikos, Eddie Herskovits, Christos
Faloutsos, Paul Thompson, David Isecke, Ling Cheng, and Tilmann Steinberg for
providing pictures, comments, and other helpful information. This work was
supported in part by the Ira DeCamp Foundation, NARSAD and New Hampshire
Hospital. Support was also provided by the Dartmouth Experimental Visualization
Laboratory (DEVLAB).
References
1 Koslow SH, Huerta MF eds. Neuroinformatics: distribution analysis of subtracted PET
an overview of the Human Brain Project. images. Journal of Cerebral Blood Flow
Mahway, NJ: Lawrence Erlbaum, 1997. Metabolism 1988; 8: 642–53.
2 Agrawal R, Imielinski T, Swami A. Database 7 Fox P. Functional brain mapping with
mining: a performance perspective. IEEE positron emission tomography. Seminars in
Transactions on Knowledge and Data Engineering Neurology 1989; 9: 323–9.
1993; 5: 914–25. 8 Fox P, Mintum M. Noninvasive functional
3 Huerta MF, Koslow SH, Leshner AI. The brain mapping by change-distribution
Human Brain Project: an international analysis of averaged PET images of H2150
resource. Trends in Neuroscience 1993; 16: 436–
tissue activity. Journal of Nuclear Medicine
38.
1989; 30: 141–49.
4 Mitchell TM. Machine learning and data
9 Fox P. Physiological ROI definition by image
mining. Communications of the ACM 1999; 42:
31–36. subtraction. Journal of Cerebral Blood Flow
5 Anderson S, Damasio H. Neuropsychological Metabolism 1991; 11: A79–82.
impairments associated with lesions caused by 10 Evans A, Beil C, Marrett S, Thompson C,
tumor or stroke. Archives in Neurology 1990; 47: Hakim A. Anatomical–function correlation
397–405. using an adjustable MRI-based region of
6 Fox P, Mintum M, Reiman E, Raichle M. interest atlas with positron emission
Enhanced detection of brain responses using tomography. Journal of Cerebral Blood Flow
intersubject averaging and change- Metabolism 1988; 8: 513–30.
Data mining in brain imaging 385
11 Evans A, Marrett S, Torrescorzo J, Ku S, Transactions on Medical Imaging 1997; 16: 878–
Collins L. MRI–PET correlation in three 86.
dimensions using a volume-of-interest (VOI) 24 Chang M, Sezan M, Tekalp A, Berg M.
atlas. Journal of Cerebral Blood Flow Metabolism Bayesian segmentation of multislice brain
1991; 11: A69–78. magnetic resonance imaging using three-
12 Arya M, Cody W, Faloutsos C, Richardson J, dimensional Gibbsian priors. Optical
Toga A. A 3D medical image database Engineering 1996; 35: 3206–21.
management system. International Journal of 25 Collins D, Evans A. ANIMAL: validation and
Computerized Medical Imaging and Graphics applications of nonlinear registration-based
1996; 20: 269–84. segmentation. International Journal of Pattern
13 Letovsky SI, Whitehead SH, Paik CH et al. A Recognition and Artificial Intelligence 1997; 11:
brain image database for structure–function 1271–94.
analysis. American Journal of Neuroradiology 26 Gee J, Reivich M, Bajcsy R. Elastically
1998; 19: 1869–77. deforming 3D atlas to match anatomical brain
14 Herskovits EH, Megalooikonomou V, images. Journal of Computed Assisted
Davatzikos C, Chen A, Bryan RN, Gerring JP. Tomography 1993; 17: 225–36.
Is the spatial distribution of brain lesions 27 Miller M, Christensen G, Amit Y, Grenander
associated with closed-head injury predictive U. Mathematical textbook of deformable
of subsequent development of attention-deficit neuroanatomies. Proceedings of the National
hyperactivity disorder?: Analysis with brain- Academy of Sciences 1993; 90:11944–48.
image database. Radiology 1999; 213: 389–94. 28 Kamber M, Shingal R, Collins D, Francis G,
15 Nielsen FA, Hansen LK. Modeling of Evans A. Model-based 3-D segmentation of
brainmap data. In: NIPS ‘99. Denver, multiple sclerosis lesions in magnetic
Colorado; 1999. resonance brain images. IEEE Transactions on
16 Nowinski WL, Fang A, Nguyen BT et al. Medical Imaging 1995; 14: 442–53.
Multiple brain atlas database and atlas-based 29 Talairach J, Tournoux P. Co-planar stereotaxic
neuroimaging system. Computer Aided Surgery atlas of the human brain. Stuttgart: Thieme,
1997; 2: 42–66. 1988.
17 Levrier O, Poline J, Tzourio N, Mazoyer B, 30 Bookstein F. Principal warps: thin-plate
Salamon G. Individual functional splines and the decomposition of
neuroanatomy using PET–MRI integration. deformations. IEEE Transactions on Pattern
In: 31st Annual Meeting of the American Society Analysis and Machine Intelligence 1989; 11: 567–
of Neuroradiology. Vancouver, BC, 1993. 85.
18 Rajapakse J, Giedd J, Rapoport J. Statistical 31 Collins D, Neelin P, Peters T, Evans A.
approach to segmentation of single-channel Automatic 3D intersubject registration of MR
cerebral MR images. IEEE Transactions on volumetric data in standardized Talairach
Medical Imaging 1997; 16: 176–86. space. Journal of Computer Assisted Tomography
19 Pal N, Pal S. A review on image segmentation 1994; 18: 192–205.
techniques. Pattern Recognition 1993; 26: 1277– 32 Davatzikos C. Spatial transformation and
94. registration of brain images using elastically
20 Zhang Y. A survey on evaluation methods for deformable models. Computer Vision and Image
image segmentation. Pattern Recognition 1996; Understanding 1997; 66: 207–22.
29: 1335–46. 33 Damasio H. Human brain anatomy in
21 Worth A, Makris N, Caviness V, Kennedy D. computerized images. Oxford: Oxford University
Neuroanatomical segmentation in MRI: Press, 1995.
Technological objectives. International Journal 34 Talairach J, Szikla G. Atlas d’anatomie
of Pattern Recognition and Artificial Intelligence stereotaxique du telencephale: etudes anatomo-
1997; 11: 116–87. radiologiques. Paris: Masson, 1967.
22 Vannier M, Butterfield R, Rickman D, Jordan 35 Ono M, Kubik S, Abernathey C. Atlas of the
D, Murphy W, Biondetti P. Multispectral cerebral sulci. Stuttgart: Thieme, 1990.
magnetic resonance image analysis. CRC 36 Brodmann K. Vergleichende
Critical Reviews in Biomedical Engineering 1987; Lokalisationslehre der Grosshirnrinde in
15: 117–44. ihren Principien dargestellt auf Grund des
23 Held K, Korps E, Krause B, Wells W, Kikinis Zellenbaues, Barth, Leipzig. In: Some Papers
R, Muller-Gartner H. Markov random field on the Cerebral Cortex. Springfield, IL:
segmentation of brain MR images. IEEE Thomas, 1960: 201–30.
386 V Megalooikonomou et al.
37 Minoshima S, Koeppe R, Frey K, Ishihara M, 50 Lindeberg T, Lidberg Par, Roland PE.
Kuhl D. Stereotactic PET atlas of the human Analysis of brain activation patterns using a 3-
brain: aid for visual interpretation of D scale-space primal sketch. Human Brain
functional brain images. Journal of Nuclear Mapping 1999; 7:166–94.
Medicine 1994; 35: 949–54. 51 Lowe MJ, Sorenson JA. Spatially filtering
38 Bihan DL. Functional MRI of the brain: functional magnetic resonance imaging data.
principles, applications and limitations. Magnetic Resonance in Medicine 1997; 37: 723–
Neuroradiology 1996; 23: 1–5. 29.
39 Essen DV, Maunsell J. Hierarchical 52 Raz J, Turetsky B. Wavelet ANOVA and
organization of functional streams in the fMRI. In: Wavelet Applications in Signal and
visual cortex. Trends in Neurological Sciences Image Processing VIII. Denver, Colorado, 1999.
1983; 6: 370–75. 53 Sijbers J, Dekker AJd, Van der Linden A,
40 Wible CG, Shenton ME, Fischer IA et al. Verhoye TM, Van Dyck D. Adaptive
Parcellation of the human prefrontal cortex anisotropic noise filtering for magnitude MR
using MRI. Psychiatry Research 1997; 76: 29– data. Magnetic Resonance Imaging 1999; 17:
40. 1533–39.
41 Ogawa S, Menon RS, Kim SG, Ugurbil K. On 54 Skudlarski P, Constable RT, Gore JC. ROC
the characteristics of functional magnetic analysis of statistical methods used in
resonance imaging of the brain. Annual Review functional MRI: individual subjects.
of Biophysical and Biomolecular Structures 1998; NeuroImage 1999; 9: 311–32.
27:447–74. 55 Coulon O, Mangin J-F, Poline J-B, Frouin V,
42 Bahn MM. A linear relationship exists among Bloch I. Structural group analysis of
brain diffusion eigenvalues measured by functional maps. In: Kuba MSaAT-P A ed.
diffusion tensor magnetic resonance imaging. 16th International Conference on Information
Journal of Magnetic Resonance 1999; 137: 33–38. Processing in Medical Imaging. Visegrad, 1999:
43 Friston KJ, Williams S, Howard R, 448–53.
Frackowiak RSJ, Turner R. Movement related 56 Worsley KJ, Marrett S, Neelin P, Evans AC.
effects in fMRI time series. Magnetic Resonance Searching scale space for activation in PET
in Medicine 1996; 35: 346–55. images. Human Brain Mapping 1996; 4: 74–90.
44 Thacker NA, Burton E, Lacey AJ, Jackson A. 57 Poline J-B, Mazoyer BM. Enhanced detection
The effects of motion on parametric fMRI in brain activation maps using a multifiltering
analysis techniques. Physiological Measurement approach. Journal of Cerebral Blood Flow
1999; 20: 251–63. Metabolism 1994; 14: 639–42.
45 Ashburner J, Friston KJ. The role of 58 Brammer MJ. Multidimensional wavelet
registration and spatial normalization in analysis of functional magnetic resonance
detecting activations in functional imaging. images. Human Brain Mapping 1998; 6: 378–82.
Clinical MRI/Developments in MR 1997; 7: 26– 59 Jansen M, Uytterhoeven G, Bultheel A. Image
8. de-noising by integer wavelet transforms and
46 Kim B, Boes JL, Bland PH, Chenevert TL, generalized cross validation. Medical Physics
Meyer CR. Motion correction in fMRI via 1999; 26: 622–30.
registration of individual slices into an 60 Ruttimann UE, Unser M, Rawlings RR et al.
anatomical volume. Magnetic Resonance in Statistical analysis of functional MRI data in
Medicine 1999; 41: 964–72. the wavelet domain. IEEE Transactions on
47 Worsley KJ, Marrett S, Neelin P, Vandal AC, Medical Imaging 1998; 17: 142–54.
Friston KJ, Evans AC. A unified statistical 61 Zaroubi S, Goelman G. Complex denoising of
approach to determining significant signals in MR data via wavelet analysis: application for
images of cerebral activation. Human Brain functional MRI. Magnetic Resonance Imaging
Mapping 1996; 4: 58–73. 2000; 18: 59–68.
48 Descombes X, Kruggel F, Cramon DYv. fMRI 62 Kruggel F, Cramon DYv. Physiologically
signal restoration using a spatio-temporal oriented models of the hemodynamic response
Markov random field preserving transitions. in functional MRI. Lecture Notes in Computer
NeuroImage 1998; 8: 340–49. Science. Berlin: Springer, 1999.
49 Descombes X, Kruggel F, Cramon DYv. 63 Kruggel F, Cramon DYv. Modeling the
Spatio-temporal fMRI analysis using Markov hemodynamic response in single-trial
random fields. IEEE Transactions on Medical functional MRI experiments. Magnetic
Imaging 1998; 17: 1028–39. Resonance in Medicine 1999; 42: 787–97.
Data mining in brain imaging 387
64 Friston KJ, Holmes AP, Worsley KJ, Poline 77 Owen CB. Multiple media correlation: theory and
JP, Frith CD, Frackowiak RSJ. Statistical applications. Hanover: Dartmouth College,
parametric maps in functional imaging: a 1998.
general linear approach. Human Brain 78 Owen CB, Makedon F. Computed
Mapping 1995; 2: 189–210. synchronization for multimedia applications.
65 Friston K. Statistical parametric mapping and Dordrecht: Kluwer, 1999.
other analyses of functional imaging data. In 79 Worsley KJ, Poline JB, Friston KJ, Evans AC.
Toga A, Mazziotta J eds. Brain mapping: the Characterizing the response of PET and fMRI
methods. San Diego, CA: Academic Press, data using multivariate linear models.
1996. NeuroImage 1997; 6: 305–19.
66 Friston KJ, Holmes AP, Price CJ, Buchel C, 80 Worsley KJ, Poline J-B, Vandal AC, Friston
Worsley KJ. Multisubject fMRI studies and KJ. Tests for distributed, non-focal brain
conjunction analyses. NeuroImage 1999; 10: activations. NeuroImage 1995; 2: 183–94.
385–96. 81 Frank LR, Buxton RB, Wong EC.
67 Bosch V. Statistical analysis of multi-subject Probabilistic analysis of functional magnetic
fMRI data: assessment of focal activations. resonance imaging data. Magnetic Resonance in
Journal of Magnetic Resonance Imaging 2000; 11: Medicine 1998; 39: 132–48.
61–64. 82 Fu Z, Hui Y, Liang Z-P. Joint spatiotemporal
68 Cohen MS, DuBois RM. Stability, statistical analysis of functional MRI data. In:
repeatability, and the expression of signal Proceedings of IPCIP’98 International
magnitude in functional magnetic resonance Conference on Image Processing, 1998: 709–13.
imaging. Journal of Magnetic Resonance Imaging 83 Svensen M, Kruggel F, Von Cramon DY.
1999; 10: 33–40. Markov random field modelling of fMRI data
69 Tegeler C, Strother SC, Anderson JR, Kim SG. using a mean field EM-algorithm. In Pelillo
Reproducibility of BOLD-based functional ERHaM ed. Second International Workshop on
MRI obtained at 4 T. Human Brain Mapping Energy Minimization Methods in Computer
1999; 7: 267–83. Vision and Pattern Recognition. York. London:
70 Casey BJ, Cohen JD, O’Craven K et al. Springer, 1999: 317–30.
Reproducibility of fMRI results across four 84 Poline J-B, Worsley KJ, Evans AC, Friston KJ.
institutions using a spatial working memory Combining spatial extent and peak intensity
task. NeuroImage 1998; 8: 249–61. to test for activations in functional imaging.
71 Constable RT, Skudlarski P, Mencl E et al. NeuroImage 1997; 5: 83–96.
Quantifying and comparing region-of-interest 85 Worsley KJ, Cao J, Paus T, Petrides M, Evans
activation patterns in functional brain MR AC. Applications of random field theory to
imaging: methodology considerations. functional connectivity. Human Brain Mapping
Magnetic Resonance Imaging 1998; 16: 289–300. 1998; 6: 364–67.
72 Lange N, Strother SC, Anderson JR et al. 86 Horwitz B. Modeling of functional brain
Plurality and resemblance in fMRI data imaging data. In: SPIE Ninth Workshop on
analysis. NeuroImage 1999; 10: 282–303. Virtual Intelligence/Dynamic Neural Networks.
73 Purdon PL, Weisskoff RM. Effect of temporal Stockholm, Sweden, 1999.
autocorrelation due to physiological noise and 87 Holmes AP, Blair RC, Watson JD, Ford I.
stimulus paradigm on voxel-level false- Nonparametric analysis of statistic images
positive rates in fMRI. Human Brain Mapping from functional mapping experiments. Journal
1998; 6: 239–49. of Cerebral Blood Flow Metabolism 1996; 16: 7–
74 Lowe MJ, Russell DP. Treatment of baseline 22.
drifts in fMRI time series analysis. Journal of 88 Bandettini PA, Jesmanowicz A, Wong EC,
Computer Assisted Tomography 1999; 23: 463–73. Hyde JS. Processing strategies for time-course
75 Maas LC, Frederick Bd, Yurgelun-Todd DA, data sets in functional MRI of the human
Renshaw PF. Autocovariance based analysis of brain. Magnetic Resonance in Medicine 1993; 30:
functional MRI data. Biological Psychiatry 161–73.
1996; 39: 640–41. 89 Bullmore E, Brammer M, Williams S et al.
76 McIntosh AR, Brookstein FL, Haxby JV, Statistical methods of estimation and
Grady CL. Spatial pattern analysis of inference for functional MR image analysis.
functional brain images using partial least Magnetic Resonance in Medicine 1996; 35: 261–
squares. NeuroImage 1996; 3: 143–57. 77.
388 V Megalooikonomou et al.
90 Strother S, Kanno I, Rottenberg D. Principal and spatial variability of the optic radiation.
component analysis, variance partitioning, NeuroImage 1999; 10: 489–99.
and functional connectivity. Journal of 104 May A, Ashburner J, Buchel C et al.
Cerebral Flow and Metabolism 1995; 15: 355–60. Correlation between structural and functional
91 McKeown MJ, Makeig S, Brown CG et al. changes in brain in an idiopathic headache
Analysis of fMRI data by blind separation into syndrome [see comments]. Nature Medicine
independent spatial components. Human 1999; 5: 836–38.
Brain Mapping 1998; 6: 160–88. 105 Mummery CJ, Patterson K, Wise RJ,
92 Buchel C, Coull J, Friston K. The predictive Vandenbergh R, Price CJ, Hodges JR.
value of changes in effective connectivity for Disrupted temporal lobe connections in
human learning. Science 1999; 283: 1538–41. semantic dementia. Brain 1999; 122: 61–73.
93 Cressie N. Statistics for spatial data. New York: 106 Davatzikos C, Vaillant M, Resnick SM, Prince
John Wiley, 1993. JL, Letovsky S, Bryan RN. A computerized
94 Tsukimoto H, Morita C. The discovery of approach for morphological analysis of the
rules from brain images. In: Discovery Science – corpus callosum. Journal of Computer
First International Conference, 1998: 198–209. Assisted Tomography 1996; 20: 88–97.
95 Quinlan J. Induction of decision trees. 107 Csernansky J, Joshi S, Wang L et al.
Machine Learning 1986; 1: 81–106. Hippocampal morphometry in schizophrenia
96 Enbank R. Spline smoothing and nonparametric by high dimensional brain mapping.
regression. New York: Marcel Dekker, 1988. Proceedings of the National Academy of Sciences
97 Megalooikonomou V, Davatzikos C, of the United States of America 1998; 95: 11406–
Herskovits E. Mining lesion-deficit 11.
associations in a brain image database. In: 108 Thompson PM, Schwartz C, Toga AW. High-
ACM SIGKDD International Conference on resolution random mesh algorithms for
Knowledge Discovery and Data Mining. San creating a probabilistic 3D surface atlas of the
Diego, CA, 1999: 347–51. human brain. NeuroImage 1996; 3: 19–34.
98 Andersen E. Introduction to the statistical 109 Thompson PM, Schwartz C, Lin RT, Khan
analysis of categorical data. Berlin: Springer, AA, Toga AW. Three-dimensional statistical
1997. analysis of sulcal variability in the human
99 Bryan R, Manolio T. A method for using MR brain. Journal of Neuroscience 1996; 16: 4261–
to evaluate the effects of cardiovascular 74.
disease of the brain: the cardiovascular health 110 Alen L, Richey M, Cahi Y, Gorski R. Sex
study. American Journal of Neuroradiology 1994; differences in the corpus callosum of the
15: 1625–33. living human being. Journal of Neuroscience
100 Gerring J, Brady K, Chen A et al. 1991; 11: 933–42.
Neuroimaging variables related to the 111 Delisi L. Brain imaging studies of cerebral
development of secondary attention deficit morphology and activation in schizophrenia.
hyperactivity disorder in children who have In: Steinhauer SR, Gruzelier H eds.
moderate and severe closed head injury. Neuropsychology, psychophysiology, and
Journal of the American Academy of Child and information processing. Amsterdam: Elsevier,
Adolescent Psychiatry 1998; 37: 647–54. 1991: 147–60.
101 Bryan R, Wells S, Miller T et al. Infarctlike 112 Nelson MD, Saykin AJ, Flashman LA,
lesions in the brain: prevalence and anatomic Riordan HJ. Hippocampal volume reduction
characteristics at MR imaging of the elderly – in schizophrenia as assessed by magnetic
data from cardiovascular health study. resonance imaging: a meta-analytic study.
Radiology 1997; 202: 47–54. Archives in General Psychiatry 1998; 55: 433–40.
102 Mummery CJ, Patterson K, Price CJ, 113 Thompson PM, Moussai J, Zohoori S et al.
Ashburner J, Frackowiak RS, Hodges JR. A Cortical variability and asymmetry in normal
voxel-based morphometry study of semantic aging and Alzheimer’s disease. Cerebral Cortex
dementia: relationship between temporal lobe 1998; 8: 492–509.
atrophy and semantic memory. Annals in 114 Mazziotta JC, Toga AW, Evans A, Fox P,
Neurology 2000; 47: 36–45. Lancaster J. A probabilistic atlas of the human
103 Burgel U, Schormann T, Schleicher A, Zilles brain: theory and rationale for its
K. Mapping of histologically identified long development. The International Consortium
fiber tracts in human cerebral hemispheres to for Brain Mapping (ICBM). NeuroImage 1995;
the MRI volume of a reference brain: position 2: 89–101.
Data mining in brain imaging 389
115 Bookstein F. Biometrics, biomathematics, and Conference on Visualization in Biomedical
the morphometric synthesis. Bulletin of Computing, 1996: 429–38.
Mathematical Biology 1996; 58: 313–65. 130 Korn F, Sidiropoulos N, Faloutsos C, Siegel E,
116 Davatzikos C, Resnick S. Sex differences in Protopapas Z. Fast and effective similarity
anatomic measures of interhemisperic search in medical tumor databases using
connectivity: correlations with cognition in morphology. In: SPIE Proceedings. Boston,
men but not in women. Cerebral Cortex 1998; 8: MA, 1996.
635–40. 131 Guttman A. R-trees: a dynamic index
117 Miller M, Banerjee A, Christensen G et al. structure for spatial searching. ACM
Statistical methods in computational SIGMOD 1984; 47–57.
anatomy. Statistical Methods in Medical 132 Agrawal R, Faloutsos C, Swami A. Efficient
Research 1997; 6: 267–99. similarity search in sequence databases. In:
118 Gee J, LeBriquer L, Barillot C. Probabilistic Foundations of Data Organization and
matching of brain images. In: Bizais Y, Algorithms (FODO) Conference. Evanston, IL,
Barillot C, Di Paola R eds. Information 1993.
processing in medical imaging. Dordrecht: 133 Eden M. A two-dimensional growth process.
Kluwer, 1995: 113–25. In: Fourth Berkeley Symposium on Mathematical
119 Haller J, Christensen G, Joshi S et al. Statistics and Probability. Berkeley, CA:
Hippocampal MR imaging morphometry by University of California Press, 1961.
means of general pattern matching. Radiology 134 Ringwald M, Baldock R, Bard J et al. A
1996; 199: 787–91. database for mouse development. Science 1994;
120 Grenander U, Miller MI. Computational 265: 2033–34.
anatomy: an emerging discipline. Statistical 135 Williams B, Doyle M. An internet atlas of
mouse development. Computerized Medical
Computing and Graphics Newsletter 1996; 7: 3–8.
Imaging and Graphics 1996; 20: 433–47.
121 Grenander U, Miller MI. Computational
136 Toga A, Santori E, Hazani R, Ambach K. A 3D
anatomy: an emerging discipline. Quarterly of
digital map of the rat brain. Brain Research
Applied Mathematics 1998; LVI: 617–94.
Bulletin 1995; 38: 77–85.
122 Sholl D. Dendritic organization in the
137 Cohen F, Yang Z, Huang Z, Nissanov J.
neurons of the visual and motor cortices of the
Automatic matching of homologous
cat. Journal of Anatomy 1953; 87: 387–406. histological sections. IEEE Transactions on
123 Hu MK. Visual pattern recognition by
Biomedical Engineering 1998; 45: 642–50.
moment invariants. IRE Transactions on 138 Ali W, Cohen F. Registering coronal
Information Theory 1962; 8: 179–87. histological 2-D sections of a rat brain with
124 Sadjadi F, Hall E. Three-dimensional moment coronal sections of a 3-D atlas using geometric
invariants. IEEE Transactions on Pattern curve invariants and B-spline representation.
Analysis and Machine Intelligence 1980; 2: 127– IEEE Transactions on Medical Imaging 1998; 17:
36. 957–66.
125 Teh C-H, Chin R. On image analysis by the 139 Rangarajan A, Chui H, Mjolsness E et al. A
methods of moments. IEEE Transactions on robust point matching algorithm for
Pattern Analysis and Machine Intelligence 1988: autoradiograph alignment. Medical Image
10: 496–513. Analysis 1997; 4: 379–98.
126 Karten HJ, Kelly P. Content-based query and 140 Rangarajan A, Chui H, Bookstein F. The
retrieval in neuroscience databases, 1999. softassign Procrustes matching algorithm.
127 Jacobs GA, Theunissen FE. Extraction of Information Processing in Medical Imaging.
sensory parameters from a neural map by London: Springer, 1997: 29–42.
primary sensory interneurons. Journal of 141 Besl P, McKay N. A method for registration of
Neuroscience 2000; 20: 2934–43. 3-D shapes. IEEE Transactions on Pattern
128 Symanzik J, Ascoli GA, Washington SS, Analysis and Machine Intelligence 1992; 14: 239–
Krichmar JL. Visual data mining of brain 56.
cells. Computing Science and Statistics 1999; 31: 142 Pearl J. Fusion, propagation and structuring
445–49. in belief networks. Artificial Intelligence 1986;
129 Rossmanith C, Handels H, Poppl S, Rinast E, 29: 241–88.
Weiss D. Characterisation and classification 143 Herskovits E. Computer-based probabilistic
of brain tumors in three-dimensional MR network construction. PhD thesis. Stanford
image sequences. In: Fourth International University, CA, 1991.
390 V Megalooikonomou et al.
144 Cooper G, Herskovits E. A Bayesian method 157 Baumgartner R, Ryner L, Richter W,
for the induction of probabilistic networks Summers R, Jarmasz M, Somarjai R.
from data. Machine Learning 1992; 9: 309–47. Comparison of two exploratory data analysis
145 Pearl J. Probabilistic reasoning in intelligent methods for fMRI: fuzzy clustering vs.
systems: networks of plausible inference. San principal component analysis. Magnetic
Mateo, CA: Morgan Kaufmann, 1988. Resonance Imaging 2000; 18: 89–94.
146 Bouckaert R. Properties of measures for 158 Golay X, Kollias S, Stoll G, Meier D, Valavanis
Bayesian belief network learning. In: Tenth A, Boesiger P. A new correlation-based fuzzy
Conference on Uncertainty in Artificial logic clustering algorithm for fMRI. Magnetic
Intelligence. Seattle, WA, 1994. Resonance in Medicine 1998; 40: 249–60.
147 Lam W, Bacchus F. Learning Bayesian belief 159 Baumgartner R, Somorjai R, Summers R,
networks – an approach based on the MDL Richter W, Ryner L, Jarmasz M. Resampling
principle. Computational Intelligence 1994; 10: as a cluster validation technique in fMRI.
269–93. Journal of Magnetic Resonance Imaging 2000; 11:
148 Rissanen J. Modeling by shortest data 228–31.
description. Automatica 1978; 14: 465–71. 160 Baune A, Sommer FT, Erb M et al. Dynamical
149 Lam W. Bayesian network refinement via cluster analysis of cortical fMRI activation.
machine learning approach. IEEE NeuroImage 1999; 9: 477–89.
Transactions on Pattern Analysis and Machine 161 Fadili MJ, Ruan S, Bloyet D, Mazoyer B.
Intelligence 1998; 20: 240–51. Unsupervised fuzzy clustering analysis of
150 Gur RC, Trivedi SS, Saykin AJ, Gur RE. fMRI series. In: Proceedings of the 20th Annual
‘Behavioral imaging’ – a procedure for International Conference of the IEEE
analysis and display of neuropsychological Engineering in Medicine and Biology Society, Vol.
test scores: I. Construction of algorithm and 20, Biomedical Engineering Towards the Year
initial clinical application. Neuropsychiatry, 2000 and Beyond. Hong Kong, 1998: 696–99.
Neuropsychology, and Behavioral Neurology 162 Filzmoser P, Baumgartner R, Moser E. A
1988; 1: 53–60. hierarchical clustering method for analyzing
151 Gur RC, Saykin AJ, Blonder LX, Gur RE. functional MR images. Magnetic Resonance
‘Behavioral imaging’: II. Application of the Imaging 1999; 17: 817–26.
quantitative algorithm to hypothesis testing in 163 Goutte C, Toft P, Rostrup E, Nielsen F,
the population of hemiparkinson patients. Hansen L. On clustering fMRI time series.
Neuropsychiatry, Neuropsychology, and NeuroImage 1999; 9: 298–310.
Behavioral Neurology 1988; 1: 87–96. 164 Ledberg A, Akerman S, Roland PE.
152 Gur RC, Saykin AJ, Benton A et al. ‘Behavioral Estimation of the probabilities of 3D clusters
imaging’ – III. Interexpert agreement and in functional brain images. NeuroImage 1998;
reliability of weightings. Neuropsychiatry, 8: 113–28.
Neuropyschology, and Behavioral Neurology 165 Moser E, Baumgartner R, Barth M,
1990; 3: 113–24. Windischberger C. Explorative signal
153 Fisher LD, Belle GV. Biostatistics: a processing in functional MR imaging.
methodology for the health sciences. New York: International Journal of Imaging Systems and
John Wiley, 1993. Technology 1999; 10: 166–76.
154 Friston KJ. Testing for anatomically specified 166 Fischer H, Hennig J. Neural network-based
regional effects. Human Brain Mapping 1997; 5: analysis of MR time series. Magnetic Resonance
133–36. in Medicine 1999; 41: 124–31.
155 Forman SD, Cohen JD, Fitzgerald M, Eddy 167 Horwitz B, Tagamets MA. Predicting human
WF, Mintun MA, Noll DC. Improved functional maps with neural net modeling.
assessment of significant activation in Human Brain Mapping 1999; 8: 137–42.
functional magnetic resonance imaging 168 Baumgartner R, Windischberger C, Moser E.
(fMRI): use of a cluster-size threshold. Quantification in functional magnetic
Magnetic Resonance in Medicine 1995; 33: 636– resonance imaging: fuzzy clustering vs.
47. correlation analysis. Magnetic Resonance
156 Baumgartner R, Somorjai R, Summers R, Imaging 1998; 16: 115–25.
Richter W. Assessment of cluster homogeneity 169 Larntz K. Small-sample comparisons of exact
in fMRI data using Kendall’s coefficient of levels for chi-squared goodness-of-fit statistics.
concordance. Magnetic Resonance Imaging 1999; Journal of the American Statistical Association
17: 1525–32. 1978; 73: 253–63.
Data mining in brain imaging 391
170 Lee C, Shen S. Convergence-rates and power visualization of traditional and multimedia
of 6 power-divergence statistics for testing datasets. In: ACM SIGMOD Conference on
independence in 2by2 contingency table. Management of Data. San Jose, CA, 1995.
Communications in Statistics – Theory and 180 Cohen J. Statistical power analysis for the
Methods 1994; 23: 2113–26. behavioral sciences. Mahway, NJ: Lawrence
171 Oluyede B. A modified chi-square test of Erlbaum, 1987.
independence against a class of ordered 181 Megalooikonomou V, Herskovits E. Mining
alternatives in a RxC contingency table. structure-function associations in a brain
Canadian Journal of Statistics 1994; 22: 75–87. image database. In: Cios K ed. Medical data
172 Harwell M, Serlin R. An empirical study of mining and knowledge discovery. Berlin:
five multivariate tests for the singe-factor Springer, 2000.
repeated measures model. Communications in 182 Narr K, Sharma T, Moussai J et al. 3D maps of
Statistics – Simulation and Computation 1997; cortical surface variability and sulcal
26: 605–18. asymmetries in schizophrenia and normal
173 Tanizaki H. Power comparison of non- populations. In: 5th International Conference on
parametric tests: small-sample properties Functional Mapping of the Human Brain.
from Monte Carlo experiments. Journal of Dusseldorf, 1999.
Applied Statistics 1997; 24: 603–32. 183 Thompson P, Mega M, Toga A. Disease-
174 Osius G, Rojek D. Normal goodness-of-fit tests specific brain atlases. In: Toga A, Mazziotta J,
for multinomial models with large degrees of
Frackowiak R eds. Brain mapping: the disorders.
freedom. Journal of the American Statistical
San Diego, CA: Academic Press, 2000.
Association 1992; 87: 1145–52.
184 Thompson P, Woods R, Mega M, Toga A.
175 Thomas R, Conlon M. Sample-size
Mathematical/computational challenges in
determination based on Fisher exact test for
use in 22 comparative trials with low event creating deformable and probabilistic atlases
rates. Controlled Clinical Trials 1992; 13: 134– of the human brain. Human Brain Mapping
47. 2000; 9: 81–92.
176 Mannan M, Nassar R. Size and power of test 185 Thompson P, Toga A. Detection, visualization
statistics for gene correlation in 22 and animation of abnormal anatomic
contingency-tables. Biometrical Journal 1995; structure with a deformable probabilistic
37: 409–33. brain atlas based on random vector field
177 Megalooikonomou V, Davatzikos C, transformations. Medical Image Analysis 1997;
Herskovits E. A simulator for evaluating 1: 271–94.
methods for the detection of lesion-deficit 186 Ashburner J, Friston KJ. Voxel-based
associations. Human Brain Mapping in press, morphometry – the methods. NeuroImage
2000. 2000; 11: 805–21.
178 Shu Y, Liaw J-S, Berger T, Shahabi C. Data 187 Turkheimer E, Yeo RA, Jones C, Bigler ED.
mining for neuroscience databases. In: 29th Quantitative assessment of covariation
Annual Meeting of Society for Neuroscience. between neuropsychological function and
Miami Beach, FL, 1999. location of naturally occurring lesions in
179 Faloutsos C, Lin K-I. FastMap: a fast humans. Journal of Clinical and Experimental
algorithm for indexing, data mining and Neuropsychology 1990; 12: 549–65.
The t-test compares two groups of samples and determines whether there is a
significant difference; here the groups are MR signal readings during two different
states (conditions): one R state (rest or control) and one T state (test or task). Let xyz
denote a voxel (volume element) of a 3-D image and let N be the total number of
voxels. Then the 3-D images [rxyz;R ], [rxyz;T ] exist for each subject k. Let [rxyz;k ] where
rxyz;k ¼ rxyz;k;Tÿ rxyz;k;R be the voxel-by-voxel subtraction picture of the differences in
392 V Megalooikonomou et al.
and
1 Xn ÿ 2
Varrxyz ¼ rxyz ÿ Erxyz
nÿ1 k
A deformation function dðu; vÞ, is defined at each point ðu; vÞ of the atlas structure, S,
of interest. It measures the enlargement or shrinkage associated with the transforma-
tion from an infinitesimal region around a point in the atlas space to its corresponding
infinitesimal region in the subject space. If the point ðu; vÞ of the atlas space is mapped
to the point [Uðu; vÞ; V ðu; vÞ] in the subject space, then dðu; vÞ is defined by
dðu; vÞ ¼ detfr½Uðu; vÞ; V ðu; vÞg, where r denotes the gradient of a vector function
and detðÞ denotes the determinant of a matrix. Intersubject comparisons are made by
comparing deformation functions. Specifically, let [U1 ðu; vÞ], ½V1 ðu; vÞ; . . . ;
½UN ðu; vÞ; VN ðu; vÞ be the maps from the structure S of the atlas to the structure S
of each of the N subjects of a population. Let also Up ðu; vÞ and Vp ðu; vÞ be the average
of the N functions U1 ; . . . ; UN and V1 ; . . . ; VN , respectively:
X X
Up ðu; vÞ ¼ 1=N Ui ðu; vÞ; Vp ðu; vÞ ¼ 1=N Vi ðu; vÞ
I¼1;...;N I¼1;...;N
The average structure Sp of that population is defined as the collection of the points
where the atlas structure points are mapped:
Sp ¼ [ðu;vÞ2Sa ½Up ðu; vÞ; Vp ðu; vÞ
where Sa is the collection of points belonging to structure S of the atlas. Let p ðu; vÞ be
the point-wise mean of the deformation function of the population:
X
p ðu; vÞ ¼ 1=N di ðu; vÞ
I¼1;...;N
Data mining in brain imaging 393
where d1 ðu; vÞ; . . . ; dN ðu; vÞ are the deformation functions of the N subjects. Then the
difference between the two populations, denoted with subscripts 1 and 2, can be
measured for each structural region as an effect size180 defined as
eðu; vÞ ¼ ðp1 ðu; vÞ ÿ p2 ðu; vÞÞ=ðu; vÞ
where ðu; vÞ is the point-wise standard deviation of the two populations combined.
Without loss of generality, we assume in this section that each variable is Boolean, i.e.
it represents a logical statement that is either true (T) or false (F). The problem of
generating the Bayesian-network structure that is most likely to have generated the
cases in the database D can be restated as
BSmax ¼ argmax PðBS jDÞ
Bs
where BSmax is the network structure (i.e. set of associations) we seek. Using Bayes’
theorem, we obtain
PðDj BS ÞPðBS Þ
BSmax ¼ argmax
BS PðDÞ
Since the prior probability of observing the data is constant for all models, the
problem reduces to solving
BSmax ¼ argmax PðDj BS ÞPðBS Þ
BS