Professional Documents
Culture Documents
1, JANUARY 2010 19
II. METHODS
as similarity measure (see Fig. 1), with expressing the spatial
position in the reference image. The image is subdivided into Consider a reference image with intensity , a
cubic regions of – voxels, overlapping 50% in each di- floating image with intensity and a transforma-
mension (i.e., in 3-D, every voxel belongs to eight regions). The tion μ that maps every reference position to the
joint intensity histogram within each region is constructed using corresponding floating position μ given a set
B-spline Parzen windows [12]. of parameters μ. In the next paragraphs we first describe the
Within this paper, we propose conditional mutual information transformation model μ that defines the space in which
(cMI) [13] as similarity measure. The cMI is calcu- the best solution is sought and then the similarity measure that
lated between the reference and floating intensity distributions calculates how well the reference and deformed floating image
and , given a certain spatial distribution match. Next, we give some details about the spatial resolution
at which both the transformation model and proposed similarity
measure are calculated. To conclude, we describe the calcu-
(6)
lation of the derivatives and the optimization and discuss the
(7) computational complexity of the algorithm.
1Although total correlation (5) and cMI (6) are clearly different, (7)
is equivalent to (11) in [11]. We think this is due to a miscalcula-
Whereas the bi-variate gMI expresses the reduction tion of Studholme et al.: in [11, Eq. (6) and (7)], the authors state that
p(m ) = p(m j r)p(r) and p(m ) = p(m j r)p(r). However, this should
in the uncertainty of due to the knowledge of (and be p(m ) = p(m j r)p(r) and analogous. Note that Studholme uses
vice–versa), cMI expresses the reduction in the uncertainty of a slightly different notation, i.e., p (m ) p(m j r ).
LOECKX et al.: NONRIGID IMAGE REGISTRATION USING CONDITIONAL MUTUAL INFORMATION 21
μ
(19)
μ (16)
The image derivative depends on the image interpo-
lation scheme. We use th degree B-spline image interpolation
C. Spatial Resolution throughout this work, thus the calculation of the image deriva-
tive requires only B-spline derivatives.
Equations (8) and (12) both stipulate the spatial resolution of For PV, the derivative of (16) with respect to a transformation
the algorithm. The former governs the region of influence of a parameter is given by
registration parameter and the latter the scale at which the cMI
and thus similarity measure is calculated. We will use the same μ
settings for the B-spline degree, mesh knots, and spacing in both
formulas. Thus, the local transformation around a certain dis-
placement vector is guided by the conditional joint histogram, μ
(20)
both using the same concept and scale of locality. Therefore, we
also keep the B-spline degree fixed over a single
registration: using B-spline subdivision formula’s, (8) can be re- requiring no image derivative.
fined exactly for constant when is divided
by two. E. Computational Complexity
The extent of the image region contributing to the joint his-
togram for a given spatial bin is the same as the region of Taking the limited-span properties of the B-splines into ac-
influence of the corresponding displacement vector . Due to count, the construction of the PW joint histogram (9) for 3-D im-
the limited span properties of B-splines, this extent is limited ages requires B-spline evaluations per
to a by by cuboid cen- iteration, with the number of voxels, accounting
tred around , with a higher contribution for the voxels closer for the Parzen window histogram update and for the
to . The mesh spacing regulates the locality of the cMI. floating image interpolation. The B-spline evaluations needed to
To represent a more global histogram, a large is chosen, calculate the transformation field at each voxel are performed
yielding a limited number of spatial bins or displacement vec- only once for each multiresolution stage, which is possible if
tors containing contributions from a large image area. A fine the mesh spacing equals an integer number of voxels.
mesh with small and many spatial bins is used to calcu- For the PV histogram, the image interpolation is replaced by the
late a more local histogram and transformation field. In practice, partial volume kernel and the Parzen window histogramming is
a multiresolution scheme is adopted, starting from a coarse mesh no longer used, leading to B-spline evaluations.
and gradually refining it. For most applications we choose Note that the number of evaluations is independent of the mesh
, although we deviate from this for non-isotropic data. spacing. The number of evaluations does not change for cMI,
as the spatial kernel for cMI is the same as used to calculate the
D. Derivatives and Optimization transformation field. If we take the same B-spline degree for all
aspects of the method, the ratios of computational complexity
A limited memory quasi Newton method [18] is adopted for
for are for PW and for PV.
the optimization, using analytical derivatives to avoid discretiza-
However, the number of multiplications required for cMI will
tion errors. The derivative of (11) with respect to a transforma-
increase, as a different joint histogram has to be constructed
tion parameter is given by
for each mesh control point . Whereas each joint inten-
μ sity pair contributes only to 1 (i.e., the global) joint histogram
for gMI, it will contribute to joint
histograms for cMI. This factor amounts to 8, 27, and 64 for
μ , respectively. The number of joint histogram deriva-
(17)
tives will be multiplied with the same factor. Moreover, for gMI,
the memory requirement for the joint histogram and its deriva-
and similar for gMI. The central factor is calculated using tives is μ floating-point numbers, with
and the number of reference and floating bins and μ the
μ number of displacement vectors. For cMI, each spatial bin is
influenced by displacement vec-
tors, thus we need
μ
μ floating-point numbers to store the conditional joint
μ μ histograms and their derivatives, leading to excessive memory
(18) requirements. Luckily, we do not need to store all cMI joint
LOECKX et al.: NONRIGID IMAGE REGISTRATION USING CONDITIONAL MUTUAL INFORMATION 23
Fig. 5. Boxplot of the registration results obtained for 200 artificial multi-
modality image pairs. The graphs show the average intensity difference (AID)
and warping index $ (in voxels) of the registration result compared to the
ground truth. Registrations were performed using global MI (gMI), conditional
MI, and total correlation (TC), using Parzen window (PW) and partial volume
(PV) histogram estimation, and using B-spline degree n= n
2 and = 3.
TABLE I
Fig. 4. Original images and average of the results obtained over all 200 mul- RESULTS OF THE EXPERIMENTS SHOWN IN FIG. 5
timodal registrations using second degree B-splines. The best results (sharpest
average image) are obtained using conditional MI (cMI) and Partial volume
(PV) interpolation.
Fig. 7. Boxplot of the registration results obtained for 200 bias field distor-
tion pairs. The graphs show the average intensity difference (AID) and warping
$
index (in voxels) of the registration result compared to the ground truth. Reg-
istrations were performed using global MI (gMI), conditional MI, and total cor-
relation (TC), using Parzen window (PW) and partial volume (PV) histogram
estimation, and using B-spline degree n= n
2 and = 3.
Fig. 6. Average of the results obtained over all 200 registrations after bias field TABLE II
distortion. RESULTS OF THE EXPERIMENTS SHOWN IN FIG. 7
=
Fig. 8. BrainWeb registration results for (left) monomodal and (right) multimodal and (top to bottom) mesh spline degree n 1; 2, and 3. Each graph represents
the results for 6–254 bins and conditional and global MI. The best results are obtained with n = 2 and 30 bins. The difference is clearest in the multimodal case.
TABLE III structures (and the tumor) are better visible in the MR scan,
REGISTRATION SETTINGS FOR BRAINWEB IMAGES yet the CT scan is required to perform the actual planning and
dose distribution calculation. This registration is a challenging
problem, as MR and CT images differ significantly due to dif-
ferences in rectum position and filling and the CT contrast of
the region of interest is rather limited. Moreover, the MR image
is slightly distorted with a bias field.
We dispose of 41 pairs of patient MR and CT scans. The
pairs are obtained from 14 patients. For each patient, at three
best results are obtained using PV interpolation for the coarse different time points during treatment, an MR and a CT image
registration and PW interpolation for the finer stages. There- were recorded (one CT/MR pair is missing). A trained radio-
fore, we used PV interpolation for stages 1–3 and PW for the therapist manually segmented the rectum in each CT and MR
remaining stages. For the validation, we calculated the warping image. A multiresolution registration scheme is adopted to
index between the ground-truth field and the field obtained by register the images within each pair to each other, using the
the validation registrations. The warping index was evaluated in CT image as reference image. The mesh spacing, as well for
the foreground only, defined by a ROI covering the brain region. the cMI calculation as for the tensor product B-spline field,
The results are summarized in Fig. 8, showing the calcula- is gradually decreased, starting from 512 512 64 voxels
tion time and warping index for different registration settings. (about 400 400 313 mm ) in the first stage to 32 32 8
In most cases, best results are achieved using cMI, however at voxels (about 25 25 40 mm ) in the fifth and last stage.
the cost of an increased calculation time. The biggest advan- Based upon initial experiments, we have chosen to use 30 bins
tage of cMI over gMI is seen for multimodal registration with for the floating and reference intensities in the joint histogram
first and second degree B-splines. High registration errors (up to and PW interpolation.
60 mm) can occur when registration in the final stages diverges The results obtained using cMI and gMI are compared to rigid
from the ground-truth optimum. alignment. As validation measure, the Dice similarity criterium
(DSC) and surface distance (SD) between corresponding rectum
E. Experiment 5: Clinical CT/MR segmentations are evaluated (see Fig. 9). Both for DSC and SD,
In our last experiment, we applied global and conditional MI conditional MI significantly outperformed global
to MR/CT registration for colorectal cancer treatment [20]. The MI and rigid registration. An example comparing segmentations
clinical goal of the registration is to transfer expert delineations obtained by global and conditional MI is shown in Fig. 10. The
of structures of interest from the MR scan to the CT scan. Most cMI registrations took on average 10610 s (8050–13022 s) and
LOECKX et al.: NONRIGID IMAGE REGISTRATION USING CONDITIONAL MUTUAL INFORMATION 27
The conditional mutual information will be the sum of the mu- [6] J. West et al., “Comparison and evaluation of retrospective inter-
tual information of each of the spatial bins. Using as the modality brain image registration techniques,” J. Comput. Assist.
Tomogr., vol. 21, no. 4, pp. 554–566, 1997.
area of the part of contained within the central spatial bin, [7] T. Gaens, F. Maes, D. Vandermeulen, and P. Suetens, “Non-rigid multi-
the joint histogram for this central spatial bin is modal image registration using mutual information,” in Proc. MICCAI,
1998, pp. 1099–1106.
[8] B. Likar and F. Pernuš, “A hierarchical approach to elastic registration
(22) based on mutual information,” Image Vis. Comput., vol. 19, no. 1–2,
pp. 33–44, Jan. 2001.
[9] A. Andronache, P. Cattin, and G. Székely, “Local intensity mapping
Now, we apply a perturbation on such that . for hierarchical non-rigid registration of multi-modal images using the
The global and conditional joint histograms now become cross-correlation coefficient,” in Proc. WBIR, 2006, pp. 26–33.
[10] J. Weese, P. Rösch, T. Netsch, T. Blaffert, and M. Quist, “Gray-value
based registration of CT and MR images by maximization of local cor-
relation,” in Proc. MICCAI, 1999, pp. 656–663.
[11] C. Studholme, C. Drapaca, B. Iordanova, and V. Cardenas, “Defor-
(23) mation-based mapping of volume change from serial brain MRI in the
presence of local tissue contrast change,” IEEE Trans. Med. Imag., vol.
25, no. 5, pp. 626–639, May 2006.
[12] P. Thévenaz and M. Unser, “Optimization of mutual information for
multiresolution image registration,” IEEE Trans. Signal Process., vol.
9, no. 12, pp. 2083–2099, Dec. 2000.
(24) [13] T. M. Cover and J. A. Thomas, Elements of Information Theory, 2nd
ed. New York: Wiley, 2006.
[14] D. Rueckert, L. I. Sonoda, C. Hayes, D. L. Hill, M. O. Leach, and D. J.
Hawkes, “Nonrigid registration using free-form deformations: Appli-
The perturbation will only influence the size of the structures cation to breast MR images,” IEEE Trans. Med. Imag., vol. 18, no. 8,
contained within the central spatial bin, so the contribution of pp. 712–721, Aug. 1999.
[15] D. L. Collins, A. P. Zijdenbos, V. Kollokian, J. G. Sled, N. J. Kabani,
the peripheral spatial bins to cMI and TC remains constant. C. J. Holmes, and A. C. Evans, “Design and construction of a realistic
From (23) and (24), the mutual information can be calcu- digital brain phantom,” IEEE Trans. Med. Imag., vol. 17, no. 3, pp.
lated easily using (2). To create the graph in Fig. 3, we chose 463–468, Jun. 1998.
[16] F. Maes, A. Collignon, D. Vandermeulen, G. Marchal, and P. Suetens,
a radius of 3.5 and 12.5 for the inner and outer circle and a “Multimodality image registration by maximization of mutual infor-
square edge of 32. The local optimum of the gMI is situated mation,” IEEE Trans. Med. Imag., vol. 16, no. 2, pp. 187–198, Apr.
at . 1997.
[17] H.-M. Chen and P. K. Varshney, “Mutual information-based CT-MR
brain image registration using generalized partial volume joint his-
REFERENCES togram estimation,” IEEE Trans. Med. Imag., vol. 22, no. 9, pp.
1111–1119, Sep. 2003.
[1] D. Loeckx, P. Slagmolen, F. Maes, D. Vandermeulen, and P. Suetens, [18] R. Byrd, P. Lu, J. Nocedal, and C. Zhu, “A limited memory algorithm
“Nonrigid image registration using conditional mutual information,” in for bound constrained optimization,” SIAM J. Sci. Comput., vol. 16, no.
Proc. IPMI, 2007, pp. 725–737. 5, pp. 1190–1208, Sep. 1995.
[2] A. Collignon, F. Maes, D. Delaere, D. Vandermeulen, P. Suetens, and [19] D. Loeckx, F. Maes, D. Vandermeulen, and P. Suetens, “Comparison
G. Marchal, “Automated multi-modality image registration based on between Parzen window interpolation and generalised partial volume
information theory,” in Proc. IPMI, 1995, pp. 263–274. estimation for non-rigid image registration using mutual information,”
[3] P. Viola and W. M. Wells, “Alignment by maximization of mutual in- in Proc. WBIR, 2006, pp. 206–213.
formation,” in Proc. ICCV, 1995, pp. 16–23. [20] P. Slagmolen, D. Loeckx, S. Roels, X. Geets, F. Maes, K. Haustermans,
[4] F. Maes, D. Vandermeulen, and P. Suetens, “Medical image regis- and P. Suetens, “Nonrigid registration of multitemporal CT and MR
tration using mutual information,” Proc. IEEE, vol. 91, no. 10, pp. images for radiotherapy treatment planning,” in Proc. WBIR, 2006, pp.
1699–1722, Oct. 2003. 297–305.
[5] J. Pluim, J. Maintz, and M. Viergever, “Mutual-information-based reg- [21] E. D’Agostino, F. Maes, D. Vandermeulen, and P. Suetens, “Atlas-to-
istration of medical images: A survey,” IEEE Trans. Med. Imag., vol. image non-rigid registration by minimization of conditional local en-
22, no. 8, pp. 986–1004, Aug. 2003. tropy,” in Proc. IPMI, 2007, pp. 320–332.