Professional Documents
Culture Documents
net/publication/283166018
CITATIONS READS
11 922
4 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Andrea Schwahofer on 03 November 2016.
Improving radiotherapy planning in patients with metallic implants using the iterative metal
This content has been downloaded from IOPscience. Please scroll down to see the full text.
(http://iopscience.iop.org/2057-1976/1/2/025206)
View the table of contents for this issue, or go to the journal homepage for more
Download details:
IP Address: 107.173.217.91
This content was downloaded on 03/11/2016 at 11:15
An evaluation of three commercially available metal artifact reduction methods for CT imaging
Jessie Y Huang, James R Kerns, Jessica L Nute et al.
CT metal artifact reduction method correcting for beam hardening and missing projections
Joost M Verburg and Joao Seco
Development of virtual patient models for permanent implant brachytherapy Monte Carlo dose
calculations: interdependence of CT image artifact mitigation and tissue assignment
N Miksys, C Xu, L Beaulieu et al.
PAPER
Keywords: metal artifact reduction, metal implants, radiotherapy planning, iterative raw data reconstruction
Abstract
Metal artifacts in computed tomography (CT) images can cause important errors in dose calculation.
Algorithms developed to reduce these artifacts could improve these images but also introduce a bias in
tissues surrounding metallic implants. The purpose of this study is to validate a pre-commercial metal
artifact reduction tool for radiotherapy treatment planning. The performance of the iterative metal
artifact reduction (iMAR) algorithm developed by Siemens Healthcare is evaluated on a test platform.
A calibration phantom constituted of tissue-equivalent plastics is used to estimate the image bias from
artifact correction. Patient CT data with metal implants (seven dental fillings, one bilateral hip) are
reconstructed using weighted filtered backprojection and corrected using the iMAR algorithm.
Radiotherapy treatment plans are calculated and compared on corrected and uncorrected images
using the collapsed cone convolution dose algorithm implemented in RayStation (RaySearch).
Phantom scans show that iMAR reproduces CT numbers within ±44 HU for tissue equivalent
substitutes in 3.6 cm2 circular ROIs. The effect of this CT number difference on megavoltage photon
dose calculation is shown to be within ±1% dose error. Comparing patient plans from corrected and
uncorrected images, dose differences of up to ±5% are discovered in target volume and organs at risk,
depending on the treatment site. The iMAR algorithm resulted in improvement of image quality in
metal artifact affected CT images for radiotherapy planning. The technique reduces dose errors
significantly while keeping calculated doses in the surrounding tissues within a clinically acceptable
level in comparison to ground truth. Future work could aim at the improvement of benchmark
methods for a clinical environment.
experience to delineate target and OARs [2]. On the fillings. They concluded that ‘normalized MAR sig-
other hand, CT number accuracy is affected by metal nificantly improves image quality in CT imaging ... in
artifacts. This leads to incorrect assignments of elec- patients with metallic dental hardware and revealed
tron density values, which results in errors in dose cal- tumors that were masked by artifacts’ [18]. The quality
culation. With modern techniques like intensity of the artifact corrected image can be further
modulated radiation therapy (IMRT) or particle ther- improved by including a technique called frequency
apy, high dose gradients can be achieved to increase split (FSMAR) into the reconstruction process [19]. It
dose in the target volume and simultaneously spare enhances the visibility of fine details that otherwise
OARs [3]. Particularly in cases of pelvic and head and might get lost during the inpainting step. Based on this
neck tumors, metal artifacts can be adjacent to or even approach, the algorithm iterative metal artifact reduc-
included in the target volume or OAR. This leads to tion (iMAR) was developed by Siemens (Siemens Sec-
inaccuracies in target delineation and dose calcula- tor Healthcare, Forchheim, Germany). A recent study
tion [4]. by Axente et al [20] presented a clinical evaluation of
To prevent major errors in dose calculation, the iMAR for the application in radiotherapy.
electron density of soft tissue is assigned manually to The here presented study aims at a detailed clinical
the artifact affected region. This is time and resource evaluation of the iMAR algorithm for its application in
consuming, and decreases the quality of the treatment radiotherapy. Besides confirming the results found by
as the benefit from highly precise radiotherapy techni- Axente et al, we extend the phantom study in two
ques is degraded. Another approach is to obtain mega- points. Firstly, we investigated a number of different
voltage CT (MVCT) images. MVCT images are less metal inserts (aluminum, titanium, steel and tung-
susceptible to metal artifacts [5]. However, the use of sten) to mimic different clinical situations. Secondly,
MVCT is concerned with low soft tissue contrast and we investigated the performance of iMAR in very close
additional patient dose [6]. proximity to the metal in both, patient and phantom.
Several techniques for metal artifact reduction An extensive elaboration of ROIs in patients was per-
(MAR) are proposed in literature [7–10]. Dual energy formed using an approximate ground truth (GT).
CT based monoenergetic imaging provides a method Additionally, the dose differences between doses from
to reduce artifacts from beam hardening [11]. CT data iMAR and conventionally reconstructed images are
are acquired from two different energies and a mono- evaluated closely using histograms.
energetic extrapolation is performed. The result of this
extrapolation is a virtually monochromatic dataset,
which in principle is free of beam hardening artifacts 2. Materials and methods
[7, 12, 13]. This method is widely used in radiology for
diagnosing regions of interest (ROIs) near metal clips 2.1. Phantom study
or titanium implants which only cause mild artifacts 2.1.1. Data acquisition
[14]. However, the majority of radiotherapy cases with The tissue characterization phantom Gammex 467
metal artifacts in close proximity to target volume or (Gammex, Middleton, WI, USA) was scanned with and
OARs are head and neck patients with dental fillings or without metal inserts. The phantom itself has a
implants. These fillings are often made from Amal- diameter of 32 cm and comprises 16 rod shaped
gam, which has a high mass density and produces tissue substitutes of 5.6 cm length with a diameter of
severe photon starvation artifacts. Metal artifacts can 2.8 cm. The phantom loading pattern is illustrated
be reduced, but not completely corrected with this in figure 1 and table 1. The following metal inserts
algorithm [15, 16]. were introduced into the phantom: aluminum
Another technique for MAR is based on sinogram (ρAl = 2.7 g cm−3), titanium (ρTi = 4.5 g cm−3), steel
inpainting, which was proposed by Kalender et al in (ρsteel = 7.9 g cm−3), tungsten ( ρ W = 19.3 g cm−3).
1987 [9]. It relies on the interpolation of missing data The metals were chosen to simulate materials with
within the sinogram data. In a first step, the metal is densities used in typical medical applications such as
identified in an uncorrected image. The metal-only hip and dental implants. CT data was acquired using a
image is forward projected to find regions in the origi- Siemens Somatom Flash (Siemens Sector Healthcare,
nal sinogram that need to be interpolated. The inter- Forchheim, Germany) CT scanner at 120 kVp at a tube
polated sinogram is then used to reconstruct the current of 300 mAs. Reconstruction parameters were
corrected images using filtered backprojection. chosen as: field of view (FoV) of 500 mm, slice
Although this method effectively reduces metal arti- thickness of 2 mm and a B40f reconstruction kernel,
facts, new artifacts are introduced, especially in the using weighted filtered back projection (WFBP) as
region close to the implant. Meyer et al [17] proposed implemented in the Syngo software (version 2012b,
to apply a normalization to the original sinogram Siemens Sector Healthcare, Forchheim, Germany). In
before interpolation to overcome this problem. This a first scan, a metal free data set has been acquired
normalized metal artifact reduction (NMAR) using a real water insert, serving as GT. This scan has
approach has been investigated from a diagnostic been repeated for each metal insert at the position of
point of view, using CT data of patients with dental real water (figure 1(a)). In a second scan, an image of
2
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 1. Configuration of the Gammex phantom tissue equivalent inserts for (a) single metal plugs of different material, placed at the
position of real water; and (b) two steel inserts. The position of the metal inserts is indicated by the shading. The ROIs for artifact
density evaluations are illustrated. The indices correspond to the indices in table 1.
3
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Patient Indication
The measurements were performed on WFBP and 3.1. Phantom study: CT number accuracy
iMAR reconstructed images for all head and neck evaluation
patients. Furthermore reference values were estimated Images representative for all phantom setups are
in order to create a GT. Therefore ROIs of the same presented in figure 3. Results of the artifact density
4
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 3. Top: original images without metal inserts, serving as ground truth. Middle: original images containing artifacts induced by
different metal inserts. Bottom: iMAR reconstructed images. (c = 40 HU, w = 350 HU).
Figure 4. CT numbers determined in ROI 1 and 2 for images with different metal plugs. The ground truth CT number (GT) is
displayed as a dashed line.
evaluation are shown in figure 4. CT numbers were visually most affected by artifacts, see figure 2. iMAR
determined as described in section 2.1.2. The values improved CT number estimation for all investigated
on the GT image are (−4.5 ± 30.9) HU in ROI 1 and tissue substitutes on the metal artifact affected images.
(−7.3 ± 38.1) HU in ROI 2. CT numbers determined The largest CT number difference between iMAR and
on the data set with aluminum insert are equal for GT is 43.5 HU, observed for the lung substitute in the
both, iMAR and WFBP reconstruction. For all other images containing steel. In the soft tissue region, the
data sets with metal inserts, the CT numbers in ROI 1 CT numbers could be corrected to a level where they
are represented well by iMAR (titanium: (−3.5 ± are not significantly different from the GT, as shown in
35.1) HU, steel: (−17.6 ± 34.6) HU, tungsten (−8.0 table 3 (p < 0.05; ✓, p > 0.05; X).
± 35.4) HU). Results are slightly worse for ROI 2, In the WFBP image containing two steel inserts,
where iMAR overestimates the CT numbers within the
one observes CT number differences of up to
ROI (titanium: (66.6 ± 42.0) HU, steel: (68.5 ±
±351 HU, especially in inserts which are located
36.3) HU, tungsten (73.9 ± 45.3) HU).
Figure 5 displays the results of the CT number directly in the region between the inserts (adipose and
accuracy evaluation. The column charts show that CT liver). Artifacts are corrected well with iMAR, within a
number alteration due to artifacts is strongest in lung, maximum CT number difference of ±33.9 HU (in
liver and cortical bone inserts. These inserts are also liver insert).
5
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 5. The CT number differences to ground truth, following equation (1), determined for WFBP and iMAR images. Only absolute
values are displayed here.
Table 3. Results of the significance analysis. Differences in CT num- WFBP image is lower in regions of dark streaks
ber mean values over the investigated ROIs were tested according to
their statistical significance using Studentʼs t-test. Significant differ- compared to dose calculated on GT, with dose
ences between iMAR and ground truth are marked with a tick. differences of up to 5%. Comparing the dose calcu-
p < 0.05 Aluminum Titanium Steel Tungsten lated on the GT image with the dose on iMAR, small
dose differences are observed within the range of ±1%.
Lung ✓ ✓ ✓ ✓
The gamma analysis of GT versus WFBP dose revealed
Adipose ✓ ✓ ✓ ✓
Breast X X X X
62.1% passed dose points with the chosen criteria.
Solid water X X X X iMAR could improve that up to 85.7% passed dose
Liver X X X X points.
Inner bone X X ✓ X Profiles of the dose distributions were measured
B200 X ✓ ✓ ✓ on the central axis and 50 mm off axis. Figure 7 shows
Cortical bone X ✓ ✓ ✓
these profiles. A clear underdose of the WFBP dose in
the region of the dark streak can be observed on the
3.2. Phantom study: dosimetric evaluation central axis as well as off axis. However, the profiles
The calculated doses on GT, WFBP and iMAR images measured on the GT and on the iMAR reconstructed
are displayed in figure 6. The dose calculated on the images are almost equal.
6
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 6. Dose distribution calculated on (a) the ground truth image, (b) the WFBP reconstructed image and (c) the iMAR
reconstructed image. Dose difference maps to ground truth determined on (d) WFBP and (e) iMAR data (c = 0 HU, w = 800 HU).
Figure 7. Depth dose profiles measured from the dose distributions. Top panel: central axis dose profile. Bottom panel: off axis dose
profile, 50 mm from central axis. The origin of the x-axis was set to the midpoint between the steel inserts, which corresponds to
10 cm depth in the phantom, marked by the Iso-point in figure 6.
3.3. Patient study: image quality from the reference CT numbers. In the following, the
Results of the image quality analysis using the ROI CT number difference from reference is discussed.
readout are presented in figure 8. The reference values iMAR worked well especially for patients 7 (+5 HU)
for ROIs 1-2 are tabulated in table 4. ROI 1 and 2 were and patient 2 (+22 HU). The least effect was detected
placed within the hypodense streaks. ROI 1: the mean for patient 4 (−94 HU).
CT numbers determined on WFBP image for the The CT numbers determined for ROI 2 on the
different patients are between −301 HU and −846 HU WFBP image are within the same range as for ROI 1.
over the whole patient cohort. With iMAR, mean CT iMAR corrected CT numbers for patients 1, 2, 3 and 6
numbers in ROI 1 are within the range of ±94 HU within the range of ±50 HU. In Patients 3 and 5, iMAR
7
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 8. CT numbers for the four ROIs determined on WFBP and iMAR images in comparison to CT numbers determined in a
reference ROI, as described in section 2.2.2.
Table 4. Estimated reference CT numbers (mean) in In regions of bright streaks on the WFBP image,
HU for the patient ROIs, measured in slices which
were not affected by artifacts. the dose calculated on iMAR is higher. Accordingly, in
regions of dark streaks, the dose calculated on the
ROI 1 ROI 2 ROI 3 ROI 4 iMAR image is lower. This is demonstrated in the head
Pat 1 81 111 81 111 and neck patient of figure 10. The target volume (red
Pat 2 38 66 38 66 contour) in this case is directly located within an area
Pat 3 59 44 59 44 of very bright streak artifacts. This leads to a dose dif-
Pat 4 61 87 61 87 ference of up to −5% in the target and +5% in the sur-
Pat 5 28 108 28 108 rounding tissue when calculated on the WFBP image.
Pat 6 107 89 107 89
The dose difference in the prostate case is displayed in
Pat 7 27 72 27 72
figure 10. Maximal dose differences of −5% (seminal
vesicles, within bright region on WFBP) and +5%
(bladder and rectal wall, within dark region on WFBP)
underestimated CT numbers in ROI 2 (−340 HU for
were determined.
both patients). CT number for patient 7 is over-
The calculated gamma values for all seven head
estimated by iMAR (+172 HU).
and neck patients and the prostate patient are shown
ROI 3 and 4 were placed within the hyperdense
in table 5.
streaks. CT numbers determined for ROI 3 on
WFBP are within the range of 246–1697 HU.
With iMAR, ROI 3 in patients 4 and 6 is described very 4. Discussion
good (+35 HU and −41 HU, respectively). For all other
patients, iMAR corrected CT numbers within the 4.1. Phantom study
range of +130 HU. For ROI 4, iMAR correction was Intensities of metal artifacts were simulated to test the
very good for all patients except for patient 7, where the performance of the iMAR algorithm regarding geo-
CT number is still overestimated (+210 HU). metric and image quality reestablishment. The metal
inserts were distinguished in low-Z and high-Z (Z:
3.4. Patient study: dosimetric evaluation atomic number) implant materials. Titanium is a very
Dose difference column charts for all patients are commonly used material in trauma surgery (e.g.
displayed in figure 9. Dose differences within the screws for extremities and spinal cord). Steel is a very
patients vary from case to case. Maximal dose differ- common basic material for all kinds of endoprostheses
ences of up to ±20% can occur in single voxels. The (e.g. hip-, knee-, shoulder joints). Tungsten in turn
dose difference range in all patients in this study is has nearly the same physical density as gold, which is
± 5%. preferred for dental prostheses.
8
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Figure 9. Column charts of the voxelwise dose difference between doses calculated on iMAR and WFBP for the whole patient cohort.
Figure 10. Top panel: WFBP and iMAR reconstructed images of one head and neck patient, as well as the dose difference map between
WFBP and iMAR. The target volume is contoured in red. Bottom panel: WFBP and iMAR reconstructed images of the prostate
patient, (C = 40 HU, W = 350 HU).
Table 5. Percentage of evaluated points with γ < 1 within the 50%, Without metal implants present in the images, the
80% and 90% isodose volumes.
iMAR reconstruction algorithm yields the exact same
Patient Case Gamma 50% Gamma 80% Gamma 90% results than a standard WFBP reconstruction. No
negative or undesired effects occur. This was tested in
Head and neck 1 92.1 94.8 94.3
reconstructions of phantom and patient data sets
Head and neck 2 98.5 100.0 100.0
Head and neck 3 96.1 99.7 100.0
without metal inserts. However, the behavior of the
Head and neck 4 96.9 99.9 100.0 algorithm in the presence of metal and sharp tissue
Head and neck 5 99 99.8 100.0 boundaries such as soft tissue and air needs to be
Head and neck 6 96.9 100 99.9 investigated more closely.
Head and neck 7 95.1 95 94.2 In this pre-commercial release of the iMAR soft-
Hip 97.9 95.1 94.5 ware, the reconstruction parameters were set to
9
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
Table 6. CT number difference that causes a 1% dose difference for a selected set of tissues.
correct strong metal artifacts arising from high-Z Axente et al in their paper used the Acuros XB
metals such as alloy prostheses or severe artifacts from (Varian Medical Systems Inc, Palo Alto, CA) determi-
dental fillings. This setup of reconstruction para- nistic radiation transport method for dose calculation.
meters detects only strong artifacts and was therefor In our comparison, we used a CCC method. Han et al
not able to recognize aluminum as an artifact introdu- [23] have performed an extensive comparison of these
cing metal. It follows that the software did not correct two dose calculation algorithms. They concluded that
the artifacts from aluminum, hence the original WFBP the algorithms are comparable to Monte Carlo dose
reconstructed data equals the corrected data. But high- simulations and only show differences in bone, lung
Z materials cause a nearly complete absorption of the and interface regions, where Acuros XB is slightly
x-rays leading to missing information in the projec- superior to CCC. The here presented results of the
tion data, showing up as severe bright and dark streaks dosimetric analysis will thus be different from the
in WFBP reconstructed image (see figure 2, second results presented in the paper from Axente et al. It is
row). The higher the metal density the stronger are the subject to discussion whether these differences would
artifacts. be clinically relevant. As to compare the influence of
The iMAR algorithm replaces this missing infor- the dose calculation algorithm, further analysis needs
mation iteratively [17]. Hence the resulting image is to be performed applying the two different algorithms
almost artifact free (see figure 2, third row). All tissue to the same phantom setup and subsequent patient
equivalent inserts in the phantom can now be delim- cases.
ited in their geometrical borders as in the GT images. The results found in this paper confirm the results
Differences of the CT numbers in the iMAR data com- of Axenteʼs phantom study and provide com-
pared to the GT is given in figure 4. The highest devia- plementary information on how the algorithm per-
tions of CT numbers after iMAR reconstruction are forms in close proximity to the metal insert. We also
observed for the cortical bone tissue substitute investigated the performance of the algorithm in pre-
with 41 HU (tungsten) and for the lung tissue sub- sence of different metals. We found that in all cases the
stitute with 44 HU (steel). Thus we report higher abso- CT numbers reconstructed by iMAR result in a dose
lute CT number differences between iMAR and GT difference less than 1% compared to GT.
than Axente et al [20], who claimed a CT number
accuracy of ±25 HU in the presence of multiple steel 4.2. Patient study
inserts. Patient images from WFBP and iMAR reconstruction
Our tolerance level for interpretation was set to were compared. In all cases, the overall image quality
that CT number difference which would cause a 1% was improved with iMAR. Areas of bright or dark
error in dose calculation when converting HU in elec- streaks were filled with soft tissue. One of the head and
tron densities. These tolerance levels are listed in neck cases was chosen as example (figure 10(a)). In
table 6 for selected tissue equivalent subsitutes. this case, artifacts arise from dental fillings and affect
Finally it can be summarized that the CT numbers 17 slices on the WFBP image. These artifacts do not
of all tissue equivalent substitutes in the iMAR data allow for precise differentiation of the buccal tissue,
meet the required 1% tolerance level and hence it was master muscle, mandible and infiltrative metastasis.
shown that iMAR can be useful to reconstruct metal With iMAR, the visibility of the metastasis and
artifact affected CT images. surrounding tissues is improved. Residual artifacts are
Furthermore, this is confirmed by the evaluation visible on the depicted slice, especially between the
of the dose distributions: the differences in dose dis- dental fillings itself and between fillings and tissues of
tribution between GT and iMAR are negligible, high density such as bone. These results are confirmed
whereas up to ±5% had been revealed between GT by the remaining head and neck cases.
and WFBP. Figure 10(b) illustrates the image quality of the
The dose accuracy evaluation in the phantom pelvis case (bilateral hip prostheses). On the usual
study revealed dose distribution differences between WFBP reconstructed image the area between the pros-
GT and WFBP of up to −5% in the region of the dark theses is hardly visible. Important anatomical struc-
streak. The dose difference between GT and iMAR is tures are not discernible, such as the borders of the
very small and only show small residual dose errors in prostate and seminal glands as well as the caudal
the field periphery. It can be followed from these bladder wall and ventral rectal wall. Based on this lack
results that the dose calculated on the iMAR image is of anatomical information in the WFBP image, RT
closer to the true dose than the dose calculated on with highly conformal techniques like IMRT is ques-
WFBP image. tionable. On the iMAR reconstructed image in
10
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
11
Biomed. Phys. Eng. Express 1 (2015) 025206 E Bär et al
[15] Kuchenbecker S, Faby S, Sawall S, Lell M and Kachelrieß M iterative metal artifact reduction algorithm for CT simulation
2015 Dual energy CT: how well can pseudo-monochromatic in radiotherapy Med. Phys. 42 1170–83
imaging reduce metal artifacts? Med. Phys. 42 1023–36 [21] Low D A, Harms W B, Mutic S and Purdy J A 1998 A technique
[16] Schwahofer A, Bär E, Kuchenbecker S, Großmann G, for the quantitative evaluation of dose distributions Med. Phys.
Kachelrieß M and Sterzing F 2015 The application of metal 25 656–61
artifact reduction (MAR) in CT scans for radiation oncology [22] Depuydt T, van Esch A and Huyskens D P 2002 A quantitative
by monoenergetic extrapolation with a DECT scanner Z. Med. evaluation of IMRT dose distributions: refinement and clinical
Phys. at press assessment of the gamma evaluation Radiother. Oncol. 62
[17] Meyer E, Raupach R, Lell M, Schmidt B and Kachelrieß M 309–19
2010 Normalized metal artifact reduction (NMAR) in [23] Han T, Mikell J K, Salehpour M and Mourtada F 2011
computed tomography Med. Phys. 37 5482–93 Dosimetric comparison of Acuros XB deterministic radiation
[18] Lell M M, Meyer E, Kuefner M A, May M S, Raupach R, transport method with Monte Carlo and model-based
Uder M and Kachelrieß M 2012 Normalized metal artifact convolution methods in heterogeneous media Med. Phys. 38
reduction in head and neck computed tomography Invest. 2651–64
Radiol. 47 415–21 [24] Dolezel M, Odrazka K, Zouhar M, Vaculikova M, Sefrova J,
[19] Meyer E, Raupach R, Lell M, Schmidt B and Kachelrieß M Jansa J, Paluska P, Kohlova T, Vanasek J and Kovarik J 2015
2012 Frequency split metal artifact reduction (FSMAR) in Comparing morbidity and cancer control after 3D-
computed tomography Med. Phys. 39 1904–16 conformal (70/74 Gy) and intensity modulated
[20] Axente M, Paidi A, von Eyben R, Zeng C, Bani-Hashemi A, radiotherapy (78/82 Gy) for prostate cancer Strahlenther.
Krauss A and Hristov D 2015 Clinical evaluation of the Onkol. 191 338–46
12