You are on page 1of 5

358 IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 9, NO.

3, MAY 2012

Remote-Sensing Image Denoising Using Partial


Differential Equations and Auxiliary Images as Priors
Peng Liu, Fang Huang, Guoqing Li, and Zhiwen Liu

Abstract—In this letter, a new method for denoising multispectral image to improve its spatial resolution. In fact, the
remote-sensing images based on partial differential equations auxiliary noise-free image is a more suitable aid to denoising.
(PDEs) is proposed. The method employs the similarity between In [15] and [16], an auxiliary image as a prior is used to assist
the different band images in a multicomponent image. Initially, in denoising or deblurring the image, but this is not suitable for
one of the noise-free images in multicomponent remote-sensing
images as a prior is introduced into the PDE denoising method.
a remote-sensing image, and the auxiliary noisy image must
To make use of the priors of the noise-free image in denoising, we come from the same sensor. There has been some research
construct a new smoothing term for the PDE so as to compute on denoising based on such an auxiliary image in the remote-
the total variation. The new smoothing term refers to a specific sensing field [3], [8]. In hyperspectral images, the infrared part
smoothing direction and a specific smoothing intensity of the of the spectrum contains noise near the water-vapor absorption
reference image when denoising the noisy image. The proposed band. To denoise these bands, image bands from other parts
smoothing term is added as a new constraint into the PDE of the spectrum can be applied as noise-free images. Recently,
denoising method. Based on the proposed method, the similarity a multispectral and hyperspectral image denoising algorithm
of the directions of the edges between the noisy image and
the reference image enables the new algorithm to smooth out has been proposed [8] and has achieved good results, where
more noise and conserve more detail in the denoising process. within the Bayesian framework, the extra initial information is
We also present the discrete form of the proposed denoising included in the form of a noise-free single-band image.
model. Multispectral remote-sensing images and hyperspectral The goal of our approach is denoising. In this letter, follow-
remote-sensing images are experimented in this letter. A better ing the ideas in [3] and [8], the correlation between the different
performance is achieved by the proposed method when compared bands of multicomponent images is used. The auxiliary image
with other methods. as the prior is introduced into the TV or partial differential equa-
Index Terms—Image denoising, partial differential equation tion (PDE) denoising method. Moreover, the auxiliary image is
(PDE), total variation (TV). applied in the form of a “noise-free” single-component image
(no image is completely noise-free and by “noise-free” we
I. I NTRODUCTION mean “with a high SNR”). To illustrate the proposed method,
we experiment on the multispectral and hyperspectral remote-

I N RECENT years, several classes of denoising algorithms


such as total variation (TV) [1]–[3], wavelets [4]–[8], and
nonlocal means [9], [10] have all achieved much success. These
sensing images.

algorithms are based on different theories, and all show good II. I MAGE D ENOISING BASED ON PDEs
performance in denoising. When denoising an image, the TV When an image is corrupted by noise, the following is used:
method makes use of the geometric features of the image,
the wavelet method makes use of the statistical features of Io = I + n. (1)
the coefficients, and the nonlocal means method makes use
of the redundancy in the image texture features. However, the In (1), Io is the observed image, I is the original image,
features that have been used by these methods all come from and n is the additive noise in the observed image. Usually, n
the noisy image itself. In fact, the image features acquired by is assumed to follow a Gaussian distribution with a zero mean
other sensors from the same scene can also be used as priors in and a variance of σ 2 . The TV denoising model, as described in
denoising. [1] and [2], is denoted by
 
In many situations involving multicomponent remote- λ
sensing images, a single-component image with a higher SNR TV(I) = |∇I|dxdy + (I − Io )2 dxdy. (2)
2
or higher spatial resolution is often available. In the past, such S S
an auxiliary image was applied for fusion [11]–[14] with a 
Here, S is the support area of the image, |∇I|dxdy is the
S
regularization term, and λ is the regularization parameter. ∇I
Manuscript received April 12, 2011; revised June 15, 2011; accepted
July 18, 2011. Date of publication October 26, 2011; date of current version denotes the gradient of I, and |∇I| is the modulus of ∇I.
March 7, 2012. This work was supported by CEODE director foundation and Minimizing the object function of (2) with respect to I, as in
NSF (41001265). [1], [2], we obtain (3) for I the following:
P. Liu, G, Li, and Z. Liu are with the Spatial Data Center, Center for Earth  
Observation and Digital Earth, Chinese Academy of Sciences, Beijing 100094, ∂TV(I) ∇I
China (e-mail: pliu@ne.rsgs.ac.cn; liupeng.liu@gmail.com). = div + λ(I − Io ) = 0. (3)
F. Huang is with the Institute of Geo-Spatial Information Technology, ∂I |∇I|
College of Automation, University of Electronic Science and Technology of
China, Chengdu 611731, China. In (3), div(·) is the divergence operator. To solve (3), the time
Digital Object Identifier 10.1109/LGRS.2011.2168598 marching method described in [1] can be employed, and for the

1545-598X/$26.00 © 2011 IEEE


LIU et al.: REMOTE-SENSING IMAGE DENOISING USING PDES AND AUXILIARY IMAGES AS PRIORS 359

Fig. 1. Introducing the prior of the auxiliary image into the smoothing term

nth iteration, we obtain bands that can be used as priors in the denoising process. In
    this letter, the auxiliary image from another sensor is denoted as
∇I n
I n+1 n
= I + dt div + λ(I − Io ) .
n
(4) the reference image u. Image u is both similar to and different
|∇I n | from the noisy image Io , i.e., the image intensity distributions
Thus, the problem of TV denoising is converted to com- are different, but the edge directions and texture information are
puting the PDE given by (4). In fact, we can consider the similar. If we smooth the noise-free image u using (4) and (5),
corresponding form of div(∇I n /|∇I n |) from the perspective there is a similar smoothing term as given by
of differential geometry [17] as follows:  
  ∇u 1 T
∇I n 1 div = ξ Hu ξ u . (7)
div = ξ Tn HI n ξI n . (5) |∇u| |∇u| u
|∇I |
n |∇I n | I
Equation (5) has been proven in [17]. In (5), ξI n is the unit Similar to (5), (7) has anisotropic and uneven properties.
vector perpendicular to the direction of the image gradient, and However, we do not need to smooth the noise-free u but use
ξITn is the transposition of ξI n [17]. HI n represents the Hessian the priors of the edges of u. To make use of the priors of
matrix [17] of the image I n . Substituting (5) into (4) gives u in denoising I n , we need to construct the new smoothing
  term referring to the specific smoothing direction and specific
1
I n+1 = I n + dt ξ T
n H I n ξI n + λ(I
n
− I o ) . (6) smoothing intensity of image u and (7). Since ξu determines the
|∇I n | I direction of the smoothing and |∇u| determines the strength of
With regard to the effects of the regularization term the smoothing, we can still refer to the local feature ξu and |∇u|
(1/|∇I n |)ξITn HI n ξI n , we can see that, on one hand, due to the of the image u, and there is a proposed hybrid smoothing term,
anisotropic nature of ξITn HI n ξI n , the smoothing is always in as shown in Fig. 1.
the direction of ξI n , which is tangential to the edge, whereas In Fig. 1, the new term (1/|∇u|)ξuT HI n ξu that contains both
on the other hand, due to the uneven property of (1/|∇I n |), the the information of the noise image I n and the reference image
smoothing is always weakened where the gradient of the image u is constructed. For the arbitrary iteration n, the edges and
is large, and this further preserves the edge of the image. In the gradients in both images I n and u should be taken into consid-
next section, another smoothing term that relates to an auxiliary eration in denoising; therefore, we add (1/|∇u|)ξuT HI n ξu into
image of the same scene is constructed and introduced into (6). (6) to give
The direction of the edges and the strength of the gradients  
of the auxiliary image will be used in the PDE-based remote- 1 1 T
I n+1=I n +dt ξ T
n H n
I Iξ n+ ξ H n
I uξ +λ(I n
−I o .
)
sensing image denoising. |∇I n | I |∇u| u
(8)
III. I NTRODUCING AN AUXILIARY I MAGE AS A Now, when we denoise an image using (8), the information
of the auxiliary image will be taken into consideration. In
P RIOR INTO PDE D ENOISING
denoising, the direction of the smoothing refers not only to ξI n
In many situations in the remote-sensing area, multicompo- but also to ξu , and the strength of the smoothing refers not only
nent images are often acquired. Although an image comprised to |∇I n | but also to |∇u|. Therefore, when the auxiliary image
of several bands is corrupted by noise, a single-component u is noise free or not as noisy as Io , |∇u| and ξu also contain
image with a higher SNR is often available. For multispectral less noise, and PDE-based denoising, as given by (8), should
and hyperspectral images, there are often noise-free image give better results. When referring to the form of the time
360 IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 9, NO. 3, MAY 2012

Fig. 3. Denoising the simulated noisy image of AVIRIS. σ = 15 (a) Original


image. (b) Noisy image. (c) Reference image. (d) TV, 30.08 dB. (e) Priors–TV,
31.31 dB. (f) Priors–wavelet, 31.24 dB.

image with additive Gaussian noise with the standard deviation


σ, and we denoise the simulated noisy third band and use the
fourth band as the reference image for the prior in Fig. 2.
For the hyperspectral images, the AVIRIS images over
Cuprite, Nevada, were taken. The noisy image is simulated by
contaminating the original image with additive Gaussian noise
Fig. 2. Denoising the simulated noisy Image of CEBERS. σ = 20. with standard deviation σ, and we denoise the simulated noisy
(a) Original image (b) Noisy image. (c) Reference image. (d)TV, 25.92 dB.
(e) priors–TV, 27.05 dB. (f) The PSNR in the iteration. (g) Priors–wavelet, hyperspectral band and use the 12th band as the reference image
25.61 dB. for the prior in Fig. 3.
A state-of-the-art wavelet-based method [8] that has also
marching method [1], we use (9) and (10) to define the discrete
introduced a noise-free image as a prior was compared with
forms of (1/|∇I n |)ξITn HI n ξI n and (1/|∇u|)ξuT HI n ξu in (8)
 n 2 the proposed method, and this method was denoted as the
1 T
n
Ixx Iy − 2Ixy n n n n
Ix Iy + Iyy (Ixn )2 priors–wavelet. The TV denoising method [2] without image
ξ n HI ξ I =
n n
  32 (9) priors is denoted as TV in the experiments. The proposed PDE
|∇I n | I 2
Iyn + (Ixn )2 + β method with the priors of the noise-free image is denoted by
priors–TV in the experiments. When the noise is simulated,
1 T n
Ixx (uy )2 − 2Ixy
n n
ux uy + Iyy (ux )2
ξ u HI n ξ u = . (10) the performance of the different denoising techniques can be
|∇u| 3
((uy )2 + (ux )2 + β) 2 quantitatively described by the PSNR (in decibels) [8]. The
PSNR is defined as
In (9) and (10), β is a small constant [1] to prevent a zero

denominator. ux and uy are the derivatives of u. Ixn , Ixx n n
, Ixy , 2552
n PSNR(I, I) ˆ = 10 log10 (11)
Ixy , and Iyn are the derivatives of I n . Similar to [1], the upwind ˆ
MSE(I, I)
scheme is used in computing ux , uy , Ixn , Ixx n n
, Ixy n
, Ixy , and
n
Iy . In the next section, we will do some denoising experiments where MSE(I, I) ˆ is the MSE between the original noiseless
using (8) and compare its performance with other algorithms. image I and the denoised image I. ˆ
In Fig. 2, we compare the proposed priors–TV method with
IV. E XPERIMENTS AND R ESULTS the TV method [2] and the priors–wavelet method [8] by
To validate and compare the proposed method, we perform using the multispectral image. Fig. 2(a) is the original image,
the simulation experiments and real-data experiments on differ- Fig. 2(b) is the simulated noisy image, Fig. 2(c) is the reference
ent data sets. Multispectral and hyperspectral sensors acquire image, and σ is the standard deviation of the initial noise.
multicomponent images. These data sets contain both the noisy For the priors–TV method and the TV method, the number of
and noise-free images. A higher quality image can be obtained iterations will affect their performance; therefore, the PSNR in
from one of the sensors or from another part of the reflectance the iteration is given in Fig. 2(f), and the optimal location is
spectrum with a higher SNR. We will apply such an image as shown by the circle. Fig. 2(d) and (e) are the optimal results
the noise-free image in the proposed method. for TV and priors–TV in terms of the iterations. We always
use the optimal denoising result of TV and priors–TV in all
the experiments when comparing them with priors–wavelet.
A. Experiments on Simulated Noisy Images
Priors–wavelet is not an iteration method; therefore, we do not
The multispectral images come from the CEBERS satellite. need to select its optimal result. We can see that priors–TV in
There are five bands in the multispectral image from CEBERS. Fig. 2(e) is better than TV in Fig. 2(d). There is less noise left in
The noisy image is simulated by contaminating the original priors–TV than in TV. The similarity of the edges between the
LIU et al.: REMOTE-SENSING IMAGE DENOISING USING PDES AND AUXILIARY IMAGES AS PRIORS 361

TABLE I
C OMPARISON OF THE D IFFERENT M ETHODS W HEN THE S IMULATED N OISY I MAGES H AVE D IFFERENT L EVELS OF N OISE

Because the spectrum of the original image in Fig. 3(a) is near


394.9 nm, and the reference image Fig. 3(c) is the 12th band
whose spectrum is near 472.7 nm. They are close to each
other, and the distributions of the image intensity are similar.
However, the situation is different for the multispectral image
in Fig. 2. Therefore, in Fig. 3, steadier and stronger priors
are introduced into the denoising by the reference image. In
this situation, the performances of priors–TV in Fig. 3(e) and
priors–wavelet in Fig. 3(f) are very similar, and there are no
artificial or fake edges in Fig. 3(f). However, the results of
TV in Fig. 3(d) are oversmoothed (relative to priors–TV and
priors–wavelet).
To further validate the proposed priors–TV method, we
compared these methods with different levels of noise and for
different images in Table I. In Table I, Images 1 and 2 are
multispectral images, and Images 3 and 4 are the hyperspectral
images. σ is the standard deviation of the simulated noise. We
can see that when the variance of noise is small, priors–TV and
Fig. 4. Denoising the image of CEBERS. (a) Noisy second band of CEBERS
image. (b) Reference image from the fourth band of CEBERS image. (c) Result priors–wavelet are similar, although each has its own strong
of TV method. (d) Result of priors–TV method. (e) Result of priors–wavelet point. However, the performance of priors–TV is always better
method. (a) Noisy image. (b) Reference image. (c) TV. (d) priors–TV. than TV whether the noise is large or small. When the noise
(e) priors–wavelet.
is large, the performance of priors–TV is better than that for
priors–wavelet. For the multispectral images (i.e., Images 1
noisy image and the reference image helps priors–TV denoising and 2), the proposed priors–TV shows more advantages than
to conserve more details in the smoothing. In Fig. 2(g), there is priors–wavelet. For the hyperspectral images (i.e., Images 3 and
also less noise in priors–wavelet, but it is over smoothed and 4), Image 3 is near 394.9 nm, and the reference for Image 3 is
artificially overdone. We can see that for multispectral images, the 12th band whose spectrum is near 472.7 nm; they are close
although the edges between the noisy image and the reference to each other, and the distributions of the image intensity are
image are similar, their image intensity distribution is obviously similar. Image 4 is the 189th band of the hyperspectral images
different for the different spectral response functions of the whose spectrum is near 2238.4 nm, but the reference image for
different sensors. In priors–wavelet denoising, the noise-free Image 4 is the 175th band whose spectrum is near 2008.3 nm;
reference image also enhances the edges of resulting image in therefore, the difference in their image intensity distributions is
Fig. 2(g). However, because of the noticeable difference in the large. When the reference image is very similar to the original
distribution of the intensity between the different bands of the image such as Image 3 in Table I or Fig. 3, the performances
multispectral images, many fake edges are introduced into of priors–wavelet and priors–TV are close. However, when the
the resulting image in Fig. 2(g). spectrum of the reference image is far from the original image
In Fig. 3, we compare the proposed method with the TV such as for Image 4 in Table I, the performance of priors–TV is
method [2] and the priors–wavelet method [8] using the hyper- obviously better than that of priors–wavelet.
spectral image. Fig. 3(a) shows the original image, Fig. 3(b)
is the simulated noisy image, and Fig. 3(c) is the reference
B. Experiments on Real Noisy Images
image. σ is the standard deviation of the noise. We can see
that, for the hyperspectral image in Fig. 3, the reference Figs. 2 and 3, and Table I are all simulation experiments. To
image is more similar to the original image than in Fig. 2. further validate the proposed priors–TV method, we compared
362 IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, VOL. 9, NO. 3, MAY 2012

In general, the experiment results on the real noisy images


are similar to the simulation experiments.

V. C ONCLUSION
In this letter, the auxiliary noise-free image has been used
as a prior when we denoise one of the noisy images in the
multicomponent remote-sensing image. The edge information
of the reference image is fully considered, and a new smoothing
term reference to the edges is constructed in the proposed
method. Comprehensive experiments using different multispec-
tral and hyperspectral images with different levels of noise were
carried out. We have also compared the proposed method with
other state-of-the-art methods, and the better performance of
the proposed method is demonstrated. In particular, when the
variance of the noise in the multispectral image is large, the
advantage of the proposed method is more obvious.

ACKNOWLEDGMENT
The authors would like to thank the anonymous reviewers for
their constructive suggestions.
R EFERENCES
[1] L. Rudin, S. Osher, and E. Fatemi, “Nonlinear total variation based noise
removal algorithms,” Phys. D, vol. 60, no. 1–4, pp. 259–268, Nov. 1992.
[2] A. Chambolle, “An algorithm for total variation minimization and appli-
cations,” J. Math. Imag. Vis., vol. 20, no. 1/2, pp. 89–97, Jan. 2004.
[3] P. Liu, D. Liu, and Z. Liu, “Total variation restoration of the defocus image
based on spectral priors,” in Proc. SPIE, 2010, vol. 7830, p. 783 018.
[4] F. Abramovitch, T. Sapatinas, and B. W. Silverman, “Wavelet thresholding
via a Bayesian approach,” J. R. Statist. Soc. Ser. B, vol. 60, no. 4, pp. 725–
749, 1998.
[5] L. Sendur and I. W. Selesnick, “Bivariate shrinkage functions for wavelet-
based denoising exploiting interscale dependency,” IEEE Trans. Signal
Process., vol. 50, no. 11, pp. 2744–2756, Nov. 2002.
[6] J. Portilla, V. Strela, M. J. Wainwright, and E. P. Simoncelli, “Image
denoising using scale mixtures of gaussians in the wavelet domain,” IEEE
Trans. Image Process., vol. 12, no. 11, pp. 1338–1351, Nov. 2003.
Fig. 5. Denoising the image of AVIRIS. (a) Noisy image is the159th band [7] F. Luisier and T. Blu, “SURE-LET multichannel image denoising: Inter-
whose spectrum is near 1871.2 nm. (b) Reference 1 is the 155th band whose scale orthonormal wavelet thresholding,” IEEE Trans. Image Process.,
spectrum is near 1831.4 nm. (c) Reference 2 is the 130th band whose spectrum vol. 17, no. 4, pp. 482–492, Apr. 2008.
is near 1582.3 nm. (d) Result of TV. (e) Result of priors–TV by using [8] P. Scheunders and S. De Backer, “Wavelet denoising of multicomponent
Reference 1. (f) Result of priors–wavelet by using Reference 1. (g) Result images using gaussian scale mixture models and a noise-free image as
of priors–TV by using Reference 2. (h) Result of priors–wavelet by using priors, image processing,” IEEE Trans. Image Process., vol. 16, no. 7,
Reference 2. (a) Noisy image. (b) Reference 1 (c) Reference 2. (d) TV. pp. 1865–1872, Jul. 2007.
(e) Priors–TV by Reference 1. (f) Priors–wavelet by Reference 1. (g) Priors–TV [9] A. Buades, B. Coll, and J.-M. Morel, “A non-local algorithm for image
by Reference 2 (h) Priors–wavelet by Reference 2. denoising,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2005,
pp. 60–65.
the three methods by using real noisy images in Figs. 4 and 5. [10] T. Brox, O. Kleinschmidt, and D. Cremers, “Efficient nonlocal means for
In Fig. 4, we denoise the multispectral image of CEBERS and denoising of textural patterns,” IEEE Trans. Image Process., vol. 17, no. 7,
pp. 1083–1092, Jul. 2008.
in Fig. 5, we denoise the hyperspectral image of AVIRIS. [11] A. Duijster, P. Scheunders, and S. De Backer, “Wavelet-based EM algo-
For multispectral images with relative large noise in Fig. 4, rithm for multispectral-image restoration,” IEEE Trans. Geosci. Remote
there is less noise left in priors–TV of Fig. 4(d) than in TV of Sens., vol. 47, no. 11, pp. 3892–3898, Nov. 2009.
Fig. 4(c). Furthermore, unlike priors–wavelet in Fig. 4(e), there [12] C. Thomas, T. Ranchin, L. Wald, and J. Chanussot, “Synthesis of mul-
tispectral images to high spatial resolution: A critical review of fusion
are no fake edges in priors–TV of Fig. 4(d). methods based on remote sensing physics,” IEEE Trans. Geosci. Remote
For hyperspectral images in Fig. 5, the proposed priors–TV Sens., vol. 46, no. 5, pp. 1301–1312, May 2008.
in Fig. 5(e) and (g) is also better than TV in Fig. 5(d). When [13] K. A. Kalpoma and J. I. Kudoh, “Image fusion processing for IKONOS
1-m color imagery,” IEEE Trans. Geosci. Remote Sens., vol. 45, no. 10,
Reference 1 in Fig. 5(b) is used (i.e., the spectrum of the pp. 3075–3086, Oct. 2007.
reference image is very similar to the noisy image), the per- [14] R. C. Hardie, M. T. Eismann, and G. L. Wilson, “Map estimation for
formance of priors–wavelet in Fig. 5(f) is similar to priors–TV hyper-spectral image resolution enhancement using an auxiliary sensor,”
in Fig. 5(e), or each has its own strong point; however, if IEEE Trans. Image Process., vol. 13, no. 9, pp. 1174–1184, Sep. 2004.
[15] L. Yuan, J. Sun, L. Quan, and H.-Y. Shum, “Image deblurring with
Reference 2 in Fig. 5(c) is used (i.e., the spectrum of the blurred/noisy image pairs,” ACM Trans. Graph., vol. 26, no. 3, pp. 1–10,
reference image is far from the noisy image), the performance Jul. 2007.
of priors–TV in Fig. 5(g) is better than that of priors–wavelet [16] P.-H. Huang, Y.-M. Lin, and S.-H. Lai, “Image deblurring with blur kernel
in Fig. 5(h). Priors–TV mainly makes use of the direction and estimation from a reference image patch,” in Proc. ICPR, 2008, pp. 1–4.
[17] D. Tschumperlé and R. Deriche, “Vector-valued image regularization with
strength of the edges of the reference image, but the intensity PDE’s: A common framework for different applications,” IEEE Trans.
distribution of the reference image has less effect on priors–TV. Pattern Anal. Mach. Intell., vol. 27, no. 4, pp. 506–517, Apr. 2005.

You might also like