Professional Documents
Culture Documents
Application of Papoulis-Gerchberg Method in Image
Application of Papoulis-Gerchberg Method in Image
net/publication/220459504
CITATIONS READS
59 1,299
4 authors, including:
Guna Seetharaman
Air Force Research Laboratory
238 PUBLICATIONS 5,791 CITATIONS
SEE PROFILE
All content following this page was uploaded by Guna Seetharaman on 15 May 2014.
In this paper, we study the Papoulis–Gerchberg (PG) method and its applications to domains of
image restoration such as super-resolution (SR) and inpainting. We show that the method performs
well under certain conditions. We then suggest improvements to the method to achieve better SR
and inpainting results. The modification applied to the SR process also allows us to apply the
method to a larger class of images by doing away with some of the restrictions inherent in the clas-
sical PG method. We also present results to demonstrate the performance of the proposed
techniques.
ð
þ1
The SR process includes three main tasks: an alias-free upsam- SR problem by assuming that the correlated neighbors
pling of the image, thereby increasing the maximum spatial remain similar across scales, and this a priori information is
frequency and removing degradations that arise during the learnt locally from the available image samples across
image capture, viz., blur and noise. In effect, the SR process scales. When a new image is presented, a kernel that best
tries to generate the missing high-frequency components and reconstructs each local region is selected automatically and
minimize aliasing, blur and noise. This has been a field of the super-resolved image is reconstructed by a simple convo-
intense research, and many different methods have been pro- lution operation. Jiji et al. [18] propose a single frame,
posed to obtain high-resolution (HR) images. The methods learning-based SR restoration technique using the wavelet
differ widely in their choice of domain (spatial, spectral or domain to define a constraint on the solution. Wavelet coeffi-
even both), in the choice of the observation model used, in cients at finer scales of the unknown HR image are learnt from
the method of capturing LR images, in the choice of number a set of HR images that are used as a training database. This
of LR images required and also in the choice of cues that learnt image is further used for regularization while super-
are used to extract information from images. Resolution resolving the image. An appropriate smoothness prior is
enhancement using a multiple-aperture camera system has used with discontinuity preservation in addition to the
been carried out by Komatsu et al. [8]. In this, the authors wavelet-based constraint to estimate the HR image. This
use different aperture systems to acquire multiple LR method was further improved by the use of contourlets in
images. An iterative algorithm is used for alternate registration [19] to capture edges better and hence results in better recon-
and reconstruction. Capel and Zisserman [9] use different struction of images. Surveys of various SR techniques have
methods of image registration and then go on to estimate the been presented in [20– 24].
HR image using maximum-likelihood estimation, maximum
a priori (MAP) estimation with priors like Gaussian Markov
3.2. SR using PG method
random field (MRF) and Huber MRFs. Rajan et al. [10, 11]
use blur as a cue for SR. They model the HR image and the The PG algorithm has been used in a modified form by Vander-
blur processes as separate independent MRFs. The HR space walle et al. [25] to super-resolve images when multiple LR
variant blur and SR image are estimated using a MAP estima- registered images are available. We restrict ourselves to the
tor. They also show how MRF models are well suited for mod- case when only a single LR image is available. The initial
eling the structural information in the form of surface normal. image is thus a higher-dimensional grid where the values
The shape from shading problem has also been used for restor- of some pixels are known and some are unknown. The
ation of an HR image in [10]. Joshi et al. [12, 13] make use of unknown pixel values are initially set to zero. In the next
zoom as a cue to perform SR. They consider linear dependency step, the image is passed through a low-pass filter with a nor-
of pixels in a neighborhood of the HR image and model this malized cut-off frequency of s, which is assumed to be the
using a simultaneous auto-regressive (SAR) model. This maximal allowable frequency. This effectively results in blur-
SAR model is then used as a prior to perform SR. Takeda ring of the image and the unknown pixels obtain some values.
et al. [14] extend the theory of kernel regression to derive a However, the known pixel values have also undergone changes
generalized kernel regression method which successfully per- as a result of the filtering. In the next step, the original values
forms SR from single and multiple LR frames as well as image of the known pixels are restored, creating high-frequency
denoising. Under this framework, the bilateral filter becomes a components. The whole process is repeated. Fig. 2 shows the
special case of their kernel regression method. reconstructed image after one such iteration of the method.
Researchers have also attempted to solve the SR problem Fig. 3 compares the output of the method after 50 iterations
using learning-based techniques. These methods try to recog- with that of the standard bicubic interpolation.
nize local features of an LR image and then try to retrieve the Although the method is quite fast, it has some drawbacks.
most likely high-frequency information from the given train- Owing to the steep cut-off in the frequency domain, the resul-
ing samples. They try to re-create the HR image from just a tant HR image has ringing artifacts near the edges. Also, it
single SR image by making use of a image database that is relies heavily on the fact that the measured (known) pixel
used as a training set. Freeman et al. [15] describe image values are the values that are to be obtained in the recon-
interpolation algorithms, which use a database of training structed HR image. In other words, it assumes that the LR
images to create plausible high-frequency details in zoomed images are downsampled versions of the expected HR image
images. Baker and Kanade [16] develop an SR algorithm by without any low-pass (averaging) filtering. Moreover, they
modifying the prior term in the cost function to include the should also be totally free from noise perturbations. Hence,
results of a set of recognition decisions, and call it as it is not able to compensate effectively for blur and noisy
recognition-based SR or ‘hallucination’. Their prior enforces measurement of data. Fig. 3 demonstrates these drawbacks. In
the condition that the gradient of the HR image should Fig. 3a, we have a fairly good-quality LR image, and Fig. 3b
be equal to the gradient of the best matching training image. and c show corresponding results of interpolation using
Candocia and Principe [17] address the ill-posedness of the bicubic and PG methods, respectively. One may notice that
FIGURE 2. Reconstruction with PG method. (a) Starting of iteration 1 with the HR grid partly filled with the given LR data. (b) Image after
low-pass filtering with a normalized cut-off frequency of 0.6. (c) Image after known pixel values are reset to original values. Spectra in log
scale (d) of (a), (e) of (b) and (f) of (c). Notice that the spurious peaks at the higher frequency regions in (d) reduce in (e) due to the spatial filling-in
process. This increases again in (f) and the process continues.
the PG method offers a better result. This can be seen as block of the HR image. The classical PG method will not
sharper features near the nose, eyes and freckles. In Fig. 3d, deliver a good result in such a case as shown in Fig. 3. This
we show a poor-quality LR image. The input image is quite is because we enforce the available pixel values in the LR
blurred. Hence the corresponding HR reconstructions in image to be pixel values in the HR image. This results in an
Fig. 3e and f are poor. The PG method offers no help as it overall blurred HR image. We overcome this drawback by
assumes correct pixel values on the LR grid. Similarly, introducing a different constraint enforcing part. We want to
Fig. 3g shows a noisy LR input image and the corresponding enforce that the super-resolved image obtained after every iter-
HR reconstructions are shown in Fig. 3h and i. For the same ation of the method conforms to the input LR image. To do this,
reason, the PG method does not offer a quality reconstruction. we introduce a back projection part within the PG method.
Iterative back projection has been used in image SR by Irani
and Peleg [26]. However, as opposed to their case, we deal
3.3. Modified PG method with the case where only a single LR image is available.
In our method, after every iteration of the PG method we calcu-
As mentioned in Section 3.2, the PG method cannot deal with late the error between the LR image and the simulated LR image
images that are blurred due to averaging during the downsam- formed by applying the known decimation model to the obtained
pling process. We now propose an extension which allows the SR image. Mathematically, the error e i can be expressed as
method to be applied to SR of images for which the downsam-
pling process is known. In our experiment, we assume that the
ei ¼ y DHzi ; ð5Þ
LR image is formed by averaging and subsampling the HR
image that we aim to recover. This means that for 2
zooming of an image we assume that every pixel in the LR where y is the input LR image lexicographically ordered as a
input image is the average of a corresponding 2 2 pixel vector of size M 2 1, D is the M 2 N 2 decimation matrix
4.3.2. Directional filtering averaging in a 22 window. This defines the forward model
We observe that scratches in images are often highly direc- of the image formation process and defines the parameters
tional. Hence, we investigate the possibility of using a direc- for calculation of the error at every iteration given by equation 5.
tional filtering approach to solve the inpainting problem for We found that the method performs better than the standard
scratches which have some sort of a directional nature such PG method. The results obtained are shown in Fig. 4.
as the image shown in Fig. 7a. A scratch in one direction intro- The images obtained using bicubic and Delaunay interp-
duces higher frequency components in a direction perpendicu- olation shown in Fig. 4b and c are quite blurred with respect
lar to the direction of the scratch. This effect can be seen when to those obtained using the standard PG method shown in
we compare the spectrum of the unscratched original image Fig. 4e. We show that the result is further improved using
with that of the corrupt image. We modified the filtering the modified PG method with the deblurring step. This can
process to take advantage of this directional nature of the be seen in Fig. 4f, which is sharper than the other images
image corruption. Rather than taking a uniform filtering shown even though the Steering Kernel method provides a
mask (which was circular in nature for all the above better interpolation in the peak signal to noise ratio (PSNR)
methods), we use a directional filter with two different sense. Note the increase in textural detail of the hat, the sharp-
cut-offs—one for the direction where higher frequency com- ness of the lips and nose in the figure with respect to those
ponents have been introduced due to the scratch, and obtained using other methods even though the PSNR value
another for all other directions. We set a smaller bandwidth is less. Results with 3 magnification are shown in Fig. 5.
for the filter in the direction perpendicular to that of the
scratches. This has a greater smoothing effect across the
edges of the scratch and allows faster flow of information
across the edges into the region to be inpainted.
5. RESULTS
FIGURE 4. SR results—2 magnification. (a) LR Lena image
5.1. Super-resolution formed by averaging and downsampling of an HR image. (b) 2
zoomed image using bicubic interpolation. (c) Interpolation using
We applied the algorithm discussed in Section 3.3 on a Lena Delaunay triangulation. (d) Upscaled image using Steering Kernel
image of size 128128 to perform 2 zooming. For this Regression [14]. (e) Super-resolved image formed using the standard
we consider the LR image to have been formed by decimation PG method. (f) Super-resolved image formed using the PG method
of the desired 256256 HR image blurred by simple with deblurring.
FIGURE 5. SR results—3 magnification. (a) LR image formed by image using the standard PG method with a normalized
33 pixel averaging and downsampling of an HR image. (b) 3 low-pass filter cut-off of 0.2 is shown in Fig. 7b. Note the
zoomed image using bicubic interpolation. (c) Interpolated image ringing effect in the inpainted regions. Next we tried to
using Steering Kernel Regression [14]. (d) Super-resolved better this method using a smooth filter by doing away with
image formed using the standard PG method. (e) Super-resolved
the ideal low-pass filter and replacing it with a gradual
image formed using the PG method with deblurring.
cut-off. The result obtained shows lesser ringing as pointed
out in Fig. 7c. This results in a 0.19 dB gain over the standard
The bicubic image in Fig. 5b is much blurred compared to the
method, as can be seen in Table 1. Fig. 7(d) illustrates the
output of the standard PG method result shown in Fig. 5c. The
result using DCT instead of the DFT, in order to take advan-
result obtained using the deblurring step, shown in Fig. 5d, can
tage of the greater information packing density of the DCT.
be seen to be sharper than the other images, specifically at the
We see that this produces nearly similar results, in terms
scales and leaves of the pineapple.
of quality and PSNR, as compared to using DFT. The
result obtained using a smooth cut-off on DCT of the image
5.2. Inpainting in the filtering process is shown in Fig. 7e. It shows a
In our experiment, we first took the Lena image and scratched 0.29 dB increase in PSNR from that obtained using a simple
it arbitrarily with a variable width of 3 – 5 pixels. The differ- DCT-based filtering. We then show the result obtained using
ence between this corrupted image and the original image pro- the directional filtering method to take advantage of the direc-
vided us an accurate mask of the scratch. This mask is thus the tionality of the scratches. In this experiment, the low-pass
region which is to be inpainted. Then taking all pixels outside filtering normalized cut-off frequency was kept at 0.1 in the
this mask as known pixel values we perform the standard PG direction perpendicular to that of the scratch and 0.3 elsewhere.
method to obtain an inpainted image. Similar experiment was The result obtained is shown in Fig. 7f. Notice that the ringing
carried out with overlaid text of variable font size. The results along the smoother areas have reduced. We see that in a PSNR
of the method are shown in Fig. 6. Note that while the thinner sense all these modifications result in a better inpainted image
areas are inpainted without any visual artifacts, this is not the than that obtained using the standard PG method.
case for the thicker regions.
We then performed inpainting with the proposed modifi-
cations. This time we used thicker edges of width 5 – 7
6. CONCLUSION
pixels. The scratches used this time were loosely directional
in nature as shown in Fig. 7a. We obtain the scratch mask From the discussions and the results presented above, it is
similar to the method explained above. The resultant inpainted clear that the PG method performs well for SR when one
ACKNOWLEDGMENTS
Funding supports from AOARD, USA and ISRO, India,
are gratefully acknowledged. The authors would also like to
thank the reviewers whose suggestions and constructive criti-
cism have brought this paper to its current state. We would
also like to extend our gratitude to Hiroyuki Takeda for pro-
viding us results of his research to compare with our results.
FIGURE 7. Inpainting results with modified PG method. (a)
Example of a scratched image requiring inpainting. Inpainting
using (b) The PG method using a normalized cut-off frequency of
0.2 for the low-pass filter. (c) Using a smooth filter with a transition
band of 30 pixels. (d) The DCT-based PG method. (e) DCT-based PG
REFERENCES
with smooth cut-off. (f) Directional filtering approach. [1] Sabri, M.S. and Steenart, W. (1978) An approach to band-
limited signal extrapolation: the extrapolation matrix. IEEE
Trans. Circuits Syst., CAS-25, 74– 79.
TABLE 1. PSNR values of results from different inpainting
[2] Slepian, D. (1978) Prolate spheroidal wave functions, Fourier
methods as shown in Fig. 7.
analysis, and uncertainty. V: the discrete case. Bell Syst. Tech. J.,
Method PSNR 57, 1371–1429.
[3] Harris, J. L. (1964) Diffraction and resolving power. J. Opt. Soc.
Scratched image 20.16 Am., 54, 931–936.
PG method 37.67 [4] Papoulis, A. (1975) A new algorithm in spectral analysis and
PG with smooth cut-off 37.86 band-limited extrapolation. IEEE Trans. Circuits Syst.,
PG with directional filtering 37.72 CAS-22, 735– 742.
PG using DCT 37.65 [5] Gerchberg, R. (1974) Super-resolution through error energy
PG using DCT with smooth cut-off 37.94 reduction. Opt. Acta, 21, 709– 720.
[6] Jain, A.K. (2001) Fundamentals of Digital Image Processing.
Prentice-Hall, New Delhi.
does not have multiple images or any HR training data set. It [7] Walsh, D.O. and Nielsen-Delaney, P.A. (1994) Direct method
also does very well in image inpainting. However, it has some for superresolution. J. Opt. Soc. Am., 11, 572– 579.
severe limitations as illustrated by Fig. 3 as it cannot handle [8] Komatsu, T., Igarashi, T., Aizawa, K. and Saito, T. (1993) Very
noisy data. However, this is quite natural as one does not high resolution imaging scheme with multiple different-aperture
have multiple images for the purpose of noise smoothing. cameras. Signal Process., Image Commun., 5, 511–526.
An early termination of the iterative process as suggested in [9] Capel, D. and Zisserman, A. (2003) Computer vision applied to
[4] is one way to stop the propagation of error with future super resolution. IEEE Signal Process. Mag., 20, 75– 86.
[10] Rajan, D., Chaudhuri, S. and Joshi, M.V. (2003) Multi-objective [26] Irani, M. and Peleg, S. (1991) Improving resolution by image
super resolution: concepts and examples. IEEE Signal Process. registration. CVGIP: Graph. Models Image Process., 53,
Mag., 20, 49–61. 231– 239.
[11] Rajan, D. and Chaudhuri, S. (2003) Simultaneous estimation of [27] Bertalmio, M., Sapiro, G., Caselles, V. and Ballester, C. (2000)
super-resolved scene and depth map from low resolution Image inpainting. In Akeley, K. (ed) SIGGRAPH 2000,
defocused observations. IEEE Trans. Pattern Anal. Mach. Computer Graphics Proc., New Orleans, USA, July, pp. 417–
Intell., 25, 1102–1117. 424. ACM Press/ACM SIGGRAPH/Addison Wesley Longman.
[12] Joshi, M.V. and Chaudhuri, S. (2004) Zoom-based super- [28] Masnou, S. (2002) Disocclusion: a variational approach using
resolution through SAR model fitting. Proc. IEEE Int. Conf. level lines. IEEE Trans. Image Process., 11, 68–76.
Image Processing, Singapore, August, pp. 1775–1778. [29] Masnou, S. and Morel, J. (1998) Level lines based disocclusion.
[13] Joshi, M.V., Chaudhuri, S. and Panuganti, R. (2004) Super- Fifth IEEE Int. Conf. Image Processing, Chicago, USA,
resolution imaging: use of zoom as a cue. Image Vis. October, pp. 259–263. IEEE Computer Society.
Comput., 22, 1185– 1196. [30] Ballester, C., Bertalmio, M., Caselles, V., Sapiro, G.
[14] Takeda, H., Farsiu, S. and Milanfar, P. (2007) Kernel regression and Verdera, J. (2001) Filling-in by joint interpolation of
for image processing and reconstruction. IEEE Trans. Image vector fields and gray levels. IEEE Trans. Image Process., 10,
Process., 16, 349 –366. 1200–1211.
[15] Freeman, W.T., Jones, T.R. and Pasztor, E.C. (2002) Example- [31] Chan, T. and Shen, J. (2001) Non-texture inpainting by
based super-resolution. IEEE Comput. Graph. Appl., 22, 56–65. curvature-driven diffusions. J. Vis. Commun. Image Represent,
[16] Baker, S. and Kanade, T. (2002) Limits of super- resolution and 12, 436–449.
how to break them. IEEE Trans. Pattern Anal. Mach. Intell., 24,
[32] Oliveira, M.M., Bowen, B., McKenna, R. and Chang, Y.-S.
1167–1183.
(2001) Fast digital image inpainting. Proc. IASTED Int. Conf.
[17] Candocia, F.M. and Principe, J.C. (1999) Super-resolution of
Visualization, Imaging and Image Processing (VIIP),
images based on local correlations. IEEE Trans. Neural
Marbella, Spain, September, pp. 261–266. ACTA Press.
Netw., 10, 372 –380.
[33] Grossauer, H. and Scherzer, O. (2003) Using the complex
[18] Jiji, C.V., Joshi, M.V. and Chaudhuri, S. (2004) Single-frame
Ginzburg–Landau equation for digital inpainting in 2D and
image super-resolution using learnt wavelet coefficients.
3D. Scale-Space, Isle of Skype, UK, June, Lecture Notes on
Int. J. Imag. Syst. Technol., 14, 105– 112.
Computer Science, Vol. 2695, pp. 225–236. Springer.
[19] Jiji, C.V. and Chaudhuri, S. (2006) Single-frame image super-
[34] Borzi, A., Grossauer, H. and Scherzer, O. (2005) Analysis of
resolution through contourlet learning. EURASIP J. Appl.
iterative methods for solving a Ginzburg-Landau equation.
Signal Process., 2006, 1–11.
Int. J. Comput. Vis., 64, 203– 219.
[20] Chaudhuri, S. (2001) Super-Resolution Imaging. Kluwer Academic
Publishers, Norwell, MA. [35] Hirani, A.H. and Totsuka, T. (1996) Dual domain interaction
image restoration: basic algorithm. Third IEEE Int. Conf.
[21] Park, S.C., Park, M.K. and Kang, M.K. (2003) Super-resolution
Image Processing, 797–800. IEEE Press.
image reconstruction: a technical overview. IEEE Signal
Process. Mag., 20, 21 –36. [36] Patwardhan, K.A. and Sapiro, G. (2003) Projection based
[22] Jiji, C.V., Chaudhuri, S. and Chatterjee, P. (2007) Single frame image and video inpainting using wavelets. IEEE Int. Conf.
image super-resolution: should we process locally or globally? Image Processing, Barcelona, Spain, 14 –18 September, pp.
Multidimens. Sys. Signal Process., 18, 123 –152. 857– 860.
[23] Farsiu, S., Robinson, D., Elad, M. and Milanfar, P. (2004) [37] Chan, R., Shen, L. and Shen, Z. (2005) A framelet-based
Advances and challenges in super-resolution. Int. J. Imag. approach to image inpainting. Technical report. Department of
Syst. Technol., 14, 47–57. Mathematics, Chinese University of Hong Kong, Hong Kong.
[24] Brown, L.G. (1992) A survey of image registration techniques. [38] Strohmer, T. (1995) On discrete band-limited signal extrapo-
ACM Comput. Surv., 24, 325 –376. lation. Math. Anal., Wavelets Signal Process., 190, 323– 337.
[25] Vandewalle, P., Susstrunk, S. and Vetterli, M. (2003) [39] Grochenig, K. and Strohmer, T. (2001) Numerical and
Superresolution images reconstructed from aliased images. theoretical aspects of non-uniform sampling of band- limited
In Ebrahimi, T. and Sikora, T. (eds) SPIE/IS&T Visual signals. In Marvasti, F. (ed) Nonuniform Sampling: Theory
Communication and Image Processing Conf., Lugano, and Practice, Chapter 6, pp. 283– 324. Kluwer Academic
Switzerland, July, pp. 1398–1405. Publishers, Norwell, MA.