• Embed Doc
  • Readcast
  • Collections
  • CommentGo Back
 
Principal Component Analysis with Pre-emphasis for Compression of Hyperspectral Imagery
Euisun Choi
Fundamental Tech. Res., Nautilus Hyosung183 Hogye-Dong, Dongan-GuAnyang-Si 431-080, KOREAeschoi@hyosung.com 
Hyunsoo Choi, and Chulhee Lee
Dept. Electrical & Electronic Eng., Yonsei University134 Shinchon-Dong, Seodaemoon-KuSeoul 120-749, KOREAchulhee@yonsei.ac.kr  
 Abstract 
 —In this paper, we propose to use the principalcomponent analysis for the compression of hyperspectral images.When hyperspectral images are compressed using conventionalimage compression algorithms, discriminant features of originaldata may be lost during compression process. In order topreserve such discriminant information, we first apply a linearfeature extraction method to the original data. Then, weemphasize discriminant features and use the principalcomponent analysis in order to compress the images whosediscriminant features are enhanced. Experiments show that theproposed method provides improved classification accuraciesthan existing compression algorithms.
 Keywords-Principal component analysis; pre-emphasis;compression; classification, hyperspectral imagery
I.
 
I
 NTRODUCTION
 The advancement of sensor technology produces very largeamounts of data with hundreds of spectral bands, which mayoften cause difficulties in transmitting and archiving suchhyperspectral data. As a result, there is an increasing need for efficient compression algorithms for hyperspectral images. Acharacteristic of hyperspectral imagery is interband correlation.This interband correlation can be exploited to compresshyperspectral images and a number of compression algorithmshave been proposed [1-4]. For instance, the principalcomponent analysis (PCA) can be used to spectrallydecorrelate hyperspectral data. When the principal componentanalysis is applied to hyperspectral images such as AVIRISdata that have 220 channels, it is found that most energy isconcentrated along a few eigenvectors. In general, whenhyperspectral images are compressed using conventionalimage compression algorithms, which have been developed tominimize mean squared errors, discriminant features of theoriginal data may be lost during compression process. In order to preserve such critical information, we should take intoaccount discriminating capability of remotely sensed data indesigning compression algorithms. Recently, a compressionmethod with pre-enhancement of discriminant features has been proposed [9]. In the compression method, a linear transform is applied to the original data and discriminantfeatures are enhanced. Then, the inverse of the linear transformis applied and the 3D SPIHT algorithm is used to compress theenhanced data. A problem with the 3D SPIHT algorithm is thatit requires a large memory. In this paper, we explore the use of the principal component analysis as a compression methodwith pre-enhanced data. Experiments with AVIRIS data showthat the proposed method provides improved classificationaccuracies than the conventional PCA method for hyperspectral data compression.II.
 
P
RINCIPAL
C
OMPONENT
A
 NALYSIS
 The principal component analysis is a widely used methodfor information extraction and data compression, which provides a set of uncorrelated principal components such thatthe data have maximum variance in the transformed space [5].The new set of bands (principal components) is obtained fromthe
 N 
original spectral bands as follows:()
Y A X M 
=. (1)
where
 X 
is a vector of spectral intensities,
=
 E 
[
 X 
], and
 A
isthe
 N 
×
 N 
unitary matrix derived from the covariance matrix
 X 
Σ
of the spectral bands ([()()]
 X 
E X M X
Σ=
). Thecolumns of 
 A
are the
 N 
normalized eigenvectors of 
 X 
Σ
. ThePCA can compact the total energy into relatively a fewcoefficients. However, the conventional PCA does not takeinto account discriminating features. In classification problems,the discriminant information, which is needed to distinguish between classes, may be small in energy.III.
 
E
 NHANCING
D
ISCRIMINAT
F
EATURES
 In order to enhance discriminant features of hyperspectraldata, we first apply a feature extraction method to the originaldata. In this paper, we use the decision boundary featureextraction (DBFE) method [6] in order to extract discriminantfeatures, though one may use any feature extraction method. In[6], the discriminantly informative features are extracted usingthe decision boundary feature matrix given by1()()()
 DBFM 
 N X N X p X d K 
Σ=
∫ 
(2)
where
 N 
(
 X 
) is a vector normal to the decision boundary,
 p
(
 X 
) a probability density function, ()
 K P X d
=
∫ 
, and
is thedecision boundary, and the integral is performed over theentire decision boundary. It has been reported that theeigenvectors of the decision boundary feature matrixcorresponding to non-zero eigenvalues are the necessary
0-7803-9050-4/05/$20.00 ©2005 IEEE.704
 
feature vectors to achieve the same classification accuracy asin the original space for the given classification problem.Let the
 N 
 E 
be the
 N 
-dimensional Euclidean space and{}
i
φ 
a new feature vector set produced by a feature extractionalgorithm where
i
φ 
is a
 N 
×
1 column vector, and it is assumedthat
i
φ 
has more discriminating power than
 j
φ 
if 
i j
<
. It isalso assumed that {}
i
φ 
is a basis for the
 N 
-dimensionalEuclidean space. One can always make an independent featurevector set into an orthonormal basis by using the Gram-Schmidt procedure [7]. Then, an observation,
 X 
, can berepresented by
1
 N i ii
 X 
α φ 
=
=
(3)
In the proposed method, the coefficients of feature vectorswhich are dominant in discriminating power are enhanced asfollows:
1
 N i i ii
 X w
αφ 
=
=
(4)
where
i
w
is a weight in accordance of discriminating power of the corresponding feature vector. Then the pre-enhanced data
 X 
is compressed using the conventional PCA method. Inorder to reconstruct the original data from the compressed data,the following equation can be used:
1
1ˆ
 N i iii
 X w
 βφ 
=
=
 
whereˆ
 X 
is a pre-enhanced data reconstructed by the inversePCA andˆ
 X 
represents the final reconstructed data fromˆ
 X 
. Itis assumed that {}
i
φ 
and {}
i
w
are available at both theencoder and decoder.There can be a number of possible weight functions for (4).In this paper, we tested various types of weight function andempirically selected the following 2 types of weight function:weightfct1:1()weightfct2:(width=5bands)
i i i DBFi
w eigenvaluesow stair fct 
λ λ 
=+=Σ
=
.
Most feature extraction methods, which include canonicalanalysis and the decision boundary feature extraction method,require that covariance matrices should be invertible.However, due to high correlations between adjacent bands, inmost cases, the covariance matrix of 220 spectral bands maynot be invertible, even though there are a large number of training samples. In order to address this problem, a possiblesolution is to group spectral bands. In this paper, we divide thedata into two groups.IV.
 
E
XPERIMENTAL
ESULTS
 The data used in this paper was acquired by the AirborneVisible/Infrared Imaging Spectrometer (AVIRIS), whichcontains 220 spectral bands, and the size of images is2166
×
614 [8]. From the data, we selected a sub-region of 256
×
256. In order to evaluate the performance of the proposed algorithm, we computed SNRs and classificationaccuracies, and compared them with those of non-enhanceddata. In this paper, the bit-rate is defined on a per-band basis.That is, 1 bpp indicates 1 bit per pixel for each band. The SNR is defined as
210
[]10log
xSNRSE 
=.
where
2
ˆ[()]
SE E x x
=
. In the experiment, we chose 15classes from the selected area. Fig. 1 shows the selected area(50
th
band) with 15 classes chosen, and Table I describes theclass information.
Fig. 1. Sub-region of 256 x 256 with 15 classes chosen.TABLE I. D
ESCRIPTION OF CLASS INFORMATION
.
Class species No. samplesNo. trainingsamples
Buildings 375 100Corn 819 100Corn-CleanTill 966 100Corn-CleanTill-EW 1320 100Grass 172 100Hay 231 100 NotCorpped 480 100Pasture 483 100Soybeans 836 100Soybeans-CleanTill 1050 100Soybeans-CleanTill-EW 1722 100Water 189 100Trees 330 100Wheat 940 100Woods 252 100
The Gaussian ML classifier was used assuming theGaussian distribution for each class, and the 220 bands werereduced to 20 features by combining adjacent bands for classification process. It is noted that combining adjacent bands was performed after decompressing the compressed data.In other words, the compression was performed using allspectral bands.First, we divided 220 spectral bands of AVIRIS data intotwo spectral divisions (2 groups), and enhanced each datagroup by applying (2)-(4). Then, the pre-enhanced data arecompressed using the conventional PCA method. Fig. 2 showsSNRs of the PCA compression method with different numbersof eigenimages used for compression. It was observed that theweight function 1 and 2 showed almost the same compression
0-7803-9050-4/05/$20.00 ©2005 IEEE.705
of 00

Leave a Comment

You must be to leave a comment.
Submit
Characters: ...
You must be to leave a comment.
Submit
Characters: ...