• Embed Doc
  • Readcast
  • Collections
  • CommentGo Back
 
Hyperspectral Image Analysis -A Robust Algorithm using Support Vectorsand Principal Components
Sindhumol S.
 Department of Information Technology, Avenir Computer Services Export Pvt. Ltd., Kochi, Kerala. sindhumol_s@yahoo.co.in
M.Wilscy
 Department of Computer Science,University of Kerala,Thiruvananthapuram, Kerala.wilsyphilipose@rediffmail.com
Abstract
This paper presents a new algorithm for hyperspectral image analysis using spectral-anglebased Support Vector Clustering (SVC) and Principal Component Analysis (PCA)
 .
 In the classical approachto hyper-spectral dimensionality reduction based on Principal Component Analysis (PCA), no meaning or behavior of the spectrum is considered and results areinfluenced by majority components in the scene. A spectral angle based classification beforedimensionality reduction is a possible solution to this problem. Clustering based on support vectors using  spectral based kernels is proposed in this work, whichis found to generate good results in hyperspectral image classification. The algorithm is tested with twohyperspectral image data sets of 210 bands each,which are taken with HYper-spectral Digital ImageryCollection Experiment (HYDICE) air-borne sensors. Acomparative study of the proposed algorithm and other two conventional algorithms (PCA alone and  PCA with Spectral Angle Mapping (SAM)) is alsodone.
1. Introduction
Imaging the Earth through several hundredcarefully selected continuous spectral bands iscommonly referred to as hyper-spectral imaging. Thistechnique is found to greatly enhance the ability for deriving land cover information through image fusionand classification in the field of remote sensing.Several supervised and unsupervised algorithm existsfor hyperspectral classification [1]-[3], but thesealgorithms fail to provide accurate results. One of themost prominent applications of hyper-spectral imageryis land cover classification [3]. For many years,researchers have recognized the importance of landcover for a variety of development in several fieldssuch as agriculture, environment, forestry, geology,and hydrology.The conventional methods in remote sensing for hyper-spectral image analysis were based ondimensionality reduction using PCA. But principalcomponents calculated depend on the statistical meanof the image. That means the more frequently occurredobjects are given more importance and less frequentlyoccurred objects are ignored. To resolve this, attemptswere made to make object based analysis instead of doing whole image analysis. In visual classification,this approach generated amazing results. But, thequality of the resulting image is found to vary ondifferent classification methods used. Biophysicalmeaning of the spectral signature also has animportant role in hyperspectral image classification.Considering all these requirements, a method based onsupport vectors using spectral based kernel is proposedin this work for hyperspectral image classification.The kernel used in this method is one based onSAM [4]. A spectral angle matrix is first found for the pixels involved in a hyper-spectral image. Input dataspace is mapped to a higher-dimensional feature spaceusing a local kernel and spectral angle values [5].Support vectors are found using these values. Spectralsignatures are formed from the classes and principalcomponents are calculated [6]. A few among these principal components contain the significantinformation for detailed analysis and target detection.Two images - URBAN (307x307x210) andTERRAIN (500x307x210)-of 210 bands eachcollected using HYDICE airborne sensor are used totest the robustness of the algorithm. A comparativestudy on the results of the three algorithms PCA alone,PCA with SAM and PCA with SAM and SVC arecarried out.This paper is divided into five sections. Section 2is an overview of the related works. The proposedalgorithm is outlined in Section 3. Section 4 deals with
Proceedings of the International Conference on Computing: Theory and Applications (ICCTA'07)0-7695-2770-1/07 $20.00 © 2007
 
Authorized licensed use limited to: Univ of Puerto Rico Mayaguez - Library. Downloaded on February 18, 2009 at 21:38 from IEEE Xplore. Restrictions apply.
 
the data sets used, tests conducted and resultsobtained. A comparative analysis on existingalgorithms is also given here. Section 5 concludes thework 
.
 
2. Related Works
Recent researches in hyperspectral image analysismainly focus on applying supervised or unsupervisedclassification methods [7]-[9] before dimensionalityreduction using classical approaches like PCA [6] or Independent Component Analysis [10]. Classificationmethods such as the Maximum Likelihood Classifier (MLC) [9] are highly dependent on assumptionsrelated to statistical distribution of the data and theywill not give good results in hyper-spectralclassification. Though neural networks [11] have anadvantage as they can work with complex datasets,these are slow in the training phase. Recently, SupportVector Machines (SVMs) [12][13] that canoutperform other hyperspectral image classifiers have been proposed. To make classification more simpleand robust keeping the quality of the resultscomparable to the same obtained through SVM, aclustering based on support vectors [14] isimplemented here. The kernel used in SVC discussedin [14] is based on geometric distance betweenvectors, which does not take spectral meaning and behaviour of the data into consideration. Toincorporate the spectral knowledge also into theclassification, the spectral based kernel [5] is used inthe clustering process associated with this work. In arelated work [15] based on SAM and PCA, spectralscreening is done before dimensionality reduction insuch a manner that if the spectral distance betweentwo vectors exceeds a particular threshold, they belongto separate classes. The major steps involved in our work are described in the next section.
3. Proposed Method
The mathematical concept used in this work is built on the theory of spectral angles, support vectorsand principal components. Spectral based kernels areused to find out the support vectors and thereby uniquespectral signatures. Initially the unique signature set isempty. Using support vectors with spectral–basedkernels, a clustering is done first. To do this, theoriginal data is mapped to a feature space using SAMand RBF kernel [5] and clustered. The bestrepresentative of each cluster is added to a unique set.This unique set is used to find out the mean in thespectral de-correlation process thereby avoiding the problem of giving more importance to the frequentlyoccurred data.A ray casting through each pixel in the hyper-spectral image cube results in a vector called the pixelvector, which gives the spectral signature of theground region (or the material) represented by that pixel. As a first step in the algorithm, the spectralangle matrix and kernel function for SVC arecalculated. If 
x
and
y
are two ‘n’ dimensional pixelvectors, where ‘n’ is the number of bands in the inputimage, then the angle between two vectors can begiven as,
α
(
x,
 
y
) =cos
-1
((
x
 
y
)
  ⁄  
(||
x
| |.||
y
||)) (1)As described in SVC algorithm [14], let
x
i
be avector in the data space X, where the number of attributes for each
x
i
is
n
’. Using a nonlinear mapping
Φ
,
the data is mapped from X to some highdimensional feature space and the smallest enclosingsphere of radius R is found. It is described by theconstraints:||
Φ
(
x
 j
)-
a
||
2
2
 j, (2)where || . || is the Euclidean norm and ‘
a’
is the center of the sphere. At each point
x
the distance of its imagein feature space from the center 
a
of the sphere isdefined as,
2
(
x
) = ||
Φ
(
x
)-
a
||
2
(3)The support Vectors are found by solving for Lagrange Multipliers as follows [14]: Theoptimization problem is,Minimize
W=
 Σ 
 j
 K( 
 x 
 j
 ,
 x 
 j
 )
 β 
 j
-
 Σ 
i,j
 β 
i
 
 β 
 j
 K( 
 x 
i
 ,
 x 
 j
 )
(4)Such that 0
 
 β 
i
 
C,where C is the regularization parameter,
 β 
i
is the i
th
 Lagrange Multiplier and
 K 
is the kernel matrix.The next step in the proposed method is clusteringand classification. The square of the radial distance of the image of 
x
from the center of the sphere is given by eq. (3) and can be written as,
 R
2
 x 
 )=K( 
 x 
 ,
 x 
 )-2
 Σ 
 j
 
 β 
 j
K( 
 x 
 ,
 x 
 j 
 )+
 Σ 
i, j
 β 
i
 
 β 
 j
 K( 
 x 
 ,
 x 
 j 
 )
 
(5)where
 β 
i,
 β 
 j
are Lagrangian multipliers
 
and
 K( 
 x 
 ,
 x 
 j 
)is an appropriate Mercer kernel[5]. In this work theRBF kernel given by
 K( 
 x 
 ,
 x 
 j 
 )=exp(-q ||
α
(
x
,
x
 j
)||) (6)where
α
 x 
 ,
 x 
 j 
 )
is the spectral angle between vectors
x
and
x
 j
 
and ‘q’
 
is the width parameter of the kernelwhich decides the number of clusters formed in thedata space.The radius of the sphere is calculated asR = {R(
x
i
) |
x
i
 
is a support vector} (7)The contours that enclose the points in data space aredefined by the set {
x
| R(
x
) = R}, and they describethe boundary of the clusters. To differentiate betweenthe points belonging to different clusters the conceptof connected components is used [14]. To do this, anadjacency matrix
 Aij
 between a pair of point
x
i
 
and
x
 j
 
whose images lie in or on the sphere in feature space isdefined as,
Proceedings of the International Conference on Computing: Theory and Applications (ICCTA'07)0-7695-2770-1/07 $20.00 © 2007
 
Authorized licensed use limited to: Univ of Puerto Rico Mayaguez - Library. Downloaded on February 18, 2009 at 21:38 from IEEE Xplore. Restrictions apply.
 
Aij
= 1 if, for all
y
on the line segment connecting
x
i
 
and
x
 j
, R
(
y
)
≤ 
R
= 0 otherwise
.
Clusters are now defined as the connected componentsof the graph induced by
 A
[14]
.
 There is a high level redundancy in hyper-spectraldata because of high correlation among adjacent bands. PCA [5] decorrelates the correlated data and itis perhaps the most popular dimension reductiontechnique for hyper-spectral image analysis. Theoutcome of PCA is a number of principal components(PCs) in descending order of significant information.The first few among them contain the maximumvariance and the remaining is usually neglected.The final step of the proposed method isdimensionality reduction using PCA. Consider the k 
th
 pixel vector 
x
 
from a hyper-spectral image cube of n bands.
x
=[x
k1
, x
k2
,…x
kn
]
T
(8)The mean pixel vector can be defined as
m
 x 
=(1/p)
 Σ 
 x 
(9)where
=1…
 p
and ‘
 p
’ is the number of pixel vectors inthe image cube. The covariance matrix is given by
 x 
=(1/p)
 Σ 
 x 
-
m
 x 
 
m
 x 
(10)The eigen-values and eigen-vectors are found using
Cyclic Jacobi Method 
[16].The transformation matrix
V
=[
eig
1
|eig
2
…|eig
n
] (11)where
eig
i
 
is the eigen vector corresponding to the i
th
 eigen value in the decreasing order.The principal components are computed as
F
=
V
T
S, (12)where S is the original hyperspectral data set. The firstfew principal components, especially the first three,contain the maximum variance in decreasing order. Sothese components can be further analyzed inhyperspectral image analysis to collect information for visual as well as digital interpretation.
4. Result Analysis and Discussion
The proposed algorithm for hyper-spectral imageanalysis is tested to check the improvement inclassification using two hyper-spectral imagesURBAN and TERRAIN. The input hyper-spectralimages are tested with three algorithms, PCA only,PCA with SAM, and PCA with SAM and SVC.URBAN is a HYDICE image with a spatial resolutionof 307x307 and 210 spectral bands. It is downloadedfrom Internet, the location or time of acquisition is notknown. It consists of urban features like buildings,trees, roads, playgrounds, grass etc. The PCA methodis first applied and the resulting images are shown in
Figure 1(a)-(c)
.
 
PC1, PC2 and PC3 are the first, secondand third principal components respectively in thedecreasing order of significant information. From
Figure 1(a)-(c)
, it is evident that small details areignored because of the abundance of major data.The principal components obtained after aclustering based on SAM with threshold
α
=0.0055
°
isshown in
Figure 2(a)-(c).
PCA is applied after a SAM based clustering. A spectral signature from each classis taken to form the spectral signature set. To compute‘Mean’, only the elements in this set are used, therebyassigning equal priority to all the scenes in the image-cube [15]. On varying the threshold value
α  
 
the details
 
in
 
the principal components also varies. By trial anderror method one can find a suitable value for 
α  
 
to getgood results.
(a) PC1 (b) PC2
 
(c) PC3Figure 1. Principal components resultingfrom PCA based dimensionality reduction.(a) PC1 (b) PC2
 
(c) PC3
 
Figure 2. Principal components obtainedfor PCA with SAM for threshold
α
=0.0055
°
.
Proceedings of the International Conference on Computing: Theory and Applications (ICCTA'07)0-7695-2770-1/07 $20.00 © 2007
 
Authorized licensed use limited to: Univ of Puerto Rico Mayaguez - Library. Downloaded on February 18, 2009 at 21:38 from IEEE Xplore. Restrictions apply.
of 00

Leave a Comment

You must be to leave a comment.
Submit
Characters: ...
You must be to leave a comment.
Submit
Characters: ...