You are on page 1of 2

patches are also related to different types, such as a deforesta- Considering aN+1 = a1 and oN+1 = o1 , we can obtain the

can obtain the co-


tion area in a forest region, or a region containing a roof in a ordinates of a closed shape. Figure 6 illustrates a cycle and its
urban imagery [18]. Based on these considerations, 3 groups of transformation to the polar coordinates. Given the shapes, we
metrics are defined [52]: can extract various geometric features, such as area, perimeter,
direction, or bounding ellipse. In this scheme, a cycle with con-
• Patch metrics qualify individual patches and characterizes stant values outcomes a circle, and different cycles draw differ-
their spatial and contextual information. Examples include ent shapes according to their properties. Henceforth, this type
the area of a polygon, perimeter, and compacity. As an of feature is named as Polar.
example, one patch can be defined as a forest fragment. Moreover, a polar representation provides a new visualiza-
tion scheme that can help us to describe the pattern represented
• Class metrics integrate all patches of a given type inside in the cycle. A first idea when using annual cycles suggests
a specific area, by simple or weighted averaging. The splitting the polar representation into 4 quadrants related to the
weighted averaging scheme can reflect a greater contribu- 4 seasons. Hence, other features such as average values per
tion of large patches to the overall index. Instances include season can also be computed. Another feature, called Polar
average shape index, and patch size standard deviation. Balance, calculates the standard deviation of the area per sea-
These metrics are used to define, for instance, the amount son, which indicates the stability of the profile throughout the
of houses in a block, or the average size of croplands in a cycles.
state. Phenological indicators plus linearity metrics are the com-
• Landscape metrics concern all patch types or classes in- monly used metrics in the literature, therefore we refer to them
side a specific area. These metrics are integrated by a sim- as basic features. The remaining features are of the polar
ple or weighted averaging, and they reflect combined patch type. Table 5 describes the available multi-temporal features
mosaic. Landscape metrics include average perimeter-area in GeoDMA.
ratio and patch size coefficient of variation.
2.3. Module “Data mining for detecting land cover and change
Table 4 describes some types of landscape ecology features. patterns”
In the data mining module (Figure 7) the interpreter selects
2.2.3. Multi-temporal features representative (training) samples of the expected patterns. All
patterns compose the land cover typology, and some algorithm
Multi-temporal features include several descriptors for cy-
will create automatically a classification model based on train-
cles. This group encompasses phenological indicators, a well
ing samples. The classification model can be stored for further
known set of metrics for time series, as described in [60] and
and manual analysis. This model shall be used to classify the
[38], including the dates of the beginning or end of a growing
entire database, or different databases with the same expected
season, the length of the green season, and so on. Besides phe-
typology.
nological, we suggested to use the linearity metrics [75] and
shape measures based on polar representation of cycles.
According to [36], the standard computational models of
Class
time do not consider that certain events or phenomena may be Information

recurring. The term cycle can also be used to capture the no- Interpreter
Classification
Classification
Sample Model
tion of recurring events. To support cycle’s visualization, [17] Selection
Algorithm

proposed a time-wheel legend, resembling a clock face, divided


into several wedges according to the data instances. Database
Data Mining
In our case, we adapted the time wheel legend by plotting
each cycle of the profile, and by projecting values to angles in
the interval [0, 2π]. Let a cycle be a function f (x, y, T ), where Figure 7: The interpreter defines a typology and the set of representative sam-
(x, y) is the spatial position of a point, and T is a time interval ples, which are used to create the classification by applying the data mining
algorithm.
t1 , . . . tN , and N is the number of observations in such a cycle.
The cycle can be visualized by a set of values vi ∈ V, where vi is
Usually, the search for patterns includes the automatic execu-
a possible value of f (x, y) in time ti . Let its polar representation
tion of a classification algorithm and a phase of feature evalua-
be defined by a function g(V) → {A, O} (A corresponds to the
tion by the interpreter. As [61] points out, the inclusion of data
abscissa axis in the Cartesian coordinates, and O to the ordinate
mining techniques in the classification process can increase the
axis) where
speed and also reduces the empirical nature of the feature se-
2πi lection process and the creation of classification models. One
ai = vi cos( ) ∈ A, i = 1, . . . N (1) mechanism to evaluate the features is provided in the visualiza-
N
tion module, which displays features in a scatterplot to visualize
and the data distribution in the feature space, as shown in Figure 8.
2πi According to [21], data mining is one step of a process called
oi = vi sin( ) ∈ O, i = 1, . . . N. (2) knowledge discovery in databases – KDD. In this sense, KDD
N
5
0.7

value
v3 0.40.4

0.6
0.6

[v2 cos(π/2), v2 sin(π/2)]


0.5
0.5
0.20.2

0.4
0.4

[v1 cos(0),v1 sin(0)]


0 0
[v3 cos(π),v3 sin(π)]
[v5 cos(2π-Δ),v5 sin(2π-Δ)]

0.3
0.3

v2 v4
0.2
0.2 -0.2
-0.2

[v4 cos(3π/2),v4 sin(3π/2)]

0.1
0.1

-0.4
-0.4

v1 angle
v5
0 0
0 1 2 3 4 5 6 7 8 -0.6 -0.4 -0.2 0 0.2

0 π/2 π 3π/2 2π-Δ -0.6 -0.4 0.2 0 0.2

Figure 6: When the values of the cycle are associated to a certain angle (left), the closed shape is created from its polar transformation (right).

involves data preparation, search for patterns, knowledge eval- 3. Experimental results
uation, and refinement, possible in multiple iterations. The
search for patterns includes the automatic execution of a clas- In this Section we present 2 case studies to illustrate the ef-
sification algorithm and the evaluation of features by the inter- fective use of the GeoDMA system. The first experiment uses
preter. segmentation-based features to map urban land cover classes.
Since the interpreter knows the typology behind the data he The second one uses multi-temporal features to map land cover
must create a description of the expected patterns by selecting
training samples for each pattern, which must be representa-
tive over the images. These samples are represented by a set
of features. Afterwards, in the supervised classification step,
the algorithm uses these training samples to build a classifica-
tion model. Although GeoDMA provides 3 classification algo-
rithms (decision trees, SOM, and neural networks) the focus of
the experiments in this paper is on decision trees [63]. In a clas-
sifier based on decision trees, thresholds are applied to object’s
features. Observations satisfying the thresholds are assigned to
the left branch, otherwise to the right branch [34]. In the final
step, classes are assigned to the terminal nodes (or leaves) of
the tree.

2.4. Module “Classification evaluation”


The output of GeoDMA is a thematic map, created by apply-
ing the classification model to the database. According to [21],
this step is part of a repetitive process, in which the interpreter
evaluates the results visually and statistically. Depending on the
obtained accuracy, the interpreter repeats some previous steps
aiming to create a better classification model.
According to [28], a strong and experienced evaluator of seg-
mentation techniques is the human eye/brain combination. In
addition, when dealing with multi-temporal analysis, the vali-
dation is often not straightforward, since independent reference
sources must be available during the change interval [79]. How-
ever, it is always important to establish measures of correctness
of the results with ground truth data. According to [14], valida-
tion has become a standard component of any land cover map
derived from remotely sensed data.
GeoDMA computes error matrices and the Kappa statistics
[23] for a classification result. In the sample selection module
the system automatically divides the samples into training and
validation sets, randomly. Results are compared with validation Figure 8: The input image (top), visualization tools using a map of features
(middle), and a scatterplot (bottom). The map shows the feature ratio of band
samples to create the error matrix. In cases where the sample set 1, showing it is a proper feature to distinguish the target Trees. The scatterplot
is small, GeoDMA provides error evaluation based on Monte shows the feature space of features mean of bands 0 and 2, distinguishing the
Carlo simulation [65] using only training samples. target Roofs from the rest of the image.

You might also like