Professional Documents
Culture Documents
net/publication/251401579
CITATIONS READS
7 56
3 authors:
Miloš Železný
University of West Bohemia
81 PUBLICATIONS 347 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Miroslav Jiřík on 19 January 2018.
This paper describes a method for texture based segmentation. Texture features are
extracted by applying a bank of Gabor filters using two-sided convolution strategy.
Probability texture model is represented by Gaussian mixture that is trained with the
Expectation-maximization algorithm. Texture similarity, obtained this way, is used like
the input of a Graph cut method. We show that the combination of texture analysis and
the Graph cut method produce good results.
G( x , y )=α ( x , y )⋅cos(2 π x¿
λ
+ϕ ) (1)
α ( x , y)=exp −
( ) ( )
x¿
2 σ 2x
exp −
y¿
2σ 2y
(2)
x '=( x−m x ) cos ( γ )−( y −m x ) sin ( γ ) Fig. 3: Schematic image of two-sided convolution
(3) strategy
y ' =( x−m y ) cos ( γ )−( y−m y ) sin ( γ )
After the convolution of filters with training
image, we obtain filter responses R n . Here n
1
The work has been supported by the grant of The University of West Bohemia: “Inteligentni
metody strojoveho vnimani a porozumneni”, project No. SGS-2010-054 and by the grant
ED1.1.00/02.0090 – New Technologies for Information Society.
represents the direction of the half plane. From is equivalent to the maximum flow problem,
these responses we create a feature vector. which belongs to the classical optimizing
After texture features extraction, we need a problems.
texture model for good similarity measure. We The first step is to obtain a weighted graph
model the training textures in the feature G(V,E) from the image, where V represents set
vector space as a mixture of Gaussians (one of nodes and E represents set of edges. Each
model for each direction n): node corresponds to a pixel in the image and
each edge between these nodes corresponds to
K
a neighboring relationship between these
U n ( x P|Θn )=∑ α ni h ni ( x P|Θni ) (4) nodes and is called an n-link (neighborhood
i=1
link). The graph contains two additional nodes,
hi ( x|Θ i )=
(
exp −
1
2
T
( x−μ i ) Bi=1 ( x−μ i ) ) (5) the source (represents objects) and the sink
(represents background) respectively. These
D
special nodes are called terminals. Each node
(2 π ) 2
√ det Bi from the set V is connected to the terminals
with a link called t-link (terminal link).
Here, index P is a point in the image, XP is the We can imagine the graph like a pipeline
feature vector of the point P, α ni is a mixing system (in the basic version of Graph cut
method), described in [3]. Each pipe has own
weights, Θ n is a collection of parameters capacity given by the weight of the
{α n 1 ,…, α nK ; Θn1 ,…,ΘnK } . Function hni corresponding edge. Segmentation consists in
is a multivariate Gaussian density. This gradual filling this pipeline with a medium
mixture of Gaussians is trained by Expectation through the source and the leakage of this
Maximization algorithm. medium through the sink. Some of the pipes
The Expectation Maximization algorithm (or become saturated in this process, which means
simply EM algorithm for short) is used for that through this pipe flows the maximum
finding maximum likelihood estimates of value of the medium given by its capacity.
parameters in statistical models. It is an After a certain amount of water was sent to the
iterative method which alternates between system, there can occur a state in which there
expectation and maximization step. During the isn´t a path from the source to the sink without
expectation step, the current estimate of any saturated edge. In this case the
parameters is used for computation of new segmentation ends and the objects are
expectation. During the maximization step, separated from background by the set of the
parameters which maximizing the expectation saturated edges. This set of saturated edges
are computed. These estimated parameters are represents a cut of the graph. The graph, t-
then used as an input to the expectation step links, n-links and a cut of the graph are shown
and so on. in figure 4.
Before final segmentation the object texture
similarity and background texture similarity
are computed.
Image segmentation
C ( L )=λR ( L ) +B ( L ) (6)
R ( L )= ∑ R p ( L p ) (7)
p∈R p
B ( L )= ∑ B( p, q ) δ ( L p , L q ) (8)
( p ,q ) ∈N
δ ( L p , L q )=
{ 1 , L p ≠L q
0, otherwise
(9)
Fig. 5: Background terminal links
( UO
)
B
For example for object texture: n ⋅U n'
Qn=log (11)
UO
n
⋅U nB
'
( )
N
1
R Op =−ln ∑ U O ( x |Θ )
N n=1 n P n
(10)
( N2 )mod N
n' = n+ (12)
1
The work has been supported by the grant of The University of West Bohemia: “Inteligentni
metody strojoveho vnimani a porozumneni”, project No. SGS-2010-054 and by the grant
ED1.1.00/02.0090 – New Technologies for Information Society.
( ) Q 2n We used this technique on many test images
Bnp =exp − (13) and compared the results with threshold,
2σ2 Gabor filters method (GF) and simple method
combining Gabor filters with Graph cut
Here, n´ represents the opposite direction of (GFGC), see table 1. For quantitative
n- th half plan, N is the number of directions, evaluation of the segmentation results, we use
index p indicates the point of the image and the classification errors FP (False positive) and
σ usually represents intensity variation. In
FN (False negative). False positive represents
this case it can be estimated as variation of the percentage of pixel which does not belong
Qn . to the object, but which has been assigned to
Figures 7 and 8 show the neighborhood links, it. On the other hand, false negative represents
calculated by using the equation (11). There, the percentage of pixels which belong to the
you can see the weights in vertical (fig. 7) and object, but which have been classified as
horizontal (fig. 8) direction. background.
Total
FP FN
error
Threshold 2.2793 3.7328 6.0121
Gabor filters 1.5012 5.3847 6.8859
GFGC 1.4958 6.0415 7.5373
Two sided GFGC 0.7164 0.2155 0.9319
Example
Results
1
The work has been supported by the grant of The University of West Bohemia: “Inteligentni
metody strojoveho vnimani a porozumneni”, project No. SGS-2010-054 and by the grant
ED1.1.00/02.0090 – New Technologies for Information Society.
References
Conclusion
1
The work has been supported by the grant of The University of West Bohemia: “Inteligentni
metody strojoveho vnimani a porozumneni”, project No. SGS-2010-054 and by the grant
ED1.1.00/02.0090 – New Technologies for Information Society.