Professional Documents
Culture Documents
Abstract It has been several years Background subtraction techniques were put into
use in vision and image applications for motion detection. However, most of the
methods fall short of providing fine results due to dynamic backgrounds, illumination
variation, noise, etc. Uniqueness of the proposal is construction of a steady back-
ground from a video sequel. In the editorial, proposal is to develop a steady back-
ground representation from a certain video sequel. The background is updated on
arrival of each frame. For detecting moving objects, the constructed background has
been compared with diverse frames of the video sequel. For this, the background
model is developed using combination of Local Binary Pattern (LBP) and Gaussian
averaging. Gaussian averaging employs different forms that occur with time to
confines the underlying opulence of the background. Likewise, a spatial region of
hold is used by LBP. The projected proposal depends on spatio-temporal forms
occurring with time to fabricate a suitable model background. Efficacy of the pro-
jected proposal is established by comparing the outcomes with some of the existing
avant-garde background subtraction methods on open standard records.
1 Introduction
Dynamic detection of object from video sequels [1] is a major difficulty in Image
processing and computer vision. Accurate recognition of motion in objects is vital for
several applications of artificial intelligence and image processing. Dynamic object
detection is being used extensively in the areas like robotics, activity recognition [2],
video surveillance [3], gait and face-based human recognition [4], etc.
Different object detection methods generally used are Background subtraction,
frame differencing, and Optical flow. Out of these, Background subtraction has
been the most accepted and popular method for detecting an object. Background
subtraction constitutes of two basic steps-background modeling and subtraction [5].
Frame differencing technique for object detection uses a reference background
image for comparing frames. Places, where changes are seen are identified as
dynamic object in the case of frame differencing.
In the Background Subtraction (BGS) model proposed by Heikkila and Olli [6],
both the concepts of frame differencing and background updation had been used. In
this, background is updated according to current frame. Nevertheless this background
model is rigid against non-static background, noise, and illumination variation.
Based on this concern, Wren et al. [7] proposed a flexible BGS method by means of
running Gaussian average. In this researchers have represented background auton-
omously at every pixel position by correcting a Gaussian pdf (probability distribution
function) above a series of preceding frames. By means of intensity comparison of
every pixel positions in the marked frame and the consequent factors of Gaussian pdf,
the positions of the dynamic objects in a frame are attained.
So as to model a background correctly, a multi-valued background scheme is
required. Stauffer and Grimson [8, 9] projected a scheme where multiple Gaussian
pdfs are used to model pixels at a specific position of an image frame. This method
gives improved results in non-static backgrounds and is found flexible and robust.
Of late, this model was modified incorporating dynamic textures for detecting
dynamic objects from non-static backgrounds [10]. From these, it can be concluded
that the methods mentioned above do not consider spatio-contextual information for
detecting changes and are pixel based. Such a spatio-contextual BGS scheme that
uses non-altering textural property, LBP histogram has been scrutinized in [11].
From these analyses, it is obvious that various techniques for object detection
applying background subtraction had been suggested by several analysts in the area
of Image Processing. As in the case of everything, these algorithms have their own
advantages and drawbacks. It can be brought to a conclusion that, for a good
background subtraction technique to be designed, it requires construction and
updation of a steady background which can exemplify the background of a video. It
must also distinguish the effect of noise and dynamic background.
In this work that is proposed, efforts has been taken to merge the benefits of
Gaussian model based background construction and the idea of Local binary pattern
for non-static object detection from a video sequel. Therefore the model can be
termed as a spatio-temporal model. Here, the proposal is a pioneer BGS method to
detect non-static objects from static and dynamic backgrounds. Uniqueness of the
model is in developing an unwavering spatio-temporal background scheme from a
certain video sequel which proficiently illustrates the features of background in the
sight. It is also noticed that the projected model offers better outcomes against small
variations in light, noise, and dynamic background. The efficacy of the scheme
proposed has been established by comparing with the BGS models using threshold,
Moving Object Detection Using Local Binary Pattern … 369
and two different Gaussian mixture models. The observation is that this proposal
offers better outcome in comparison with other existing methods considered.
As objects in a video are usually moving objects, this kind of variability changes
are pretty evident. When an object shifts from one frame to another throughout the
line of sight of a camera, image of an entity may change significantly. Listed below
are the characteristic challenges of background subtraction are Dynamic Back-
ground, Occlusion, Clutter, Camouflage, Presence of Shadow, Motion of the
Camera and Video Noise.
2 Proposed Algorithm
For texture matching in computer vision, one of the vital attribute descriptor is
Local Binary Pattern (LBP). It is a potent way of texture description. LBP operator
tags pixels of a scene by thresholding the zone (neighbors) of every pixel along
with the middle value and by arraying the outcome to binary codes [12, 13]. Local
Binary Pattern has few striking properties like gray scale invariance, non-parametric
computational simplicity, illumination invariant and high discriminative.
The basic one has eight pixels as neighbors, which are easily extensive to
contain a bigger neighborhood with any number of pixels. In LBP, a middle pixel is
taken and its M neighbors are considered. Then the difference between middle pixel
and its M neighbor pixels are computed to get the M-bit binary number. If the
neighborhood value is less than central pixel, the value is 0 and otherwise 1. From
this arose 2M discrete decimal values for binary pattern. The process is as illustrated
in the figure beneath. The value of the middle pixel is 7. Here we compares from the
upper pixel and maintains a clockwise direction to cover all its neighbors.
M −1
LBP M, R ðcx , cy Þ = ∑ T Iq − Ic ð1Þ
q=0
1 α≥0
T ðαÞ =
0 α<1
370 A.P. Athira et al.
Here Ic is the central pixel value with co-ordinate (cx, cy) and Iq is the neigh-
boring value of the pixels in a circle of radius 1 unit (Fig. 1).
The primary step for any foreground segmentation method is background con-
struction. The steadiness of the background model constructed decides accuracy of
object detection. The accurateness of segmentation gets corrupted if the background
constructed is degraded by noise. Even the dynamic background reduces the quality
of segmented foreground. Hence, it is very essential to put up a vigorous back-
ground construction scheme for precise moving object detection.
In the projected proposal, the model background is constructed by taking into
account the series of image frames that are previously available. The key concept of
the proposed work is the dependent pixels of similar texture in the background at a
certain position of the image that follows Gaussian distribution at temporal course.
This scheme is constructed using intensity values of these dependant pixels. The
dependency of pixels is checked using LBP histogram. For this reason, in the
projected proposal we have utilized LBP histogram to verify whether the pixel in a
certain position of the background model formed is reliant on the pixel in the
equivalent position in the succeeding image frames. Initially we set the first frame
as background.
Here µ(γ, δ) is the background model and I1(γ, δ) is the first frame. We test the
similarity between the pixels at location (γ, δ) using the following steps.
Step 1: Construct the LBP histogram for each pixel using binary masks for
background image and target image. LBP histogram for each pixel is computed
over neighboring pixels in a circular region.
Step 2: Find the similarity between target histogram and background histogram
corresponding to the pixel value (γ, δ). The similarity between the histograms can
be find out using the concept of correlation
Moving Object Detection Using Local Binary Pattern … 371
∑x ðHt ð xÞ − Ht ÞðHb ð xÞ − Hb Þ
DðHt , Hb Þ = qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð3Þ
∑x ðHt ð xÞ − Ht Þ2 ∑x ðHb ð xÞ − Hb Þ2
Here D(Ht, Hb) is similarity measure, Ht is the target LBP histogram corre-
sponding to particular pixel (γ, δ), and Hb is the background LBP histogram cor-
responding to the same pixel (γ, δ).
Step 3: If the value of D(Ht, Hb) does not deviate from 1, then it means the pixels
are similar, and the corresponding pixel is taken as of backgrounds.
If a pixel in the position (γ, δ) is noticed alike with the constructed background
then the model background is updated by taking into consideration the notion of
Gaussian averaging. Background can be updated using the given formula,
ði − 1Þ μðt − 1Þ ðγ, δÞ + It ðγ, δÞ
μt ðγ, δÞ = ð4Þ
i
and
1 i
var ðγ, δÞ = ∑ jIt ðγ, δÞ − μt ðγ, δÞj2 ð6Þ
i k=0
where var(γ, δ) represents the variance for pixel location (γ, δ).
8
< Iðγ, δÞ − μt ðγ, δÞ
1 if abs pffiffiffiffiffiffiffiffiffiffiffiffi ≥ Th
Rt + 1 = varðγ, δÞ ð7Þ
:
0 Otherwise
Here the absolute value is represented by abs(). At times the detected region is
not connected so for better result we have used some morphological techniques. Th
represents a manually given threshold value. The concept of morphological closing
on an image is used in the proposed work to remove small dark spots and connect
small bright cracks. Updation of background is the next stage of background
subtraction. In this proposal, the updation of the background model at each time
(t + 1) has been performed as below,
ηIt + 1 ðγ, δÞ + ð1 − ηÞμt ðγ, δÞ if Rt + 1 ðγ, δÞ = 0
μðt + 1Þ ðγ, δÞ = ð8Þ
βIt + 1 ðγ, δÞ + ð1 − βÞμt ðγ, δÞ Otherwise
and
ηðIt + 1 ðγ, δÞ − μt ðγ, δÞ2 + ð1 − ηÞvart ðγ, δÞ, if Rt + 1 ðγ, δÞ = 0
varðt + 1Þ ðγ, δÞ =
βðIt + 1 ðγ, δÞ − μt ðγ, δÞ2 + ð1 − βÞvart ðγ, δÞ Otherwise
ð9Þ
where vart(γ, δ) represents the variance of Gaussian model at pixel location (γ, δ). η
represents the constant value that varies from 0 to 1 and changes with every frame,
η is the ratio of total number of background pixels by total number of pixels. In
order to adjust swiftly for the pixels identified as background and gradually for the
pixels identified as foreground, β has to be much lesser than η and β usually is zero.
The projected proposal is executed in python with OpenCV and is run on Intel Core
i3 with Ubuntu OS and 4 GB RAM. For ascertaining efficacy of the projected
method, trials have been carried out on four video sequels. The chosen video
sequels are ‘Water Surface’, ‘PETS2006’, ‘Traffic’ and ‘Curtain’. So as to legiti-
mate the projected proposal, results achieved from the method is compared with
two different Gaussian algorithms [14, 15] and thresholding method.
For object detection by means of manual thresholdng method [1], the object in
an intended frame can be perceived by taking into consideration the absolute dif-
ference of background & intended frame and manual thresholding follows. Here,
0 to 255 is the range of threshold value. In the experiment different threshold values
are considered.
Moving Object Detection Using Local Binary Pattern … 373
The foremost instance that is appraised in this analysis is PETS2006 video sequel.
The video is taken from a railway station and depicts the area throughout a day.
Two images of this video sequence are presented in Fig. 2a. The detected object by
proposed work and three existing methods are given in Fig. 2b–d. From these
outcomes, it is examined that manual thresholding [1], GMM-1 [14], and GMM-2
[15] based Background Subtraction techniques have recognized non-static objects
from the frames with higher false rate. Compared to the three existing methods
proposed method give better result.
The second example that is appraised in our analysis is a non-static background
having one non-static object. Here we have chosen dynamic Water Surface video
sequence. The video is attained in a less illumine situation with dynamic back-
ground because of variation in surface of water in the lake. Two actual video
sequence of the video is presented in Fig. 3a. The object detected using thresh-
olding, GMM-1, GMM-2are given in Fig. 3b–d, respectively. In Thresholding and
(a) Video sequence (b) Object detected using (c) Object detected using
Thresholding GMM 1
(d) Object detected using Gaussian 2 (e) Moving object detected using
proposed work
(a) Water Surface –Video frames (b) Object detected using (c) Object detected using
thresholding GMM-1
(d) Object detected using GMM-2 (e) Object detected using proposed work
(a) Curtain-Video frames (b) Object detected using (c) Object detected using
Thresholding GMM-1
(d) Object detected using GMM-2 (e) Object detected by proposed work
(a) Traffic video frames (b) Object detected using (c) Object detected using
Thresholding GMM-1
(d) Object detected using GMM-2 (e) Object detected using proposed work
Finally we are considered traffic video sequence. This video sequence is noisy
and camera jitter also considered taking place in this video sequence. In our pro-
posed work we are using spatio-temporal method so the problem of camera jitter
can be solved. Because of camera jitter existing algorithms such as thresholding,
GMM-1, and GMM-2 falsely detect the moving object. In the case of GMM-1
because of noise and poor picture quality most of the portion of background is
detected as shadow. Proposed work give accurate result, so proposed work can
solve camera jitter, poor image clarity problems (Fig. 5).
The proposed work is found to offer superior results in comparison with
Thresholding, and two GMM-based approaches.
References
1. Bovic, A.L.: Image and Video Processing. Academic Press, New York (2000)
2. Lin, W., Sun, M.T., Poovendran, R., Zhang, Z.: Activity recognition using a combination of
category components and local models for video surveillance. IEEE Trans. Circuits Syst.
Video Technol. 18(8), 1128–1139 (2008)
3. Piccardi, M., Jan, T.: Mean-shift background image modelling. In: Proceedings of
International Conference on Image Processing, vol. 5, pp. 3399–3402 (2004)
4. Shi, Q., Cheng, L., Wang, L., Smola, A.: Discriminative human segmentation and recognition
using semi-Markov model. In: Proceedings of IEEE Conference on Computer Vision and
Pattern Recognition, pp. 1–8 (2008)
5. Subudhi, B.N., Ghosh, S., Ghosh, A.: Moving object detection using Gaussian background
model and Wronskian framework. In: Proceedings of 2nd International Conference on
Advances in Computing, Communications and Informatics, pp. 1775–1780 (2013)
6. Heikkila, J., Silven, O.: A real-time system for monitoring of cyclists and pedestrians. In:
Second IEEE Workshop on Visual Surveillance Fort Collins, Colorado, June 1999, pp. 74–81
7. Wren, C., Azarbayejani, A., Darrell, T., Pentland, A.: Pfinder: realtime tracking of the human
body. IEEE Trans. Pattern Anal. Mach. Intell. 19(7), 780–785 (1997)
8. Stauffer, C., Grimson, W.: Learning patterns of activity using real time tracking. IEEE Trans.
Pattern Anal. Mach. Intell. 22(8), 747–767 (2000)
9. Stauffer, C., Grimson, W.E.L.: Adaptive background mixture models for real-time tracking.
In: Proceedings of International Conference on Computer Vision and Pattern recognition,
pp. 2246–2252 (1999)
10. Chan, A., Mahadevan, V., Vasconcelos, N.: Generalized Stauffer and Grimson background
subtraction for dynamic scenes. Mach. Vis. Appl. 22, 751–766 (2011)
11. Heikkila, M., Pietikainen, M.: A texture-based method for modeling the background and
detecting moving objects. IEEE Trans. Pattern Anal. Mach. Intell. 28(4), 657–662 (2006)
12. Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution gray-scale and rotation invariant
texture classification with local binary patterns. IEEE Trans. Pattern Anal. Mach. Intell. 24(7),
971–987 (2002)
13. Ahonen, T., Matas, J., He, C., Pietikäinen, M.: Rotation invariant image description with local
binary pattern histogram fourier features. In: Salberg, A.-B., Hardeberg, J.Y., Jenssen, R.
(eds.) SCIA 2009. LNCS, vol. 5575, pp. 61–70. Springer, Heidelberg (2009)
14. Zivkovic, Z.: Improved Adaptive Gaussian Mixture Model for Background Subtraction
(2004)
15. KadewTraKuPong, P., Bowden, R.: An improved adaptive background mixture model for
real-time tracking with shadow detection (2001)
16. Zivkovic, Z.: Efficient adaptive density estimation per image pixel for the task of background
subtraction. Pattern Recogn. Lett. 27(7), 773–780 (2006)