You are on page 1of 5

TAAI2015 Tainan, Taiwan Nov.

20-22, 2015

Fall down detection for surveillance system of


health care
Wei Quan Naoyuki Kubota
Graduate School of System Design Graduate School of System Design
Tokyo Metropolitan University Tokyo Metropolitan University
Tokyo, Japan 191-0065 Tokyo, Japan 191-0065
Email: quan-weil@ed.tmu.ac.jp Email: Naokubota@nifty.com

Abstract-Since the world technology grows faster and faster, system. In Section 3, we will briefly explain the main steps
the people is becoming much more health than ever before, thus of the whole system, and the proposed methodology for
getting longer of living age. On the other hand however, the
fall detection will explained in section 4. Further experiment
rising number of elderly people also course the problem such
as the aging of population. One case is that the population of
results will be presented in Section 5. Last but not least, the
elderly who live alone is increasing and more assistance should final section will draw the conclusions and future extension.
support on the situation the health care resource is less. Thus we
proposed the surveillance system to apply for this situation. This II. REV IEW
paper focuses on the surveillance system which focuses on the
According to [1], the fall detection system can be broadly
individual house to detect the unmoral behavior such as falling
down when elderly people lives alone. And comparing with the
categorized into two types: wearable device based system
most popular methodology such as Aspect Ratios, the method and context-aware system. The first type briefly includes the
we proposed has conquered its weakness and performed will in collaboration between the aware sensor (in most cases the
most situations. sensor is camera) and the wearable device. The vast majority
of wearable fall detectors are in the form of accelerometer
I. INTRODUCTION
devices. Some of them also incorporate other sensors such as
The rapid development of Computer Science, video surveil­ gyroscopes to obtain information about the patient's position.
lance system has been widely used in various fields such as By wearing these assist device the data will get much more
bank surveillance and pedestrian tracking, etc[5]. On the other exact and complicated.
hand, requirement also becomes overwhelming tendency for Nevertheless, since the fall detection system always serve
video surveillance system. the people, we should considering that elderly people always
Camera tracking system has become a common need in forget to put on the device. By missing the wearable device
the today's society. This system is dealt in biometrics and the whole system will decrease the accuracy largely and will
situation/context awareness technologies. Currently, the bio­ ignore some accident, thus we will have to remind the elderly
metric technology is widely used in the real world situation. to wear their device, This seems much less convenience.
Contrarily, the increase use of the camera tracking system has Based on this truth, we focus on the research of second type,
resulted to linear decrease of its price each year. i.e. context-aware system. Handling and computing all the
In recent years, image processing has been used in lots of information from the data that sent only from the camera.
fields such as bank surveillance and pedestrian tracking [3] [6]. Or we can even using embedded device such as DSP ships to
Background Construction system which using static sensors compute the image data since it is collected by the camera.
can be used in the area of static situation such as plaza mall,
ATM, etc. Nevertheless, despite it's wide application for public III. SYSTEM O V ERV IEW

security, the increasing number of ordinary fields have intro­ The system is similar to the other tracking system. The
duced the surveillance system for the more convenient life. main flow follows the : background construction/foreground
For example, the house surveillance system for elderly people abstract, foreground extraction, foreground human modeling
who live alone. According to the World Health Organization, and fall detection system, which can be seen in figure 1. And
approximately 28-35% of people aged 65 and over fall each this part will explain each step respectively.
year increasing to 32-42% for those over 70 years of age. And
falls are a major cause of fatal injury especially for the elderly. A. Background Construction/Foreground abstract

Considering that large part of these aged people live alone, it In most cases, the cameras settled in the house are fixed in
is significant that introducing the assistant system automatic some certain places such as the roof and the corner of wall.
surveillance to support them health care surveillance. Thus in most time the image that shot by cameras are the
The remaining of this paper is organized as follow: Section same. Based on this, background model. In video surveillance
2 briefly reviews the referent methodology of fall detection system, the sensors are typically stationary and scene of the

978-1-4673-9606-6/15/$31.00 ©2015 IEEE 232


TAAI2015 Tainan, Taiwan Nov. 20-22, 2015

_ 0

Human
Foreground Fall
Video Foreground Modeling
Object detection
frames Detection (Head
Extract Evaluation
Detection)

Fig. 1. The main flow of the fall detection system.

monitor is static in the most part of time, thus it is possible to


construct a background and detect change for monitoring[7].
Different from certain fast-tracking algorithms, research based
on the foreground detecting provides more accurate monitoring
effort in video tracking.
Background modeling is often applied in various applica­
tions to model the background and detect the moving objects in
the scene like in video surveillance, optical motion capture and Fig. 2. The result of foreground detection by CodeBook. The top is original
multimedia. The simplest way to model the background is to image, and the bottom is foreground flxels
acquire a background image which doesn't include any moving
object. In certain conditions, the background is unavailable and
always be changed under critical situations like illumination
Model (GMM) which using several Gaussian distributions to
changes, objects being introduced or removed from the scene.
represent the probability density function to handle the obsta­
To handle these problems of robustness and adaptation, many
cle of situation mentioned before. To make background and
background modeling methods have been developed and the
foreground modeling more multifarious, in 2002, Elgammal,
most recent surveys can be found. These background modeling
A. et al. proposed a method to construct the background pdf by
methods can be classified in the following categories: Ba­
using kernel density estimation. Construct probability density
sic Background Modeling, Statistical Background Modeling,
function of background for each pixel in the image by the
Fuzzy Background Modeling and Background Estimation.
kernel function is constructed by every sample data separately.
Other classifications can be found in term of prediction , recur­
sion , adaptation, or modality. All these modeling approaches Reading the literature, two remarks can be made: (1) The
are used in background subtraction context which presents the most used model is the pixel-wise MOG one proposed by
following steps and issues: background modeling, background Stauffer and Grimson due to a good compromise between
initialization, background maintenance, foreground detection, robustness to the critical situations and the constraints (CT,
choice of the feature size (pixel, a block or a cluster), choice of MR). (2) There are many improvements of this MOG model as
the feature type (color features, edge features, stereo features, shown by the different acronyms found like GMM, TLGMM,
motion features and texture features). STGMM, SKMGM, TAPPMOG and STAPPMOG. (3) Other
Based on the methods which have mentioned before, we can proposed method such as Kernel Density Estimation is also
classify them into several types: most of these are based on the widely used in video tracking system for background construc­
statistical model which means catching the suddenly change tion and foreground detection. All the developed strategies
of gray or color density for a pixel that appears in the scenes. attempt to be more rigorous statistically or to introduce spatial
Among these mature algorithms, most understandable method and/or temporal constraints. The objective is to group these
is just to calculate the maximum and minimum values of gray different strategies in one paper and classify them following
scale for each pixel and the absolute difference between two the different steps and issues of the MOG model. The idea is
coherent images. For a pixel in a sequence of image, the pixel not to present a numeric comparative evaluation using ROC
in this frame will be regarded as a foreground pixel if the or PDR analysis of the different algorithms due their large
absolute value is larger than a threshold. But because of its number and because this evaluation can be generally found in
simplified structure, it performs less ideal when applied in the related paper.
some complicated situations. Thus Gaussian distribution was In our system, we chose CodeBook algorithm to construct
presented for the construction of probability density function the background and detect foreground. The codeBook was
(pdf) for each pixel to get the foreground. However, single proposed by Kyungnam Kim, etc [2]. Comparison with other
Gaussian model cannot handle some certain situation such as multimode modeling algorithms shows that the codebook
the density of a pixel has several distribution ranges, thus Chris algorithm has good properties on several background modeling
Stauffer and W.E.L Grimson proposed the Gaussian Mixture problems. The result can be seen in figure 2.

233
TAAI2015 Tainan, Taiwan Nov. 20-22, 2015

• CORE POINT

• E < '"

• NOISE POINT

Fig. 3. The illustration of DBSCAN algorithms

B. Foreground Extraction

Even though various clustering algorithms can be applied,


clustering in video surveillance has many restrictions. For
example, K-means algorithm is one of the most famous
clustering algorithms because of its efficient perform. Despite
these advantages, several features make it unsuitable for clus­
tering in video surveillance such as it has to initial k before
calculation and cannot exclude noise, etc. For the reason of Fig. 4. The Top part is the original image and the bottom is the result of the
foreground object extraction by using DBSCAN, where red points represent
self-adaptive, an algorithm should be selected and applied in the edge points
video monitoring.
In order to make sure the accuracy, we chose DBSCAN
algorithm to fulfill this part. DBSCAN is a density-based can not recognize the fall of human in some certain case.
clustering algorithm. The foreground detection only gives the Based on this truth, we proposed one simplified model.
foreground individual pixels, without any other information. The model divides the human shape into two parts: head
Hence we have to know some detail such as the shape of and torso. And detect the head and torso respectively and
tracking objects. Thus have to choose a cluster algorithm for analysis the relationship between them to analogizing the
the ROI extraction[4]. human current status at the same time.
The main step of the DBSCAN may have three: core point For the part of human head, it will be described as the
extraction, edge point extraction and noise point exclusion. following:
Core point is the point whose density within its area is larger
than a certain threshold. The result of extraction can be seen (1)
in figure 3.
where He(x, y) represents the 2-dimensional positional cen­
IV. HUMAN FALL DETECTION troid point and Fe refer to the pixels which constructing the
Since we have extracted the foreground human shape by the head. Similarity, the torso part can also been defined as the
steps described above, the only step left is the fall detection. As same. Generally the head is concluded by the hair and the skin.
we explained before, Aspect ratio[8] can detect the fall expect for the pixel p with the RGB value {r, g, b}, we can evaluate
some special cases, for instance, when the people fall towards
1.2g < r < 2.2g and
or back to the direction of the camera. In this situation, since
1.2b < r < 2.2b and
the people only change the length of height, the ratio of the
60 < r (2)
height and width will not change strongly, aspect ratio detector
may sometimes ignore this accident, thus some terrible result
would be happen.
Even there exit various kinds of human models, even some where the p will be regarded as skin if Skinp is equal to
of them are extremely close to the original human model, there 1, and will be regarded as body otherwise. This method is
is still lack of suitable model for fall down detection. Among under the assumption of the skin color of Asian. By using
these algorithms, Aspect Ratio of human height and width is this method we can evaluate the position of the face. In case
used most frequently. The structure is quite simple and the that some other parts of skin will confuse the analysis, the
accuracy can be also ideal in most cases. Nevertheless, it still color of hair will also introduced to help to identify the head.

234
TAAI2015 Tainan, Taiwan Nov. 20-22, 2015

In our experiment, we expect the people are Asian and the


color of hair are black in most situations.
Despite the head part, the rest part of this foreground object
would be marked as the torso definitely. Similar with the head,
the body part can be also described as:

(3)

where Tc (x, y) represents the centroid position of torso part


and Ft is the total pixels. The torso can be also divided into
body and bottom in further research, but right now we just
regard them as on part for the simplification.
After the head and torso have been constructed, we can
confine the relationship of these two parts. And the relationship Fig. 5. The comparison between Aspect Ratio and proposed method. In this
of them is described as two values here: the angle and distance case both Aspect Ratio and proposed method proposed the alert correctly.

of the centroid points. the two evaluating value e and dist of


a human will be computed by:
figure, the blue and red point represent the center of the head
Yh - Yt and torso respectively. In normal status, in rectangle and the
e = arctan (4)
Xh - Xt line will stay in green, and if the fall detection happens, the
color will turn to red.
And the dist can be computed by Euclidian distance as:
On another situation, which can be seen in figure 6, that the
person fall along the direction of the camera. The top is the
(5)
normal standing status, which both Aspect Ratio and proposed
For the first parameter angel e, the human will be regarded method do not make the alert. In the middle figure, the man sit
as standing(normal status) if the degree of e is within some on the ground, without lying, thus the system did not propose
certain range like : alert either. But in the bottom figure, the man lying on the
ground then the system made the alert. At the same time, we

Status =
{ Unmoral
Normal Threshold1
Otherwise
< e < Threshold2 can see that the alert signal which proposed by Aspect Ratio
still behave normal. Proper setting of the parameter will help
(6) us to distinguish the sitting status and lying status.
In this case, we supposed the Thresholdl and 2 will be equal
VI. CONCLUSION
to 45° and 135° respectively, and any status which is out of
Here we just make a row model for indoor human detection.
this range will be regarded as suspicious and make further
It performed better comparing with the standard Aspect Ratio,
evaluation.
and because the simple structure, it can be also embedded into
Despite these case, there are some other possibilities which
hardware chip such as DSP chip for camera. Nevertheless,
the human fall towards or back to the direction of the camera.
the really situation can be much more complicate then the
We should use other way to conquer this weakness. And for
experiment situation. We will continue working on this topic
the parameter dist, the human will be regarded as fall down
and make deeper research in the future time. And the head
if dist is less then a threshold as:
locating by using skin color is not stable in real situation. We

Status =
{ UNnmora
ormal
l
Threshold3
Otherwise
< dist
(7)
should consider the more stable methods to improve it.
Shadow is another weakness of this system. Considering
that the living house is limit, the shadow would cause a lot of
By combining equation (6) and (7) we can get a method problem. Our future extend would also find a better solution
to evaluate the fall detection status and has ideal performance to solve this problem.
according to the experiment. The detail of the experiment will
REFERENCES
be explain in the next section.
[1] Igual. R, Medrano. C and Plaza. I, Chanllenges, issues and trends in fall
V. EXPERIMENT RESULT detection systems , Biome. Eng. Online, 2013.
[2] K. Kim, T. H. Chalidabhongse, D. Harwood and L. Davis" Real-time
To prove our assumption, we tested several videos with Foreground-Background Segmentation using Codebook Model , Real­
different situation. The foreground human will be marked with time Imaging, Volume 11, Issue 3, Pages 167-256, June 2005.
[3] J. Wang, Analogue winner-take-all neural networks for determining
the rectangle and the head and torso of it are described within maximum and minimum signals , Int. J. Electronics, vol. 77, no. 3,
the rectangle too. In our experiment, the warning mark of pp. 355-367, 1994.
two comparing methods - Aspect ratio and proposed method [4] Ester . M, Kriegel . H. E, Sander J., Xu. X, A density-based algorithm
for discovering clusters in large spatial databases with noise , Proc.
- will be marked with different color of rectangle and line. 2rid Int. Conf. on Knowledge Discovery and Data Mining, Portland, OR,
The explanation of the system can be seen in figure 5. In this AAAI Press, 1996.

235
TAAI2015 Tainan, Taiwan Nov. 20-22, 2015

Fig. 6. The comparison between Aspect Ratio and proposed method. The
top is normal standing status, and the middle is sitting on the ground, and the
bottom is lying on the ground

[5] Trucco . E and Plakas . K, Video Tracking: A Concise Survey, IEEE


JOURNAL OF OCEANIC ENGINEERING, Vol. 31, No. 2, 2006.
[6] I. Haritaoglu, D. Harwood, L. Davis, W4: a real time system for detecting
and tracking people, Proceedings of the IEEE Conference on Autoatic
Face and Gesture Recognition, pp. l5-64, 1999.
[7] K. P. Karmann and A. von Brandt, Moving object recognition using
and adaptive background memory, Time-Varying Image Processing
and Moving Object Recognition. Amsterdam, The Netherlands: Elsevier,
1990.
[8] Miaou SG, Sung PH, Huang CY, A customized human fall detection
system using omni-camera images and personal information, Proceed­
ings of the 1st Distributed Diagnosis and Home Healthcare Conference.
Arlington: Institute of Electrical and Electronics Engineers; 2006.

236

You might also like