Professional Documents
Culture Documents
INDIVIDUAL SETECTION
TEAM MEMBERS
AJAY KRISHNAN 20BEC1355
JANAKI MANOJ 20BEC1213
OBJECTIVE
•
• Δx(y) = horizontal (vertical) scale of an object at y
• Δxref = horizontal (vertical) reference scale.
• q(y) is the ratio for different locations
• Yref= reference location
• Perspective correction method yv=the line where extension of parelle line
meet(vanishing point)
• Computation of total number of foreground pixels
• imgY = height of the processing image.
• N(y) =number of foreground pixels in the yth row
• Npixel = total no of foreground pixels.
• q(y) = ratio for different locations
• Estimating the relation between total foreground pixels and no of people by using neural network
• METHOD 1
• By directing giving the foreground as input and finding the relation with total number of people .
M =f(x)
Where M =total no of people
X = no of foreground pixels
•
• METHOD 2
• Based on Closed Foreground Pixels
Solid foreground blob represent moving people
Scattered pixels represent stationary crowd
In order to bring uniformity areas located with people are covered by white pixels and others by
black pixels. And these are given as iput to the neural network and relationship is found out.
M = F2(c)
M= total no of individual
c = total no of closed foreground pixels
• METHOD 3
• Both foreground pixels and foreground pixels after closing operation is given to the neural
network and the relation is found out.
• M = f3(c,x)
• It is seen that the foreground pixels after doing closed operation is used for further process.
HUMAN TRACKING/DETECTION
• Only segmenting foreground blobs would work as the resolution of the image is low.
• For feature detection
• The four hour video was taken at 10 fps, and the image resolution is 640 ∗ 480 pixels. One image scene
every 100 seconds was used for the evaluation
• total of 153 images were extracted from the original four-hour video.
• In the set of images, the number of people in the scene ranges from 36 to 222.
• The training set consists of 102 images, which were formed by taking the first two images out of every
three consecutive images
• The test set is composed of the remaining 51 images.
• To increase the speed of people counting, all the images were resized to 320 ∗ 240 pixels
RESULTS
• It was seen that the accuracy increased after doing closing operation
FUTRE IMPROVEMENTS
• Texture inside the foreground region can be used as another input for the neural network
• Combination of foreground pixels and feature point clustering method feature point can be used
to get more feature points of human being.
• Higher resolution camera can be used for accurately detecting human and non human objects.