Hds Report Print

HUMAN DETECTION SYSTEM USING DEEP LEARNING
Submitted in partial fulfilment of the requirements of the degree of

Bachelor of Engineering
By
Ms. POURNIMA GHUDE ARMIET/AI21/GP218
Ms. JYOTI JADHAV ARMIET/AI21/JJ219
Mr. SANDIP PASHTE ARMIET/IT21/SP226
Mr. VIKAS SHARMA ARMIET/AI21/SV222
Supervisor:
Prof. Mayank Mangal
ALAMURI RATNAMALA INSTITUTE OF ENGINEERING AND

TECHNOLOGY
Affiliated to
UNIVERSITY OF MUMBAI
Department of Information Technology (IT)

Academic Year – 2022-2023
CERTIFICATE
This mini project report entitled “Human Detection System using Deep Learning”
by Ms. Pournima Ghude, Ms. Jyoti Jadhav, Mr. Vikas Sharma, Mr. Sandip Pashte is
approved for the degree of Bachelor of Engineering in Information Technology
for academic year 2022 – 2023.
Examiners
1.
2.
Supervisor
1.
Prof. Mayank Mangal
Head of the Department Principal

Prof. Shailesh Nandgaonkar
Date:
Place:
Declaration
We declare that this written submission represents my ideas in my own words and where others' ideas
or words have been included, we have adequately cited and referenced the original sources. We also
declare that we have adhered to all principles of academic honesty and integrity and have not
misrepresented or fabricated or falsified any idea/data/fact/source in my submission. We understand
that any violation of the above will be cause for disciplinary action by the Institute and can also evoke
penal action from the sources which have thus not been properly cited or from whom proper permission
has not been taken when needed.
_______________ _________________ __________________ ___________________

Sandip Pashte Pournima Ghude Jyoti Jadhav Vikas Sharma
TABLE OF CONTENTS
CH.
NO TOPIC NAME PAGE NO.
LIST OF FIGURES I
LIST OF TABLES I
LIST OF SYMBOLS AND ABBREVATION I
ABSTRACT II
INTRODUCTION 1
1
1.1 Introduction 2
1.2 Objective of the project 2

1.3 Main Purpose 2
2 LITERATURE REVIEW 4
SYSTEM DESIGN AND ARCHITECTURE 6
3 3.1 Existing System 7

3.2 Proposed System 7
3.3 Technology 7
3.4 Methodology 8
3.5 Working 8
3.6 Unified Modelling Language (UML) 9-12
HARDWARE AND SOFTWARE 13
4
4.1 Hardaware Requirment 14
4.2 Software Requirment 15
PROJECT IMPLEMENTATION 16
5 5.1 Implementation 17-18
CONCLUSION 19
FUTURE WORK 21
REFERENCE 22
ACKKNOWLEDGMENT 24
LIST OF FIGURES
Sr. No. Fig. No. Figure Name Page No.
1 3.6.1 Class Diagram 12
2 3.6.2 Use Case Diagram 13
3 3.6.3 System Flowchart 14
4 3.6.4 System Architecture 15
5 3.6.4 Output 17
List of tables
Sr.No. Table No. Table Name Page No.
1 2.1 Review of Literatures 4
2 4.2.1 Application 7
LIST OF SYMBOLS AND ABBREVIATIONS
RAM Random Access Memory

GPU Graphics Processing Unit
SE Standard Edition
VMS Virtual Machine Specification
API Application Programming Interface
I
ABSTRACT
Targeting the current Covid 19 pandemic situation, this report identifies the need of crowd management. Thus,
proposes an effective and efficient real-time human detection and counting solution specifically for shopping
malls by producing a system with GUI and management functionalities. Besides, comprehensively reviews and
compares the existing techniques and similar systems to select the ideal solution for this scenario. Specifically,
advanced deep learning computer vision techniques are decided by using TensorFlow for detecting and
classifying the human objects each detected human object and perform counting using intrusion line judgement.
Additionally using “TensorFlow” format for better and faster real-time computation using GPU instead of using
CPU as the traditional target machine. The experimental results have proven this implementation combination to
be 91.07% accurate and real-time capable with testing videos from
The internet to simulate the shopping mall entrance scenario.
I
HUMAN DETECTION SYSTEM
CHAPTER 1
INTRODUCTION
Alamuri ratnamala institute of engineering and technology 1

1. INTRODUCTION
1.1 Introduction
Real-time human detection and tracking is a vast, challenging and important field of research. It has
wide range of applications in human recognition, human computer interaction (HCI), video surveillance
etc. The research for biometric authentication of a person has reached far but the real-time tracking of
human beings has not gained much importance. Tracking of human being can be used as a prior step in
biometric face recognition. Keeping continuous track of person will allow to identify person at any time.
The system consist of two parts first human detection and secondly tracking. Human detection step is
split into face detection and eye detection. Face is a vital part of human being represent most important
information about the individual. Eyes are the important biometric feature used in person identification.
Face detection is done using skin color-based methods. Color model is used to detect skin regions as it
represents intensity and color information separately. For eye region detection projection function and
pixel count methods are used.
1.2 Objective of the Project

The motive of object detection is to recognize and locate all known objects in a scene. Preferably in 3D
space, recovering pose of individual in 3D is very important for robotic control systems. Imparting
intelligence to machines and making robots more and more autonomous and independent has been a
sustaining technological dream for the mankind. It is our dream to let the robots take on tedious, boring,
or dangerous work so that we can commit our time to more creative tasks. Unfortunately, the intelligent
part seems to be still lagging behind. In real life, to achieve this goal, besides hardware development,
we need the software that can enable robot the intelligence to do the work and act independently. One
of the crucial components regarding this is vision, apart from other types of intelligences such as
learning and cognitive thinking.

1.3 Main Purpose

A number of surveillance scenarios require the detection and tracking of people. Although person
detection and counting systems are commercially available today, there is need for further research to
address the challenges of real-world scenarios. The focus of this work is the segmentation of groups of
people into individuals and tracking them over time. The relevant applications of this algorithm are
people counting and event detection. Experiments document that the presented approach leads to robust
people counts.

CHAPTER 2
LITERATURE REVIEW

2. LITERATURE REVIEW
2.1 Review of literatures:

Extracting high level features is an important field in video indexing and retrieving. Identifying the
presence of human in video is one of these high-level features, which facilitate the understanding of
other aspects concerning people or the interactions between people. Our work proposes a method for
identifying the presence of human in videos. Experimental results demonstrate the successfulness of the
algorithm used and its capability in detecting faces under different challenges. The proposed work is
crucial in lots of applications whose concern is mainly human activities and can be a basic step in such
activities. So, for that an algorithm has been proposed to detect the presence of human in video sequence

CHAPTER 3
SYSTEM DESIGN AND ARCHITECTURE

3.1 Existing System:

Most existing digital video surveillance systems rely on human observers for detecting specific
activities ina real-time video scene. However, there are limitations in the human capability to monitor
simultaneous events in surveillance displays. Hence, human motion analysis in automated video
surveillance has becomeone of the most active an attractive research topics in the area of computer
vision and pattern recognition.
3.2 Proposed System:

When presented with an image or video, object detection TensorFlow works by identifying known
objects from all instances through the help of computer vision. The history of object detection is as
recent as the internet. The first recorded neural network for object detection was Overfeat, as
developers believed objectdetection would help improve image identification. TensorFlow object
detection combines Deep and Machine Learning for object recognition. Through APIs, developers
can run TensorFlow trained models rather than build from scratch. This saves them a lot of time and
improves their predictive accuracy. Also, TensorFlow object detection tutorial ensures anyone can
apply a model without prior knowledge of Machine Learning. However, they'd still need a basic
understanding of Python.
3.3 Applications:
People counting has a wide range of applications in the context of pervasive systems. These
applications range from efficient allocation of resources in smart buildings to handling emergency
situations. There exist several vision-based algorithms for people counting. Each algorithm performs
differently in terms of efficiency, flexibility and accuracy for different indoor scenarios. Hence,
evaluating these algorithms with respect to different application scenarios, environment conditions
and camera orientations will provide a better choice for actual deployment. For this purpose, in our
paper the most commonly implemented Frame Differencing, Circular Hough Transform and
Histogram of Oriented Gradient based methods are evaluated with respect to different factors like
camera orientation, lighting, occlusion etc. The performance of these algorithms under different
scenarios demonstrates the need for more accurate and faster people counting algorithms.

3.4 Methodology:
TensorFlow is an open-source API from Google, which is widely used for solving machine learning
tasks that involve Deep Neural Networks. TensorFlow Object Detection API is an open-source library
made based on TensorFlow for supporting training and evaluation of Object Detection models. Today
we will take a look at “TensorFlow Detection Model”, which is a collection of pre-trained models
compatible with TensorFlow Object Detection API. PyCharm is a programming language that
translates an abstract idea into a program design we can see on screens. PyCharm presents a three-
step approach for creating programs which are to design the appearance of the application, assign
property settings to the objects of your program & write the code to direct specific tasks at runtime.
3.5 Working:
User just need to download the file and run the main.py on their local system. On the starting window
of the application, user will be able to see START and EXIT option using whichuser can start the
application or exit from the application. When user starts the application using START button, a
new window will open, which allows user withoptions like, DETECT FROM IMAGE, DETECT
FROM VIDEO or DETECT FROM CAMERA. When user selects any of the first two option,
he/she needs to select the respective files using SELECTbutton. User can preview the selected file
using PREVIEW button, and detect and count the humans using DETECTbutton. And when user
selects, the last option of detecting through camera, user need to open the Camera, usingOPEN
CAMERA button, As soon as camera opens, detection process will start. After detection process gets
completed or user manually completes it, two graph get plotted,
1.) Enumeration Plot(Human Count Vs. time) and
2.) Avg. Accuracy Plot(Avg. Accuracy Vs. time).
Along with this two plots, an option to generate crowd report also appears, On clicking on it, a crowd
reportin form of PDF is generated and saved automatically at the project file location. In the crowd
report generated, there will be information like, What is Max Human Count, Max Accuracy,Max
Avg. Accuracy, and also a two line status about crowd.

3.6 UNIFIED MODELING LANGUAGE (UML)
3.6.1 Class Diagram:
Fig 5.3.1: Class Diagram
The HUMAN DETECTION SYSTEM Class Diagram is a modeled diagram that explain its classes and
relationships. The diagram depicts the names and attributes of the classes, as well as their links and,
their methods. It is the most essential type of UML diagram which is critical in software development.
It is an approach to show the system’s structure in detail, including its properties and operations.
The HUMAN DETECTION SYSTEM must have a designed diagram to define the classes neededfor
the desired outcome. It is used to model the items that make up the system, depict their relationships,
and define what those objects perform and the services they provide.

3.6.2 Use Case Diagram:
Fig 5.3.2: Use Case Diagram
The objective of a use case diagram is to show the interactions of numerous items called actors with
the use case and to capture fundamental functionalities of a system. As you see through the diagrams,
there are the use cases involved to define the core functions of a system. These processes were
expected by the users to be connected to produce a certain output. Being a programmer, this could be
an important role that the HUMAN DETECTION SYSTEM general Use Case Diagram should have

3.6.3 System Flowchart Diagram:
Fig 5.3.3: System Flowchart
The data included in the System flow chart diagram was labeled properly to guide the developers
onthe graphical representation of the HUMAN DETECTION SYSTEM.

3.6.4 System Architecture:
Fig 5.3.4: System Architecture
A system architecture is the conceptual model that defines the structure, behavior, and more views of
a system. An architecture description is a formal description and representation of a system, organized
in a way that supports reasoning about the structures and behaviors of the system. These diagrams
visualize the boundaries, along with the software, nodes, and processors that make up the system.
They can also help you understand how different components communicate with each other. Not only
that, but they also give you an overview of the physical hardware in the system.

CHAPTER 4
HARDWARE AND SOFTWARE REQUIRMENT

4.1 Hardware Requirements:

 Laptop or Computer
 Processor: Intel core i5 (or above version)
 RAM: 8GB (or above)
 Storage: 512 GB SSD (or above)
 GPU: GTX 1080 (or above)
4.2 Software Requirements:

 Platform: Windows 10/11
 Python
 PyCharm
PYTHON:
Python is a multi-paradigm programming language. Object-oriented programming and structured

programming are fully supported, and many of its features support functional programming and
aspect- oriented programming (including metaprogramming and metaobjects).Many other paradigms
are supported via extensions, including design by contract and logic programming. Python uses
dynamic typing and a combination of reference counting and a cycle-detecting garbage collectorfor
memory management. It uses dynamic name resolution (late binding), which binds method and
variablenames during program execution.
Its design offers some support for functional programming in the Lisp tradition. It has filter,
mapandreduce functions; list comprehensions, dictionaries, sets, and generator expressions. The
standard library has two modules (itertools and functools) that implement functional tools borrowed
from Haskell and Standard ML. Rather than building all of its functionality into its core, Python was
designed to be highly extensible via modules.

This compact modularity has made it particularly popular as a means of adding programmable
interfaces to existing applications. Van Rossum's vision of a small core language with a large
standard library and easily extensible interpreter stemmed from his frustrations with ABC, which
espoused the opposite approach.
Python strives for a simpler, less-cluttered syntax and grammar while giving developers a choice in
their coding methodology. In contrast to Perl's "there is more than one way to do it" motto, Python
embraces a "there should be one—and preferably only one—obvious way to do it" philosophy.
Python's developers strive to avoid premature optimization and reject patches to non-critical parts
of the When speed is important, a Python programmer can move time-critical functions to extension
modules written in languages such as C; or use PyPy, a just-in-time compiler. Cython is also
available, which translates a Python script into C and makes direct C-level API calls into the Python
interpreter.
Python's developers aim for it to be fun to use. This is reflected in its name—a tribute to the British
comedy group Monty Python and in occasionally playful approaches to tutorials and reference
materials, such as examples that refer to spam and eggs instead of the standard foo andbar. The
programming language's name 'Python' came from the BBC Comedy series Monty Python's Flying
Circus. Guido van Rossum thought he needed a name that was short, unique and slightly mysterious,
and so, he decided to name the programming language 'Python'.
A common neologism in the Python community is pythonic, which has a wide range of meanings
related toprogram style. "Pythonic" code may use Python idioms well, be natural or show fluency in
the language, orconform with Python's minimalist philosophy and emphasis on readability. Code that
is difficult tounderstand or reads like a rough transcription from another programming language is
called un-pythonic. Python users and admirers, especially those considered knowledgeable or
experienced, are often referred toas Pythonistas.
PYCHARM:
PyCharm is a dedicated Python Integrated Development Environment (IDE) providing a wide range
of essential tools for Python developers tightly integrate to create a convenient environment for
productive Python, web, and data science development.

CHAPTER 5
IMPLEMENTATION

5.1 PROJECT IMPLEMENTATION


CONCLUSION
Alamuri Ratnamala Institute of Engineering and Technology 19

CONCLUSION
Despite years of progress in the United States, alcohol-impaired driving remains a major preventable
public health and safety issue. Over the past 10 years progress has stalled, and for thepast 2 years the
number of alcohol-impaired fatalities has increased. Alcohol-impaired driving crashes cost the nation
billions of dollars in terms of loss of life, injury and disability, lost productivity, and social and
economic consequences, and it needs to become a priority issue across stakeholders and sectors.
Changes in the social, economic, technological, and clinical arenas haveopened up new opportunities
to take action on this multifaceted problem. A renewed and comprehensive approach is needed to
successfully reduce serious injuries and fatalities. Furthermore, a one-size-fits-all approach will not
suffice, as U.S. states make up a patchwork of different existing policies, needs, and resources. Vision
Zero, a philosophy that is grounded in thebelief that no death caused by a motor vehicle crash is
acceptable, provides a framework to engage novel stakeholders and a system with the goal of
eliminating alcohol-impaired traffic fatalities. With this philosophy, each alcohol-impaired driver
represents a system failure.

FUTURE SCOPE
Big data applications are consuming most of the space in industry and research area. Among the
widespreadexamples of big data, the role of video streams from CCTV cameras is equally important
as other sources like social media data, sensor data, agriculture data, medical data and data evolved
from space research. Surveillance videos have a major contribution in unstructured big data. CCTV
cameras are implemented inall places where security having much importance. Manual surveillance
seems tedious and time consuming.Security can be defined in different terms in different contexts
like theft identification, violence detection, chances of explosion etc.
In crowded public places the term security covers almost all type of abnormal events. Among them
violencedetection is difficult to handle since it involves group activity. The anomalous or abnormal
activity analysisin a crowd video scene is very difficult due to several real-world constraints. The
paper includes a deep- rooted survey which starts from object recognition, action recognition, crowd
analysis and finally violencedetection in a crowd environment. Majority of the papers reviewed in
this survey are based on deep learningtechnique. Various deep learning methods are compared in
terms of their algorithms and models.

REFERENCES

REFERENCES
1. D. Comaniciu and V. Ramesh, "Robust detection and tracking of human faces with an active camera,"
in Proceedings of Third IEEE International Workshop on Visual Surveillance, pp. 11-18, 2000.
2. H. Yang, L. Shao, F. Zheng, L. Wang, and Z. Song, "Recent advances and trends in visual tracking:
A review," Neuro Computing, pp. 1-9, 2011.
3. Hemdan, S. Karungaru, and K. Terada, "Facial features-based method for human tracking,"
Workshop on Frontiers of Computer Vision, 2011.
4. C. N. RaviKumar and B. A, "An efficient skin illumination compensation model for efficient face
detection," IEEE Industrial Electronics IECON, pp. 3444-3449, 2006.

ACKNOWLEDGEMENT
In our project we are extremely thankful to our project guide Prof. Mayank Mangal for his
valuable support and time. We would like to take this opportunity to acknowledgement the
innumerable guidance and support extended to us by our co guide in preparation of the
synopsis. We also want to thank our honorable principle for his support. Our foremost thanks
go to my well-wishers and colleagues. We are graceful to all staff members, non-teaching staff
and all our friends us the helping hand.

Hds Report Print

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hds Report Print

Uploaded by

Copyright:

Available Formats

HUMAN DETECTION SYSTEM USING DEEP LEARNING

Submitted in partial fulfilment of the requirements of the degree of

ALAMURI RATNAMALA INSTITUTE OF ENGINEERING AND

Department of Information Technology (IT)

Head of the Department Principal

_______________ _________________ __________________ ___________________

1.2 Objective of the project 2

3 3.1 Existing System 7

5 5.1 Implementation 17-18

Sr. No. Fig. No. Figure Name Page No.

1 3.6.1 Class Diagram 12

2 3.6.2 Use Case Diagram 13

3 3.6.3 System Flowchart 14

4 3.6.4 System Architecture 15

Sr.No. Table No. Table Name Page No.

1 2.1 Review of Literatures 4

RAM Random Access Memory

Alamuri ratnamala institute of engineering and technology 1

1.2 Objective of the Project

Alamuri ratnamala institute of engineering and technology 2

1.3 Main Purpose

Alamuri ratnamala institute of engineering and technology 3

Alamuri ratnamala institute of engineering and technology 4

2.1 Review of literatures:

Alamuri ratnamala institute of engineering and technology 5

Alamuri ratnamala institute of engineering and technology 6

3.1 Existing System:

3.2 Proposed System:

Alamuri ratnamala institute of engineering and technology 7

Alamuri ratnamala institute of engineering and technology 8

3.6 UNIFIED MODELING LANGUAGE (UML)

3.6.1 Class Diagram:

Fig 5.3.1: Class Diagram

Alamuri ratnamala institute of engineering and technology 9

3.6.2 Use Case Diagram:

Fig 5.3.2: Use Case Diagram

Alamuri ratnamala institute of engineering and technology 10

3.6.3 System Flowchart Diagram:

Fig 5.3.3: System Flowchart

Alamuri ratnamala institute of engineering and technology 11

3.6.4 System Architecture:

Fig 5.3.4: System Architecture

Alamuri ratnamala institute of engineering and technology 12

HARDWARE AND SOFTWARE REQUIRMENT

Alamuri ratnamala institute of engineering and technology 13

4.1 Hardware Requirements:

 Processor: Intel core i5 (or above version)

 RAM: 8GB (or above)

 Storage: 512 GB SSD (or above)

 GPU: GTX 1080 (or above)

4.2 Software Requirements:

Python is a multi-paradigm programming language. Object-oriented programming and structured

Alamuri ratnamala institute of engineering and technology 14

Alamuri ratnamala institute of engineering and technology 15

Alamuri ratnamala institute of engineering and technology 16

5.1 PROJECT IMPLEMENTATION

Alamuri ratnamala institute of engineering and technology 17

Alamuri ratnamala institute of engineering and technology 18

Alamuri Ratnamala Institute of Engineering and Technology 19

Alamuri Ratnamala Institute of Engineering and Technology 20

Alamuri Ratnamala Institute of Engineering and Technology 21

Alamuri Ratnamala Institute of Engineering and Technology 22

Alamuri Ratnamala Institute of Engineering and Technology 23

_______ _ _________