Professional Documents
Culture Documents
By
Ms. POURNIMA GHUDE ARMIET/AI21/GP218
Ms. JYOTI JADHAV ARMIET/AI21/JJ219
Mr. SANDIP PASHTE ARMIET/IT21/SP226
Mr. VIKAS SHARMA ARMIET/AI21/SV222
Supervisor:
Prof. Mayank Mangal
Affiliated to
UNIVERSITY OF MUMBAI
This mini project report entitled “Human Detection System using Deep Learning”
by Ms. Pournima Ghude, Ms. Jyoti Jadhav, Mr. Vikas Sharma, Mr. Sandip Pashte is
approved for the degree of Bachelor of Engineering in Information Technology
for academic year 2022 – 2023.
Examiners
1.
2.
Supervisor
1.
Prof. Mayank Mangal
Date:
Place:
Declaration
We declare that this written submission represents my ideas in my own words and where others' ideas
or words have been included, we have adequately cited and referenced the original sources. We also
declare that we have adhered to all principles of academic honesty and integrity and have not
misrepresented or fabricated or falsified any idea/data/fact/source in my submission. We understand
that any violation of the above will be cause for disciplinary action by the Institute and can also evoke
penal action from the sources which have thus not been properly cited or from whom proper permission
has not been taken when needed.
LIST OF FIGURES I
LIST OF TABLES I
LIST OF SYMBOLS AND ABBREVATION I
ABSTRACT II
INTRODUCTION 1
1
1.1 Introduction 2
CONCLUSION 19
FUTURE WORK 21
REFERENCE 22
ACKKNOWLEDGMENT 24
LIST OF FIGURES
5 3.6.4 Output 17
List of tables
2 4.2.1 Application 7
LIST OF SYMBOLS AND ABBREVIATIONS
I
ABSTRACT
Targeting the current Covid 19 pandemic situation, this report identifies the need of crowd management. Thus,
proposes an effective and efficient real-time human detection and counting solution specifically for shopping
malls by producing a system with GUI and management functionalities. Besides, comprehensively reviews and
compares the existing techniques and similar systems to select the ideal solution for this scenario. Specifically,
advanced deep learning computer vision techniques are decided by using TensorFlow for detecting and
classifying the human objects each detected human object and perform counting using intrusion line judgement.
Additionally using “TensorFlow” format for better and faster real-time computation using GPU instead of using
CPU as the traditional target machine. The experimental results have proven this implementation combination to
be 91.07% accurate and real-time capable with testing videos from
The internet to simulate the shopping mall entrance scenario.
I
HUMAN DETECTION SYSTEM
CHAPTER 1
INTRODUCTION
1. INTRODUCTION
1.1 Introduction
Real-time human detection and tracking is a vast, challenging and important field of research. It has
wide range of applications in human recognition, human computer interaction (HCI), video surveillance
etc. The research for biometric authentication of a person has reached far but the real-time tracking of
human beings has not gained much importance. Tracking of human being can be used as a prior step in
biometric face recognition. Keeping continuous track of person will allow to identify person at any time.
The system consist of two parts first human detection and secondly tracking. Human detection step is
split into face detection and eye detection. Face is a vital part of human being represent most important
information about the individual. Eyes are the important biometric feature used in person identification.
Face detection is done using skin color-based methods. Color model is used to detect skin regions as it
represents intensity and color information separately. For eye region detection projection function and
pixel count methods are used.
CHAPTER 2
LITERATURE REVIEW
2. LITERATURE REVIEW
CHAPTER 3
SYSTEM DESIGN AND ARCHITECTURE
3.3 Applications:
People counting has a wide range of applications in the context of pervasive systems. These
applications range from efficient allocation of resources in smart buildings to handling emergency
situations. There exist several vision-based algorithms for people counting. Each algorithm performs
differently in terms of efficiency, flexibility and accuracy for different indoor scenarios. Hence,
evaluating these algorithms with respect to different application scenarios, environment conditions
and camera orientations will provide a better choice for actual deployment. For this purpose, in our
paper the most commonly implemented Frame Differencing, Circular Hough Transform and
Histogram of Oriented Gradient based methods are evaluated with respect to different factors like
camera orientation, lighting, occlusion etc. The performance of these algorithms under different
scenarios demonstrates the need for more accurate and faster people counting algorithms.
3.4 Methodology:
TensorFlow is an open-source API from Google, which is widely used for solving machine learning
tasks that involve Deep Neural Networks. TensorFlow Object Detection API is an open-source library
made based on TensorFlow for supporting training and evaluation of Object Detection models. Today
we will take a look at “TensorFlow Detection Model”, which is a collection of pre-trained models
compatible with TensorFlow Object Detection API. PyCharm is a programming language that
translates an abstract idea into a program design we can see on screens. PyCharm presents a three-
step approach for creating programs which are to design the appearance of the application, assign
property settings to the objects of your program & write the code to direct specific tasks at runtime.
3.5 Working:
User just need to download the file and run the main.py on their local system. On the starting window
of the application, user will be able to see START and EXIT option using whichuser can start the
application or exit from the application. When user starts the application using START button, a
new window will open, which allows user withoptions like, DETECT FROM IMAGE, DETECT
FROM VIDEO or DETECT FROM CAMERA. When user selects any of the first two option,
he/she needs to select the respective files using SELECTbutton. User can preview the selected file
using PREVIEW button, and detect and count the humans using DETECTbutton. And when user
selects, the last option of detecting through camera, user need to open the Camera, usingOPEN
CAMERA button, As soon as camera opens, detection process will start. After detection process gets
completed or user manually completes it, two graph get plotted,
1.) Enumeration Plot(Human Count Vs. time) and
2.) Avg. Accuracy Plot(Avg. Accuracy Vs. time).
Along with this two plots, an option to generate crowd report also appears, On clicking on it, a crowd
reportin form of PDF is generated and saved automatically at the project file location. In the crowd
report generated, there will be information like, What is Max Human Count, Max Accuracy,Max
Avg. Accuracy, and also a two line status about crowd.
The HUMAN DETECTION SYSTEM Class Diagram is a modeled diagram that explain its classes and
relationships. The diagram depicts the names and attributes of the classes, as well as their links and,
their methods. It is the most essential type of UML diagram which is critical in software development.
It is an approach to show the system’s structure in detail, including its properties and operations.
The HUMAN DETECTION SYSTEM must have a designed diagram to define the classes neededfor
the desired outcome. It is used to model the items that make up the system, depict their relationships,
and define what those objects perform and the services they provide.
The objective of a use case diagram is to show the interactions of numerous items called actors with
the use case and to capture fundamental functionalities of a system. As you see through the diagrams,
there are the use cases involved to define the core functions of a system. These processes were
expected by the users to be connected to produce a certain output. Being a programmer, this could be
an important role that the HUMAN DETECTION SYSTEM general Use Case Diagram should have
The data included in the System flow chart diagram was labeled properly to guide the developers
onthe graphical representation of the HUMAN DETECTION SYSTEM.
A system architecture is the conceptual model that defines the structure, behavior, and more views of
a system. An architecture description is a formal description and representation of a system, organized
in a way that supports reasoning about the structures and behaviors of the system. These diagrams
visualize the boundaries, along with the software, nodes, and processors that make up the system.
They can also help you understand how different components communicate with each other. Not only
that, but they also give you an overview of the physical hardware in the system.
CHAPTER 4
Python
PyCharm
PYTHON:
Its design offers some support for functional programming in the Lisp tradition. It has filter,
mapandreduce functions; list comprehensions, dictionaries, sets, and generator expressions. The
standard library has two modules (itertools and functools) that implement functional tools borrowed
from Haskell and Standard ML. Rather than building all of its functionality into its core, Python was
designed to be highly extensible via modules.
This compact modularity has made it particularly popular as a means of adding programmable
interfaces to existing applications. Van Rossum's vision of a small core language with a large
standard library and easily extensible interpreter stemmed from his frustrations with ABC, which
espoused the opposite approach.
Python strives for a simpler, less-cluttered syntax and grammar while giving developers a choice in
their coding methodology. In contrast to Perl's "there is more than one way to do it" motto, Python
embraces a "there should be one—and preferably only one—obvious way to do it" philosophy.
Python's developers strive to avoid premature optimization and reject patches to non-critical parts
of the When speed is important, a Python programmer can move time-critical functions to extension
modules written in languages such as C; or use PyPy, a just-in-time compiler. Cython is also
available, which translates a Python script into C and makes direct C-level API calls into the Python
interpreter.
Python's developers aim for it to be fun to use. This is reflected in its name—a tribute to the British
comedy group Monty Python and in occasionally playful approaches to tutorials and reference
materials, such as examples that refer to spam and eggs instead of the standard foo andbar. The
programming language's name 'Python' came from the BBC Comedy series Monty Python's Flying
Circus. Guido van Rossum thought he needed a name that was short, unique and slightly mysterious,
and so, he decided to name the programming language 'Python'.
A common neologism in the Python community is pythonic, which has a wide range of meanings
related toprogram style. "Pythonic" code may use Python idioms well, be natural or show fluency in
the language, orconform with Python's minimalist philosophy and emphasis on readability. Code that
is difficult tounderstand or reads like a rough transcription from another programming language is
called un-pythonic. Python users and admirers, especially those considered knowledgeable or
experienced, are often referred toas Pythonistas.
PYCHARM:
PyCharm is a dedicated Python Integrated Development Environment (IDE) providing a wide range
of essential tools for Python developers tightly integrate to create a convenient environment for
productive Python, web, and data science development.
CHAPTER 5
IMPLEMENTATION
CONCLUSION
CONCLUSION
Despite years of progress in the United States, alcohol-impaired driving remains a major preventable
public health and safety issue. Over the past 10 years progress has stalled, and for thepast 2 years the
number of alcohol-impaired fatalities has increased. Alcohol-impaired driving crashes cost the nation
billions of dollars in terms of loss of life, injury and disability, lost productivity, and social and
economic consequences, and it needs to become a priority issue across stakeholders and sectors.
Changes in the social, economic, technological, and clinical arenas haveopened up new opportunities
to take action on this multifaceted problem. A renewed and comprehensive approach is needed to
successfully reduce serious injuries and fatalities. Furthermore, a one-size-fits-all approach will not
suffice, as U.S. states make up a patchwork of different existing policies, needs, and resources. Vision
Zero, a philosophy that is grounded in thebelief that no death caused by a motor vehicle crash is
acceptable, provides a framework to engage novel stakeholders and a system with the goal of
eliminating alcohol-impaired traffic fatalities. With this philosophy, each alcohol-impaired driver
represents a system failure.
FUTURE SCOPE
Big data applications are consuming most of the space in industry and research area. Among the
widespreadexamples of big data, the role of video streams from CCTV cameras is equally important
as other sources like social media data, sensor data, agriculture data, medical data and data evolved
from space research. Surveillance videos have a major contribution in unstructured big data. CCTV
cameras are implemented inall places where security having much importance. Manual surveillance
seems tedious and time consuming.Security can be defined in different terms in different contexts
like theft identification, violence detection, chances of explosion etc.
In crowded public places the term security covers almost all type of abnormal events. Among them
violencedetection is difficult to handle since it involves group activity. The anomalous or abnormal
activity analysisin a crowd video scene is very difficult due to several real-world constraints. The
paper includes a deep- rooted survey which starts from object recognition, action recognition, crowd
analysis and finally violencedetection in a crowd environment. Majority of the papers reviewed in
this survey are based on deep learningtechnique. Various deep learning methods are compared in
terms of their algorithms and models.
REFERENCES
REFERENCES
1. D. Comaniciu and V. Ramesh, "Robust detection and tracking of human faces with an active camera,"
in Proceedings of Third IEEE International Workshop on Visual Surveillance, pp. 11-18, 2000.
2. H. Yang, L. Shao, F. Zheng, L. Wang, and Z. Song, "Recent advances and trends in visual tracking:
A review," Neuro Computing, pp. 1-9, 2011.
3. Hemdan, S. Karungaru, and K. Terada, "Facial features-based method for human tracking,"
Workshop on Frontiers of Computer Vision, 2011.
4. C. N. RaviKumar and B. A, "An efficient skin illumination compensation model for efficient face
detection," IEEE Industrial Electronics IECON, pp. 3444-3449, 2006.
ACKNOWLEDGEMENT
In our project we are extremely thankful to our project guide Prof. Mayank Mangal for his
valuable support and time. We would like to take this opportunity to acknowledgement the
innumerable guidance and support extended to us by our co guide in preparation of the
synopsis. We also want to thank our honorable principle for his support. Our foremost thanks
go to my well-wishers and colleagues. We are graceful to all staff members, non-teaching staff
and all our friends us the helping hand.