You are on page 1of 8

RESUME SCREENING

TOPIC OF THE PROJECT


MINOR PROJECT SYNOPSIS
MINOR PROJECT SYNOPSIS
of
of
BACHELOR OF TECHNOLOGY
BACHELOR OF TECHNOLOGY
in
in
COMPUTER SCIENCE & ENGINEERING
by
COMPUTER SCIENCE & ENGINEERING
Project ID:
by
Name of the Student Name of the Student Name of the Student
Enrollment No: XXX Project
Enrollment No:id:
XXX Enrollment No: XXX

Shivansh Singhal Anurag Kumar Thakur Taniya Sharma


Enrollment No:20514802720 Enrollment No:35714802720 Enrollment No:20214802720

Guided by:
Dr. Ashish Sharma
And
Dr. Sandeep Tayal

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING


MAHARAJA AGRASEN INSTITUTE OF TECHNOLOGY
(AFFILIATED TO GURU GOBIND SINGH INDRAPRASTHA UNIVERSITY, DELHI)
Resume Screening
Differentiating Different Fields Resume.

INTRODUCTION

In response to the challenges posed by the time-intensive and subjective


nature of traditional resume screening, this project introduces an innovative
solution aimed at revolutionizing the hiring process. Leveraging the power of
Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn, coupled
with cutting-edge Natural Language Processing (NLP) techniques, the project
seeks to automate and streamline resume evaluation. By harnessing the
capabilities of data manipulation and visualization tools, the system aims to
enhance the accuracy and efficiency of candidate assessment, mitigating
biases and providing a more equitable and effective means of identifying the
most qualified individuals for employment opportunities.

PROBLEM STATEMENT

The traditional resume screening process is time-consuming and subjective,


posing challenges in identifying the most qualified candidates efficiently. Manual
review often leads to human biases and oversight, hindering the recruitment
process. Additionally, the sheer volume of resumes received exacerbates the
workload for hiring teams, increasing the likelihood of overlooking qualified
candidates. This project addresses these issues by implementing a streamlined,
automated solution using Natural Language Processing (NLP) and data
manipulation techniques. The aim is to revolutionize resume screening, mitigating
biases, and improving the accuracy and speed of candidate evaluation for a more
effective and equitable hiring process.
OBJECTIVE and SCOPE

The project's scope encompasses the entire resume screening process within
the hiring domain. It involves the automation of candidate evaluation through
the integration of Natural Language Processing (NLP) techniques and the
utilization of Python libraries such as Pandas, NumPy, Matplotlib, and
Seaborn. The scope extends to handling diverse resume formats, extracting
relevant information, and presenting structured outputs. Data visualization
tools enhance result interpretability, providing a comprehensive solution for
efficient and unbiased candidate screening. The primary objective is to
streamline and optimize the hiring process by addressing the shortcomings of
manual resume screening. The project aims to automate the extraction of key
information from resumes using NLP, leveraging Pandas and NumPy for
efficient data manipulation. Through Matplotlib and Seaborn, the objective is
to enhance the visualization of results, making it easier for hiring teams to
assess and select qualified candidates.

METHODOLGY

Dataset Preparation

The project utilizes a Kaggle dataset comprising 1000 resumes, forming the
foundation for our methodology. Employing Pandas and NumPy, we conduct data
preprocessing and manipulation to ensure data integrity. Natural Language
Processing (NLP) techniques are then applied to extract relevant information from
the resumes. The structured data is visualized using Matplotlib and Seaborn,
providing insights for streamlined candidate evaluation. This dataset-driven
approach ensures the practical applicability and effectiveness of our resume
screening methodology.

Training and Evaluation

In the project, three distinct classifiers, namely KNeighborsClassifier,


OneVsRestClassifier, and accuracy_score, are pivotal in the training process. The
KNeighborsClassifier facilitates the classification of resumes based on similarity,
utilizing the k-nearest neighbors algorithm. The OneVsRestClassifier extends the
model's applicability to multilabel classification tasks, crucial for diverse resume
content. The accuracy_score metric evaluates the performance of the models,
quantifying the accuracy of predictions. These components collectively empower
our project, enhancing its adaptability and ensuring robust evaluation for
screening.
Features

In response to the challenges posed by the time-intensive and subjective


nature of traditional resume screening, this project introduces an innovative
solution aimed at revolutionizing the hiring process. Leveraging the power of
Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn, coupled
with cutting-edge Natural Language Processing (NLP) techniques, the project
seeks to automate and streamline resume evaluation. By harnessing the
capabilities of data manipulation and visualization tools, the system aims to
enhance the accuracy and efficiency of candidate assessment, mitigating
biases and providing a more equitable and effective means of identifying the
most qualified individuals for employment opportunities.

Modelling

1. KNeighborsClassifier:
The KNeighborsClassifier is a fundamental model in our methodology,
employing the k-nearest neighbors’ algorithm. In the context of resume
screening, it evaluates resumes based on their similarity to others in the dataset.
This algorithm classifies a resume by considering the class labels of its k-nearest
neighbors, effectively capturing patterns within the data. The
KNeighborsClassifier contributes to the project's ability to identify
commonalities and differences between resumes, facilitating a nuanced and data-
driven approach to candidate assessment.

2. OneVsRestClassifier:
The OneVsRestClassifier is instrumental in extending the project's applicability
to multilabel classification scenarios. Given the diverse nature of resume content,
where candidates may possess skills spanning multiple categories, the
OneVsRestClassifier enables the model to handle each label independently. This
approach is crucial for accurately capturing the various skills, experiences, and
qualifications that candidates may present in their resumes. The
OneVsRestClassifier enhances the project's adaptability, ensuring that it can
effectively process and categorize resumes with multiple skill sets,.
TECHNOLOGIES USED

Operating System:
• Windows 10/11 x64

Language Used:
• Python

Editors:
• Jupyter Notebook
• Google Colab

Libraries:
• Pandas
• Numpy
• Matplotlib
• Seaborn
• Tensorflow

Fig: A simple flowchart representing the phases of development.


CONCLUSION

The "Resume Screening using NLP" project represents a significant


advancement in the realm of automated candidate evaluation. By harnessing the
power of KNeighborsClassifier and OneVsRestClassifier models, the system
adeptly navigates the complexities of resume content, offering a nuanced
approach to classification and multilabel scenarios. The KNeighborsClassifier
facilitates a data-driven understanding of resume similarities, contributing to a
more insightful screening process. The OneVsRestClassifier extends the
project's versatility, accommodating resumes with diverse skill sets, aligning
with the multifaceted nature of candidate qualifications.

The utilization of accuracy_score as a performance metric ensures a quantitative


assessment of the models, emphasizing the project's commitment to precision
and reliability in candidate evaluation. Through extensive data manipulation,
NLP techniques, and visualization tools, the project successfully streamlines the
hiring process, addressing the inefficiencies and biases associated with manual
resume screening.

The integration of these models culminates in an automated solution that not


only expedites the hiring process but also enhances its objectivity and accuracy.
The project's success lies in its adaptability to diverse resume formats and its
ability to distill meaningful insights from large datasets. As a result, the
"Resume Screening using NLP" project stands as a transformative tool, ushering
in a new era of efficiency and effectiveness in the recruitment domain.
REFERENCES:

• Authors: Colin Raffel, et al. S. Amin, N. Jayakar, S. Sunny, and P. Babu,


"Web Application for Screening Resume," in 2019 International Conference
on Nascent Technologies in Engineering (ICNTE), Jan. 2019, DOI:
10.1109/ICNTE44896.2019.8945869.

• M. N. V. S. Raghavendra, "Resume Screening Using Machine Learning," PG


student, Department of Computer Science and System Engineering, Andhra
University, Visakhapatnam, India.

• E. Pisanelli, "Your resume is your gatekeeper: Automated resume screening


as a strategy to reduce gender gaps in hiring," Economics Letters, [Online].

You might also like