You are on page 1of 2

Named Entity Recognition System using Deep Learning

Project Domain / Category


Natural Language Processing/Deep Learning

Abstract / Introduction
Named Entity Recognition (NER) is a fundamental task in natural language processing (NLP) that involves
identifying and classifying named entities in text. Named entities are real-world objects, such as people,
organizations, locations, dates, and times. NER has a wide range of applications, including information
extraction, machine translation, question answering, and text summarization.

The admin (student) will develop a system to detect Named entity and find accuracy by applying appropriate
deep learning techniques for Named entity recognition. The system will also compare which techniques are
best for Named entity recognition and why.

Functional Requirements:

Admin (Student) will perform all the following tasks.

1. Data Annotation: Annotate dataset for named entities and preprocess the text data. There will be
following tags for dataset (ORGANIZATION, PERSON, LOCATION, DATE, TIME, MONEY,
PERCENT, FACILITY, GPE).
2. Model Selection and Development: Investigate NER models and select an appropriate architecture
(e.g., BiLSTM-CRF, LSTM, CRF, BERT-based models) for development.
3. Train & Test Data: Split data into 70% training and 30% testing data sets.
Train the selected model on the annotated dataset to learn to recognize named entities effectively.
4. Evaluation and Fine-tuning: Assess the model's performance using standard NER evaluation metrics
(e.g., F1-score, precision, recall) and fine-tune the model for improved accuracy.
5. Confusion Matrix: Create a confusion matrix table to describe the performance of a classification
model.
6. Accuracy Evaluation: Find the accuracy of all techniques and compare their accuracy.
*This project will also tell us which machine learning technique is better to detect Named entity.
7. Named Entity Categorization: Develop a system that not only recognizes entities but also
categorizes them into predefined categories (e.g., person, organization, location).
8. Integration with Applications: Build an interface for users to input text and integrate the NER
system into applications that demonstrate its utility.

Dataset:
https://drive.google.com/file/d/1oR5fq7VbUBPIVX64SBmY19LFcnVK0cYu/view?usp=drive_link
*You must use your VU email id to access/download the dataset.

Tools:
 Python
 jupyter notebook
 Colab
 PyQt
 wxPython
 Tkinter
 Kivy
 PySimpleGUI
Prerequisite:
Artificial Intelligence, Machine Learning, and Natural Language Processing Concepts,
"Admin (student) s will cover a short course relevant to the mentioned concepts besides SRS and
Design initial documentation or see the links below."

Helping Material
Python
https://www.python.org/
https://www.w3schools.com/python/
https://www.tutorialspoint.com/python/index.htm
Deep Learning:
https://www.tutorialspoint.com/python_deep_learning/index.htm
https://www.tutorialspoint.com/deep-learning-tutorials/index.asp
https://www.youtube.com/watch?v=VyWAvY2CF9c
https://www.youtube.com/watch?v=6M5VXKLf4D4

Supervisor:
Name: Umair Ali
Email ID: umairali@vu.edu.pk
Skype ID: live:umairalihamid_1

You might also like