Professional Documents
Culture Documents
Project Report on
Submitted by
HARSHITHA B
4VZ20MC032
Prof. KUMAR P K
Assistant Professor
Dept. of Computer Science & Engineering
VTU, Centre of PG Studies, Mysuru
CERTIFICATE
This is to certify that the project work entitled “RELEVANCE FEATURE DISCOVERY
FOR TEXT MINING” is bonafide work carried out by HARSHITHA B bearing
4VZ20MC032 at Department of Computer Science and Engineering, Visvesvaraya
Technological University, Centre for Post Graduate Studies, Mysuru, in partial fulfilment
for the award of Master of Computer Applications, Visvesvaraya Technological University,
Belagavi during the academic year 2021-2022. It is certified that all the corrections/suggestions
indicated during Internal Assessment have been incorporated in the report. The project report
has been approved as it satisfies the academic requirements in respect of Project work
prescribed for the Master of Computer Applications degree.
External Examiners
1.
2.
DECLARATION
I, HARSHITHA B, student of 4th semester MCA, Dept. of CSE (MCA Program), Post
Graduate Studies Mysuru, V T U M y s u r u , bearing 4VZ20MC032 hereby declare
that the project entitled “RELEVANCE FEATURE DISCOVERY FOR TEXT
MINING” has been carried out by me under the supervision of Internal Guide, Prof.
KUMAR P K, Assistant Professor, Dept of CS&E, VTU, Center for PG Studies, Mysuru,
submitted in the partial fulfilment of the requirements for the award of the Degree of
Master of Computer Applications by the Visvesvaraya Technological University,
Belagavi during the academic year 2021-2022. This report has not been submitted to any
other Organization/University for any award of degree or certificate.
Name: HARSHITHA B
USN: 4VZ20MC032
Signature:
ACKNOWLEDGEMENT
The satisfaction & euphoria that accompany the successful completion of any task
would be incomplete without the mention of people who made it possible because Success
is the abstract of hard work & perseverance, but steadfast of all is encouragement
guidance”. So, I acknowledge all those whose guidance and encouragement served as a
beacon light & crowned our efforts with success.
I also thank all the staff members of Dept. of CSE, Department of PG Studies, VTU,
Mysuru for their help during the course of this project.
I also thank our parents, family members & friends, for their continuous and great
support and encouragement throughout this project.
Name: HARSHITHA B
USN: 4VZ20MC032
ABSTRACT
The process of creating a concise, clear, and accurate summary of a lengthy text document
is known as text summarization. In order to learn the proper quantity of information quickly,
text summary is crucial. Because text contains a bigger amount of characters, it is more
challenging to grasp. Text summarization is a vital tool in today's world. Extractive text
summarization (ETS) and abstractive text summarization are the two subcategories of text
summarization (ATS). Compared to ATS, ETS is easier. ETS extracts the crucial phrases or
words from the incoming text content using algorithms.
Automatic text summarization basically uses machine learning language and natural language
processing to summarise the provided paragraph. There will be a massive increase in volume
of text data coming from different fields. This book volume is a priceless source of knowledge
and information that must be skillfully distilled in order to be of use. The primary methods for
automatic text summarization are explained in this review. We examine the various
summarising procedures and discuss the merits and drawbacks of the various approaches.
Extractive approach and abstractive approach are the two types that will be used. Finding the
data subset that encompasses all of the data is the fundamental concept of summarization.
Reducing unused data is urgently important. Being Manual summarization of the document is
challenging, necessitating the use of automatic techniques. It has been suggested to take
approaches that are motivated for using the deep learning techniques by automatic machine
translation, more especially by framing there is a issue in text summarization of a sequence-to-
sequence learning problem.
CONTENTS
Chapter 1
1.1 Introductin ………………………………………………………………………….....1
1.2 Problem Statement……………………………………………………………………..2
1.3 Scope and Objectives…………………………………………………………………..2
Chapter 2
Literature Survey……………………………………………………………………………...3
2.1 Survey Papers…………………………………………………………………………..3
2.2 Tools/Technology Survey……………………………………………………………………...4
2.3 Software Development Tools…………………………………………………………10
Chapter 3
System Analysis……………………………………………………………………………...11
3.1 Existing System……………………………………………………………………….11
3.2 Proposed System……………………………………………………………………...11
3.3 Study Feasibility……………………….……………………………………………...12
Chapter 4
System Design……………………………………………………………………………….14
4.1DataflowDiagram………………………………………………………………………14
4.2 Use Case Diagram………………………………………………………...…………...16
4.3 Sequence Diagram…………………………………………………………………......18
4.4 Activity Diagram………………………………………………………………..……...19
Chapter 5
Software requirement specification……………………………………………………….....21
5.1 Introduction……………………………………………………………………………21
5.2 Functional Requirements………………………………………………………………21
5.3 Non Functional Requirments………………………………………………………….22
5.4 Hardware Requirements……………………………………………………………….23
5.6 Software Requirments………………………………………………………………....23
Chapter 6……………………………………………………………………………………24
System Implemantation……………………………………………………………………..24
6.1 module Wise Implimentation Details…………………………………………………24
6.2 Code Design…………………………………………………………………………...25
6.3 Screenshots……………………………………………………………………………28
Chapter 7
System Testing………………………………………………………………………………30
7.1 Introduction…………………………………………………………………………..30
7.2 Testing Strategies……………………………………………………………….…...31
7.3 Test Case…………………………………………………………………………….32
Chapter 8
Conclusion And Future Enhancement……………………………………………………34
Publication………………………………………………………………………………..35
Chapter 9
Bibliography………………………………………………………………………………..36
Chapter 9………………………………………………………………………………………….40-40
Conclusion.………………………………………………………………………………………40-40
Paper Publication…………………………………………………………………………………41-42
LIST OF FIGURES AND TABLES
Fig 5.4.4: snapshot of Test and validating of Unet Keras model …………………………………………….23
Fig 5.5.4: snapshot of Test and validating of Trans Unet model …………………………………………….27
LIST OF TABLES