Professional Documents
Culture Documents
by
Athulya S. – AM.EN.P2MCA15011
Salma Shaji – AM.EN.P2MCA15022
Department of Computer Science and Application
June 2017
AMRITA VISHWA VIDYAPEETHAM,
AMRITA UNIVERSITY
AMRITA SCHOOL OF ENGINEERING, AMRITAPURI CAMPUS
BONAFIDE CERTIFICATE
This is to certify that the thesis entitled “A Graph-Based Relation Extraction Method
for Question Answering System” submitted by Athulya S, AM.EN.P2MCA15011
and Salma Shaji, AM.EN.P2MCA15022 in partial fulfillment of the requirements for
the award of the degree of Master of Computer Applications is a bonafide record of
the work carried out under our guidance and supervision at Amrita School of Engi-
neering, Amritapuri.
SIGNATURE SIGNATURE
We , Athulya S and Salma Shaji, hereby declare that this project entitled “A Graph-
Based Relation Extraction Method for Question Answering System” done at Amrita
Vishwa Vidyapeetham is a record of original work done by us under the guidance of
Mrs.Veena G, Department of Computer Science And Applications, Amrita School of
Engineering, Amritapuri and this work has not formed on the basis of the award of any
degree/diploma/fellowship or a similar award to any candidate in any University, to the
best of our knowledge.
Place: Amritapuri
Date:
First of all we would like to thank the Almighty for giving us the courage to complete
this project work successfully. We express our gratitude to our respected Chancellor
Sri Mata Amritanandamayi Devi for being a backbone to achieve this project suc-
cessfully.
We express our deep gratitude to Dr S.N. Jyothi, Principal, Amrita School of Engi-
neering, Amritapuri.
Also we would like to express our deep-felt gratitude to Mrs. Manjusha Nair and
Mrs. Kavitha K.R, Project coordinators, Department of Computer Science and Appli-
cation, Amrita School of Engineering, Amritapuri for their primary support throughout
the project.
Our heartfelt thanks to Internal Guide, Mrs. Veena G, Assistant Professor, Amrita
School of Engineering, Amritapuri, who has supported and guided us throughout the
project period by continual encouragement through a relaxed approach.
We would also like to thank our friends for giving their valuable information and to
our family for their moral support.
Last but not the least we would like to thank all the people who are directly or indi-
rectly involved in this project for granting their support.
ABSTRACT
List of Tables...............................................................................................................
List of Figures .............................................................................................................
1 Introduction……………………………………………………………………....1
1.1 Overview of Question Answering System…………………………………..2
1.1.1 Questions………….………………………………………………...2
1.1.2 Answers…………………………………………………………......3
1.1.3 Data Sources………………………………………………………...3
1.2 Background and Context……………………………………………………4
1.3 Scope and Objectives………………………………………………………..6
1.4 Related Work………………………………………………………………..6
1.5 Overview of the Thesis……………………………………………………...9
2 Problem Description……………………………………………………………10
3 Methodology……………………………………………………………………12
3.1 Document Processing……………………………………………………...14
3.1.1 Pre-processing…………………………………………………………..15
3.1.1.1 Tokenization………………………………………………………..15
3.1.1.2 Parts-of-Speech Tagging…………………………………………...15
3.1.1.3 Named Entity Recognition…………………………………………16
3.1.1.4 Syntactic Dependency Parsing……………………………………..17
3.1.2 Coreference Resolution…………………………………………………18
3.1.3 Gender Analysis………………………………………………………...19
3.1.3.1 Naïve Bayes Classifier……………………………………………..19
3.1.4 Relation Extraction……………………………………………………..20
3.1.5 Graph Generation……………………………………………………….21
3.2 Query Processing……………………………………………………..........22
3.2.1 Pre-processing…………………………………………………………..22
3.2.2 Coreference Resolution…………………………………………………23
3.2.3 Relation Extraction……………………………………………………..24
3.3 Answer Extraction…………………………………………………………25
4 Results and Analysis……………………………………………………………28
4.1 Result……………………………………………………………………….29
4.1.1 Document Processing…………………………………………………...29
4.1.2 Query Processing……………………………………………………….31
4.1.3 Answer Extraction….…………………………………………………...32
4.2 Evaluation Metrics…………………………………………………………..33
4.3 Performance Evaluation…………………………………………………….34
4.4 Analysis……………………………………………………………………..34
5. Discussion and Conclusion……………………………………………………...35
5.1 Summary…………………………………………………………………...36
5.2 Future Work………………………………………………………………..37
References…………………………………………………………………………38
Appendix A : Research Paper……………………………………………………...40
LIST OF TABLES