Professional Documents
Culture Documents
FIE 2019 2
Learning Through Discussions
Live
Online
Class
FIE 2019 3
Problem Statement
Knowledge
Generation
FIE 2019 5
Summarizing Posts from the Discussion Forum
FIE 2019 6
Text Summarization Approaches
Abstractive Extractive
Summarization Summarization
• Understand • Extract
the main important
concepts of a sentences
document from
• Generate new document
sentences
which are not
seen in the
original
document
FIE 2019 7
Text Summarization Approaches
Supervised Unsupervised
Summarization Summarization
• Use data sets • Do not use
that are annotated data
labelled by • Use linguistic
human and statistical
annotators information
obtained from
the document
itself
FIE 2019 8
Solution Design
• Convert into lower case, remove trailing and ending
spaces
Data • Tokenize each post into sentences, remove stop words
Processin • Perform Lemmatization using NLTK’s WordNet based
g Lemmatizer API
FIE 2019 9
Data Set
Business Process Modelling and Solutioning
2nd year undergraduate course within the BSc (IS)
FIE 2019 10
Data Set
The students were given a case study of the sales
process currently implemented in a sandwich shop
(e.g. Subway)
FIE 2019 12
User Interface: Input Discussion Forum Data
FIE 2019 13
Visual Dashboard: Topics and Summaries
FIE 2019 14
Tool Evaluation
What are some recommendations to improve the process and
state the rationales for it?”
Cluster: “Resources for reducing cycle time”
FIE 2019 15
Tool Evaluation
FIE 2019 16
Summary
We presented a text mining based approach to
analyse the discussion forums and generate topic
based summaries
Future Work
Improve spell check performance