Final Updated PPTX 22jan PDF

1
AHSANULLAH UNIVERSITY OF SCIENCE AND TECHNOLOGY

Department Of Computer Science And Technology
2
SUPERVISOR
Mohammad Moinul Hoque

Associate Professor, Dept. of CSE, AUST
PRESENTED BY
Asadullahhil Galib 15.02.04.022

Mahimul Islam 15.02.04.047
Fariha Nuzhat Majumdar 15.02.04.054
Nazmul Huda Auvy 13.02.04.081
SUMMARIZATION 3
What is text summarization

▪ Extracting important sentences from any kind of document(s)
▪ General overview of a document
▪ Maximum coverage of information(s)
▪ Must include all crucial points
Main document
Summary
HYBRID BANGLA TEXT SUMMARIZATION 4
▪ Limited number of summarizers for Bangla language
▪ Saving reading time for big documents by ignoring useless and redundant data
▪ Machine generated summaries are free from bias

CATEGORIES OF TEXT SUMMARIZATION 5
Abstractive Text Extractive

Summarization Summarization
Summarization
METHODS OF AUTOMATIC TEXT SUMMARIZATION 6
Generic Summarization
Generic summary gives users the overall
sense of document. It typically contains core
AUTOMATIC information of the document.
TEXT
SUMMARIZATION
Topic Centric Summarization
The topic centric summaries are generally
restricted to one topic. Topic is typically
based on the key words supplied by human.
LITERATURE REVIEW 7
“An approach to summarizing Bengali news documents”
Significance on:
▪ Stop words removal, stemming and tokenization have been done as preprocessing.
Sentence ranking has been done with thematic terms and sentence position.
Unigram based
Precision:53.80%
Document “Ananda Bazar Recall score 0.4122
Resource Evaluation Recall: 55.60%
Patrika” The score for
F-Score: 54.60%
LEAD baseline 0.3991
LITERATURE REVIEW 8
“A study on text summarization techniques and implement few

of them for Bangla language”
Significance on:
▪ Sentence location, Cue Phrase presence, Title word presence, Term frequency
and Numerical data
Document “The Daily Prothom Average Accuracy

Resource Evaluation
Alo” 71.3%
LITERATURE REVIEW 9
“Topic-Based Bangla Opinion Summarization”

Significance on:
▪ Identifying the sentiment information in a document and aggregate that for
generating summary.
Document “Ananda Bazar Precision: 72.15%

Resource Evaluation Recall: 67.32%
Patrika”
F-Score: 69.65%
10
LITERATURE REVIEW
“Automated Bangla Text Summarization by Sentence

Scoring and Ranking”
Significance on:
▪ Calculating sentences’ scores based on different aspects such as Frequency,
Sentence Position, Cue Phrases’ presence and Skeleton of document.
“The Daily Prothom Alo” 83.57 percent of

Document “The Daily Ittefaq”
Resource Evaluation summary
“The Daily Jugantor”
sentences match
LITERATURE REVIEW 11
"Automatic Bengali news documents summarization by

introducing sentence frequency and clustering"
Significance on:
▪ Sentence ranking
▪ Clustering
Document “The Daily Prothom Alo” Precision: 60.8%

Resource “The Daily Jugantor” Evaluation Recall: 66.4%
F-Score: 63.2%
RESEARCH OPPORTUNITIES 12
Research problem and

Document Document Resource Evaluation
improvement scope
Precision:53.80%
“An approach to summarizing “Ananda Bazar Scope of improvement in ROUGE
Recall: 55.60%
Bengali news documents” Patrika” score.
F-Score: 54.60%
Sometime it generate some
“A study on text summarization
Average accuracy sentences in summaries which is
techniques and implement few of “Daily Prothom Alo”
71.3%. totally different from human
them for Bangla language”
generated summaries.
Precision:72.15%,
Topic-Based Bangla Opinion “Ananda Bazar
Recall: 67.32%, Main problem is “subjectivity”
Summarization Patrika”
F-score: 69.65%
“The Daily Prothom
Automated Bangla Text 83.57% percent of Scope of improvement in the
Alo”, “The Daily
Summarization By Sentence summary sentences percentage of summary sentence
Ittefaq”,
Scoring and Ranking match match.
“The Daily Jugantor”
Automatic Bengali News
“The Daily Prothom Precision: 0.608
Documents Summarization Scope of improvement in ROUGE
Alo” and “The Daily Recall: 0.664
By Introducing sentence score.
Jugantor” F-score: 0.632
frequency and clustering
PROPOSED MODEL 13
Start
Input Document
Pre-Processing
Document Type Separation
Key Word Ranking Sentiment Scoring Text Ranking
Weighted Summation of Scores
Generated Output (with top 40% of sentence
End
PROPOSED MODEL 14
Start
Input Document
INPUT DOCUMENT
15
Dataset generated in 2 forms
All the datasets on Four different categories are collected from

renowned Bengali newspaper “The Daily Prothom Alo”
News Category Number of documents
Politics 100
Economic 100
Entertainment 100
Accidents 100
** The summaries we are considering as the gold summaries are created by random people.
INPUT DOCUMENT
16
Dataset collected from "Bangla Natural Language Processing Community" (BNLPC)

There are two data sets for the evaluation of Bangla text summarization system.
200 Bangla news

documents 3 human generated
model summaries for
each document
PROPOSED MODEL 17
Start
Input Document
Pre-Processing
PREPROCESSING
18
▪ We are performing pre-processing by splitting the input documents into words
নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ

PREPROCESSING
19
We have removed the stop words from the documents from a list of stop words
অতএব অথচ অথবা অনেক অবধি অেয অন্তত

PROPOSED MODEL 20
Start
Input Document
Pre-Processing

DOCUMENT TYPE SEPARATION
21
Keyword Extraction Algorithm for Keyword Extraction

We worked on keywords instead of key phrase.
We created a list for storing all the unique
words for and a list for storing their frequencies
for each category.
Word score calculation formula

Word score = Word Frequency / No of words present in
all documents in each category
WORDS 'সড়ক', 'গত', 'দুঘর্টোয়', 'দুই', 'ধেহত’
FREQUENCIES 163, 105, 105, 101, 98

DOCUMENT TYPE SEPARATION
22
Algorithm for Classification of Unknown Documents
▪ Crosschecking each and every word

of the document with the four
documents prepared earlier that
stored the keywords with scores.
▪ Choosing the class with the highest

score among the four categories.
PROPOSED MODEL 23
Start
Input Document
Pre-Processing

SENTENCE SCORING BASED ON KEYWORD 24
Here we are calculating the total value of the sentence based on the Keyword
ACTUAL NEWS
নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ।

Value of নিহত : 0.0026474323

CALCULATION
Value of দুজধ্ির : 0.0001080585
0.0026474323 +
Value of একজি : 0.0007293946 0.0001080585 +
0 + 0.0007293946
Value of িারী : 0.0002161169 + 0.0002161169 +
0.0000540292 + 0
Value of অিেজি : 0.0000540292
=0.0037550315
Value of মধ্যে and পু রুষ : 0 (as these are included stop words)
Algorithm for Individual Sentence Scoring Based

on Keyword
Value we have measured by our proposed model:

0.003755031500000004
SENTENCE SCORING BASED ON SENTIMENT 27
Here we are calculating the total value of the sentence based on the Sentiment
ACTUAL NEWS
Our picked line

SENTENCE SCORING BASED ON SENTIMENT 28
0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001
SSCORE= 0.0001+ 0.0001+ 0.0001+ 0.0001+ 0.0001+ 0.0001+ 0.0001+ 0.0001+ 0.0001= 0.0009
Algorithm for Sentence Ranking

Based on Sentiment Scoring
SENTENCE SCORING BASED ON TEXT RANKING 29
0.0637 S1= দাউদকাধিনত সড়ক দুঘটর্োয় ধেহত ৪

S1 S2
S2= কুধিল্লার দাউদকাধি উপনেলায় যাত্রীবাহী বাস দুঘটর্োয়
চারেে ধেহত এবং কিপনে ১৫ েে আহত হনয়নে
S3 S3= উপনেলার ধসংগুলাদীধঘর পাড় এলাকায় ঢাকা-চট্টগ্রাি

িহাসড়নক গতকাল ররাববার ধদবাগত রাত রদড়র্ার ধদনক এ
দুঘটর্ো ঘনর্
SENTENCE
SENTENCE SCORING
SCORING
SENTENCE BASED
BASED
SCORING ON
ON
BASED TEXT
TEXT
ON RANKING
RANKING
SENTIMENT 30
▪ This method includes tokenization of each sentence from training data sets and converting them into
sentence vectors. In order to do so we used “word2vec” imported from “genism.models”.
▪ In order to score sentences using text rank method, we need to find similarities between sentences.
First the vector model was loaded and each sentence was converted to vector using “sentence2vec”
PROPOSED MODEL 31
Start
Input Document
Pre-Processing

HYBRID SCORING
32
We added weight to each of them and the final ranking will be based on the
combination of the three.
KR = Keyword Ranking
SS = Sentiment scoring
TR = Text Ranking
W1 = A percentage of the total Keyword score
W2 = A percentage of the total Sentiment score
W3 = A percentage of the total Text Ranked score
HYBRID SCORING
33
Hybrid Scoring 1 (Keyword Ranking and Sentiment Scoring)

Keyword Ranking * W1 + Sentiment scoring * W2 = Combined Score of a sentence
W1 = 40 percent of the total keyword ranking score

W2 = 60 percent of the total sentiment score
Hybrid Scoring 2 (Keyword Ranking, Sentiment Scoring and Text ranking)

Keyword Ranking * W1 + Sentiment scoring * W2 + Text Ranking * W3 = Combined Score of a sentence

W3 = 50 percent of the total Text Rank score
Hybrid Scoring 3 (Keyword Ranking, Sentiment Scoring and Text ranking)

Keyword Ranking * W1 + Sentiment scoring * W2 + Text Ranking * W3 = Combined Score of a sentence

W3 = 30 percent of the total Text Rank score
PROPOSED MODEL 34
Start
Input Document
Pre-Processing
Generated Output (with top 40% of sentence
End
SAMPLE GENERATED OUTPUT 35
Actual News System Generated Summary (Hybrid 2)

"দাউদকান্দিতে সড়ক দুর্ঘটনায় ন্দনহে ৪" এতে র্টনাস্থতলই বাতসর যাত্রী চট্টগ্রাতির রলাহাগাড়ার
ন্দরফাে রহাতসন (২৯), সাহাব উন্দিন (৩৫) ও আবু
কুন্দিল্লার দাউদকান্দি উপতেলায় যাত্রীবাহী বাস দুর্ঘটনায় চারেন ন্দনহে এবং হান্দনফ (৬) এবং সােকান্দনয়ার আিোদ রহাতসন (২৮)
কিপতে ১৫ েন আহে হতয়তে। উপতেলার ন্দসংগুলাদীন্দর্র পাড় এলাকায় িারা যান। হাইওতয় পু ন্দলশ ও স্থানীয় রলাকেন োনান,
ঢাকা-চট্টগ্রাি িহাসড়তক গেকাল ররাববার ন্দদবাগে রাে রদড়টার ন্দদতক এ ঢাকা রেতক চট্টগ্রািগািী রসৌন্দদয়া পন্দরবহতনর যাত্রীবাহী
দুর্ঘটনা র্তট। হাইওতয় পু ন্দলশ ও স্থানীয় রলাকেন োনান, ঢাকা রেতক বাসটি দাউদকান্দির ন্দসংগুলাদীন্দর্র পাতড় রপৌৌঁোতল চালক
চট্টগ্রািগািী রসৌন্দদয়া পন্দরবহতনর যাত্রীবাহী বাসটি দাউদকান্দির ন্দসংগুলাদীন্দর্র ন্দনয়ন্ত্রণ হারান। উপতেলার ন্দসংগুলাদীন্দর্র পাড় এলাকায়
পাতড় রপৌৌঁোতল চালক ন্দনয়ন্ত্রণ হারান। গান্দড়টি েখন িহাসড়তকর পাতশ একটি ঢাকা-চট্টগ্রাি িহাসড়তক গেকাল ররাববার ন্দদবাগে রাে
গাতের সতে ধাক্কা রলতগ আটতক যায়। এতে র্টনাস্থতলই বাতসর যাত্রী চট্টগ্রাতির রদড়টার ন্দদতক এ দুর্ঘটনা র্তট।
রলাহাগাড়ার ন্দরফাে রহাতসন (২৯), সাহাব উন্দিন (৩৫) ও আবু হান্দনফ (৬)
এবং সােকান্দনয়ার আিোদ রহাতসন (২৮) িারা যান। আহে বযন্দিতদর িতধয
আটেনতক আশঙ্কােনক অবস্থায় ঢাকা রিন্দিতকল কতলে হাসপাোতল পাঠাতনা
হতয়তে। দাউদকান্দি হাইওতয় োনার ভারপ্রাপ্ত কিঘকেঘ া (ওন্দস) আবু ল কালাি
আোদ দুর্ঘটনায় হোহে হওয়ার খবতরর সেযো ন্দনন্দিে কতরতেন।
EVALUATION MEASURE 36
ROUGE precision = No. of overlapping words / No. of words in

PRECISION the generated summary
RECALL ROUGE recall = No. of overlapping words / No. of words in the

gold summary
F-MEASURE F-measure = 2 * Precision * Recall / (Precision + Recall)
refers to the overlap of 1-gram (each word)

ROUGE-1 between the system and reference summaries.
refers to the overlap of bigrams between the

ROUGE-2 system and reference summaries.
EXPERIMENTAL SETUP 37
Categories and number of documents for training and testing:
Document Number of Doc number Doc no. for Doc number Doc no. for
Category Document for testing training for testing training
(1st setup) (1st setup) (2nd setup) (2nd setup)
Politics 100 50 50 70 30
Economics 100 50 50 70 30
Entertainment 100 50 50 70 30
Accidents 100 50 50 70 30
Table: Categories and number of documents for training and testing

EXPERIMENTAL SETUP
38
Classification Result in 1st Setup
Classification Result in 2nd Setup

EXPERIMENTAL SETUP
39
Using our own classifier we have classified the BNLPC documents:
Dataset No. of Docs Accident Economics Entertainment Politics
Dataset-1 100 29 35 12 24
Dataset-2 100 18 19 6 57
CLASS SPECIFIC RESULT 40
Average ROUGE 1 Score of the system for Category Accidents (1st Setup)
0.6456 0.6606
Average ROUGE 2 Score of the system for Category Accidents (1st Setup)
0.5762 0.5902
Average ROUGE 1 Score of the system for Category Economics (1st Setup)
0.5619 0.5656
Average ROUGE 2 Score of the system for Category Economics (1st Setup)
0.4705 0.4756
Average ROUGE 1 Score of the system for Category Entertainment (1st Setup)
0.4893 0.4841
Average ROUGE 2 Score of the system for Category Entertainment (1st Setup)
0.3981 0.3839
Average ROUGE 1 Score of the system for Category Politics (1st Setup)
0.6708 0.6667
Average ROUGE 2 Score of the system for Category Politics (1st Setup)
0.5645 0.5585
Average ROUGE 1 Score of the system for Category Accidents (2nd Setup)
0.6280 0.6132
Average ROUGE 2 Score of the system for Category Accidents (2nd Setup)
0.5470 0.5280
Average ROUGE 1 Score of the system for Category Economics (2nd Setup)
0.5853 0.5854
Average ROUGE 2 Score of the system for Category Economics (2nd Setup)
0.4921 0.4934
Average ROUGE 1 Score of the system for Category Entertainment (2nd Setup)
0.4702 0.4509
Average ROUGE 2 Score of the system for Category Entertainment (2nd Setup)
0.3695 0.3538
Average ROUGE 1 Score of the system for Category Politics (2nd Setup)
0.6601 0.6519
Average ROUGE 2 Score of the system for Category Politics (2nd Setup)
0.5525 0.5422
EXPERIMENTAL RESULT 48
Avg ROUGE-1 Scores

0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
KeyWord Sentiment Text Rank New Hybrid
Precesion 0.542631685 0.508663132 0.508635158 0.547183881
Recall 0.606858059 0.693553635 0.692635234 0.656746838
F-measure 0.55973558 0.57365212 0.57410977 0.584185785
Average ROUGE 1 Scores of the system for different methods (1st Setup)
Avg ROUGE-1 Scores

0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
KeyWord Text Rank New Hybrid
Precesion 0.536450081 0.500673054 0.537122573
Recall 0.621884818 0.691011787 0.664016871
F-measure 0.562766 0.568768895 0.593866852
Average ROUGE 1 Scores of the system for different methods (2nd Setup)
Avg ROUGE-2 Scores

0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
KeyWord Sentiment Text Rank New Hybrid
Precesion 0.447528188 0.421271345 0.410866401 0.459600427
Recall 0.505991455 0.581238746 0.590964907 0.557536232
F-measure 0.463635094 0.476486973 0.473836208 0.492378553
Average ROUGE 2 Scores of the system for different methods (1st Setup)
Avg ROUGE-2 Scores

0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
KeyWord Text Rank New Hybrid
Precesion 0.439922462 0.400916393 0.447010528
Recall 0.519289684 0.587950582 0.562046722
F-measure 0.464852314 0.466303582 0.486369813
Average ROUGE 1 Scores of the system for different methods (2nd Setup)
1st model summary

Classification and ROUGE Scores of summaries for different categories (for BNLPC data set 1)
0.7071 0.6348
0.6517 0.5842
Average ROUGE Scores of summaries for BNLPC data sets

2nd model summary

0.6543 0.5828
0.6526 0.5836

3rd model summary

0.6712 0.6016
0.6737 0.6048

Average ROUGE Scores of the proposed system summaries for BNLPC data sets
COMPARISON WITH EXISTING MODELS 59
Average ROUGE Scores of our proposed system and other existing systems for BNLPC data sets
Methods Precision Recall F-Measure

Our Proposed Model
Majharul Haque [2015]
Kamal Sarkar [2012]
Average ROUGE Scores of summaries our proposed system and other existing web based system for our data set
Methods Rouge-1 Rouge-1 Rouge-1 Rouge-2 Rouge-2 Rouge-2

Precision Recall F-Measure Precision Recall F-Measure
Our Proposed Model 0.5472 0.6567 0.5842 0.4596 0.5575 0.4923
Existing Web Based 0.6220 0.4019 0.4652 0.5249 0.3253 0.3769
System
TIME ANALYSIS 60
We have calculated the Average Time requirement to generate a summary.
We have the following configuration:

❑ Intel CORE i5 Processor
❑ 8 Gb ram
❑ 256 GB SSD
TIME ANALYSIS 61
Time needed to generate summary of 10 documents for Category Accident
1st Setup 2nd Setup

Methods Time needed Per Unit Time Time needed to Per Unit Time
to summarize (in seconds) summarize 10 (in seconds)
10 documents documents
(in Seconds) (in Seconds)
Keyword Ranking 40.329 4.0329 30.0367 3.00367

Sentiment Scoring 0.1462 0.01462 0.1462 0.01462
Text Ranking 1.8191 0.18191 2.6004 0.26004
Hybrid System 85.6817 8.56817 74.2117 7.42117
(.3KR+.3SS+.5TR)
TIME ANALYSIS 62
Time needed to generate summary of 10 documents for Category Economics
1st Setup 2nd Setup

Keyword Ranking 409.2141 40.92141 195.7745 19.57745

Text Ranking 53.6738 5.36738 42.6732 4.26732
Hybrid System 985.47 98.547 548.6031 54.86031
(.3KR+.3SS+.5TR)
TIME ANALYSIS 63
Time needed to generate summary of 10 documents for Category Entertainment
1st Setup 2nd Setup

Keyword Ranking 54.8148 5.48148 66.2184 6.62184

Text Ranking 6.0225 0.60225 6.3414 0.63414
Hybrid System 113.0268 11.30268 154.5379 15.45379
(.3KR+.3SS+.5TR)
TIME ANALYSIS 64
Time needed to generate summary of 10 documents for Category Politics
1st Setup 2nd Setup

Keyword Ranking 189.5014 18.95014 278.1268 27.81268

Text Ranking 56.2164 5.62164 56.3719 5.63719
Hybrid System 580.9077 58.09077 705.4806 70.54806
(.3KR+.3SS+.5TR)
DIFFICULTIES ENCOUNTERD 65
Gold Summaries are not 100 percent accurate
Limited Resources for Bengali documents
“PolyGlot” library is not well trained
Problem in splitting sentence in “PolyGlot” library
Taking moderate amount of time in some cases

FUTURE WORKS 66
Named Headline &

Multithreading Train New FastText
Entity Numerical
System Categories Library
Recognition Values
67
ANY QUESTION?

Final Updated PPTX 22jan PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Final Updated PPTX 22jan PDF

Uploaded by

Copyright:

Available Formats

1

AHSANULLAH UNIVERSITY OF SCIENCE AND TECHNOLOGY

Mohammad Moinul Hoque

Asadullahhil Galib 15.02.04.022

What is text summarization

▪ Limited number of summarizers for Bangla language

▪ Machine generated summaries are free from bias

Abstractive Text Extractive

“An approach to summarizing Bengali news documents”

“A study on text summarization techniques and implement few

Document “The Daily Prothom Average Accuracy

“Topic-Based Bangla Opinion Summarization”

Document “Ananda Bazar Precision: 72.15%

“Automated Bangla Text Summarization by Sentence

“The Daily Prothom Alo” 83.57 percent of

"Automatic Bengali news documents summarization by

Document “The Daily Prothom Alo” Precision: 60.8%

Research problem and

Document Type Separation

Key Word Ranking Sentiment Scoring Text Ranking

Weighted Summation of Scores

Generated Output (with top 40% of sentence

Dataset generated in 2 forms

All the datasets on Four different categories are collected from

Dataset collected from "Bangla Natural Language Processing Community" (BNLPC)

200 Bangla news

▪ We are performing pre-processing by splitting the input documents into words

নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ

নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ

অতএব অথচ অথবা অনেক অবধি অেয অন্তত

Document Type Separation

Keyword Extraction Algorithm for Keyword Extraction

Word score calculation formula

WORDS 'সড়ক', 'গত', 'দুঘর্টোয়', 'দুই', 'ধেহত’

FREQUENCIES 163, 105, 105, 101, 98

Algorithm for Classification of Unknown Documents

▪ Crosschecking each and every word

▪ Choosing the class with the highest

Document Type Separation

Key Word Ranking Sentiment Scoring Text Ranking

নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ।

নিহত দুজধ্ির মধ্যে একজি িারী অিেজি পু রুষ

Value of নিহত : 0.0026474323

Algorithm for Individual Sentence Scoring Based

Value we have measured by our proposed model:

Our picked line

0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001 0.0001

Algorithm for Sentence Ranking

0.0637 S1= দাউদকাধিনত সড়ক দুঘটর্োয় ধেহত ৪

S3 S3= উপনেলার ধসংগুলাদীধঘর পাড় এলাকায় ঢাকা-চট্টগ্রাি

Document Type Separation

Key Word Ranking Sentiment Scoring Text Ranking

Weighted Summation of Scores

Hybrid Scoring 1 (Keyword Ranking and Sentiment Scoring)

W1 = 40 percent of the total keyword ranking score

Hybrid Scoring 2 (Keyword Ranking, Sentiment Scoring and Text ranking)

W1 = 30 percent of the total keyword ranking score

Hybrid Scoring 3 (Keyword Ranking, Sentiment Scoring and Text ranking)

W1 = 50 percent of the total keyword ranking score

Document Type Separation

Key Word Ranking Sentiment Scoring Text Ranking

Weighted Summation of Scores

Generated Output (with top 40% of sentence