Welcome to Scribd!

Fast-Track Semester 2022: Technical Answers To Real-World Problems

Uploaded by

0% found this document useful (0 votes)

7 views5 pages

The document discusses an informative text summarization model using natural language processing and deep learning. It proposes an architecture that uses a T5 transformer model to summarize long videos and research papers before sending them to an LSTM model for abstractive summarization. It performed a comparison of T5, HuggingFace, and Bart models on summarizing research papers and video transcripts, selecting T5 based on the length and grammatical correctness of the summaries produced.

Original Description:

Original Title

Tarp DA3 (1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

7 views5 pages

Fast-Track Semester 2022: Technical Answers To Real-World Problems

Uploaded by

STYX

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 5

Search inside document

Fast-Track Semester 2022

Technical Answers to Real-World Problems

Digital Assignment 3
Prof. Rajakumar K

Informative Text Summarizer Using NLP and DL

Team Members:
Saumitra Pathak (19BCE2411)
Shivam Bansal (19BCE0930)
Arkaraj Ghosh (19BCE24218)
Debalay Dasgupta (19BCE2423)
Pratyay Piyush (19BCE2364)
Proposed Architecture:
Methodology

The suggested model gathers data from three distinct media sources before employing
a comprehensive strategy based on an abstractive kind of text summarization. The long
video and journal datasets are first pre-processed using the Extractive summary to
create a homogenous dataset for T5. The T5 transformer model is then employed in the
following stage, keeping the ontological relationships in mind. The suggested hybrid
model is tested on a test dataset and produces a projected summary. The produced
summary from several sources is integrated into a single document for easy access in
the shortest amount of time.

T5 Transformer Model
T5 model will summarize the long video transcripts and research journals before
sending them to the LSTM model for a better abstractive summary. The T5 transformer
model produced excellent results when processed over CNNDM, MSMO, and XSUM,
with over 42 ROUGE and 43 BLEU scores on the MSMO dataset. The T5 model uses
stacked self-attention layers and is composed of encoder-decoder layers followed by a
feed-forward network.

Pretrained Abstractive Summarizer Comparison

A comprehensive comparison between the best pre-trained abstractive text
summarization models was performed. T5, Hugging face, and Bart transformer model
was used to process 20 research papers published by Elsevier publication and the
transcripts of 25 video lectures from a YouTube playlist. The length of the first-hand
summary of the resources generated and the grammatical correctness were the two
important metrics taken into consideration. For detecting grammar errors and spelling
mistakes, language_python_tool was used, which is an open-source grammar tool. The
results were plotted, and the average was calculated, which resulted in the selection of
the T5 model.
Fig: Video Lectures

Fig: Research Paper

Pre-Trained Abstractive Summarizer Comparison

A comprehensive comparison between the best pre-trained abstractive text

summarization models was performed. T5, Hugging face, and Bart transformer model
was used to process 20 research papers published by Elsevier publication and the
transcripts of 25 video lectures from a YouTube playlist. The length of the first-hand
summary of the resources generated and the grammatical correctness were the two
important metrics taken into consideration. For detecting grammar errors and spelling
mistakes, language_python_tool was used, which is an open-source grammar tool. The
results were plotted, and the average was calculated, which resulted in the selection of
the T5 model.

J173 Tech-Talk-Sum - Fine-Tuning Extractive Summarization and Enhancing BERT Text Contextualization For Technological Talk Videos
Document18 pages
J173 Tech-Talk-Sum - Fine-Tuning Extractive Summarization and Enhancing BERT Text Contextualization For Technological Talk Videos
Timothy K Shih
No ratings yet
Artificial Intelligence Methods for Optimization of the Software Testing Process: With Practical Examples and Exercises
From Everand
Artificial Intelligence Methods for Optimization of the Software Testing Process: With Practical Examples and Exercises
Sahar Tahvili
No ratings yet
Abstractive Summarizer For Youtube Videos: Abstract. The Paper Goal Is To Design A User Interface Where The User Can Get
Document8 pages
Abstractive Summarizer For Youtube Videos: Abstract. The Paper Goal Is To Design A User Interface Where The User Can Get
Sangeeta Yadav
No ratings yet
Query Expansion Basedon NLPand Word Embeddings
Document8 pages
Query Expansion Basedon NLPand Word Embeddings
hasna.nafir
No ratings yet
Ijetae 0223 071
Document11 pages
Ijetae 0223 071
Muhammad Arsyad Victor
No ratings yet
8 Quiz Maker Automatic Quiz Generation From Text Using NLP
Document11 pages
8 Quiz Maker Automatic Quiz Generation From Text Using NLP
codelover0007
No ratings yet
Automatic Text Summarization Using A Machine Learn
Document11 pages
Automatic Text Summarization Using A Machine Learn
manish
No ratings yet
TARP Report
Document18 pages
TARP Report
dragonnishanth
No ratings yet
Clef 2010 Labs Submission 116
Document6 pages
Clef 2010 Labs Submission 116
Amel Ksibi
No ratings yet
Demos 036
Document8 pages
Demos 036
music2850
No ratings yet
MLASK
Document15 pages
MLASK
22 / Gyanaranjan Nayak
No ratings yet
Abstract Domains in Constraint Programming
From Everand
Abstract Domains in Constraint Programming
Marie Pelleau
No ratings yet
The Art of Assembly Language Programming Using PIC® Technology: Core Fundamentals
From Everand
The Art of Assembly Language Programming Using PIC® Technology: Core Fundamentals
Theresa Schousek
No ratings yet
Paper 3
Document6 pages
Paper 3
21pd06
No ratings yet
Semantic Text Summarization of Long Videos: March 2017
Document10 pages
Semantic Text Summarization of Long Videos: March 2017
Ananya
No ratings yet
Sample Masters Thesis PDF
Document5 pages
Sample Masters Thesis PDF
tracywilliamssalem
100% (2)
BTP Report
Document4 pages
BTP Report
kshalika734
No ratings yet
MPEG-V: Bridging the Virtual and Real World
From Everand
MPEG-V: Bridging the Virtual and Real World
Kyoungro Yoon
No ratings yet
Requirements Engineering
From Everand
Requirements Engineering
Jeremy Dick
Rating: 2 out of 5 stars
2/5 (2)
Full Paper A First Year Electronics Lab Project Design of Basic Voltmeter Plus Soldering Tutorial
Document7 pages
Full Paper A First Year Electronics Lab Project Design of Basic Voltmeter Plus Soldering Tutorial
shiva kumar
No ratings yet
General Ledger System
Document314 pages
General Ledger System
Hatdog
No ratings yet
Cmosttl
Document4 pages
Cmosttl
Hasen Bebba
No ratings yet
DBMS
Document8 pages
DBMS
Khageshwar Ijardar
No ratings yet
B Tech Project Thesis
Document7 pages
B Tech Project Thesis
czbsbgxff
100% (2)
Project Status Report For 6Th Semester: Niques
Document3 pages
Project Status Report For 6Th Semester: Niques
Suhash Rangari
No ratings yet
Open Domain QA
Document28 pages
Open Domain QA
ATHARVA INAMDAR
No ratings yet
BIOEN 337 ABET Style Syllabus, Updated 11 30 2012 Page 1
Document5 pages
BIOEN 337 ABET Style Syllabus, Updated 11 30 2012 Page 1
Nuryanti Rokhman
No ratings yet
(VI12241) Legal Case Summarization
Document2 pages
(VI12241) Legal Case Summarization
SAHITHI NALLA
No ratings yet
Linear Programming: Foundations and Extensions
From Everand
Linear Programming: Foundations and Extensions
Robert J Vanderbei
No ratings yet
Mathematical Optimization Terminology: A Comprehensive Glossary of Terms
From Everand
Mathematical Optimization Terminology: A Comprehensive Glossary of Terms
Andre A. Keller
No ratings yet
Automatic Prediction On DC Compunds
Document12 pages
Automatic Prediction On DC Compunds
Moazzam Ali khan
No ratings yet
Practical Declarative Model Transformation With Tefkat: Lecture Notes in Computer Science October 2005
Document18 pages
Practical Declarative Model Transformation With Tefkat: Lecture Notes in Computer Science October 2005
Chaimaa Sbh
No ratings yet
How To Structure A MSC Thesis
Document8 pages
How To Structure A MSC Thesis
bk1xaf0p
100% (2)
Text Summarization On Youtube Videos in Educational Domain
Document5 pages
Text Summarization On Youtube Videos in Educational Domain
MR. SIDDHESH KATHALE
No ratings yet
The CQL Continuous Query Language: Semantic Foundations and Query Execution
Document33 pages
The CQL Continuous Query Language: Semantic Foundations and Query Execution
Harsh Grover
No ratings yet
NDP Quesation
Document6 pages
NDP Quesation
Sameer
No ratings yet
Blueprint Paper
Document6 pages
Blueprint Paper
Nguyen Huyen
No ratings yet
Experiments and Modeling in Cognitive Science: MATLAB, SPSS, Excel and E-Prime
From Everand
Experiments and Modeling in Cognitive Science: MATLAB, SPSS, Excel and E-Prime
Fabien Mathy
No ratings yet
tmpD536 TMP
Document4 pages
tmpD536 TMP
Frontiers
No ratings yet
A MATLAB® Primer for Technical Programming for Materials Science and Engineering
From Everand
A MATLAB® Primer for Technical Programming for Materials Science and Engineering
Leonid Burstein
Rating: 5 out of 5 stars
5/5 (1)
Camera Ready Paper
Document5 pages
Camera Ready Paper
Adesh Dhakane
No ratings yet
A Quick Guide To Teaching R Programming To Computational Biology Students
Document4 pages
A Quick Guide To Teaching R Programming To Computational Biology Students
Jenaro Correa
No ratings yet
Semester-Long Internship Report: Prof. Kannan M. Moudgalya Prof. Radhendushka Srivastava
Document44 pages
Semester-Long Internship Report: Prof. Kannan M. Moudgalya Prof. Radhendushka Srivastava
hii
No ratings yet
Review of Applications of TLBO Algorithm and A Tutorial For Beginners To Solve The Unconstrained and Constrained Optimization Problems
Document31 pages
Review of Applications of TLBO Algorithm and A Tutorial For Beginners To Solve The Unconstrained and Constrained Optimization Problems
sravanthee s
No ratings yet
How Many High-Level Concepts Will Fill The Semantic Gap in Video Retrieval?
Document8 pages
How Many High-Level Concepts Will Fill The Semantic Gap in Video Retrieval?
jocl2000
No ratings yet
Self-Regulating Text Summarization: Dept of Cse, Bldeacet Vijayapurpage 1
Document22 pages
Self-Regulating Text Summarization: Dept of Cse, Bldeacet Vijayapurpage 1
Rashi Rj
No ratings yet
PCX - Report
Document4 pages
PCX - Report
espantocd
No ratings yet
AIML PGCP Project B21
Document6 pages
AIML PGCP Project B21
Murtuza Khan
No ratings yet
Document
Document3 pages
Document
p0w3r4u1m0r0n
No ratings yet
QU Master Project
Document6 pages
QU Master Project
Andriyan Saputra
No ratings yet
ChatGPT Reading Testing Items 2023
Document14 pages
ChatGPT Reading Testing Items 2023
Phạm Thế Nam
No ratings yet
Python For Data Science
From Everand
Python For Data Science
Kevin Clark
No ratings yet
Text Independent Speaker Verification System: Khushboo Modi
Document12 pages
Text Independent Speaker Verification System: Khushboo Modi
Thu Vu
No ratings yet
Term Paper Modified
Document30 pages
Term Paper Modified
soumya dutta
No ratings yet
Multiple Choice Question (MCQ) Answering System For Entrance Examination
Document14 pages
Multiple Choice Question (MCQ) Answering System For Entrance Examination
anon_46092178
No ratings yet
Question Summation and Sentence Similarity Using BERT For Key Information Extraction
Document6 pages
Question Summation and Sentence Similarity Using BERT For Key Information Extraction
IJRASETPublications
No ratings yet
Automatic Text Document Summarization Based On Machine Learning
Document4 pages
Automatic Text Document Summarization Based On Machine Learning
david
No ratings yet
The Computer Graphics Interface: Computer Graphics Standards Series
From Everand
The Computer Graphics Interface: Computer Graphics Standards Series
Karla Steinbrugge Chauveau
Rating: 5 out of 5 stars
5/5 (1)
Course Project and Term Paper Logistics
Document7 pages
Course Project and Term Paper Logistics
kingjames23
No ratings yet
2008 - Learning Transfer Rules For Machine Translation From Parallel Corpora
Document9 pages
2008 - Learning Transfer Rules For Machine Translation From Parallel Corpora
Pedor Romer
No ratings yet
Fast-Track Semester 2022: Technical Answers To Real-World Problems
Document11 pages
Fast-Track Semester 2022: Technical Answers To Real-World Problems
STYX
No ratings yet
WINSEM2021-22 CSE4020 ETH VL2021220502460 Reference Material I 02-03-2022 Ensemble Classifier Intro 22
Document37 pages
WINSEM2021-22 CSE4020 ETH VL2021220502460 Reference Material I 02-03-2022 Ensemble Classifier Intro 22
STYX
100% (1)
Learning From Imbalanced Data
Document54 pages
Learning From Imbalanced Data
STYX
No ratings yet
Feature Selection
Document56 pages
Feature Selection
STYX
No ratings yet
Missing and Outlier
Document20 pages
Missing and Outlier
STYX
No ratings yet
Img 0002
Document1 page
Img 0002
STYX
No ratings yet
ML Model Evaluation
Document17 pages
ML Model Evaluation
STYX
No ratings yet
Machine Learning in Practice
Document29 pages
Machine Learning in Practice
STYX
No ratings yet
Winter Semester 2021-22 CSE4020-Machine Learning Digital Assignment-1
Document20 pages
Winter Semester 2021-22 CSE4020-Machine Learning Digital Assignment-1
STYX
No ratings yet
Feature Selection in Machine Learning
Document4 pages
Feature Selection in Machine Learning
STYX
No ratings yet
WINSEM2021-22 CSE4020 ETH VL2021220501968 Reference Material I 22-01-2022 PAC Learning
Document34 pages
WINSEM2021-22 CSE4020 ETH VL2021220501968 Reference Material I 22-01-2022 PAC Learning
STYX
No ratings yet
MGT1036 Principles-Of-Marketing Eth 1.0 40 MGT1036
Document2 pages
MGT1036 Principles-Of-Marketing Eth 1.0 40 MGT1036
STYX
No ratings yet
CSE4001 - Parallel and Distributed Computing, Fall 2019 Vellore Institute of Technology Instructor: Prof Deebak B D - SCOPE
Document3 pages
CSE4001 - Parallel and Distributed Computing, Fall 2019 Vellore Institute of Technology Instructor: Prof Deebak B D - SCOPE
STYX
No ratings yet
CAT2
Document146 pages
CAT2
STYX
No ratings yet
Winter Semester 2021-22 Data Visualization Lab Digital Assignment-6
Document10 pages
Winter Semester 2021-22 Data Visualization Lab Digital Assignment-6
STYX
No ratings yet
Notebook 12
Document6 pages
Notebook 12
api-335583967
No ratings yet
Latihan PostgreSQL - 1
Document4 pages
Latihan PostgreSQL - 1
embuh rek embuh
No ratings yet
Oracle Database Administration I
Document46 pages
Oracle Database Administration I
ocp001
No ratings yet
Codd's Rule For RDBMS
Document2 pages
Codd's Rule For RDBMS
Harinath Athreyan
No ratings yet
1 Introduction To Databases
Document35 pages
1 Introduction To Databases
Bricious Mulimbi
100% (1)
IESV201 Assignment Brief - Data Analysis 2022 2
Document6 pages
IESV201 Assignment Brief - Data Analysis 2022 2
achumakotishini
No ratings yet
Numerical Analysis S A Mollah For PDF
Document2 pages
Numerical Analysis S A Mollah For PDF
Arpita sikder
33% (6)
List of Geographic Information Systems Software
Document8 pages
List of Geographic Information Systems Software
Yankson Appiah David
No ratings yet
Solution Manual For Using Mis 7 e 7th Edition 0133546438
Document14 pages
Solution Manual For Using Mis 7 e 7th Edition 0133546438
MarieHughesebgjp
100% (74)
Laboratory Activity No 1
Document4 pages
Laboratory Activity No 1
Llal Santiago
No ratings yet
Splunk 6.0.10 SearchReference Cheatsheet
Document461 pages
Splunk 6.0.10 SearchReference Cheatsheet
ksudhakar09
No ratings yet
Introduction To Decision Support System
Document26 pages
Introduction To Decision Support System
Sagar
75% (4)
(Insert Project Name) : Preliminary Design Review (PDR) (Insert Date of PDR) (Insert Clarity ID)
Document27 pages
(Insert Project Name) : Preliminary Design Review (PDR) (Insert Date of PDR) (Insert Clarity ID)
Abrham B. GM
0% (1)
03 Adbms PDF
Document105 pages
03 Adbms PDF
qwer ty
No ratings yet
Comprehensive Clinical Nephrology 4th Editionl PDF
Document4 pages
Comprehensive Clinical Nephrology 4th Editionl PDF
Jennifer
No ratings yet
Week Five Assignment Database Modeling and Normalization
Document9 pages
Week Five Assignment Database Modeling and Normalization
Evans Oduor
No ratings yet
Context Lies
Document8 pages
Context Lies
Timothy Cook
No ratings yet
Time Model Query's
Document4 pages
Time Model Query's
sreeharirao kadali
No ratings yet
MSBI 3.5resume
Document4 pages
MSBI 3.5resume
sravan
No ratings yet
Intro SQL Book
Document186 pages
Intro SQL Book
DSunte Wilson
No ratings yet
DBA
Document9 pages
DBA
NeeBa 12
No ratings yet
SpagoBI & ADempiere
Document4 pages
SpagoBI & ADempiere
Moses K. Wangaruro
No ratings yet
Data and Analytics Skills Tracker
Document23 pages
Data and Analytics Skills Tracker
Muhammad Faizan
No ratings yet
Talendopenstudio Components RG 5.4.0 en
Document2,084 pages
Talendopenstudio Components RG 5.4.0 en
DiogoMartins
100% (1)
Name of Student: Navratan Kapil Sap Id:1000009892
Document9 pages
Name of Student: Navratan Kapil Sap Id:1000009892
Kunal Ranjan
No ratings yet
IME 212 Course Orientation
Document15 pages
IME 212 Course Orientation
fuckyo
No ratings yet
Sports Tournament
Document8 pages
Sports Tournament
vishal90301
No ratings yet
International Standard Iso-690
Document11 pages
International Standard Iso-690
Sergej Glavan
No ratings yet
Methodology For Systematic Literature Review Applied To Engineering and Education
Document11 pages
Methodology For Systematic Literature Review Applied To Engineering and Education
Rio Quindala
No ratings yet
Final Assignment - Advanced HRMS
Document12 pages
Final Assignment - Advanced HRMS
angelia
No ratings yet