Welcome to Scribd!

Second Progress Report Updated Final

Uploaded by

0% found this document useful (0 votes)

20 views6 pages

The report summarizes progress on developing a deep learning model using Transformer architecture for abstractive text summarization. Key objectives achieved include applying multi-headed attention and feed-forward neural networks. An encoder-decoder architecture is proposed using stacked self-attention and point-wise layers. Additional work will train the model with sparse categorical cross entropy loss and Adam optimizer to provide more accurate and efficient summarization than other approaches.

Original Description:

progress report

Original Title

second_progress_report_updated_final

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

20 views6 pages

Second Progress Report Updated Final

Uploaded by

Deepanshu Aggarwal

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Second Progress Report

Text Summarization

Submitted in the partial fulfillment of the Degree of

Bachelor of Technology
(Computer Science and Engineering)

Submitted By

Raghav Vohra (01096302719)

Piyush Garg (02296302719)
Deepanshu Aggarwal (02596302719)

Under the Supervision of

Dr. Neeti Sangwan

Department of Computer Science and Engineering

Maharaja Surajmal Institute of Technology
Janakpuri, New Delhi.
2019-2023
SCOPE AND PURPOSE

To develop a Deep Learning Model using Transformer architecture for Abstractive Text Summarization

PROGRESS:

OBJECTIVE ACHIEVED:

1. We applied Multi-Headed Attention and Feed-Forward Neural Network. We split the inputs into
multiple heads, and after processing, concatenate output from all the heads.
2. We made fundamental units of encoder and decoder. These expanded into 4 encoder/decoder
layers.
3. We stack all the intermediate layers in a Custom Model class.
4. We applied custom learning rate in which training on a custom learning rate scheduler that helps faster
convergence.
5. We train the model with Sparse Categorical Cross Entropy loss and used Adam Optimizer.

ADDITIONAL WORK:

PROPOSED ARCHITECTURE

Most competitive neural sequence transduction models have an encoder-decoder structure [5, 2, 35]. Here, the
encoder maps an input sequence of symbol representations (x1, ..., xn) to a sequence of continuous representations z
= (z1, ..., zn). Given z, the decoder then generates an output sequence (y1, ..., ym) of symbols one element at a time.
At each step the model is auto-regressive [10], consuming the previously generated symbols as additional input
when generating the next. The Transformer follows this overall architecture using stacked self-attention and point-
wise, fully connected layers for both the encoder and decoder.
Fig.1. Architect
ureofth emodel

Encoder
And Decoder
Blocks

1. The first
step in
calculating self-
attention is to
create three
vectors from
each of the
encoder’s input
vectors (in this
case, the
embedding of
each word). So
for each word,
we create a
Query vector, a
Key vector, and
a Value vector.
These vectors
are created by
multiplying the
embedding by three matrices that we trained during the training process.
2. The second step in calculating self-attention is to calculate a score. Say we’re calculating the self-attention
for the first word in this example, “Thinking”. We need to score each word of the input sentence against this
word. The score determines how much focus to place on other parts of the input sentence as we encode a
word at a certain position.
3. The third and fourth steps are to divide the scores by 8 (the square root of the dimension of the key
vectors used in the paper – 64. This leads to having more stable gradients. There could be other possible
values here, but this is the default), then pass the result through a SoftMax operation. SoftMax normalizes
the scores so they’re all positive and add up to 1.
Custom Learning Rate
The transformer paper also suggests training on a custom learning rate scheduler that helps faster
convergence
Encoder Decoder Block

EXPECTED RESULT:

The expected result of our project is:

 Our algorithm takes less computation time and resources than other approaches.
 Provide more accuracy of summarization than other approaches.

5|Page
REFERENCES:

1. Transformer for NMT: https://www.tensorflow.org/tutorials/text/transformer

2. TensorFlow API docs: https://www.tensorflow.org/api_docs

3. Attention is all you need: https://arxiv.org/abs/1706.03762

4. https://machinelearningmastery.com/how-does-attention-work-in-encoder-decoder-
recurrent-neural-networks/

5. https://medium.com/analytics-vidhya/https-medium-com-understanding-attention-
mechanism-natural-language-processing-9744ab6aed6a

6|Page

10 The Dot Product and Convolution
Document10 pages
10 The Dot Product and Convolution
nandini bub
No ratings yet
Recycling-We Are Going Green!: Name of Project: 2. Duration: 3 Weeks
Document9 pages
Recycling-We Are Going Green!: Name of Project: 2. Duration: 3 Weeks
Mary McDonnell
No ratings yet
Sylabus
Document12 pages
Sylabus
Andrej Ilić
No ratings yet
Major Project Presentation 2 - G6
Document13 pages
Major Project Presentation 2 - G6
Hdyo mdmd
No ratings yet
495 Lecture 10 Attall
Document18 pages
495 Lecture 10 Attall
Mohibur Nabil
No ratings yet
Working With Vectors
Document3 pages
Working With Vectors
Roger M. Idrovo
No ratings yet
Report For The IS Project: Fingerprint Classification Through Self Organization Maps Modified To Treat Uncertainties
Document14 pages
Report For The IS Project: Fingerprint Classification Through Self Organization Maps Modified To Treat Uncertainties
lambk89
No ratings yet
116cs0213 STW Report
Document11 pages
116cs0213 STW Report
Akshay Saraogi
No ratings yet
Second Assignment
Document6 pages
Second Assignment
Mahmoud Barmawi
No ratings yet
Autoencoders For Content - Ased Image Retrieval With Keras and Tensorflow
Document40 pages
Autoencoders For Content - Ased Image Retrieval With Keras and Tensorflow
超揚林
No ratings yet
Transformers Torch
Document38 pages
Transformers Torch
rishsingh.singh3scribd
No ratings yet
Autoencoder Report 1
Document34 pages
Autoencoder Report 1
Antush Tesfaye
No ratings yet
Vector Quantization
Document25 pages
Vector Quantization
Pavithran Vijayalakshmi
No ratings yet
MD Adil Irshad
Document37 pages
MD Adil Irshad
chatroom Mern
No ratings yet
Week9 Seq2seq
Document32 pages
Week9 Seq2seq
Fresney ALejandro
No ratings yet
Report 1
Document4 pages
Report 1
Midhun Viswam
No ratings yet
Implementation of Vector Quantization Using Content Addressable Memory Architecture
Document14 pages
Implementation of Vector Quantization Using Content Addressable Memory Architecture
Saurabh Jaiswal
No ratings yet
Hardware RSA Accelerator: Group 3: Ariel Anders, Timur Balbekov, Neil Forrester May 15, 2013
Document15 pages
Hardware RSA Accelerator: Group 3: Ariel Anders, Timur Balbekov, Neil Forrester May 15, 2013
Rash Rashad
No ratings yet
DAA Mini Project
Document15 pages
DAA Mini Project
Ajinkya Somawanshi
No ratings yet
Enhanced Speech Recognition Using ADAG SVM Approach
Document5 pages
Enhanced Speech Recognition Using ADAG SVM Approach
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Assignment 3: Vector and Hashset
Document17 pages
Assignment 3: Vector and Hashset
Miguel
No ratings yet
Lesson 4: Attention Is All You Need Encoder and Decoder Processes
Document5 pages
Lesson 4: Attention Is All You Need Encoder and Decoder Processes
HoàngTuấnAnh
No ratings yet
For Example: C (1:50) A (1:50) + B (1:50)
Document7 pages
For Example: C (1:50) A (1:50) + B (1:50)
Sathish A Avn
No ratings yet
Transformer
Document59 pages
Transformer
IEC2020034
No ratings yet
Attention Is All You Need Paper - Removed
Document9 pages
Attention Is All You Need Paper - Removed
fhgh
No ratings yet
Attention Is All You Need 1hodz0wcqb
Document11 pages
Attention Is All You Need 1hodz0wcqb
Abhishek Junghare
No ratings yet
Attention Is All You Need
Document11 pages
Attention Is All You Need
Saniyah Mushtaq
100% (1)
Word 2 Vec
Document6 pages
Word 2 Vec
alihamda535
No ratings yet
SC 15
Document37 pages
SC 15
Warrior Bro
No ratings yet
Oops Eee Merged
Document141 pages
Oops Eee Merged
Satheesh Kumar
No ratings yet
2020 CS182 Section 5 Notes
Document7 pages
2020 CS182 Section 5 Notes
Hasim
No ratings yet
ROMAIJCA
Document8 pages
ROMAIJCA
Akah Precious Chiemena
No ratings yet
AI Sketcher
Document14 pages
AI Sketcher
Anonymous lPecwQVi
No ratings yet
SVM Ecc
Document6 pages
SVM Ecc
David Wilson
No ratings yet
Handwritten Digits Recognition: ECE462 - Multimedia Systems - Project Report University of Toronto
Document16 pages
Handwritten Digits Recognition: ECE462 - Multimedia Systems - Project Report University of Toronto
fdknkndf
No ratings yet
Oop Record 22-23
Document158 pages
Oop Record 22-23
padma sree
No ratings yet
Real Time Shape Detection
Document20 pages
Real Time Shape Detection
LAKSHAY GUPTA
No ratings yet
Simple Vector Processor Modeled With VHDL
Document6 pages
Simple Vector Processor Modeled With VHDL
duzngvt123
No ratings yet
Sample Paper of Acm
Document8 pages
Sample Paper of Acm
ajay_khunteta
No ratings yet
Doc2vec Explain
Document5 pages
Doc2vec Explain
Pushkar Mishra
No ratings yet
Lucrare de Laborator 3
Document10 pages
Lucrare de Laborator 3
Vlad Skrill
No ratings yet
Autoencoders With Keras, Tensorflow, and Deep Learning: Click Here To Download The Source Code To This Post
Document36 pages
Autoencoders With Keras, Tensorflow, and Deep Learning: Click Here To Download The Source Code To This Post
超揚林
No ratings yet
The Annotated Transformer: Alexander M. Rush
Document9 pages
The Annotated Transformer: Alexander M. Rush
vino
No ratings yet
Lecture 2.3.5lstmencoders
Document9 pages
Lecture 2.3.5lstmencoders
Mohd Yusuf
No ratings yet
Lecture 2 - Word Emedding
Document45 pages
Lecture 2 - Word Emedding
Andrew Chung
No ratings yet
Unit 5
Document8 pages
Unit 5
kapil1411
No ratings yet
TRANSFORMER
Document29 pages
TRANSFORMER
Divakar Keshri
No ratings yet
Algorithm (TIWA) :: Watermarking Multi-Message Watermarking Optimum Complete Complementary
Document4 pages
Algorithm (TIWA) :: Watermarking Multi-Message Watermarking Optimum Complete Complementary
Eng Marwa Elsherif
No ratings yet
Guo Generating Diverse and CVPR 2022 Supplemental
Document9 pages
Guo Generating Diverse and CVPR 2022 Supplemental
MohanKrishna
No ratings yet
CA Classes-201-205
Document5 pages
CA Classes-201-205
SrinivasaRao
No ratings yet
A Multimodal Text Block Segmentation Framework For Photo Translation
Document12 pages
A Multimodal Text Block Segmentation Framework For Photo Translation
bob wu
No ratings yet
Module 5
Document37 pages
Module 5
Pavithra
No ratings yet
Java Lab Assignment, 2014
Document4 pages
Java Lab Assignment, 2014
Ajay Pathak
No ratings yet
Sidhwa 2018
Document5 pages
Sidhwa 2018
Jay Bimal Mehta
No ratings yet
Java MultiThreading
Document2 pages
Java MultiThreading
salony
No ratings yet
Cs8383 - Oops Lab Record-420418105301-Allam Leela Prasad
Document92 pages
Cs8383 - Oops Lab Record-420418105301-Allam Leela Prasad
Monika Rajasekaran
No ratings yet
Vector Quantization: Data Compression and Data Retrieval
Document39 pages
Vector Quantization: Data Compression and Data Retrieval
Pooja Bharti
No ratings yet
LSTM-AutoEncoders. Understand and Perform Composite & - by Bob Rupak Roy - DataDrivenInvestor
Document9 pages
LSTM-AutoEncoders. Understand and Perform Composite & - by Bob Rupak Roy - DataDrivenInvestor
8c354be21d
100% (1)
Deep Neural Network - Application 2layer
Document7 pages
Deep Neural Network - Application 2layer
Gijacis Khaseng
No ratings yet
Network Programming Lab
Document152 pages
Network Programming Lab
raghunath9
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Perceptual Computing: Fundamentals and Applications
From Everand
Perceptual Computing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Henrich Heine Norenzayan 2010 PDF
Document1 page
Henrich Heine Norenzayan 2010 PDF
Ἡσυχαστήςἡσυχάζω
No ratings yet
Method Section-Seminar Paper
Document6 pages
Method Section-Seminar Paper
bersam05
No ratings yet
Lesson Plan Purposive
Document5 pages
Lesson Plan Purposive
Jonah Jatte Munez
No ratings yet
Brochure CTU PDF
Document4 pages
Brochure CTU PDF
AtiPriye
No ratings yet
Kartashov Task3
Document3 pages
Kartashov Task3
Volodymyr Kartashov
No ratings yet
Objectives and Standards
Document2 pages
Objectives and Standards
api-279609879
No ratings yet
Chapter 1.1
Document18 pages
Chapter 1.1
Andrea Edejer
No ratings yet
Yearly Planning 3rd Year SC FINAL
Document3 pages
Yearly Planning 3rd Year SC FINAL
Cygnus 442
No ratings yet
Trader Psychology: Trade To Trade Tomorrow
Document11 pages
Trader Psychology: Trade To Trade Tomorrow
tawhid anam
100% (4)
8 Dimensions of Wellness
Document2 pages
8 Dimensions of Wellness
Christian Alambag
No ratings yet
Week 1 Introduction To ML
Document42 pages
Week 1 Introduction To ML
Jaurel Kouam
100% (1)
Social dimension-Midterm-Exams
Document8 pages
Social dimension-Midterm-Exams
Brittaney Bato
No ratings yet
15CS324E lp2017
Document4 pages
15CS324E lp2017
Padma Santhosh
No ratings yet
Imagery, Figures of Speech and Diction
Document25 pages
Imagery, Figures of Speech and Diction
Junica Samson-Baltazar
No ratings yet
Why Political Science For UPSC?
Document3 pages
Why Political Science For UPSC?
Shashank Kumar Singh
No ratings yet
S.S.C. Part II General (Regular & Private) Group Results 2015
Document106 pages
S.S.C. Part II General (Regular & Private) Group Results 2015
Khalid Aziz
No ratings yet
Performance Task: Sample Hollistic Rubric For Writing Discourse
Document2 pages
Performance Task: Sample Hollistic Rubric For Writing Discourse
Joanne Ico Magnaye
No ratings yet
1 Barbara Alston
Document6 pages
1 Barbara Alston
Tanveer Imdad
No ratings yet
Dolores M. San Jose: July 3, 2020
Document6 pages
Dolores M. San Jose: July 3, 2020
Rubilyn Lumbres
No ratings yet
An Evolutionary Theory of Human Motivation - Bernard Et Al (2005)
Document57 pages
An Evolutionary Theory of Human Motivation - Bernard Et Al (2005)
Eduardo Aguirre Dávila
No ratings yet
Awakening Your Greatest Self Ebook
Document14 pages
Awakening Your Greatest Self Ebook
Luis Delgado
67% (3)
Technical Report
Document7 pages
Technical Report
Raihana Habbi
No ratings yet
ENT 600 - Guidelines and Templates
Document5 pages
ENT 600 - Guidelines and Templates
Fatin Nabilah
No ratings yet
Testa Assessment Survey On The Development of Learning Resources
Document14 pages
Testa Assessment Survey On The Development of Learning Resources
sarah ventura
No ratings yet
Objectives: San Pedro Integrated School
Document12 pages
Objectives: San Pedro Integrated School
Eljean Laclac
No ratings yet
Purposive Communication PDF
Document5 pages
Purposive Communication PDF
Stephany Villahermosa
No ratings yet
Man State War
Document4 pages
Man State War
Group 5 PolSci 3B Thesis
No ratings yet
Edup3073 Topic 8 Group 2
Document25 pages
Edup3073 Topic 8 Group 2
farhana syakira bt mohd zulkifli
No ratings yet