Welcome to Scribd!

Skip carousel

PDF Summarizer

Uploaded by

AZHARUDEEN S

0% found this document useful (0 votes)

4 views4 pages

gooood

Original Title

PDF summarizer.docx

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

gooood

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views4 pages

PDF Summarizer

Uploaded by

AZHARUDEEN S

gooood

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

TECHNICAL APPROVAL COMMITTEE

GUIDE APPROVAL FORM

Date: __ / __ / 2023
Starting Date of Work

Sl.
No Student Name Reg. No. Role Signature
.

1 Raghulraj M 7376212AD177 Team Leader

Applying for the work: Project

PDF summarizer
Title of Work

(To be Filled by Faculty In charge)

No. of students: 1
I acknowledge that I will act as a faculty in charge for the aforementioned students and guide
them to complete the work by adopting the guidelines provided.

Lab Name:DATA SCIENCE –BIGDATA Name & Signature of the Faculty In charge
ANALYTICS LAB with the date
(In case of Faculty belonging to any special lab)
Add process flow chart or simulated image of prototype or any relevant image related to your idea
Idea/Approach Details

Problem Statement
The problem addressed by this project pertains to the need for a versatile and efficient PDF summarization tool. PDF documents often contain lengthy and detailed
content, making it challenging for users to extract essential information quickly. Traditional manual summarization methods may be time-consuming and may not provide
the comprehensiveness desired by users.

Proposed Solution
The proposed solution is to develop a PDF Summarizer using the Flask web framework, capable of performing both abstractive and extractive summarization. This tool
aims to:

● Read and understand the content of PDF files using the PyPDF library.
● Provide abstractive summarization of the PDF content using pre-trained large language models via their API.
● Perform extractive summarization using the Latent Semantic Analysis (LSA) summarizer.
● Offer users the option to choose between abstractive and extractive summarization methods.
● Present summarized content in an easily accessible and user-friendly format.

The PDF Summarizer will empower users to quickly obtain concise summaries of PDF documents, enhancing their ability to digest and utilize information effectively.

METHODOLOGY

STEP 1 :

Develop a web application using Flask to create an intuitive user interface for uploading PDF files and selecting summarization methods.

STEP 2 :

Utilize the PyPDF library to extract and convert the content of uploaded PDF files into a suitable format for summarization.
STEP 3 :

Implement abstractive summarization using pre-trained large language models via their API to generate human-like summaries.

STEP 4:

Utilize the LSA summarizer to perform extractive summarization by identifying key sentences within the PDF content.

STEP 5:

Design a user-friendly interface that allows users to upload PDF files, choose summarization methods, and view summarized content.

Describe the features / functions of the concern work here

⮚ PDF Content Analysis: The tool extracts and analyzes content from uploaded PDF files.
⮚ Abstractive Summarization: Abstractive summarization generates human-like summaries using pre-trained language models via their API.
⮚ Extractive Summarization (LSA): Extractive summarization identifies key sentences in the PDF content to create concise summaries.
⮚ User Selection: Users can choose between abstractive and extractive summarization methods.
⮚ User-Friendly Interface: The web application provides an intuitive interface for PDF upload and summarization method selection.

Describe your required technologies / facilities to complete the prescribe work here

❖ Flask Framework: Utilized for building the web-based PDF Summarizer and creating the user interface.
❖ PyPDF Library: Employed to read and extract the content of PDF files for summarization.
❖ Pre-Trained Language Models API: Integrated to perform abstractive summarization by generating human-like summaries.
❖ LSA Summarizer: Used for extractive summarization by identifying key sentences from the PDF content.

Signature of Faculty In Charge

Networking Project: Students Guide
From Everand
Networking Project: Students Guide
Mohammad Nor Ihsan Md Zin
Rating: 4 out of 5 stars
4/5 (2)
Study Assistant Bot
Document4 pages
Study Assistant Bot
AZHARUDEEN S
No ratings yet
Software Engineer's Reference Book
From Everand
Software Engineer's Reference Book
John A McDermid
No ratings yet
Industry Synopsis GNDEC
Document10 pages
Industry Synopsis GNDEC
Sukhjinder Singh Sandhu
No ratings yet
WBP MP
Document10 pages
WBP MP
Prathamesh Sodage
No ratings yet
Ip File
Document28 pages
Ip File
Arpit Mishra
No ratings yet
Workforce Performance Builder 4 Day Navigator Agenda v2
Document5 pages
Workforce Performance Builder 4 Day Navigator Agenda v2
Bora Gür
No ratings yet
Final Semester Mca Bca Pgdca Project Guidelines
Document8 pages
Final Semester Mca Bca Pgdca Project Guidelines
Taru Goel
No ratings yet
Development of A Tool For Quick Result Analysis
Document5 pages
Development of A Tool For Quick Result Analysis
International Journal of Innovative Science and Research Technology
No ratings yet
Synopsis Major Ipuranklist
Document8 pages
Synopsis Major Ipuranklist
Gaurav Chaudhary PGP 2022-24 Batch
No ratings yet
PROJECT FILE of Ip
Document18 pages
PROJECT FILE of Ip
jakir ansari
No ratings yet
WBP Micro Project
Document27 pages
WBP Micro Project
Pranav Mhatre
100% (1)
Fyp Project Proposal Template 2016
Document11 pages
Fyp Project Proposal Template 2016
Mughira Rajput
0% (1)
Project Synopsis Mit 2021
Document3 pages
Project Synopsis Mit 2021
Kiran Rajput
No ratings yet
Aims Institute of Higher Education: Project Schedule
Document8 pages
Aims Institute of Higher Education: Project Schedule
Jeevan Manjunath
No ratings yet
WBP Micro Project
Document27 pages
WBP Micro Project
Pranav Mhatre
No ratings yet
Project Synopsis Shreyansh
Document13 pages
Project Synopsis Shreyansh
ADITYA PATEL
No ratings yet
PGDLAN-PROJECT GUIDE (English)
Document6 pages
PGDLAN-PROJECT GUIDE (English)
Smita Mohanty
No ratings yet
Poornima University, Jaipur: Report of Technical Seminar On Introduction To Hadoop
Document13 pages
Poornima University, Jaipur: Report of Technical Seminar On Introduction To Hadoop
Ajay Gupta
No ratings yet
FYP Proposal Template For Development
Document9 pages
FYP Proposal Template For Development
Areej 56
No ratings yet
Bachelor of Computer Applications: Address Book
Document62 pages
Bachelor of Computer Applications: Address Book
Anonymous pTp5YokwWj
No ratings yet
Online Training Institute: A Project Report
Document22 pages
Online Training Institute: A Project Report
rudra saha
No ratings yet
Comsats University Islamabad: Department of Computer Science
Document14 pages
Comsats University Islamabad: Department of Computer Science
Muhammad Faraz Saleem
No ratings yet
Micro-Project: Government Polytechnic Solapur
Document25 pages
Micro-Project: Government Polytechnic Solapur
adbfasdf
No ratings yet
Chapter 1: Introduction: 1.1 Background
Document14 pages
Chapter 1: Introduction: 1.1 Background
anis
No ratings yet
Major Project File
Document18 pages
Major Project File
Adarsh Srivastava
No ratings yet
College Merit List &time Table Generator: A Project Report
Document11 pages
College Merit List &time Table Generator: A Project Report
pradhyumn gupta
No ratings yet
Individual Project
Document16 pages
Individual Project
api-252215967
No ratings yet
2021 V12i757
Document4 pages
2021 V12i757
YASH SHARMA
No ratings yet
Workforce Performance Builder 9 Day Desktop Agenda v2
Document6 pages
Workforce Performance Builder 9 Day Desktop Agenda v2
Bora Gür
No ratings yet
Ip Porject Done by (Suman, Aditya, Deepak)
Document33 pages
Ip Porject Done by (Suman, Aditya, Deepak)
venkat manoj
No ratings yet
FYP Proposal Template For Development
Document9 pages
FYP Proposal Template For Development
Bilal Tahir
No ratings yet
Micro Project: Shri H. H. J. B. Polytechnic
Document12 pages
Micro Project: Shri H. H. J. B. Polytechnic
borse
100% (1)
FYP Proposal Template
Document9 pages
FYP Proposal Template
20014198-085
No ratings yet
Synopsis
Document7 pages
Synopsis
Mohit Desarda
No ratings yet
R&D of University
Document17 pages
R&D of University
Nitin Gera
No ratings yet
A Project Synopsis On Online Notice Boar
Document29 pages
A Project Synopsis On Online Notice Boar
Maltesh Kammar
No ratings yet
Student Result Management System Project Reportdocx 3 PDF Free
Document49 pages
Student Result Management System Project Reportdocx 3 PDF Free
Sagar Chauhan
No ratings yet
1 Aditi Project Draft 4
Document15 pages
1 Aditi Project Draft 4
Aditya
No ratings yet
Student Result Management System Project Report
Document49 pages
Student Result Management System Project Report
Nishant Chaudhary
65% (49)
Student Result System
Document12 pages
Student Result System
gayan wickramarathna
No ratings yet
Gui SMS Microproject
Document16 pages
Gui SMS Microproject
chopademehul
No ratings yet
Java MP
Document18 pages
Java MP
556-Harsh Tandel
No ratings yet
Visualising and Forecasting Stocks Using Dash
Document4 pages
Visualising and Forecasting Stocks Using Dash
AZHARUDEEN S
No ratings yet
Student Attendance Master
Document43 pages
Student Attendance Master
Palas Manna
100% (1)
Deogiri Institute of Technology Andmanagement Studies, Aurangabad
Document45 pages
Deogiri Institute of Technology Andmanagement Studies, Aurangabad
Najiya Shaikh
No ratings yet
Summer Training Report by Swarnima (04720602020)
Document50 pages
Summer Training Report by Swarnima (04720602020)
t6890014
No ratings yet
Training For Bigdata and Hadoop: #I Background and Introduction
Document9 pages
Training For Bigdata and Hadoop: #I Background and Introduction
Rashtra Bhushan
No ratings yet
Synopsis - Note Sharing Application Using Django
Document12 pages
Synopsis - Note Sharing Application Using Django
naina nautiyal
No ratings yet
Praveen PDF
Document65 pages
Praveen PDF
avnish
No ratings yet
Sampe Project Report
Document75 pages
Sampe Project Report
y 2j
No ratings yet
Projectdefinitionhsai
Document4 pages
Projectdefinitionhsai
api-482724270
No ratings yet
1.final Yr - UG Calender - Winter-2022 - FINAL - 14-06-2022
Document15 pages
1.final Yr - UG Calender - Winter-2022 - FINAL - 14-06-2022
kumbharpratiksha94
No ratings yet
Project Report Quiz
Document27 pages
Project Report Quiz
VD Deepakk
No ratings yet
WWW - Kashipara.in: A Project Report On
Document58 pages
WWW - Kashipara.in: A Project Report On
prince jha
No ratings yet
Document 1
Document32 pages
Document 1
krishadhikari25
No ratings yet
Dept of Mca: G.L. Bajaj Institute of Technology & Management, Greater Noida
Document4 pages
Dept of Mca: G.L. Bajaj Institute of Technology & Management, Greater Noida
Satyam Srivastava
No ratings yet
Design & Development of Website For DIT: A Project Report On
Document65 pages
Design & Development of Website For DIT: A Project Report On
avnish
No ratings yet
"Title of Project Based Learning": Firstname Last Name
Document18 pages
"Title of Project Based Learning": Firstname Last Name
Siddhant Giri
No ratings yet
PHP - Kamlesh Khatri
Document40 pages
PHP - Kamlesh Khatri
Kamlesh Khatri
No ratings yet
T-64 - Wikipedia
Document22 pages
T-64 - Wikipedia
danko1du2458
No ratings yet
Liebert-HPC S 006 022-TS-EN-EMEA-273571
Document54 pages
Liebert-HPC S 006 022-TS-EN-EMEA-273571
Breno ETCENGENHARIA
No ratings yet
Java Fundamentals 7-1: Classes, Objects, and Methods Practice Activities
Document4 pages
Java Fundamentals 7-1: Classes, Objects, and Methods Practice Activities
203 057 Rahman Qolbi
No ratings yet
myOUM App
Document12 pages
myOUM App
Kee CH
No ratings yet
David 2105 PrelimExams
Document10 pages
David 2105 PrelimExams
JAZPER DAVID
No ratings yet
Veri Log Tutorial
Document10 pages
Veri Log Tutorial
Mubashir Ali
No ratings yet
6060fs Manual de Operación PDF
Document446 pages
6060fs Manual de Operación PDF
Victor Ballesteros
100% (1)
WP Rubrik Zero Trust Microsoft Environments
Document8 pages
WP Rubrik Zero Trust Microsoft Environments
Simone Tormen
No ratings yet
01 Hardware and Loop
Document43 pages
01 Hardware and Loop
karthick
No ratings yet
PDR C366veh
Document3 pages
PDR C366veh
GI Calbuth
No ratings yet
Yzfr1 2007 PDF
Document442 pages
Yzfr1 2007 PDF
Michał Fujak
No ratings yet
MTZ Worldwide April 2011 PDF
Document63 pages
MTZ Worldwide April 2011 PDF
Tamara Siqueira
No ratings yet
SJ-20141127113509-001-ZXSDR R8872A (HV1.0) Product Description - 732736
Document20 pages
SJ-20141127113509-001-ZXSDR R8872A (HV1.0) Product Description - 732736
Rehan Haider Jaffery
No ratings yet
Arrow Product Overview
Document13 pages
Arrow Product Overview
Rafael Zurita
100% (1)
Electrical Machines and Power-Electronic Systems For High-Power Wind Energy Generation Applications
Document38 pages
Electrical Machines and Power-Electronic Systems For High-Power Wind Energy Generation Applications
mmr
No ratings yet
Castrol Oil
Document2 pages
Castrol Oil
Ji Boy
No ratings yet
Course Content: SAP Fiori Implementation (SAPX03)
Document3 pages
Course Content: SAP Fiori Implementation (SAPX03)
Jathin Varma Kanumuri
No ratings yet
Technical Spec Agitator
Document74 pages
Technical Spec Agitator
naresh kumar
No ratings yet
Balachandar Sridharan: Devops Engineer
Document3 pages
Balachandar Sridharan: Devops Engineer
Vishnu Selvam
No ratings yet
Full Question Bank by Ravi Taori 201025122739
Document345 pages
Full Question Bank by Ravi Taori 201025122739
Mr. Retrospector
No ratings yet
Chapter Two Total Quality Management - PPT Download
Document18 pages
Chapter Two Total Quality Management - PPT Download
Bhuvanesh Bala
No ratings yet
App Mdtk-Wk-Elc-Vdr-Ont-Dwg-001 - 001 - Ga Drawing For Panel Distribution Board
Document83 pages
App Mdtk-Wk-Elc-Vdr-Ont-Dwg-001 - 001 - Ga Drawing For Panel Distribution Board
Dilara Azqila Yasmin
No ratings yet
Scada Compone NTS: Prepared By:-Animesh Ghosh Roll No - 4 M.Tech (EE)
Document28 pages
Scada Compone NTS: Prepared By:-Animesh Ghosh Roll No - 4 M.Tech (EE)
Animesh Ghosh
No ratings yet
Oracle Golden Gate Microservices Installation 191 - 220628 - 082404
Document16 pages
Oracle Golden Gate Microservices Installation 191 - 220628 - 082404
ganesh rajan
No ratings yet
Navman MULTI 3100 en
Document19 pages
Navman MULTI 3100 en
Pepe Marí Mayans
No ratings yet
Catalogue PDF
Document66 pages
Catalogue PDF
Mariano Reyes
No ratings yet
PDF Mobieye 700 Service Manual v10 en 1632361886 Compress
Document20 pages
PDF Mobieye 700 Service Manual v10 en 1632361886 Compress
DONOVAN FELIPE CORDOBA MARTINEZ
100% (1)
PYTHON For Class9
Document5 pages
PYTHON For Class9
Aura
No ratings yet
Samarpan: "Where Ideas Flow Without Any Resistance"
Document17 pages
Samarpan: "Where Ideas Flow Without Any Resistance"
Ayush Munjal
No ratings yet
Jurnal 3
Document3 pages
Jurnal 3
Arin Rujin
No ratings yet