Welcome to Scribd!

Integration of Bangla Script Recognition Support in OCRopus (Thesis Presentation by Shouro Chowdhury)

Uploaded by

0% found this document useful (0 votes)

522 views19 pages

This is the undergrad thesis presentation by Shouro chowdhury. He worked on the integration of Bangla script recognition support with OCRopus.

Original Title

Integration of Bangla Script Recognition Support in OCRopus [Thesis Presentation by Shouro Chowdhury]

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

This is the undergrad thesis presentation by Shouro chowdhury. He worked on the integration of Bangla script recognition support with OCRopus.

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

522 views19 pages

Integration of Bangla Script Recognition Support in OCRopus (Thesis Presentation by Shouro Chowdhury)

Uploaded by

Md. Abul Hasnat

This is the undergrad thesis presentation by Shouro chowdhury. He worked on the integration of Bangla script recognition support with OCRopus.

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 19

Search inside document

Integration of Bangla script

recognition support in OCRopus

By
Shouro Chowdhury

Supervisor
Dr. Mumit Khan

Co-supervisor
Md. Abul Hasnat
What is OCRopus?

OCRopus is an open source state-of-the-art

document analysis and OCR system, featuring
pluggable layout analysis, pluggable character
recognition, statistical natural language
modeling, and multi-lingual capabilities.

Reference: "The OCRopus Open Source OCR System", Thomas M. Breuel, Proceedings IS&T/SPIE 20th Annual Symposium 2008.
Why OCRopus?

A complete OCR framework.

Multi-lingual capabilities.
Rich image preprocessing and layout analysis library.
Two recognition engines:
Tesseract.
bpnet (Multi Layer Perceptron Neural Network) classifier.
Statistical natural language model (OpenFST) as a
post processor.
Developed with the generous support from Google
and other organizations.
Framework overview

Training

Recognition
Training tesseract

Step 1: Create training image data

Step 2: Make Box file
Step 3: Run Tesseract for Training
Step 4: Clustering
Step 5: Compute the Character Set
Step 6: Prepare Dictionary data files
Recognition tesseract

Generate a properly segmented character

image.
Pass this image to the tesseract recognizer.
Training bpnet

Two Approaches:
From individual character image.
From text line image.
Ist approach:
Individual character image.
Character unicode.
2nd approach:
Text line image.
Segmented color image of the text line image.
Transcription of the text image.
Recognition bpnet

Generate a color segmented character image.

Pass this image to the bpnet classifier.
Example:
Training data generation for Tesseract

Step 1: Create training image data

Step 2: Make Box file

Step 3: Run Tesseract for Training

Step 4: Clustering
Example:
Training data generation for Tesseract

Step 5: Compute the Character Set

Step 6: Prepare Dictionary data files

Word List : 180K words

Frequent word List : 30K words

Reference: http://crblpocr.blogspot.com/2008/07/how-to-train-bangla-and-devanagari.html
http://crblpocr.blogspot.com/2008/08/why-tesseract-need-to-train-all.html
Example:
Testing Tesseract

Step 1: Generate a properly segmented character image.

Step 2: Pass this image to the tesseract recognizer

aদভনণঢড
hOcr output
Example:
Training data generation for bpnet

Approach 1: From individual character image

character image Character unicode

Image – transcription mapped list file

Example:
Training data generation for bpnet

Approach 2: From line image

Line image

Color image

Line transcription

Image – transcription mapped list file

Example:
Testing bpnet classifier

Step 1: Generate a color segmented character image.

Fig: Input text Image

Fig: Segmented color Image

Step 2: Pass this image to the bpnet classifier.

কত কাল কেব েস pভােত

hOcr output
Example:
Testing bpnet classifier (best NN Parameters)

nhidden - 500
epochs - 300
learningrate - 0.2
testportion - 0
normalize - 1
shuffle - 1
CRBLP segmenter plugin

Four step character segmentation algorithm.

Projection profile based.
Elimination of the joining errors
Shadow character segmentation
Touching character segmentation
Elimination of splitting errors
Reordering the modifiers
Acknowledgement

Mark Stillwell (University of Hawaii)

Prof. Thomas Breuel (IUPR)
Ilya Mezhirov (IUPR)
Yves Rangoni (IUPR)
Ray Smith (Google research)

CRBLP Lab
References

http://code.google.com/p/ocropus/
http://code.google.com/p/tesseract-ocr/
http://code.google.com/p/ocropus-bengali/
http://crblpocr.blogspot.com/

Answer For Introduction To Generative AI Quiz
Document5 pages
Answer For Introduction To Generative AI Quiz
奇門大數據
33% (3)
Data2vec: A General Framework For Self-Supervised Learning in Speech, Vision & Language
Document20 pages
Data2vec: A General Framework For Self-Supervised Learning in Speech, Vision & Language
Soumyadipto Banerjee
No ratings yet
Deep Learning TensorFlow and Keras
Document454 pages
Deep Learning TensorFlow and Keras
Zakaria Allito
No ratings yet
Generative AI With Python and TensorFlow 2 Create Images, Text, and Music With VAEs, GANs, LSTMS, Transformer Models (Joseph Babcock, Raghav Bali) (Z-Library) PDF
Document569 pages
Generative AI With Python and TensorFlow 2 Create Images, Text, and Music With VAEs, GANs, LSTMS, Transformer Models (Joseph Babcock, Raghav Bali) (Z-Library) PDF
Deepak prasad
No ratings yet
Transformers For Natural Language Processing and Computer Vision Third Edition Denis Rothman All Chapter
Document67 pages
Transformers For Natural Language Processing and Computer Vision Third Edition Denis Rothman All Chapter
heather.flores371
100% (6)
SAP System Measurement
Document57 pages
SAP System Measurement
Ajeet Singh
No ratings yet
An Efficient Incremental Clustering Algorithm
Document3 pages
An Efficient Incremental Clustering Algorithm
World of Computer Science and Information Technology Journal
No ratings yet
A Study On Credit Card Fraud Detection Using Machine Learning
Document4 pages
A Study On Credit Card Fraud Detection Using Machine Learning
Editor IJTSRD
No ratings yet
ANN Final Exam
Document13 pages
ANN Final Exam
basit
100% (1)
Research - Paper-2 (Img To Text)
Document159 pages
Research - Paper-2 (Img To Text)
Taraka Manjunatha T
No ratings yet
Roadmap 2024 Genai
Document17 pages
Roadmap 2024 Genai
vinusmiles
No ratings yet
Python Project of Gender and Age Detection With OpenCV
Document8 pages
Python Project of Gender and Age Detection With OpenCV
Nitish Kumar Choudhury
No ratings yet
Chapter 2: Technologies: What Is Yolov4?
Document6 pages
Chapter 2: Technologies: What Is Yolov4?
Đào Quỳnh Như
No ratings yet
Batch 6
Document21 pages
Batch 6
psgtelugodu
No ratings yet
2.2. Deep Learning Methods
Document1 page
2.2. Deep Learning Methods
ANANTH UPADHYA
No ratings yet
Interesting Python Project of Gender and Age Detection With OpenCV
Document8 pages
Interesting Python Project of Gender and Age Detection With OpenCV
ammar
No ratings yet
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
Document14 pages
Key Data Extraction and Emotion Analysis of Digital Shopping Based On BERT
saRIKA
No ratings yet
Creating Optical Character Recognition (OCR) Applications Using Neural Networks - CodeProject
Document7 pages
Creating Optical Character Recognition (OCR) Applications Using Neural Networks - CodeProject
Sargunam Sankaravadivel
No ratings yet
Dip
Document13 pages
Dip
Beas Banerjee
No ratings yet
Deep Learning in Practice Project Two: NLP of The Holy Quran in Python
Document11 pages
Deep Learning in Practice Project Two: NLP of The Holy Quran in Python
shoaib riaz
No ratings yet
Report in ML
Document9 pages
Report in ML
Priti Gupta
No ratings yet
Optical Character Recognition Based Speech Synthesis: Project Report
Document17 pages
Optical Character Recognition Based Speech Synthesis: Project Report
isoi
0% (1)
Content Based Video Retrieval Thesis
Document6 pages
Content Based Video Retrieval Thesis
hcivczwff
100% (2)
MCASyll 31
Document3 pages
MCASyll 31
anupamsbsb
No ratings yet
Chatbot
Document3 pages
Chatbot
shilpa
No ratings yet
III Sem Syllabus RNSIT New
Document19 pages
III Sem Syllabus RNSIT New
Mahesha S N Reddy
No ratings yet
Advanced Natural Language Processing With TensorFlow 2 (2021, Packt) - Libgen - Li
Document465 pages
Advanced Natural Language Processing With TensorFlow 2 (2021, Packt) - Libgen - Li
Rongrong Fu
No ratings yet
LP - V - Lab Manual - DL
Document53 pages
LP - V - Lab Manual - DL
Atharva Nitin Chandwadkar
No ratings yet
Lecture 3 - Data Manipulation
Document51 pages
Lecture 3 - Data Manipulation
Anurag Laddha
No ratings yet
LLM For QnA Proposal
Document12 pages
LLM For QnA Proposal
Akhil Kumar
No ratings yet
Retro Model Deep-Mind
Document43 pages
Retro Model Deep-Mind
djame seddah
No ratings yet
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
Document275 pages
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
RAZ
No ratings yet
LeHoaiDuy LeHoaiDuy Portfolio NLP
Document4 pages
LeHoaiDuy LeHoaiDuy Portfolio NLP
Hagai
No ratings yet
Text Minning
Document13 pages
Text Minning
Hudeh Udeh
No ratings yet
A Matlab Project in Optical Character Recognition (OCR) : Introduction: What Is OCR?
Document6 pages
A Matlab Project in Optical Character Recognition (OCR) : Introduction: What Is OCR?
Abdul Basith
No ratings yet
PRE Synopsis
Document3 pages
PRE Synopsis
Nilu Patel
No ratings yet
Lab - Manual FDS
Document12 pages
Lab - Manual FDS
Prabha K
No ratings yet
Learning OpenCV 3 Application Development
From Everand
Learning OpenCV 3 Application Development
Samyak Datta
No ratings yet
Python Syllabus 1
Document8 pages
Python Syllabus 1
Pradeep G
No ratings yet
Practical 1to10
Document32 pages
Practical 1to10
hetprajapati2004217
No ratings yet
Auto Jotting: A Notes Maker
Document11 pages
Auto Jotting: A Notes Maker
IJRASETPublications
No ratings yet
HW LM
Document36 pages
HW LM
Sabrina Li
No ratings yet
Python Chatbot Project
Document10 pages
Python Chatbot Project
Vanitha G
No ratings yet
Python Mini Project
Document14 pages
Python Mini Project
Nitish Kumar Choudhury
No ratings yet
Data Science Lab-KTU
Document5 pages
Data Science Lab-KTU
NAJIYA NAZRIN P N
No ratings yet
Implementation and Performance Evaluation of Neural Network For English Alphabet Recognition System
Document5 pages
Implementation and Performance Evaluation of Neural Network For English Alphabet Recognition System
Editor IJTSRD
No ratings yet
Learning OpenCV 3 Computer Vision with Python - Second Edition
From Everand
Learning OpenCV 3 Computer Vision with Python - Second Edition
Joseph Howse
No ratings yet
Improving Multimedia Retrieval With A Video OCR
Document12 pages
Improving Multimedia Retrieval With A Video OCR
SujayCj
No ratings yet
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
Rating: 5 out of 5 stars
5/5 (1)
Sign Language Recognition Using Python and OpenCV
Document22 pages
Sign Language Recognition Using Python and OpenCV
pradip suryawanshi
100% (1)
Multimodal Attribute Extraction: The Dataset Is Freely Available At: Https://rloganiv - Github.io/mae
Document7 pages
Multimodal Attribute Extraction: The Dataset Is Freely Available At: Https://rloganiv - Github.io/mae
ajsocool
No ratings yet
Text To Speech Using Labview
Document12 pages
Text To Speech Using Labview
SHUBHAM YADAV
No ratings yet
DL Mini Project Siddhesh
Document9 pages
DL Mini Project Siddhesh
shaikhsumerjaheermiya1234
No ratings yet
Atif Iqbal: Atif - Iqbal@research - Iiit.ac - in Atif@students - Iiit.ac - in
Document4 pages
Atif Iqbal: Atif - Iqbal@research - Iiit.ac - in Atif@students - Iiit.ac - in
Atif Iqbal
No ratings yet
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
Document61 pages
Urdu Optical Character Recognition OCR Thesis Zaheer Ahmad Peshawar Its Soruce Code Is Available On MATLAB Site 21-01-09
Zaheer Ahmad
100% (1)
Text To Speech
Document9 pages
Text To Speech
Devansh Singh
No ratings yet
Online Hotel Reservation System
Document16 pages
Online Hotel Reservation System
Sana Siddiqui
No ratings yet
Automatic Image Captioning Using Neural Networks
Document9 pages
Automatic Image Captioning Using Neural Networks
ABD BEST
No ratings yet
Patient Information
Document37 pages
Patient Information
MID Gaming YT
No ratings yet
Exploring Microsoft PowerPoint AI, Using Python
Document16 pages
Exploring Microsoft PowerPoint AI, Using Python
ferdad4real
No ratings yet
All Projects Spring 22
Document202 pages
All Projects Spring 22
sheraz7288
No ratings yet
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
Document3 pages
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
International Journal of Innovative Science and Research Technology
No ratings yet
Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
Document21 pages
Image Caption Generation Using Deep Learning: Department of Electronics & Instrumentation Engineering NIT Silchar, Assam
Potato Gaming
No ratings yet
First Puc PART-1 STATISTICS FOR ECONOMICS
Document47 pages
First Puc PART-1 STATISTICS FOR ECONOMICS
jackiechan j
100% (1)
Guru Nanak Dev Engineering College, Ludhiana
Document48 pages
Guru Nanak Dev Engineering College, Ludhiana
pankaj
No ratings yet
Week 1 Lecture Slides BUS265 2023
Document37 pages
Week 1 Lecture Slides BUS265 2023
Alberto Lugo Gonzalez
No ratings yet
Thesis Report On Fake Currency Detection Using Image Processing Method
Document38 pages
Thesis Report On Fake Currency Detection Using Image Processing Method
Narender Vasala
No ratings yet
Deep Learning
Document6 pages
Deep Learning
Krishna Kk
No ratings yet
TutorialXelopes 4.0
Document27 pages
TutorialXelopes 4.0
Ninguna Persona
No ratings yet
Analysis of Spam Email Filtering Through Naive Bayes Algorithm Across Different Datasets
Document4 pages
Analysis of Spam Email Filtering Through Naive Bayes Algorithm Across Different Datasets
International Journal of Innovative Science and Research Technology
No ratings yet
BA Sample Paper Questions2
Document17 pages
BA Sample Paper Questions2
kr1zhna
0% (1)
COMP9417 Review Notes
Document10 pages
COMP9417 Review Notes
Ines Sarmiento
No ratings yet
The BACKPROPAGATION Algorithm: KD KD
Document10 pages
The BACKPROPAGATION Algorithm: KD KD
Swathi Reddy
No ratings yet
Comparition of Discipulus LGP Software
Document16 pages
Comparition of Discipulus LGP Software
Vane Cañete
No ratings yet
CS583 Unsupervised Learning
Document95 pages
CS583 Unsupervised Learning
Jrey Kumalah
No ratings yet
Gujarat Technological University: Page 1 of 2
Document2 pages
Gujarat Technological University: Page 1 of 2
Rahul Meghani
No ratings yet
AICTE Activity Report Aditya Dixit
Document12 pages
AICTE Activity Report Aditya Dixit
Ravi Kumar
No ratings yet
Fundamentals of Satellite Remote Sensing
Document8 pages
Fundamentals of Satellite Remote Sensing
Geosemsem
100% (1)
TPS552 Group 12
Document14 pages
TPS552 Group 12
halifeanrentap
No ratings yet
Edureka Data Science Masters Program Curriculum
Document41 pages
Edureka Data Science Masters Program Curriculum
Meenal Luther Nhür
No ratings yet
Social Media Data Analytics To Improve Supply Chain Management in Food Industries
Document18 pages
Social Media Data Analytics To Improve Supply Chain Management in Food Industries
Xuan Vy Pham
No ratings yet
Hybrid Machine Learning Algorithm For Arrhythmia Classification Using Stacking Ensemble, Random Forest and J.48 Algorithm
Document8 pages
Hybrid Machine Learning Algorithm For Arrhythmia Classification Using Stacking Ensemble, Random Forest and J.48 Algorithm
International Journal of Innovative Science and Research Technology
No ratings yet
Speech Emotion Recognition PDF
Document5 pages
Speech Emotion Recognition PDF
dgrnwd
No ratings yet
ABA EXTERNAL MCQ1 Merged Merged Merged
Document394 pages
ABA EXTERNAL MCQ1 Merged Merged Merged
Akash aggarwal
No ratings yet
Delhi Public School, Rudrapur: (Under The Aegis of DPS Society, New Delhi)
Document6 pages
Delhi Public School, Rudrapur: (Under The Aegis of DPS Society, New Delhi)
aditi anand
No ratings yet
Chemometric Software For Multivariate Data Analysis Based On Matlab
Document8 pages
Chemometric Software For Multivariate Data Analysis Based On Matlab
mikk85
No ratings yet
Fernanda Sobrino JMP
Document63 pages
Fernanda Sobrino JMP
J. Fernando G. R.
No ratings yet
Cyberbullying Detection and Classification Using Information Retrieval Algorithm
Document6 pages
Cyberbullying Detection and Classification Using Information Retrieval Algorithm
Akash Gulhane
No ratings yet
Data Science Solutions Sample
Document53 pages
Data Science Solutions Sample
Rajiv
100% (1)
SVM Notes
Document8 pages
SVM Notes
Rahul sharma
No ratings yet