Professional Documents
Culture Documents
PES COLLEGE OF Engineering: Seminar On: "Handwriting To Text Conversion"
PES COLLEGE OF Engineering: Seminar On: "Handwriting To Text Conversion"
SEMINAR ON:
“Handwriting tO Text Conversion”
Submitted by:
Aniket Gangawane 21
Akshay Wankhede 107
Shahid jaber Shaikh106
Krushna Bangar 64
Shahid Shaikh 105
GUIDE:
DR. V.B. Kamble
Outline
Introduction
Challenges in Handwriting Recognition
Objective
Requirement
System Development
Methodology
Conclusion
Refernces
Introduction
Optical Character Recognition(OCR) market size is expected to be USD 13.38 billion by 2025 with
a year on year growth of 13.7 %. This growth is driven by rapid digitization of business processes
using OCR to reduce their labor costs and to save precious man hours. Although OCR has been
considered a solved problem there is one key component of it, Handwriting Recognition or
Handwritten Text Recognition(HTR) which is still considered a challenging problem statement.
The high variance in handwriting styles across people and poor quality of the handwritten text
compared to printed text pose significant hurdles in converting it to machine readable text.
Nevertheless it's a crucial problem to solve for multiple industries like healthcare, insurance and
banking. Optical Character Recognition(OCR) market size is expected to be USD 13.38 billion by
2025 with a year on year growth of 13.7 %. This growth is driven by rapid digitization of business
processes using OCR to reduce their labor costs and to save precious man hours. Although OCR
has been considered a solved problem there is one key component of it, Handwriting
Recognition or Handwritten Text Recognition(HTR) which is still considered a challenging
problem statement. The high variance in handwriting styles across people and poor quality of
the handwritten text compared to printed text pose significant hurdles in converting it to
machine readable text. Nevertheless it's a crucial problem to solve for multiple industries like
healthcare, insurance and banking.
Recent advancements in Deep Learning such as the advent of transformer architectures have
fast-tracked our progress in cracking handwritten text recognition. Recognizing handwritten text
is termed Intelligent Character Recognition(ICR) due to the fact that the algorithms needed to
solve ICR need much more intelligence than solving generic OCR.
Challenges in Handwriting Recognition:
1. Huge variability and ambiguity of strokes from person to person
2. Handwriting style of an individual person also varies time to time
and is inconsistent.
3. Poor quality of the source document/image due to degradation over
time.
4. Text in printed documents sit in a straight line whereas humans need
not write a line of text in a straight line on white paper
5. Cursive handwriting makes separation and recognition of characters
challenging.
6. Text in handwriting can have variable rotation to the right which is in
contrast to printed text where all the text sits up straight.
7. Collecting a good labelled dataset to learn is not cheap compared to
synthetic data.
Disadvantages of Existing System