Professional Documents
Culture Documents
Steps Involved in Text Recognition and Recent Research in OCR A Study
Steps Involved in Text Recognition and Recent Research in OCR A Study
C. Preprocessing methods
Published By:
Retrieval Number A2670058119/19©BEIESP Blue Eyes Intelligence Engineering &
3095 Sciences Publication
Steps Involved in Text Recognition and Recent Research in OCR; A Study
(a)
(c)
Fig.2 Edge Detection (a) Original Image(b)
Prewittmethod (c) Cannymethod
Spatial
Image
Filtering
Process
Point Mask
processing processing
techniques techniques
Published By:
Retrieval Number A2670058119/19©BEIESP 3096 Blue Eyes Intelligence Engineering &
Sciences Publication
International Journal of Recent Technology and Engineering (IJRTE)
ISSN: 2277-3878, Volume-8, Issue-1, May 2019
(a)
(b)
(c) (d)
Fig. 5 (a) Original Image (b) Contrast Enhancement using understanding problems to normalize variations in
Histogram Equalization (c) Original imageHistogram(d) illumination.For each level of brightness 'j' in the actual
Histogram of the processed image image, the level value (k) for the new pixel can be
calculated as,
Histogram equalization: Histogram equalization is a global i
i 0
type method thatgives the histogram of the whole pixels (0- (4)
255) range and this technique can be applied in image
Published By:
Retrieval Number A2670058119/19©BEIESP 3097 Blue Eyes Intelligence Engineering &
Sciences Publication
Steps Involved in Text Recognition and Recent Research in OCR; A Study
WhereT is thetotal pixels in an image (Russ 2007). The selecting or searching most relevant features (Mohamad et
contrast enhancement of an image using histogram al. 2015).
equalization and its histogram has been shown in figure 5.
Feature set should have a sufficient discriminating power to
G. Mask Processing Techniques enable the accurate classification even among very similar
symbols. Thus, features which are beneficial for
Averaging filters: Average filter also known as mean filter classification ensure that objects from different classes have
is auncomplicated method for smoothing images.It different values; objects from the identical class have similar
minimizes the intensity variation amount between adjacent feature values. A good feature set should be efficient about
pixels also used to decrease or remove the noise. The computation time.
averaging filter will be acting as a low-pass frequency
filterthatreduces the intensity of spatial derivatives in the K. Text Recognition
image. Mean filterdepends on a kernel that represents
thesize and shapeof the neighborhood pixelsto be sampled Considering the type of text, machine-printed and
duringmean value computation. The 3×3 square kernel or handwritten OCR methods are two majorpart of interest in
mask is shown in Figure 6 (Mori 2010);other large masks the text recognition. The difficulties for fixed and multi-font
like 5×5, 7x7, and 9x9 could be used for smoothing in OCR are solved with little constraints.The individual
severe case. character separation of full text image are very important
step in recognition.
H. Segmentation
J. Feature Extraction
Most of the Chinese characters
Feature extraction is the stage to eliminate redundancy from havecomparatively independent
the data. Theclassification accuracy can be enhanced by substructurescalled radicals
Published By:
Retrieval Number A2670058119/19©BEIESP 3098 Blue Eyes Intelligence Engineering &
Sciences Publication
International Journal of Recent Technology and Engineering (IJRTE)
ISSN: 2277-3878, Volume-8, Issue-1, May 2019
Published By:
Retrieval Number A2670058119/19©BEIESP 3099 Blue Eyes Intelligence Engineering &
Sciences Publication
Steps Involved in Text Recognition and Recent Research in OCR; A Study
REFERENCE
Published By:
Retrieval Number A2670058119/19©BEIESP 3100 Blue Eyes Intelligence Engineering &
Sciences Publication