Professional Documents
Culture Documents
handwritten characters
a review
Dl D2 D3 D4 D5 11 I2 I3 I4
features Transformation - + + + + 0 - - 0
Zoning _ 0 - - 0 - + + -
Moments 0 0 - + + 0 - 0 -
(13) comple.uity qf implementation; and the densities of points in these regions are used
(14) independence ~ whether supplementary as feature vectors.
techniques are needed. Moments. The moments of black points about a
chosen centre, such as the centre of gravity, are used
The above criteria are applied to compare recognition as features.
techniques which can be classified into three categories, n-tuples. The occurrence of black or white elements,
according to the way in which features are extracted or joint occurrences of these elements, are used as features.
Characteristic loci. For each white (i.e. empty) point
l global features in the background of the character, vertical and horizon-
l distribution of points tal vectors are generated, and the number of times the
0 topological features line segments are intersected by these vectors are used
as features.
Crossings and distances. For crossings, features are
Global features measured by the number of times line segments are
crossed by vectors in specified directions. In the case
This technique extracts features from every point inside of distances, the distances of elements or line segments
a frame surrounding the character, and features show from a given boundary, such as the frame, are used
no reflection of any local, geometrical or topological as features.
properties.
Template matching and correlations. Features are
Geometrical and topological features
determined by the state (white or black) of all points
within the frame. A measure of the similarity between
This approach is based on extracting the geometrical
the input character and store references is obtained by
and topological features which describe the physical
the matching and correlation of points or groups of
makeup of the character. They may represent both global
points in the frame.
and local properties. This is by far the most popular
Transformations and series expansions. Using the
method used by researchers and common features to
points in the frame as features, as in template matching,
extract include:
results in a very large feature vector. (The dimension
will be high because all points in the frame will be held
l strokes and curves in various directions e.g.
in the vector.) It is possible to reduce the dimensionality
and extract features invariant to some global defor-
mation, such as a translation or rotation. Rotational
I I \
transformations include those of Fourier, Walsh, Haar l end points, intersections of line segments and loops,
and Hadamard whilst Karhunen-Loeve use series l stroke relations, angular properties, sharp protrusions
expansions. and intrusions in connection with contour analysis.
The popularity of such an approach can be attributed In the feature list, a loop is signified by a continuous
to its high tolerance to distortion and style variations stroke, starting in one octant, proceeding to the adjacent
when compared with other recognition techniques, as octant, and returning to the original octant. In Figure
well as its ability to tolerate a certain amount of rotation 3a the octagonal grid is shown, with the octant numbers
and translation. used. Figures 3b to 3h show examples of characters to
It is possible to involve decision making at various illustrate the selected features.
stages during the recognition process, and thus avoid There are ten possible symbols, aO,al a9, which
the waste of time and resources if a character can be are assigned to each octant depending on the nature
rejected at an early stage in processing4. of the line segment within that octant. Each octant may
Such a recognition model would be split into two have up to four features, indicating four different stroke
phases, as shown in Figure 2. It starts with global detec- segments, and for any octant having less then four fea-
tion of features, which is then followed by local analysis tures (i.e. parts of strokes) the entries are zero rather
when the first stage provides a non-unique solution, i.e. than one of the symbols a0.. a9.
the character cannot be uniquely identified. The feature vector for a character consists of these
Within these stages four symbols for each octant, or zero, and a boolean
value to indicate whether or not two strokes cross in
The conversion takes place of the physical image to an octant; m = 2 if two strokes do cross, and m = I
a digital pattern through the use of an input device if none cross. The vector can be represented thus
such as a high speed optical scanner.
The preprocessor centres and thins the pattern, and X = (.ul 1,x12,.~13,.~14,m;.u21,.~22,.u23,.~24,m; .;
locates the start and end of each stroke. .uil ,.ui2,xi3,.ui4,m; x8 1..u82..~83..~84,m)
The first stage feature extractor determines the basic
features which are collected as a feature vector. where
The first stage classifier operates on feature vectors
and assigns characters to a primary category or rejects i = 1,2,3...8
.x,, > (a,!,
the character, involving no detailed analysis. j = 1,2,3,4
The first level decision maker uses previous experi- .k = 0,1.2.. . 9
ence, from the learning set, to see if the character
can be uniquely identified without involving more
or
detailed local analysis.
If a non-unique classification has been reached, the
X,, = 0
second stage classifier is used to detect local features
to remove any of the ambiguities realised at the first
stage. Although these features a0.. a9 help categorize a
character with a minimum of feature extraction. it is
not possible to identify uniquely all characters as some
First stage analysis will have identical vectors. In such cases the second stage
classifier will have to remove any ambiguities.
The first stage of the recognition process involves group- E.ytraction of ,fiatures. Once the start, intermediate
ing the input character into one of several subgroups. and end points of each stroke have been found (by the
9 h
Figure 3. The octugonaf grid (a) and examples offeature types (h)-(h)
preprocessor), yielding individual strokes, feature o determining when a stroke crosses a boundary,
extraction is performed. Each stroke is tracked and l calculating features a0. . , a9 as each octant is crossed;
features obtained by: l ordering of features into feature vectors.
Most online systems use as features the sequence of Online systems have been designed to recognize a vary-
subareas traversed by the pen or the sequence of ing set of alphabets from numerals and alphanumerics
directions of the drawing. Typically, feature vectors are to Japanese Katakana characters, as well as for specific
created by means of angles, structure (i.e. clockwise or purposes such as input of computer programs of com-
anti-clockwise curves etc.), zoning or Fourier descrip- puter aided design requiring FORTRAN symbols and
tors. Classi~cation is achieved by a table lookup scheme special symbols for CAD applications.
i.e. check if that vector has been previously encountered
and thus known. In an attempt to reduce the amount
of processing of data, a hierarchical scheme can be Recognition rates
adopted. operating on only part of the feature vector.
This would be particularly advantageous when the In terms of recognition rates it is not normal for an
vector has many dimensions. online system to exceed 95% accuracy, but this figure
may be seen as more than acceptable because: