You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/221389517

Writer Identification for Handwritten Telugu Documents Using Directional


Morphological Features

Conference Paper · November 2010


DOI: 10.1109/ICFHR.2010.108 · Source: DBLP

CITATIONS READS
22 2,653

3 authors:

Pulak Purkait Rajesh Kumar


Toshiba Research Europe Limited Institute of Forensic Science, Aurangabad
38 PUBLICATIONS   319 CITATIONS    7 PUBLICATIONS   231 CITATIONS   

SEE PROFILE SEE PROFILE

Bhabatosh Chanda
Indian Statistical Institute
213 PUBLICATIONS   2,761 CITATIONS   

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Phd subject View project

Document Image Analysis (DIA) View project

All content following this page was uploaded by Pulak Purkait on 31 March 2015.

The user has requested enhancement of the downloaded file.


Writer Identification for Handwritten Telugu Documents using Directional
Morphological Features

Pulak Purkait1 , Rajesh Kumar2 and Bhabatosh Chanda1


1
ECSU, Indian Statistical Institute, Kolkata, India
2
Directorate of Forensic Science, MHA, GOI, New Delhi, India
{pulak r, chanda}@isical.ac.in and rajeshkumar512@gmail.com

Abstract dividuals of different age groups, sex and educational


background using image processing and pattern recog-
Linking a person based on handwritten documents is nition techniques.
one of the oldest technique that is used by crime investi- Writer identification methods can be divided into
gators and forensic scientists. The importance of writer two major categories: text independent and text depen-
recognition in anthrax letter cases has made this exami- dent strategies. In text independent strategies a code
nation popular in recent years. In this paper we propose book of graphemes is generated from the text block and
four feature set namely directional opening, directional the features are extracted from these graphemes [1]. It
closing, direction erosion and k-curvature features for is text independent in the sense that the features doesn’t
writer recognition for Telugu handwritten documents. depend on the semantic content of the text. On the other
Each of the features are extracted from the words after hand text dependent approach is based on comparison
dividing them into a number of cells and then subjected of characters, words and/or line based on their seman-
to a nearest neighbor classifier for writer recognition. tic content. The text dependent approach is similar to
Although the results of each of the feature set is quite en- a forensic document examiner approach and in the cur-
couraging, the directional opening feature outperforms rent study we follow the same.
other feature sets. In text dependent approach words are looking more
powerful than characters as far as writer recognition
Keywords: Writer Identification, Telugu handwrit- is concerned, more discriminatory power of word than
ten documents, directional morphological features, k- characters is also evident from the literature [12, 13].
curvature . A writer may have the habit to write a character differ-
ently at different position in a word. To illustrate this
phenomenon we take an example of English words. In
1 Introduction the Figure 1 one can see the difference in shape and de-
sign of ’a’, ’r’ and ’t’ in three different words although
Linking a person based on handwritten documents is the writer for all the three words are the same. Vari-
one of the most oldest technique that is used by crime ation in the design of these characters are the part of
investigators and forensic scientists. It is evident from natural variation (intra writer variation) of that particu-
the literature [9] that Osborne(1929) [5] was the one of lar writer. Thus if a writer recognition system is based
the earliest researcher who has given a systematic ap- on character features only, these words may be recog-
proach for questioned document examination and writer nized as written by different writers and gives enough
recognition. scope of error. Similar kind of thing is applicable for
Writer Identification is based on the hypothesis that Telugu script also. The above argument gives sufficient
every individual has its own writing habits and that is reason to choose words for the writer identification and
sufficiently different from others. This hypothesis was verification purposes for the current study.
validated by different researchers starting from Osborne Research on writer Identification is as old as hand-
who has used New comb rule of probability to give a writing itself. As far as automatic writer Identification
statistical basis for document examination [3]. In recent is concerned a comprehensive review of work done be-
years Srihari [11] has validated the hypothesis by doing fore 1989 is given in [7]. A survey of recent research in
an extensive experiment on the handwriting of 1500 in- this field can be found in [1]. Although lots of research

1
(a) (b) (c)

Figure 1. An illustration of variations of


characters in different words written by
same writer (a): ‘art’ (b): ‘rat’ and (c)‘tar’

is going on in this field, there are two major issues that


make this problem challenging : natural variation and
disguise. Natural variation is variation in handwriting
Figure 2. An exemplar of handwritten Tel-
of an individual writer. The disguise is a kind of auto
ugu document
forgery by a writer to deny his or her writing later on
for some benefits. In this paper we are not addressing
any kind of forgery.
In this paper we deal with the problem using mor- document to do further analysis. Each word image is
phological directional features and k-curvature features. thinned and then skeleton is dilated with ‘square’ struc-
After pre-processing for the scanned image of Telugu turing element of fixed radius to normalize the stroke
handwritten document, features are extracted from each width.
word. Segmented word of similar content is size nor-
malized and divided into a number of cells of fixed 3 Identification Framework
size. Features are extracted from each cell and the fea-
ture vectors of different words due to different writer is 3.1 Feature Extraction
subjected to nearest neighbor classifier for identification
purposes. Line quality is one of the most important charac-
The rest of the paper is organized as follows. In teristics used by forensic document examiners to dis-
Section 2, we describe about the handwritten Telugu tinguish between different writers. Good line quality
data set and their pre-processing. Recognition scheme means smooth and round curve throughout the writing
is proposed in Section 3. Results and discussions on while the poor line quality is having tremors in strokes
the proposed method is placed in Section 4 followed and rough curves throughout the handwriting. So line
by the concluding remarks and future directions of the quality encompasses many features that stem from the
present work in Section 5. dynamic processes used to guide the writing instrument
as it moves across the paper. These elements include
2 Data acqusition and pre-processing pen pressure, speed of execution, pen lifts, consistency
and uniformity of the writing, rhythm and writing skills
An exemplar of Telugu data is prepared such that it [4]. Writings that are exhibiting poor line quality is in-
contains almost every character of Telugu alphabet and dicative of an inexperience writer or one who is suffer-
also some frequently used words. 110 handwritten doc- ing from physical and mental disability.
uments from 22 writers, 5 from each is prepared. All the To capture the line quality (structural information)
five documents of a single writer are written in his/her we propose four sets of features namely directional
most natural writing habits. opening, direction closing, directional erosion and k-
Each document is digitized at 300 dpi using a flatbed curvature [8, 2] based features.
scanner as a gray-scale image. The gray-scale doc- Arrangement of characters in a word is as important
ument image is converted into binary using Otsu’s as other individuality like structure and design of writ-
method [6]. Morphological filter [10] (opening fol- ing strokes. The deviation of different characters from
lowed by closing) is used for noise removal. While ex- baseline may be considered as one of the most impor-
amining the document it is not necessary to take each tant features. To get the deviation of strokes from base-
and every word into consideration, even forensic doc- line we draw a compact bounding box of the word and
ument examiners search for the most frequent words divide it into a number of cells. The size and number
with peculiarities. So 10 words are extracted from each of cell may vary according to the length of the word.

2
The features are extracted from each cell of a word to
capture the arrangement of strokes in a word. In this
investigation we have tried different sizes of cell; how-
ever a cell of size 18 × 18 gives reasonably good result.
So for the entire investigation we have taken cells of the
same dimension. Figure 3. Four structuring element (a)-(d)
with center as the first element used to
3.1.1 Directional Opening find the images containing the portion of
skeleton in the respective direction.
The pre-processed image is opened in four direction
using a ‘line’ structuring element of a size of twice
the average width of the strokes and the normalized
3.1.4 k-curvature features
mass of the opened image in each cell is taken as a
feature. Let I be the pre-processed word image and The slope of chain code at any point is a multiple of
[S 1 , S 2 , S 3 , S 4 ] be the four ‘line’ structuring element 45◦ , and the slope of a crack code is a multiple of 90◦ .
in four direction (horizontal, vertical and two diagonal In order to arrest finer variation in slope, we use some
directions). Let N be the number of cells in which a type of smoothing over, say, k such chain code to define
particular word has been divided and mj i be the mass k-curvature.
of a word in j th cell after opening I with the structuring Here left and right k-slopes at a point P on a curve are
element S i . The normalized mass Mj i of that partic- defined as the slopes of the line joining P to the points
ular word in j th cell after opening with S i is given as k steps away along the curve on each side of P, and the
mij
Mji = maxj=1,···,N (mij )
. Thus for a word with number k-curvature of P as the difference between its left and
of cells N, we have a directional opening feature vector right k-slopes. In this definition it is assumed that the
of length 4N. curve from P to the fixed k-steps away can be approx-
imated by a straight line segment. This assumption is
better satisfied if k is small. On the other hand, in the
3.1.2 Directional Closing case of small k, the slope may be influenced by small
perturbation due to noise. So value of k is selected com-
Instead of opening mentioned above, the pre-processed promising these two conflicting characteristics of slope
word image is closed with the same ‘line’ structuring measure.
elements and normalized mass of the closed image in We calculate k-slopes at each point of the morpho-
each of the cell is taken as a feature. Normalization is logical skeleton of the object as the angle with respect
also done in the same way. Thus for a word with number to horizontal axis, and hence the k-curvature as acute
of cells N, a directional closing feature vector is also of angle between left and right segment at P.
length 4N. We take k = 4 and observe that the value of k-
curvature in most of the pixels lies between 90◦ and
3.1.3 Directional Erosion 180◦ , with a bias towards 180◦. This suggests to finer
bias in the interval after 90◦ at the time of quantization.
Another morphological feature is obtained from the pre- So we quantize the interval [0◦ , 180◦ ] into five unequal
processed skeleton image X. Skeleton image is eroded bins as follows : 0◦ ≤ θ < 90◦ , 90◦ ≤ θ < 120◦,
by the diagonal structuring elements shown in figure 120◦ ≤ θ < 140◦ , 140◦ ≤ θ < 160◦ and 160◦ ≤ θ ≤
3(a)-3(d), to get the number of co-occurrences of skele- 180◦ . We make histogram on the basis of this distri-
tal pixels in four different direction. Here structuring el- bution. Now there are four possibility as shown in the
ements are S 1 = {(0, 0), (1, 1)}, S 2 = {(1, 0), (0, 1)}, figure 4(a)-(d) which are different in terms of charac-
S 3 = {(0, 0), (1, 0)}, and S 4 = {(0, 0), (0, 1)}. Each ter shape but their histograms of k-curvature are nearly
image X̂i = XΘS i , i = 1, ..., 4 contains the pixels at identical.
which a particular directional co-occurrences on skele- So we need to distinguish among the histogram of
ton occur, where Θ is the morphological erosion op- curve segments shown in Figure 4. The first two i.e. (a)
eration. Then we divide each image X̂i into N cells and (b) and the rest two i.e. (c) and (d) can be sepa-
and make a normalized histogram of directional co- rated by the difference of the row number and column
occurrences (sji , j = 1, ..., N ) of portion of skeleton number of the end points of the curve segments. Then
in each cell. In this case also we get a feature vector of they may be further distinguished by checking the loca-
length 4N. tion of mid-point with respect to the straight line joining

3
Table 3. Inter writer variation of characters
’pra’ by different writers

Figure 4. Four possibilities (a)-(d) where (a) (b) (c) (d) (e) (f)
histogram of k-curvature are identical.

the end points of the curve segments. Then we divide Y


k
the image into cells of size 18 × 18 and calculate the Pi (A) = pi j (1)
normalized histogram having (5 × 4) 20 bins for each j=1
cell to get 20N dimension feature vector, where N is the
number of cells into which the word image is divided. The authorship of the document A is decided based
on the maximum probability Pi (A), i = 1, 2, · · · , W .

3.2 Writer Identification 4 Results and Discussions

Writer Identification is a one to many comparison From Table 1, One can see that the directional open-
between a query document and the database documents ing feature set outperforms other feature sets. A max-
associated with different writers. Features from a docu- imum accuracy of 82.70% is obtained for word with
ment is compared to features from the document of all word index 6. For that particular word (word index 6)
the writers in the database and the inference is made every feature set gives the best performance in their re-
against the comparisons. For writer identification we spective class.
have followed ‘Leave-one out’ strategy. Out of 5 doc- From the table, it is also evident that peculiarities of a
uments from each writer, one document is kept out for word is more important than the length of the word. The
testing and remaining 4 documents are used as training longest words with word index 5 and 2 could not per-
data. All the combinations of 1 document out was tried form well in comparison to the words with word index
and an average accuracy is reported in results and dis- 6 and 1 having more peculiar characters (first character
cussions section. i.e. ‘pra’ of these words)with lots of variations within
For Writer identification nearest neighbor looks an writers. Table 3 shows inter writer variation of charac-
obvious choice since the number of writers (class) may ter ’pra’ by six different writers. Variations shown in
vary. We use Euclidean distance as similarity measure Table 3 support the higher accuracy of word with word
between different words. The writer of the query or index 6 and 1.
test document is identified if the distance to the near- Table 2 depicts the performance of nearest neighbor
est neighbor is less than a predefined threshold. when the different number of words is combined to take
decision on a document according to Section 3.4. Each
of the four feature sets could link more than 98% doc-
3.3 Combining word-level performance for a uments with their respective writers when ten words
document is taken into account. Directional opening feature is
the highest performer over all the four feature sets that
Forensic document examiners, while examining the could link all the documents with their respective writ-
documents for writer identification look for individu- ers. Merely 5 words in combination gives an accuracy
ality of different words and make a decision based on of more than 90% with all the four feature set.
the combined performance (individuality) of different
words in a document. Following a similar strategy to 5 Conclusion and Future works
link a writer from a document we also combine the per-
formances of different words in a document. In the present study we have proposed four sets of
Let pi j be the probability of a word wj of a particular feature namely directional opening, directional clos-
document A to be of written by the writer i. The proba- ing, directional erosion and k-curvature features for the
bility of a document A having words wj , j = 1, 2, · · · , k writer recognition for handwritten Telugu documents.
to be of writer i can be given as After noise removal, 10 words are segmented from each

4
Table 1. Identification results on individual words
word index word Directional opening Directional closing Directional erosion k-curvature
1 75.00 67.47 69.09 61.82
2 68.18 65.45 46.36 63.64
3 78.18 76.36 53.63 62.72
4 67.27 69.09 60.00 57.27
5 60.90 60.90 52.72 72.73
6 82.72 82.72 72.78 72.73
7 69.09 64.55 60.90 68.18
8 72.73 68.18 67.27 65.45
9 72.73 70.00 44.54 56.36
10 70.00 60.90 53.63 57.27

Table 2. writer identification on document level by combining the performance of individual


word
No. of words Directional opening Directional closing Directional erosion k-curvature
1 71.73 68.55 58.09 63.82
2 86.89 84.32 72.18 79.49
3 93.12 91.73 81.53 87.65
5 98.21 97.38 90.91 94.71
7 99.29 98.67 95.03 97.64
10 100.00 99.09 98.18 99.09

document. Segmented word of similar content is size [2] B. Chanda and D. D. Majumdar. Digital Image Process-
normalized and divided into a number of cells of fixed ing and Analysis. Prentiece Hall of India, New Delhi,
size. All the feature sets are extracted from 10 different 2005.
words of each of the document in the data set. The pro- [3] R. A. Huber and A. M. Headrick. Handwriting Identifi-
posed features give an encouraging results on both word cation: facts and fundamentals. CRC Press, 1999.
[4] J. S. Kelly and B. S. Lindblom. Scientific Examina-
and document level but the directional opening outper- tion of Questioned Documents. Taylor and Francis, New
forms other feature set. York, 2006.
We will continue our investigation as future work to [5] A. S. Osborne. Questioned Documents. Boyd Printing
see the usefulness of the proposed features on a larger Co., New York, 1929.
database. [6] N. Otsu. A threshold selection method from gray-level
histogram. IEEE Trans. System, Man and Cybernatics,
Acknowledgements 6:62–66, 1979.
[7] R. Plamondon and G.Lorrete. Automatic signature veri-
fication and writer identification -the state of art. Pattern
We would like to acknowledge Venkatappaiah Ku- Recognition, 22(2):107–131, 1989.
rapati for collection and preparation of digital Telugu [8] A. Rosenfeld and A. C. Kak. Digital Picture Process-
handwritten documents. ing, Volume 2. Academic Press, London, 1982.
[9] R. E. Saferstien. Criminalistics: An Introduction To
References Forensic Science. Prentice Hall, 2006.
[10] P. Soille. Morphological Image Analysis. Springer Ver-
lag, Berlin, 1999.
[1] M. Bulacu and L. Schomaker. Text-independent writer
[11] S. N. Srihari, S. H. Cha, H. Arora, and S. Lee. Indi-
identification and verification using textural and allo-
viduality of handwriting. Journal of Forensic Sciences,
graphic features. IEEE Trans. on Pattern Analysis and
47(4):1–17, 2002.
Machine Intelligence, 29(4):701–717, 2007.

5
[12] C. I. Tomai, B. Zhang, and S.N.Srihari. Discrimina- dividuality using word features. In Seventh Interna-
tory power of handwritten words for writer recogni- tional Conference on Document Analysis and Recogni-
tion. In Seventeenth International Conference on Pat- tion, Edinburgh, pages 1142–1146, 2003.
tern Recognition, Cambridge, pages 638–641, 2004.
[13] B. Zhang and S.N.Srihari. Analysis of handwriting in-

View publication stats

You might also like