IV.
E
XPERIMENTAL
S
ETUP
We have developed a rule based transliteration systemusing JAVA SE DEVELOPMENT KIT (JDK), VERSION 6,MYSQL Server 5.5 for database. The program flow and thesystem architecture of Hindi to English rule basedtransliterator is shown in Fig1. The string of Hindi language isfirst searched in the database created for specialized spellingused in proper names, if the string match is found in thedatabase then its transliteration equivalent is produced
.
If thestring is not found in the database, it is then transliteratedusing nine rules given below:
Observations and Rules Creation
Sample results of our experimentation are shown in TablesIII-XI where sample Hindi strings, expected Englishtransliteration (commonly used English spellings),transliteration through ITRANS mappings and resultsproduced by our transliteration system are shown.
TABLE
IIIS
AMPLE
1
(
TOTAL
45
STRINGS TESTED
)
HindiStringExpectedTransliterationThroughMappingRule BasedTransliteration
पलक
PALAK PALAKA PALAK
अभय
ABHAY ABHAYA ABHAY
भरत
BHARAT BHARATA BHARAT
करण
KARAN KARANA KARAN
1)
Rule1:
The observation of Table III gives the rule that if length of proper noun is of 3 characters containing no vowel(matraa)
then ‘
A
’ is removed from the last position.
TABLE
IVS
AMPLE
2
(
TOTAL
45
STRINGS TESTED
)
HindiString
ExpectedTransliteration
ThroughMapping
Rule BasedTransliteration
हरशद
HARSHAD HARASHADA HARSHAD
तनमय
TANMAY TANAMAYA TANMAY
अरनव
ARNAV ARANAVA ARNAV
अकबर
AKBAR AKABARA AKBAR
2)
Rule2:
The observation of Table IV gives the rule that if length of proper noun is of 4 characters containing novowel (matraa
) then ‘
A
’ is removed from the second and
last position
.
TABLE
VS
AMPLE
3
(
TOTAL
45
STRINGS TESTED
)
HindiString
ExpectedTransliteration
ThroughMapping
Rule BasedTransliteration
कपल
KAPIL KAPILA KAPIL
अभनव
ABHINAV ABHINAVA ABHINAV
वपन
VIPIN VIPINA VIPIN
म क ल
MUKUL MUKULA MUKUL
3)
Rule3:
The observation of Table V gives the rule that if proper n
oun ends with a consonant then ‘
A
’ should be
removed from last position in English spelling.
TABLE
VI
S
AMPLE
4
(
TOTAL
45
STRINGS TESTED
)
HindiString
ExpectedTransliteration
ThroughMapping
Rule BasedTransliteration
वमल
VIMLA VIMALAA VIMLA
पदम
PADMA PADAMAA PADMA
ब शर
BUSHRA BUSHARAA BUSHRA
बलदव
BALDEV BALADEVA BALDEV
4)
Rule4:
The observation of Table VI gives the rule that if two consonants occur in succession, latter consonant followedby a vowel in a proper noun and index of first consonant
should be greater than 1, then ‘
A
’ is removed from first
consonant during transliteration.
TABLE
VIIS
AMPLE
5
(
TOTAL
45
STRINGS TESTED
)
HindiString
ExpectedTransliteration
ThroughMapping
Rule BasedTransliteration
आसफ
AASIF AASIPHA AASIF
आमन
AAMNA AAMANAA AAMNA
हमन
HIMANI HIMAANII HIMANI
गरम
GARIMA GARIMAA GARIMA
5)
Rule5:
The observation of Table VII gives the rule that if proper noun begins with
‘
आ
’
it is replaced
with ‘AA’, and i
f Hindi WordTransliterationRulesEnglish OutputSpecialSpellingsDatabaseMapping TablesDatabase Look Up
Fig. 1. Program Flow of the Transliteration system
(IJCSIS) International Journal of Computer Science and Information Security,Vol. 10, No. 8, August 201220http://sites.google.com/site/ijcsis/ISSN 1947-5500