Professional Documents
Culture Documents
Madhav Gopal
Centre for Linguistics, SLL & CS,
J.N.U., New Delhi
mgopalt@gmail.com
Keshav Niranjan
Department of Computer Science,Deshbandhu College,
University of Delhi, New Delhi, India
keshav.niranjan@gmail.com
Abstract
In this paper we present the language divergence in Sanskrit to Hindi Machine Translation, concentrating on the
first, second and third person pronouns. Sanskrit uses many strategies for encoding things; one idea/information
can be expressed by many ways, and this is evident from our pronoun study also. Sanskrit and Hindi, though
closely related, have many typological and lexical differences regarding these usages. We attempt to map these
things in this article to facilitate Machine Translation of Sanskrit pronouns into Hindi. As the scope of our study
is limited to pronouns only, our focus is on the structural divergence which is prevalent in this regard.
Keywords: Language divergence, Pronouns, Machine Translation, pronoun compounding.
1. Introduction
Language divergence (Dorr, 1993; Dave et. al. 2002)
refers to the differences in lexical and syntactic
choices that languages make in expressing ideas. It
occurs when the underlying concept of a sentence
gets manifested differently in different languages.
Sanskrit
Rendering
(2)
yat
REL.NEU.3SG
asmadya-m
na
hi
tat
1POSS-NEU.SG
par-em.
other-MAS.3PL.GEN
What is ours cannot be others. <vaikputra- kath,
mtsp, PT >
Herein, asmadya is in neutral gender due to its
agreement with the neutral antecedent yat.
jo hamara hai vo dusare ka nahi ho sakata.
In this example the pronominal adjective of Sanskrit
with neuter gender changes into a cardinal pronoun
with genitive and masculine attributes. The neuter
gender is not realized in Hindi consequently this
divergence takes place.
(3)
madya-
namaskra-
1POSS-M.SG greeting-M.SG
vcya-
communicable-M.SG
bagavata
instance ,
lord/M.SG.GEN
(1)
ayam
asmadya-
DEM.PRX.M.SG.NOM
1POSS-M.SG.NOM
bndhava-.
brother-M.SG.NOM
This is our brother. <mrkhapaita-kath, apkr,
PT>
Herein, the form asmadya is agreeing with the
possessed noun bndhava with its case, gender and
number features.
yaha hamara bhai hai.
the roots asmat (used for to refer more than one), and
mat (used for singular) are used and they serve as the
PT>
(6)
atra ca
vttim
meaning.
stipend.F.ACC consumer-PL.GEN
mad-da-ttm
bhujn-nm
paca-at
(4)
pait-nm
pundit-PL.GEN
tiha-ti.
five-hundred sit-SG.PRS
adya asmat-svm
pigalaka-
bhta-
bhta-parivra-
ca
vartate
paNdita hain.
(7)
madya-bhr-ea
ati-rnta-
1POSS.SG-burden-M.SG.INS
very-tired-M.SG
tvam
you.NOM
asmad-bhpate
ignorance 1POSS-king.SG.GEN
utsarga-m
car-an
excretion-ACC conduct-PRPL
cirbha-bhkaam karoti.
cucumber-eating
do-3SG.PRS
yat
that
purastool-
inflect
for
all
cases
except
vocative.
mtsp, PT >
kath,
(10)
na asti s
not is
tul
scale.F.SG.NOM
(8)
tvady-
3SG.DEF.F 2POSS.SG-F.SG.NOM
we
forest-dweller.M.PL
yumadya-m
ca
2POSS.PL-N.SG and
jalnte
water-inside home-N.SG
men hai.
in Hindi.
in Hindi.
(11)
yumad-darana-mtra-anurakta-y
2POSS.PL-appearance-only-infatuated-F.INS 1SG.INS
tm prada-tta
self
(9)
tat
rtrau
api
ayam.
give-PSPL.M this
tvadya-m
(12)
maa-m
tvad-varjam
tyaktv
anya bhart
monastery-M.SG.ACC leave.PSPL
2POSS.SG-barring other
anyatra
api
me na
EMPH
mahe
may
y-symi.
husband
bhav-iyati
manas-i
mind-SG.LOC
iti.
(13)
kintu tvat-prrtan-siddhy-artyam
but
sarasvati-
2POSS.SG-prayer-completion-for Saraswati-
vinodam
kar-iymi
entertainment
do-1SG.FUT
(16)
sa
mtbd, PT >
api
na
EMPH
not came
(14)
te
mtula-
yata ?
aho gaja
ayam yumat-kula-atru
kat, mtbd>
2POSS.PL-family-enemy
(15)
adjectives.
yadyapi tvadya-vacanam
na
karoti,
(17)
bhavadya-shas-ena
still
master self-fault-destuction-SG.DAT
vcya.
aham
tua.
speakable
Though the master does not pay any heed to your
words, still for the destruction of your own faults you
should speak to him. <dantilagorambhayo kath,
mtbd, PT>
3. Conclusion
In this paper we have presented Sanskrit pronouns
and the divergence that take place during their
translation into Hindi. The Hindi pronouns, not that
sophisticated
morphologically,
have
complex
A dhyy
ca,
adhish sthsm
stras
kldhvanoratyanta
(like
akathitam
sa yoge,
Gruyter.
in
this
language
pair.
Also,
Gopal, M.:
system.
System. SISSCL.
Habash,
N.
and Dorr, B.
Translation
(2002),
Divergences:
Handling
Combining
References
Structures
of
Indian
Languages,
Translation
Machine
Munich: Lincom-Europa.
Translation
and
Language
Divergence,
B.
(1994),
Classification
of
Machine
Appendix 1
Abbreviations used:
ACC:
Imperative,
IMPF:
Imperfect,
INDEF:
Indefinite
PRPL:
PRX:
ldpr:
aparkitakrakam
labdhapraam,
apkr: