Professional Documents
Culture Documents
I. INTRODUCTION
Natural language processing is an area of computer
science, artificial intelligence, and language concerned
with
the
interactions
between computers
system and human like (natural) languages. NLP is
related to the field of computer-human interaction.
Natural language processing refers to computer systems
that analyse, easy to understand, or produce one or more
languages for human, such as English, Hindi, Punjabi,
Japanese, Italian, or Russian. Many problems in NLP
involve natural language understanding - that is, enabling
computers to derive particular meaning from human or
natural language input. Natural Language Processing has
various Tasks:-Part-of-Speech Tagging, Chunking,
Named Entity Recognition, Semantic Role Labelling,
Languages Models, Semantically Related Words
(Synonyms) [1].
Machine Translation is the branch of natural language
processing which strives to convert natural languages
(such as Hindi, English etc.) to another natural language
by the use of machines. It is the field of computational
www.ijsret.org
785
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 3, Issue 4, July 2014
g (x )= wi k ( x , zi )
i 1
Examples
Global India
President Pranab Mukherjee,
Chandigarh, Mount Everest
three fifty a m, 12:30 p.m.
$567,175 million Canadian
12-06-1991, June
25.22 %, fifty pct,
Stonehenge Washington
www.ijsret.org
786
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 3, Issue 4, July 2014
III. METHODOLOGY
EXPERIMENTAL SETUP
AND
Tokenization
Change in
Classifier
form
Training
5648
Testing
5566
Correct
4633
IV. EVALUATION
Make a NER
model by
machine
learning
Analysis of
NER model
Accuracy
Precision
Translation
Model
A) Evaluation Matrices
1) Accuracy: - It is the main objective of our system to
find the correct name entities from the data set and
translate effectively of that name entities. To find the
quality of output, this formula is usedAccuracy (%) =correct words/Total Name entities *100
Language
Model
Target Text
www.ijsret.org
787
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 3, Issue 4, July 2014
96.35%
LOC, ORG
97.85%
Fig 3- Evaluation of metrics
Table 3 - RR09-Shuf-unbiased-E65
NER
Precision
Recall%
FB1%
%
Person,
85.55%
87.01%
90%
MISC
LOC,OR
G
\
86.06%
87.35%
V. CONCLUSION
In our research work, we formulated a Support Vector
Method for supervised learning with structured and
interdependent outputs. It is based on a joint feature map
over input/output pairs, which covers a large class of
interesting models including weighted context-free
grammars. To solve the resulting optimization problems,
we proposed a simple and general text mining or NLP
approach for extract entity and translate it .SVM given
better results than others.
86.70%
REFERENCES
96.55%
LOC, ORG
98.03%
Table 5- ZJ03-Permute-biased-E100-R1e-5-EPH
NER
Precision% Recall%
FB1%
Person,
MISC
LOC,ORG
\
89.56%
85.76%
86.89%
83.26%
80.43%
80.92%
www.ijsret.org
788
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 3, Issue 4, July 2014
[19]https://www.google.co.in/#q=LANGUAGE+INDEP
ENDENT+NAMED+ENTITY+RECOGNITION.
[20] Brahmaleen K. Sidhu,Arjan Singhand Vishal Goyal,
Identification of Proverbs in Hindi Text Corpus and
their Translation into Punjabi,JOURNAL OF
COMPUTER SCIENCE AND ENGINEERING,
VOLUME 2, ISSUE 1, JULY 2010.
[21] Thoudam Doren Singh, Kishorjit Nongmeikapam,
Asif Ekbal and Sivaji Bandyopadhyay, Named Entity
Recognition for Manipuri Using Support Vector
Machine, 23rd Pacific Asia Conference on Language,
Information and Computation, pages 811818.
[22] Georgios Paliouras, Vangelis Karkaletsis, Georgios
Petasis and Constantine D. Spyropoulos, Learning
Decision Trees for Named-Entity Recognition and
Classification,
Institute
of
Informatics
and
Telecommunications, NCSR Demokritos, 15310.
[23]http://en.wikipedia.org/wiki/Natural_language_proce
ssing
[24] Kamal Deep and Vishal Goyal, DEVELOPMENT
OF A PUNJABI TO ENGLISH TRANSLITERATION
SYSTEM, International Journal of Computer Science
and Communication Vol. 2, No. 2, July-December 2011,
pp. 521-526.
[25] Yunita Sari, M. Fadzil Hassan, Norshuhani Zamin,
A Hybrid Approach to Semi-Supervised Named Entity
Recognition in Health, Safety and Environment
Reports,International Conference on Future Computer
and Communication 2009 IEEE.
www.ijsret.org
789