Professional Documents
Culture Documents
WordNet
Presented By :
Arushi Gaur (15UCS026)
Sonal Jain (15UCS054)
Sakshi Sachar (15UCS117)
Priyanka Sharma (15UCS170)
The Problem
In today's world of digital communication where people use various social media
platforms like twitter for fast and frequent conversations, they use various
abbreviations and phrases. These words don’t exist in any dictionary and it is very
inconvenient for novice people to search and understand the meanings of such words.
There is a set of research papers called "Five Papers" which present the
development and the implementation of WordNet.We have referred the following
research papers for our analysis which helped us in understanding the evolution and
the implementation of the WordNet.
A. Lyk- like
B. Talkig- talking
C. Havin- having
Issues Faced
1) Searching Dataset : We were finding it difficult to find tweets in which these
abbreviations and phrases were used. Mostly were short forms of the words , for
eg. words formed by removing vowels, but there were very less abbreviations or
phrases. We then found conversational dataset made public by Microsoft and
analysed it.
2) WordNet has limited words. For eg. it does not include pronouns, interrogative
words, determiners, conjunctions and prepositions. So while checking in
WordNet, common words like what,when,the etc. were not existing.
3) Context: Difference in the context of the words . For eg : ‘I’ used as personal
pronoun in a tweet but exists as Iodine in WordNet.
Plan for next Semester
Our plan for next Semester is:
1. We will map the first category of words, that exists in standard dictionaries to
the wordnet by taking their meaning from oxford and mapping them to the
specific synset they belong like noun, adjectives, adverbs and verbs. And for
special category of words which doesn’t exist in wordnet like determiners,
prepositions, pronouns, conjunctions, and articles we will add a category in
wordnet and map those words in wordnet accordingly.
2. For second type of words, which are actually abbreviations, phrases and short
forms(like ttyl which means talk to you later) which don’t exists in any
dictionary with well defined meanings are to be mapped to wordnet by entering
their meanings manually and making a special category for phrases and short
forms.