Professional Documents
Culture Documents
DR.VMS
Sentiment analysis
A technique used to interpret and classify emotions in subjective data. Sentiment analysis
is often performed on textual data to detect sentiment in emails, survey responses, social
media data, and beyond.
Text classification
NLP is used to teach a machine how to read and understand human languages. Trained
machines can extract the relationships between words, identify the entities in a sentence
(i.e., entity-recognition), etc.
Tokenizing
Breaking up a stream of characters into words, punctuation marks, numbers and other
discrete items.
Parts of speech
corpus is a collection of data selected with a descriptive or applicative aim as its purpose
a corpus must possess a common set of fundamental properties, including
representativeness, a finite size and existing in electronic format.
The linguistic data consortium
Founded in 1992 and based at the University of Pennsylvania in the United States, this
research and development center is financed primarily by the National Science
Foundation (NSF). Its main activities consist of collecting, distributing and annotating
linguistic resources which correspond to the needs of research centers and American
companies which work in the field of language technology. The linguistic data consortium
(LDC) owns an extensive catalog of written and spoken corpora which covers a fairly
large number of different languages.
LFG-GPSG
In LFG one parses sentences and builds up functional structures, in GPSG sentences are
parsed and translated into formulas of intentional logic, hardly anyone knows how to
generate from f-structures or from logical formulas
LFG
Lexical Functional Grammar arose in the late 1970’s through the collaboration of Joan
Bresnan (a linguist) and Ronald Kaplan
Lexical Functional Grammar emphasizes analysis of certain phenomena in lexical and
functional terms,
LFG-Lexical Functional Grammar
https://www.link.cs.cmu.edu/cgi-bin/link/construct-page-4.cgi#submit
https://demo.allennlp.org/dependency-parsing/MjYwODE5Ng==
Pronunciation
structure of words in their written (graphemic) form and spoken (phonemic) form. It has two
forms namely inflection and derivation.
Inflection:
It is related to the grammatical function of words of the same part of speech;
e. g. the paradigm of the verb play as:
Play, plays, played, playing
Derivation:
It is related to the production of new words of different parts of speech;
e. g. nation - (a noun )
national- (an adjective )
nationalize- ( a verb )
Morphological Analyser
A morphological analyzer can extract the base forms from inserted documents in
computers.
The applications which are achieved in this respect are:
a: hyphenation (segmenting words into their morphs),
b: spelling correction,
c: stemming which reduces the related words as possible. The problem of such
computational programs is the input which should be very broad. Other forms of
application are parsing and generating natural language utterances in written or spoken
form and machine translation. (Trost, 2006)
Syntax