Professional Documents
Culture Documents
Components of nlp
Entity extraction--extracting entities like person,organisation,geographies,events
etc.
Syntactic analysis--proper ordering of words.(where as symantic means weather
sentense forming proper meaning )
Pragmatic analysis-- extracting information from text.
POS taggers--piece of software that reads and assign parts of speech to each word.
A corpus is a large and structured set of machine-readable texts that have been
produced in a natural
Brown corpus,
features of a text corpus in NLP
a. Count of the word in a document
b. Vector notation of the word
c. Part of Speech Tag
d. Basic Dependency Grammar
conversational interface
mixes voice, chats,with images,videos etc.
Masked language modelling is the process in which the output is taken from the
corrupted input.(help master down stream task)
N-gram in NLP is simply a sequence of n words, and we also conclude the sentences
which appeared more frequently.helps
predicting next word.
A corpus is a large and structured set of machine-readable texts that have been
produced in a natural