Professional Documents
Culture Documents
NLP stands for Natural Language Processing. Natural language processing is a subfield of computer science,
information engineering, and artificial intelligence concerned with the interactions between computers and human
languages, in particular how to program computers to process and analyse large amounts of natural language data.
NLP Applications:-
Spam Detection – Did you guys notice that you have lesser Spam mails in Gmail now that it used to be few
years back? Also Gmail splits your mails into Social tabs, promotions and advertisement and so on all this is
happening using NLP where Google is reading your mail contents to filter into respective categories.
POS tagging ( Parts- of-Speech tagging) - Which terms in a sentence are noun adverbs, adjectives or
pronouns
NER (Named-Entity-Recognition) – This tells you which parts of sentence are people, organisation, date,
location, events etc.
Sentiment Analysis- Analysing reviews of customers for sentiments and reviewing insights. Basically marking
reviews positive, negative & neutral automatically based on sentence analysis or Lexical analysis is the
process of analyzing a stream of individual characters (normally arranged as lines), into a sequence
of lexical tokens (tokenization. for instance of "words" and punctuation symbols that make up source code)
to feed into the parser.
Corpus Analysis is the study of language as expressed in corpora (bodies) of "real world" text.
Lexicon Analysis is the study of words and their meaning. Example: - English Dictionary, Spanish dictionary
etc.
Machine Translation- Translating one language to another, identifying the language or auto detection.
Example :- Google Translator
Information Extraction- Did you notice Google read your emails and messages to mark events to your
calendar that is information extraction running in the background using NLP.
Why NLP is hard? Math is the only universal language. That is the primary reason why all machines run at the
ground level on mathematics only.
Language is ambiguous. Words have different meanings, synonyms and meaning changes based on context.
This is the primary reason why NLP is hard and research are going on to make this processing easy and making
conversations between Machine & Human much better smoother and understandable, for both.
The main problem with NLP is that of perception and it can be different for different people and stands correct even
for machines. Considering the example mentioned above, a machine too will have similar interpretation of the
sentence.