Professional Documents
Culture Documents
Analytics
Session 12
Dr. Mohit Malhan
80% of Data is Unstructured
• Database notes
• Call center transcripts
• Other CRM
• Email
• Open-ended survey
responses
• Web pages
• News Groups
• Documents themselves
• Competitive information
This fact means that decision-makers
often rely on only 20% data. • Reviews, tweets, comments
• Photos, Videos, info graphics.
2
What is Text Analytics?
3
Types of Text
• Dynamic Text
• Static Text
4
Types of Text
• Dynamic Text
• Dynamic text is a real-time user-generated text or
statement to expresses an opinion about content or
information posted over social media.
Examples:
• wiki content
• a blog page
• Word documents
• Corporate reports
• electronic mail (e-
mail), and
• news transcripts.
Copyright © 2018 Gohar F. Khan 6
Deployment Models
• On-premise model
• It is comparatively expensive option, but provides extra
security and control
7
Key players
9
Applications for Text Analysis
• Social media data
• Surveys
• ‘Reading’ email
• Call centre data
• Abstracts
• Document management
• Corporate history
• Scientific publications
• Thematic understanding of website
• Database notes
10
Purpose of Text Analytics
Sentiment
Analysis
Intention
Mining
11
Purpose of Text Analytics
• Sentiment Analysis
• Sentiment analysis analyzes and categorizes social
media text (mostly dynamic text) as being positive,
negative, or neutral.
12
Purpose of Text Analytics
• Intention Mining
• Intention or intent mining aims to discover users’ intention
(such as buy, sell, recommend, quit, desire, or wish) from
media text.
13
Purpose of Text Analytics
• Trends Mining
• Trends mining, also known as predictive analytics,
is used to predict future events.
• Concept Mining
• Concept mining aims to extract ideas and concepts from
documents.
15
Text analytics mechanism
Concept Maps
Text Attract
Attitudes
Clustering
Grow
Surveys Categoriza-
tion
Business UI
Operational Fraud
Systems
Attributes
Prediction
Customer Data Expert UI Business
Data Collection User
17
Supervised vs. Unsupervised Learning
18
Steps in Text Analytics
• Clustering • Dynamic text: tweets,
• Classification comments, reviews
• Association analysis • Static Text: Wiki content,
• Predictive analysis blogs, websites, reports.
• Sentiment analysis
Source
Text Mining Identification
Text Parching
Text &
Transformation
Filtering
• Terms count • Stemming
• Frequency count • Parts of speech
• Co-occurence metrics • Named entities
extraction
• Stop words
• Filtering
21
Steps in Text Analytics
• Association
• Association or association rule mining is a data-mining
technique used to find frequent patterns, correlations,
associations, or causal structures from data sets.
22
Steps in Text Analytics
• Classification
• Is used to find similarities in the document and
groups them with predefined labels based on the
themes contained in the document.
23
What you can do with it
What you can do with it
What you can do with it
Social Media Text Analysis Tools
• Lexalytics: Lexalytics (http://www.lexalytics.com/) is a social media text and semantic analysis
tool for social media platforms, including Twitter, Facebook, blogs, etc.
27
Social Media Text Analysis Tools
• Netlytic: Netlytic (https://netlytic.org) is a cloud-based text and social
network analytics platform for social media text that discovers social
networks from online conversations on social media sites.
• LIWC: Linguistic Inquiry and Word Count (LIWC) is a text analysis tool for
analyzing emotional, cognitive, structural, and process components
present in individuals’ verbal and written speech samples:
http://www.liwc.net/
28
Text Analytics Issues
29
Thanks