3 - Working With Text Data

Uploaded by

Ansruta Mohanty

0% found this document useful (0 votes)

4 views9 pages

Original Title

3_Working with text data

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views9 pages

3 - Working With Text Data

Uploaded by

Ansruta Mohanty

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 9

Search inside document

Practice problem

Natural Language Processing

Session 3

Madhuri Prabhala
Working with text data
Practice problem
Details on the dataset

Details on the dataset:

There are Tweets collected from different ids.
The discussion is about various government policies.
We want to understand the important topics of discussion.
We want to understand what the general perception on the topics is.
Please answer the below for the excel file given

1. What are the steps involved?

2. How many rows are there?
3. How many columns are there?
4. What are the names of the variables?
5. What is the type of each variable in the dataset?
6. How will you get the overall information about the dataset?
7. What do the values of the dataset look like?
8. Are there any missing values in the dataset?
9. Which is the column of interest?
Please answer the below for the excel file given

10. How many unique words are there in the text corpus?
11. What are the 10 most frequent words?
12. What are the 10 least frequent words?
13. Create a Word Cloud.
14. What are your observations?
15. What will you do next?
Sentiment Analysis

16. What are the different sentiment scores that can be calculated?
Use textblob and Vader
17. What is your take based on the sentiment scores
Topic Model

18. What are the topics of discussion?

19. How will you choose the optimum number of topics?
20. How many topics are optimum for this dataset?
21. What are the insights you can draw from the topics?
Topic Model – Assessing optimum number of topics

❑ Latent Dirichlet Allocation

o Each topic is a mixture of underlying words
o Each document is a mixture of underlying topics

❑ Perplexity
o Speaks of how well the model works on held-out data.
o Lower perplexity scores are considered better.

❑ Coherence
o How close to human intuition the identified topics are.
o There are multiple measures of coherence such as:
c_v, c_umass, c_npmi, c_a
o We choose models with higher number of coherence score.

Academic Writing Script 1
Document67 pages
Academic Writing Script 1
drummersun
No ratings yet
CSC 204 To Check Later
Document9 pages
CSC 204 To Check Later
Bashjr01
No ratings yet
A Paragraph Is A Unit of Text That Develops One Idea or Topic in Specific Detail
Document8 pages
A Paragraph Is A Unit of Text That Develops One Idea or Topic in Specific Detail
ferdzky
No ratings yet
Discourse Markers Thesis
Document4 pages
Discourse Markers Thesis
kathrynharrisvirginiabeach
100% (2)
Lesson Plan 8.0 - 115716
Document8 pages
Lesson Plan 8.0 - 115716
JA NE JA NE
No ratings yet
Writing Activities For Writing Level 3
Document3 pages
Writing Activities For Writing Level 3
Dennisse Álvarez
No ratings yet
FOM Chapter 1
Document52 pages
FOM Chapter 1
Nantha Kumaran
No ratings yet
Topic Sentence: Parts of A Paragraph
Document6 pages
Topic Sentence: Parts of A Paragraph
Dylan Liew
No ratings yet
Sentence Based Topic Modeling Using Lexical Analysis
Document7 pages
Sentence Based Topic Modeling Using Lexical Analysis
S. M. Mazharul Hoque Chowdhury
No ratings yet
Guiding Questions Fo EE Reflective Sessions
Document4 pages
Guiding Questions Fo EE Reflective Sessions
ATID Fabi Rodríguez de la Parra
No ratings yet
The Myth and Magic of Library Systems
From Everand
The Myth and Magic of Library Systems
Keith J. Kelley
Rating: 5 out of 5 stars
5/5 (1)
Memory Thesis Statement
Document6 pages
Memory Thesis Statement
WriteMyPaperApaFormatCanada
100% (2)
Guidelines in Writing Critique Paper
Document3 pages
Guidelines in Writing Critique Paper
Alexis Gee lawat
No ratings yet
1 27 2016 Tesol Observation Field Notes
Document9 pages
1 27 2016 Tesol Observation Field Notes
api-306853222
No ratings yet
NLP-Questions Class 10 Ai
Document8 pages
NLP-Questions Class 10 Ai
kritavearn
No ratings yet
Web Development from Beginner to Paid Professional: Coding Challenges and Solutions - The smartest way to learn html and css
From Everand
Web Development from Beginner to Paid Professional: Coding Challenges and Solutions - The smartest way to learn html and css
Bolakale Aremu
No ratings yet
Applied - 11 - Research in Daily Life 1 - semII - CLAS8 - Analyzing and Drawing Out Patterns and Themes With Intellectual Honesty - v2 PNS PDF
Document15 pages
Applied - 11 - Research in Daily Life 1 - semII - CLAS8 - Analyzing and Drawing Out Patterns and Themes With Intellectual Honesty - v2 PNS PDF
Anya Liggayu
No ratings yet
Simple guide to start a thesis
From Everand
Simple guide to start a thesis
lady rodriguez
No ratings yet
Cells
Document3 pages
Cells
api-105605905
No ratings yet
Math 5 2 Day 2
Document4 pages
Math 5 2 Day 2
api-300765248
No ratings yet
5.2 Natural Language Processing
Document43 pages
5.2 Natural Language Processing
punit mishra
No ratings yet
Infosys Company Profile:, Infosys Exam Cracking KIT
Document9 pages
Infosys Company Profile:, Infosys Exam Cracking KIT
mecitfuturedreams
No ratings yet
Modes of Writing: “A Beginner’S Tool for Writing Success”
From Everand
Modes of Writing: “A Beginner’S Tool for Writing Success”
Rose Hensle
No ratings yet
Giving Good Talks
Document26 pages
Giving Good Talks
anand.santosh
No ratings yet
Maher Zain Song On Big Family
Document4 pages
Maher Zain Song On Big Family
Sadiki Flta
No ratings yet
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
Document7 pages
P.S.Senior Secondary School Class X - Artificial Intelligence - 2021-22 Natural Language Processing Question and Answers
mrprathyu13
No ratings yet
Systematic Inquiry
From Everand
Systematic Inquiry
Robert Morasky
No ratings yet
Essay Writing Skills: Planning Your Essay
From Everand
Essay Writing Skills: Planning Your Essay
Grant Andrews
Rating: 4.5 out of 5 stars
4.5/5 (14)
Structure of A Paragraph
Document60 pages
Structure of A Paragraph
Asad Khokhar
No ratings yet
Subjective Ai 417 2023
Document43 pages
Subjective Ai 417 2023
muskprincipal.2022
No ratings yet
Discourse Analysis Worksheet
Document3 pages
Discourse Analysis Worksheet
api-511118360
No ratings yet
Examination Pattern - Docx MAPC 1 Cognitive Psychology, Learning and Memory
Document8 pages
Examination Pattern - Docx MAPC 1 Cognitive Psychology, Learning and Memory
akileshaiyer
No ratings yet
Ielts General Training Reading Task Type 4 Matching Information 2
Document8 pages
Ielts General Training Reading Task Type 4 Matching Information 2
celina Tobarez
No ratings yet
Abstract Writing Handout
Document4 pages
Abstract Writing Handout
baryal
No ratings yet
Composition and Grammar
From Everand
Composition and Grammar
ENC1101 Editorial Board
No ratings yet
Revision Worksheet in Class Activity Rhetorical Analysis 2
Document2 pages
Revision Worksheet in Class Activity Rhetorical Analysis 2
api-582855279
No ratings yet
OMAC Data Analyst
Document91 pages
OMAC Data Analyst
Ahmed Elbaz
No ratings yet
Cognitive Approach to Natural Language Processing
From Everand
Cognitive Approach to Natural Language Processing
Bernadette Sharp
No ratings yet
IS14604 Effective Army Writing
Document20 pages
IS14604 Effective Army Writing
webdog77
No ratings yet
Jdavis Communicationskills
Document98 pages
Jdavis Communicationskills
sadeghm110
No ratings yet
A 10 minute intro to Dogme Business English
From Everand
A 10 minute intro to Dogme Business English
Phil Wade
Rating: 3 out of 5 stars
3/5 (2)
CGE 1000 English For Academic Studies (A) : Lecture 9 Academic Essay III
Document22 pages
CGE 1000 English For Academic Studies (A) : Lecture 9 Academic Essay III
Laxer
No ratings yet
How to Revise and Practice: Study Skills, #3
From Everand
How to Revise and Practice: Study Skills, #3
Fiona McPherson
Rating: 5 out of 5 stars
5/5 (1)
DailyDialog - Li Et Al - 2017
Document10 pages
DailyDialog - Li Et Al - 2017
Kyra Wang
No ratings yet
Efl Teaching Methodology and Curriculum
Document4 pages
Efl Teaching Methodology and Curriculum
Ripo Putra
No ratings yet
b2 First For Schools Preparing For Exam Success - Self Study Reading Activities
Document5 pages
b2 First For Schools Preparing For Exam Success - Self Study Reading Activities
Lucía Di Carlo
No ratings yet
Coping Mechanism of Stem Students Towards Calculus in Modular Learning
Document2 pages
Coping Mechanism of Stem Students Towards Calculus in Modular Learning
Raja Juliana Toledo
No ratings yet
Three Point Thesis Statement Example
Document6 pages
Three Point Thesis Statement Example
kyzosik1kov3
No ratings yet
Possible Synthesis Essay Prompts
Document6 pages
Possible Synthesis Essay Prompts
fc5g0qm1
67% (3)
Proofreading For Common Grammatical Errors
Document5 pages
Proofreading For Common Grammatical Errors
CristinaCris
100% (1)
Assignment INSAID
Document7 pages
Assignment INSAID
ShadabAkhtar
No ratings yet
90 Second Thesis
Document7 pages
90 Second Thesis
dwrxjhgr
100% (2)
Psyu 101 Course Tips
Document9 pages
Psyu 101 Course Tips
api-546565206
No ratings yet
Preparing For Seminars
Document3 pages
Preparing For Seminars
tomas
No ratings yet
Type Your Answers in Below Each Question/prompt.: Final Examination
Document4 pages
Type Your Answers in Below Each Question/prompt.: Final Examination
Lovinlifex4
No ratings yet
Writing - Feelings
Document3 pages
Writing - Feelings
Romina Barrera
No ratings yet
RDL - Module 8
Document19 pages
RDL - Module 8
Padz Maverick
No ratings yet
Untitleddocument
Document24 pages
Untitleddocument
api-315900731
33% (3)
Permutation: Fundamental Counting Principle: (Multiplication Rule)
Document17 pages
Permutation: Fundamental Counting Principle: (Multiplication Rule)
Junard Ceniza
No ratings yet
Your Group Can Be Composed of One, Two or Three Other Students
Document5 pages
Your Group Can Be Composed of One, Two or Three Other Students
Chu Minh Duc Nguyen
No ratings yet
NP Brochures-Market Research Apr2023
Document1 page
NP Brochures-Market Research Apr2023
Ansruta Mohanty
No ratings yet
#A1. Accelarate Your Career With MST Formula v.1.2 Final
Document36 pages
#A1. Accelarate Your Career With MST Formula v.1.2 Final
Ansruta Mohanty
No ratings yet
3 Topic Models
Document15 pages
3 Topic Models
Ansruta Mohanty
No ratings yet
1 - Introduction To NLP
Document19 pages
1 - Introduction To NLP
Ansruta Mohanty
No ratings yet