Professional Documents
Culture Documents
Business
Opportunities
Varun Singhal
Swayam Prakash Pal
• Client (Manufacturer A) is a leading Food & Beverage manufacturer.
• Client wants to understand the growth patterns of consumer
preferences (themes) and evaluate positioning of their brand across
different themes.
• Client also wants to identify the sales drivers of their products.
Problem Statement
• Three major data sources have been provided to us
Social Media Data
Google Search Data
Sales Data
• Number of themes across each data source
Social Media- 194 unique themes
Google Search- 160 unique themes
Sales Data- 49 unique themes
Data Preparation
• Themes present across all data sources
Common themes – 30.
Example- sea salt, low carb, crab, salmon etc.
Data Preparation
Preference-Sales Data Preferred-Search Data
20%
23% 19%
Preferred-Social Data
Hypothesis Testing
Themes that reject null hypothesis Themes that accept null hypothesis
Hypothesis Validation
Soy Foods Mackerel
Hypothesis Proof
• Latency period is calculated for the themes that are common between all the 3 data sources i.e.
social media, google search and sales data
• Latency is calculated based on a weighted average approach using total posts, search volume
and sales units value
• Mean latency between social media to search data is around 41 days.
• Mean latency between search data to sales data is around 52 days.
• Analysis on the data sources show that some themes appear first on social media and then on
google search and vice-versa. Same follows between search and sales data
• This affects the latency period across different themes
Social-Search search-social
250 236 400 375
349
350
200
300
159 246
150 143 138 250 225 214
200 166 178 166
101 148 160
100 150 130
68 90 88
Days
100 83
50 42 46 47 46
50 38 24 20
0 0
c y r s r s c l c
s lth g al
t el na um ifi ree ian rr ga ue es are en ge ve ti rab on arb la rie ch ein ti
fo
od a ag
in s ker tu di pec o f tar ebe su bisq rtn rep hick bur ati ebio c alm w c ha calo pea rot exo
he ck se
a ac o t s gm ege blu low ch /ale o-p c am ser pr
v s lo p
so
y
ne a m w
s
no v n y -t h re lo
w of ic &
bo l -p lo l - re
f er a s g y f
e /p ce
r th
n
ica i ca e n -e be ves s ou e
th h /
e et ce iti gh
i en a dd hi
Themes en no
onv
c
sales-search search-sales
350 250
209
293 200 186 187 180
300 168
154 158
150
250
214 99
100 71
200 50
166 166 50 40
14 15 26
9 2 4
150
Days
131
0
c e n ry ar e ss re n er s el l
la rie na ch ein
100 78 77 e cifi fre aria ber sug isqu tne pa icke urg tive ker ha alo tu pea rot
p o t e b r re ch b rva ac c
46 47 t s gm ege blu low ch /ale o-p m e m w of
p
50 - no v
ren rgy sy-t f ha res lo ce
20 l f e a e /p ur
9 ica en - e be ves so
h /
0 et ce iti gh
lt en dd hi
ds al
th ng tic ab on sa rb try oti
c
um i a
oo gi io cr m ca ul di en no
y f h e
ck
a e b
s al ea w po ex so nv
so ne pa pr s lo
ic
& w co
bo l - hn lo
ca et Themes
thi
e
Modelling technique
low carb Salmon