You are on page 1of 8

t

IIR178

os
rP
Using Big Search Data

yo
ISSUE 32

FIRST
QUARTER
2017
to Map Your Market
op
Marketing in a Digital Age
tC

Bernd Skiera
Daniel M. Ringel
No

This article is part of IESE Insight Review, issue 32, first quarter 2017. DOI:10.15581/00.ART-2982-E.

Copyright © IESE Business School, 2017. All rights reserved

Articles and back issues can be purchased on the web at www.iesep.com


Do

To reproduce or transmit one or more IESE Insight by electronic or mechanical means (including
photocopying or archiving in any information storage or retrieval system) requires written
permission. Ask for permission writing us to iesep@iesep.com

Posting of full-text IESE Insight articles on openly accessible Internet sites is prohibited. To obtain
permission to post articles on secured and/or password-protected intranet sites, write your request
to iesep@iesep.com

IESE Business School - University of Navarra / www.ieseinsight.com


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
DEEP
insight

t
os
rP
yo
op
MARKETING IN A DIGITAL AGE

Using Big Search Data


tC

to Map Your Market


By BERND SKIERA and DANIEL M. RINGEL
No

R
ecently, we ran a marketing work- To our surprise, the participants were
shop with an experienced business stumped. First, it took them a very long time
development manager from Ama- to convert the data into a format that would al-
deus, the tech provider for the travel low them to answer basic marketing questions,
industry. The workshop focused on Travel Audi- because the sample sizes were far too big for a
ence, Amadeus’ online advertising platform for conventional spreadsheet program. They strug-
travel companies. Participants were given four gled with separating causality from correlation.
Do

data sets, ranging in size from 770,000 to more And finally, they complained about the amount
than 9 million “observations” – which reveal of data they had to analyze – until we remind-
how users or consumers behave and interact in ed them that the data sets they had been given
a specific market category. We asked the partici- barely covered a week!
pants to analyze the data and answer a few basic This experience illustrates that, unlike yes-
marketing questions: Where does the company terday’s marketing professionals, marketers to-
make the most profit? Where should it spend day need to upgrade their data-handling skills.
more money on advertising? What is the return It’s not enough to ask a junior professional to
on investment in terms of advertising? do the work for you. Marketers who can pull

https://dx.doi.org/10.15581/002.ART-2982 ISSUE 32 FIRST QUARTER 2017 31


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
Big data present distinct challenges. Marketing
professionals do not have to become IT specialists.

rP
But successful marketers will be those who also have
solid math and data-management skills.

and read the numbers themselves are in a much information boom. The internet enabled mar-
better position than those who have to run to keters to measure, on a grand scale, when an

yo
someone else each and every time they need to advertisement was shown and how consumers
retrieve key data, waiting days or weeks to get reacted to it – how many clicked and how many
the information they want from the IT depart- proceeded to purchase.
ment. Being able to read and react faster to mar- With the ubiquity of increasingly affordable
ket shifts will lend a competitive advantage. and powerful smartphones, the sea of data has
With so many brands and products and so grown exponentially. On top of all existing data,
much big data available today, where does one companies can now also collect vast quantities
begin? Interpreting the competitive relation- of location-based information on consumer be-
ships among 1,000 different products is ex- havior around the clock.
op
tremely difficult and would entail a gigantic, While having more and better information
unwieldy table. As such, we came up with a way is a good thing, big data present distinct chal-
of visualizing competitive market structure in a lenges – from poor data quality, to incomplete
single map. This makes it more manageable to data sets, to mistaking correlation for causality.
devise effective competitive strategies, pricing There are several signals that marketers are
policies and communication. behind the learning curve. It used to be that the
This article explains our approach. Using chief concerns of marketing were to develop
tC

data of large markets with more than 1,000 com- creative slogans and strategies. In fact, many
peting products, we show how companies can people went into marketing precisely because
tap into big data and visualize competitive mar- they didn’t want to do math. The digital revo-
ket structure. Various parties can benefit from lution, however, has changed all that. While
this method, which reveals valuable insights on coming up with creative content will always be
customer behavior. central to the profession, marketers now have to
take a data-driven approach to prove that their
The Big Learning Curve of Big Data ideas actually work.
No

Ever since the advent of the browser, market- This development doesn’t mean market-
ers have been trying to get to grips with the ing professionals have to become IT specialists
or set up databases and IT architecture them-
selves. Such tasks can and should stay with the
EXECUTIVE SUMMARY IT department. However, it does mean that suc-
cessful marketers will be those who also have
Markets today are flooded In this article, the authors solid math and data-management skills. The
with an increasing number of describe how they used rise of crowdsourced platforms, such as Kaggle,
products and brands, making clickstream data to visualize to carry out predictive modeling and analytics,
Do

it difficult for companies to competition in product as well as the growing popularity of online data-
track how their products categories containing more analysis courses, are testament to marketing’s
compete in the market. Big than 1,000 products. Both wider remit.
search data allow companies manufacturers and retailers
to obtain valuable insights on can profit from their approach, Market Share and Competition
market share as well as on key which yields more meaningful Recently, a manager at a price-comparison
competitors. But collecting information on customer website mentioned that his firm could quickly
and analyzing vast amounts of behavior than traditional predict the success or failure of a new prod-
data pose huge challenges. market research methods. uct by simply following the outbound clicks to

32 FIRST QUARTER 2017 ISSUE 32


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
Purchase data make it easy to estimate market share,
but they won’t tell you much about the decision

rP
process of consumers before they make a purchase.
Our research aims to fill that gap.

retailers’ websites. That gave us an idea: why not LED TV market for three reasons:
use clicks to predict market share? First, LED TVs have become highly popular

yo
We also decided to do something that mar- consumer durables on a global level.
ket research firms don’t do very well. Tradition- Second, since the market consists of more
al approaches are largely based on scanner panel than 1,000 products, the use of traditional ap-
data that require multiple purchases within a proaches to analyze competition among prod-
product category to detect switching between ucts would be cumbersome, if not impossible.
products as an indicator of their substitutabil- Third, the market is well known, with easy-to-
ity. We sought to identify competition among understand attributes, allowing us to easily
durable products that consumers typically buy check our findings for face validity.
just once. Market share and competition are two Our approach consisted of five phases:
op
important areas in which big data can provide Collect data by observing consumer behavior
key insights for marketers. on an online platform that provides informa-
We used clickstream data – that is, a log of tion on diverse products.
user actions on a website – as the basis for our Consider products that were viewed together
analysis. These data are powerful for marketers, by the same consumer as being products in
because they provide high-quality information that consumer’s consideration set.
that can be sliced up across various dimen- Identify competitive asymmetry.
tC

sions. Most market research firms only collect Use a model, called DRMABS, to create a map
purchase data. These data make it easy to esti- that visualizes asymmetric competitive mar-
mate market share, but they won’t tell you much ket structure.
about the decision process of consumers before Transpose product attributes onto this map.
they make a purchase.
Imagine you want to buy a washing machine 1 PHASE 1: COLLECT DATA
for the first time. The number of options can
be daunting. Someone might recommend a For the initial step, we collected big search data
No

product to you, and although you might recall on individual consumer clickstreams. You can
the brand, it is more difficult to remember the retrieve such data from any website that pro-
exact type or product number of the washing vides information on a broad range of products,
machine. such as online retailers, price-comparison sites
The failure to remember specific products and product review sites. For this study, we col-
is the fundamental pitfall of consumer surveys. lected data from idealo.de, a leading price-com-
When market research firms survey consum- parison website in Germany.
ers on which other products they considered To collect consumers’ clickstreams, we in-
before making a purchase, consumers typically stalled a tracking pixel on each page of the web-
Do

struggle to correctly recall which exact prod- site. This tracking pixel enabled us to collect
ucts among hundreds they looked at. One of data on 105,606 consumers who used idealo.
the goals in our research was to fill this critical de to search for and compare 1,124 products of
gap, since such insight is crucial for marketers to 56 brands in our chosen product category (LED
make informed decisions. TVs) in September 2012. A tracking pixel con-
sists of a tiny, usually transparent image that is
A Five-Phase Approach embedded in the HTML code of a website to re-
For the purposes of our research, we examined cord usage by website visitors. This means that
the LED TV market in Germany. We chose the each time a consumer opens a new webpage on

ISSUE 32 FIRST QUARTER 2017 33


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
the site, the browser downloads the tracking products consumers perceived as viable substi-
pixel from the web server. This procedure al- tutes, and then used those sets to uncover com-
lows the server to log the fact that the shopper petitive relationships among products.

rP
has visited that specific page. A consideration set was constructed for each
During the search process, consumers may individual consumer visiting the website by ana-
revisit sites to review products various times lyzing individual clickstreams with regard to the
over a period of days. For this reason, we also products that consumers searched for and com-
tracked the behavior of consumers who re- pared online. While one shopper may look at five
turned to idealo.de several days later to con- different LED TVs, another might look at two.
tinue their search for products. We constructed consideration sets for more
Through our research, we discovered than 100,000 consumers. In doing so, we were

yo
that 99.9 percent of all captured clickstreams able to determine how often any two products
spanned 16 products or fewer. As such, we chose appeared in the same consideration set. The
16 products as our cut-off, and eliminated click- more that any two products appeared jointly
streams containing more than 16 products, in consumers’ consideration sets, the stronger
since such large clickstreams may actually be their competitive relationship.
generated by search bots that systematically We captured all competitive relationships
collect large amounts of product data but do in a 1,124 x 1,124 matrix of joint product consid-
not provide insight into a consumer’s product eration. This gave us a very broad view of the
consideration. marketplace.
op
PHASE 3: IDENTIFY COMPETITIVE
2 PHASE 2: DETERMINE CONSIDERATION SETS 3
ASYMMETRY

From the data collected in Phase 1, we con- Next, we identified two types of competitive
structed consideration sets. We looked at which asymmetry: across all products in a market
(market-share competitive asymmetry) and
among pairs of products (local competitive
tC

ABOUT THE AUTHORS asymmetry).


The idea behind market-share competitive
Bernd Skiera holds the Chair University of California, Los asymmetry is that if a given product, A, is con-
of Electronic Commerce Angeles. sidered by more consumers than products B, C
at Goethe University, and D, then product A is the stronger competi-
Frankfurt. In addition, he is Daniel M. Ringel is tor overall. In this way, we were able to iden-
a member of the managing completing doctoral studies tify which products were competing against
board of the e-Finance Lab at Goethe University, and each other, as well as determining how con-
No

and a Professorial Fellow starting in summer 2017 sumers’ consideration was distributed across
at Deakin University in he will be an assistant competing products.
Australia. His research professor at the University The idea behind local competitive asymme-
focus is e-commerce, online of North Carolina at try is that from the perspective of one product,
marketing and interactive Chapel Hill. He studied the other product may be a stronger competi-
media, subjects on which business administration tor than vice versa. Take Apple’s iPod: from
he has published books and at Baden-Wuerttemberg the perspective of an iPod, an MP3 player from
articles in Marketing Science, Cooperative State University iRiver is less of a competitor than vice versa.
Journal of Marketing and in cooperation with IBM,
Journal of Marketing Research,
Do

and completed his Executive PHASE 4: VISUALIZE COMPETITIVE


4
MARKET STRUCTURE
among others. He has been MBA as Goethe Scholar
a visiting professor and from Goethe Business We then created a map to visualize competitive
scholar at Stanford University, School in alliance with Duke market structure in large product categories,
Cambridge Judge Business University’s Fuqua School using a combination of methods drawn from
School, New York University’s of Business. He previously network analysis and graph theory. These
Stern School of Business, worked as a management methods are often used to analyze and visual-
Duke University’s Fuqua consultant and founded an ize relationships in the fields of communica-
School of Business and the e-commerce business. tion, biology, medicine and IT applications.

34 FIRST QUARTER 2017 ISSUE 32


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
We were able to identify the top 10 brands and
products by share of consumer consideration.

rP
We were also able to see how individual products
competed, and pinpoint local competitive asymmetry.

To do so, we integrated several existing and To create this map, we first removed all
newly developed algorithms into a new model competitive asymmetry information (arcs

yo
called DRMABS, which stands for Decomposi- and bubble size) as well as cluster membership
tion and Reassembly of Markets by Segmenta- (bubble color) from the map. Next, we depict-
tion. The underlying concept of DRMABS is ed product attributes by varying bubble colors
that it decomposes the total market into sub- and sizes that represent individual products
markets, maps submarkets individually, and on the competitive market structure map.
then reassembles all local submarket maps into The final visualization provides a clear view of
a global map of the whole market. what consumers searched for and the drivers of
Finally, we introduced global competitive competitive market structure. See Exhibit 2.
symmetry to the global map by using bubble Using our approach, we were able to iden-
op
size to represent the frequency of consider- tify the top 10 brands and products by share of
ation as a proxy for market share, and weighted consumer consideration. As Exhibit 3 shows,
arrows to indicate local competitive asymmetry. Samsung dominated the market, with a share
See Exhibit 1. of 43 percent. Overall the top 10 brands held
a joint share of roughly 97 percent of the Ger-
PHASE 5: TRANSPOSE PRODUCT man market, while the remaining 46 brands
5
ATTRIBUTES ONTO THE MAP
accounted for just 3 percent of the market.
tC

In the fifth phase of our research, we transposed We were also able to visualize how LED TVs
product attributes, such as brand or display size, were organized into submarkets, to see how
onto the competitive market structure map. We individual products competed with each other
visualized these attributes by using different and to pinpoint when there was strong local
colors and bubble sizes. competitive asymmetry.
Since price-comparison sites generate rev-
enue with every click on any retailer, regardless
EXHIBIT 1 of what the product is, they are not affected by
No

DRMABS Model
which products are viewed by consumers, mak-
for Visualizing Market Structure
ing them an unbiased data source for consumer
WE DECOMPOSE THE MARKET INTO SUBMARKETS,
consideration.
MAP INDIVIDUALLY, THEN REASSEMBLE.
Another benefit of price-comparison sites
1
Decomposition

is that they provide low-cost clickstream data


FIND SUBMARKETS MAP SUBMARKETS
2 GLOBALLY
generated by millions of consumers who are
comparing thousands of products. They cap-
ture individual consumer searches in real time,
unlike other sources – such as Google – which
Do

only provide summary information, such as


total keyword searches.

3 MAP PRODUCTS
4 ADD ASYMMETRY
A Typical Consumer Clickstream
Reassembly

LOCALLY, OPTIMIZE
GLOBALLY To understand the value of clickstream data,
consider a typical scenario. Imagine a shopper
– let’s call her Susan – is looking for an Apple
smartphone. She uses Google to search for this
product, and one of the first results is a link to a

ISSUE 32 FIRST QUARTER 2017 35


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
Less nuanced methods would only register if Susan
clicked on an HTC product page and then purchased

rP
a smartphone there. Clickstream data offer a more
detailed picture of the customer journey.

product and price comparison site. page. She likes the phone, but is still not sure.
Susan clicks on the site and is taken to a After revisiting the Apple page, she returns

yo
product detail page, which includes technical to the HTC page. She clicks on the one with
details, pictures, other consumers’ opinions the lowest price and a new browser opens,
and ratings. She studies the information and displaying the website of the corresponding
decides to have a look at an alternative phone. online shop.
She uses the search bar to search for “Samsung In this case, Susan’s consideration set size
smartphone.” She clicks on the phone she is is three. The products she weighed are referred
interested in, but since she’s running late, she to as being “considered jointly” with each oth-
decides to continue her search another time. er, in that Apple was considered jointly with
The next day, Susan goes directly to the Samsung, Samsung was considered jointly
op
comparison site to search for “Apple phone.” with HTC, and HTC was considered jointly
She likes the phone, but wants to make sure it with Apple.
is the right choice. To reduce the list of several This is an example of the kind of informa-
hundred smartphones, she applies filters of at- tion that clickstream data can yield that tra-
tributes important to her. She clicks on a model ditional marketing research methods cannot.
from HTC and is taken to its product detail Less nuanced methods would only register
if Susan clicked on an HTC product page and
tC

then purchased a smartphone there. Click-


EXHIBIT 2 stream data offer a more detailed picture of the
Visualization of
customer journey.
Asymmetric Competitive Market
Structure Map of 1,124 LED TVs
30
1
No

28
27
29
3
2 26
25
4 24
22
14
5 17 23
6 21
16
13 19
15
7
10 12
Do

9 20
8 18
11

LEGEND competitive asymmetry Arrow weight indicates how


Bubbles represent individual (consideration frequency). intense a competitive relationship
products (SKUs). Arrows represent local is: the darker and thicker the arrow,
Bubble color indicates submarket competitive asymmetry and point the more intense the relationship
membership. at competitors of the product they Submarkets are numbered 1
Bubble size indicates global originate in. through 30.

36 FIRST QUARTER 2017 ISSUE 32


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860
Using Big Search Data to Map Your Market

t
os
If our method seems complex, let it serve both as an
eye-opener and an action agenda for how marketing

rP
professionals need to start reorienting their future
activities and capabilities.

EXHIBIT 3 customers. Both situations cut retailer revenue.


Top 10 Brands
When retailers have a clear picture of submar-
MARKET SHARE ACCORDING TO

yo
kets and their most popular products, they can
CONSUMER CONSIDERATION
serve a broad spectrum of consumer needs with
a relatively small number of products.
OTHER 1%
2%
0.5% LEGISLATORS AND PUBLIC POLICYMAKERS need to
keep an eye on competitive market structures to
2.5%
spot early signs of problems, such as the conse-
3%
quences of a merger or acquisition on free mar-
43% 4% kets or potential monopolies forming. In our
op
case (Exhibit 3), they would be able to see that
6% Samsung, Philips and LG have strong presenc-
es in most submarkets, and that the potential
SHARE OF merger of any of these three companies would
56 BRANDS 8%
lead to undesirable levels of market power.
STUDIED
(rounded)
Our approach can be used to process big search
tC

data taken from any website where consumers


14% search for and compare products in large prod-
uct categories. If our method seems complex,
16% let it serve both as an eye-opener and an action
agenda for how marketing professionals need to
start reorienting their future activities and capa-
bilities. As the amount of available clickstream
data continues to grow, the need for sophisti-
No

Reaping the Benefits cated analysis methods will become more acute.
Our approach for visualizing competition can Marketers who give proactive attention to this
yield valuable insights, not only for manufac- area will have an edge on their competitors.
turers and retailers but also for legislators and Overall, our approach opens the door to a
public policymakers, which cannot be obtained deeper and more expansive analysis of today’s
from other sources. large markets. It also demonstrates how market-
ers – when equipped with the right information
MANUFACTURERS can use asymmetric competi- management skills – can harness the power of
tive market structure maps to quickly see how big search data.
Do

a market is organized, how many submarkets


exist, which competitors they face in each sub-
market, and how strong these competitors are.
TO KNOW MORE
RETAILERS can use the maps to improve pur-
chasing and inventory management decisions. Q Ringel, D.M. and B. Skiera. “Visualizing
Poor decisions regarding inventory can leave Asymmetric Competition Among More Than
retailers with overstock that cannot be sold or 1,000 Products Using Big Search Data.” Marketing
shortages of hot products, leading to unhappy Science 35, no. 3 (May-June 2016): 511-34.

ISSUE 32 FIRST QUARTER 2017 37


This document is authorized for educator review use only by Muhammad Adnan Waseem, Other (University not listed) until Oct 2024. Copying or posting is an infringement of
copyright. Permissions@hbsp.harvard.edu or 617.783.7860

You might also like