Professional Documents
Culture Documents
Global Market Segmentation
Global Market Segmentation
This project is based on drivers of store image data to identify market segments as well as to determine entry strategy
for Whole Foods. After estimating mixture regression models in GLIMMIX, the segment with higher demand for
atmosphere and service (Quality Segment), is recommended for Whole Foods to target. Seven regions are chosen to
enter initially: Hamburg, Koln, Asturias, Galicia, Alsace, Friuli-Venezia Giulia and Alentejo, considering their
population and likelihood of belonging to Quality Segment. When targeting Quality Segment, recommended
positioning for Whole Foods is to offer the best atmosphere and service. More recommendations are also made for
Whole Foods to enter Europe market successfully.
A survey dataset for the household’s primary food retail outlet is identified and the most frequently visited food
store, has been collected from a sample of 1669 respondents in 105 regions across seven countries. The respondents
rate store image and its drivers including service, atmosphere and price on a likert scale of 1 to 7. Two datasets have
been provided, one is “EURetail.dat” which is used as the data input to estimate mixture regression models. The
other is “EURetail.GL1”, a file having variable definitions.
Mixture Regression Model is a model in which the most optimal number of segments can be identified. And within
each segment, a dependent variable is associated with a group of independent variables according to some form of
response model (linear or nonlinear), and the parameters of the response model differ between the segments. The
analysis has been done using GLIMMIX software, which is used for Mixture Regression Models.,
The first part is to identify the optimal number of segments. This is done by observing CAIC and by referring to ” the
plot of statistics against the number of segments” in the output result. The lower the CAIC value the better it is.
CAIC has been observed for 1 to 4 number of segments. By observing the CAIC in “statistics” in the output file, we
see that the lowest value is when the number of segments is two.(see Appendix II Figure 1 Figure 2).
The second part is to determine the response model for the segments. The data follows Normal distribution, so it is
necessary to generate mixture regression models for each segment to capture the effects of different
drivers(variables) on the overall store image. Maximum Likelihood Estimation methodology is used over here. The
parameters are different between two segments. (see Appendix II Figure 3).
The third part is to identify the probability of a region belonging to each segment. First, find the segment size, which
is determined by the probability of belonging to each segment and referring to the “coefficients and standard errors”
in the output file. Four iterations are selected to reaffirm our class size for 2 segments. The probability of being
present in the first segment is 0.305 and the probability of being present in the second segment is 0.694. So, the class
size of first segment is 30.5% and the class size of second segment is 69.4%. (see Appendix II Figure 3).
Next step is to assign each region to one of the identified segments. Each region is assigned to one of the segments
by comparing the posterior probability of lying in each of the two segments. The maximum value out of the two
values determines in what segment that particular region will belong to. (see Appendix II Figure 4).
Key Findings
In order to identify market segments based on drivers of Store Image and to choose the entry strategy for a particular
food retailer, mixture regression analysis is carried out. Store Image is analyzed with respect to factors such as
Region, Respondent, Service, Atmosphere and Price. The independent variables Service, Atmosphere and Price are
all categorical and the variables Region and Respondent and the Dependent variable Store image is continuous in
nature.
The GLIMMIX software is run with the following input for a Mixture Regression Analysis:
Store_image = a + b1*Service + b2*Atmosphere +b3*Price
Link Function - Identity
Distribution Level - Normal Distribution
Estimation Method - Maximum Likelihood Estimation (MLE)
Number of Classes - Min -1 and Max - 5
BIC/CAIC is a criterion for model selection among a finite set of models; the model with the lowest BIC/CAIC is
preferred.It has been found that the BIC/CAIC value (3410.67) and AIC value (3351.06) is the lowest for two
Segments/classes. Hence, we choose only two segments for making the analysis.
Coefficient estimates given by GLIMMIX (See Appendix) can be interpret as the importance of each driver to each
segment. Segment 2 has a size of 0.695 and are more sensitive to Service (Estimate Seg2 - 0.3528 > Estimate Seg1 -
0.2628) and Atmosphere (Estimate Seg2 - 0.3028 > Estimate Seg1 - 0.1571) than to Price (Estimate Seg2 - 0.2395 <
Estimate Seg1 - 0.2968) . Whereas, Segment 1 with a size of 0.305 is more sensitive to Price (Estimate Seg1 -
0.2968 > Estimate Seg2 - 0.2395 ) than Segment 2. In following report, Segment 2 will be referred as Quality
segment, and Segment 1 will be referred as Price segment. Whole Foods, a leading Organic food retailer who has
stringent quality standards and premium pricing, should focus more on Quality Segment (Segment 2) than Price
Segment (Segment 1). Also, the market size for Quality Segment is greater (0.695) than the Price Segment (0.305).
Henceforth, Whole Foods can focus on the Quality Segment.
Regions are assigned to one of the two segments based on the value of posterior membership probability, which is
the probability of the customers belonging to each group. A threshold value of 0.6 is chosen, for the posterior
membership probability in order to assign the regions to one of the two segments. A posterior membership
probability of 0.6 gives a clear separation between the two segments whereas a posterior membership probability of
0.5 shows that people from this region have almost similar preference for both the segments and there is no clear
distinction. Over 74 regions belong to Segment 2( Quality segment) based on the value of Posterior membership
probability.
Hamburg , Koln, Asturias, Galicia, Alsace,Friuli-Venezia Giulia and Alentejo are chosen as the regions to target first
for the following reasons. Firstly, they have a clear preference towards Quality segment (Segment 2) with a
Posterior Membership Probability of 0.999/1.000. They are insensitive to Price and are more conscious about Service
and Atmosphere. Secondly, considering the factors such as Population Density, number of Competitors, Supply
Chain efficiency, it is better to focus on regions such as Hamburg, Koln, Asturias and Galicia, as the Population
Density and the Purchasing Power are higher for people in these regions. (Refer Appendix.III) Focusing on these
regions could help to save time and efforts of the Whole Foods Marketing team as well in comparison to the time and
effort they would spend on regions having sparse population with low purchasing power. Also, Hamburg and Koln
are some of the major port cities and have well connected land routes and sea routes. Transportation of goods and
ingredients to these areas would be much easier than other places as well.
Conclusions and Recommendations
Conclusions
In conclusion, based on drivers of store image and Whole Foods’ competencies, regions that value atmosphere and
service should be taken as the target segments for Whole Foods entering Europe market. To take population as well
as the posterior probability into consideration, seven regions are chosen to enter: Hamburg, Koln, Asturias, Galicia,
Alsace, Friuli-Venezia Giulia and Alentejo.
Recommendations
According to analysis, three major recommendations are made for Whole Foods to enter into European market
effectively.
● Based on Mixture Regression Model and taking geographical factors into consideration, seven regions from the
quality segment are suggested for Whole Food to target first: Hamburg, Koln, Asturias, Galicia, Alsace, Friuli-
Venezia Giulia and Alentejo. Then, in order to expand market size, Whole Foods could turn to other regions of
Quality Segment as its next step. In addition, Whole Foods could lower its price to target at more price-sensitive
segments.
● To target at Quality Segment, Whole Food is suggested to focus on best atmosphere and service, which could
help them differentiate from other organic food store. Also, promotion is recommended for Whole Foods to
increase its brand awareness; and loyalty program is recommended to increase customers’ life value.
● More background knowledge in these regions need to be researched when targeting. There might be other
factors in these regions such as competition, customers’ shopping habits, and distribution that will affect
customers’ purchase intention.
Limitations
In terms of data limitation, firstly, based on the contents of questionnaires, the data counts on a 1-7 point likert scale,
which provides little information about the reference of these numbers since these numbers do not have any
numerical meaning behind and vary among individuals. Therefore, it is hard to understand the descriptive statistics
from this database. Secondly, the data was obtained in 2007, which might not reflect the current situation of
European market effectively, since customers’ purchase power and preference might change over time. Thirdly, it is
hard to evaluate whether these samples could represent the real situation of the whole region, since any background
information about them is unknown.
In terms of information insufficiency, internal factors such as cost structure, and external factors such as competition
and economics environment are not clear, which makes it hard to detect segments’ profitability and growth rate
accurately.
Appendices: Tables, Exhibits, Figures
Appendix.I Data Description
Variable Description
REGION ID number for a region (there are 105 regions in the data, ranging from 1-120,
with a few numbers absent).
CONSUMER ID number for a respondent within a region (= 1, 2, 3, …), each region may
have a different number of respondents.
STORE_IMAGE Overall store image rating for a respondent’s primary food retail outlet, on a
1-7 point scale (missing values are imputed with their corresponding sample
averages and thus may contain half points)
SERVICE Service quality perception of a respondent’s primary food retail outlet, on a 1-
7 point scale (a store image driver)
ATMOSPHERE Store atmosphere perception of a respondent’s primary food retail outlet, on a
1-7 point scale (a store image driver).
PRICE Price perception of a respondent’s primary food retail outlet, on a 1-7 point
scale (a store image driver). Note that higher value indicates more favorable
price perception (i.e., low prices, reasonable prices).
Appendix.II Segment
Figure 3
#classes: 2 current: 1 startnr: 1
Independent Coefficient estimates STD.ERR T-value
Service 0.268627 0.025496 10.536228
Atmosphere 0.157096 0.025302 6.208827
Price 0.296865 0.020900 14.204236
intercept 1.724799 0.127082 13.572384
Appendix. III