Professional Documents
Culture Documents
Survey and Multiviriate Analysis-HANDBOOK
Survey and Multiviriate Analysis-HANDBOOK
For further information and step–by–step guide to solving problems, please kindly refer to Tutoring Videos uploaded
on ISB Academic Team Facebook fanpage.
Under no circumstances should one copy this document without the author’s permission
Chapter 1,2,4 - ISB Academic Team
CHAPTER 1, 2, 4
1. Marketing Research
1
Chapter 1,2,4 - ISB Academic Team
2
Chapter 1,2,4 - ISB Academic Team
3
Chapter 1,2,4 - ISB Academic Team
4
Chapter 1,2,4 - ISB Academic Team
Competitive Intelligence
o The process of enhancing marketplace competitiveness through a greater
understanding of a firm’s competitors and the competitive environment.
o This process is unequivocally ethical. It involves the legal collection and
analysis of information regarding the capabilities, vulnerabilities, and
intentions of business competitors, conducted by using information
databases and other “open sources” and through ethical marketing
research inquiry.
o CI enables senior managers in companies of all sizes to make informed
decisions about everything from marketing, research and development
(R&D), and investing tactics to long-term business strategies.
5
Chapter 1,2,4 - ISB Academic Team
Examples: Examples:
While distinct, the marketing research problem has to be closely linked to the
management decision problem.
Conceptual map: A way to link the broad statement of the marketing research
problem to the management decision problem.
o Management wants to (take action) - the rationale for the question and the
project (the MDP).
o Therefore, we should study (topic). - what broader topic is being
investigated.
o So that we can explain (question). - who/how/why that needs to be
explained.
o Notes: the second and third lines define the broad marketing research
problem (MRP) .
6
Chapter 1,2,4 - ISB Academic Team
4. Data in research
Primary Data
o Originated by a researcher for the specific purpose of addressing the
problem at hand.
o Obtaining primary data can be expensive and time consuming.
Secondary Data
o Data that has already been collected for purposes other than the problem at
hand. These data can be located quickly and inexpensively.
o Advantages
Secondary data are easily accessible, relatively inexpensive, and
quickly obtained.
Secondary data can help you:
Identify the problem.
Better define the problem.
Develop an approach to the problem.
Formulate an appropriate research design (for example, by
identifying the key variables).
Answer certain research questions and test some
hypotheses.
Interpret primary data more insightfully.
7
Chapter 1,2,4 - ISB Academic Team
o Disadvantages
The objectives, nature, and methods used to collect the
secondary data may not be appropriate to the present
situation.
Secondary data may be lacking in accuracy, or they may not
be completely current or dependable.
o Rule for using secondary data
Start with secondary data. Proceed to primary data only when the
secondary data sources have been exhausted or yield marginal
returns.
8
Chapter 1,2,4 - ISB Academic Team
9
Chapter 2, 8, 9, 10 – ISB Academic Team
CHAPTER 2, 8, 9, 10
APPROACH DEVELOPMENT, MEASUREMENT & QUESTIONNAIRE
10
Chapter 2, 8, 9, 10 – ISB Academic Team
Analytical Model
o An analytical model is a set of variables and their interrelationships
designed to represent, in whole or in part, some real system or process.
o Models can have many different forms. The most common are verbal,
graphical, and mathematical structures.
Verbal Models: Analytical models that provide a written
representation of the relationships between variables.
Graphical Models: Analytical models that provide a visual picture
of the relationships between variables.
Mathematical Models: Analytical models that explicitly describe
the relationships between variables, usually in the equation form.
Research Questions
o Research questions are refined statements of the specific components of
the problem.
To develop an approach, each component of the problem may have
to be broken down into subcomponents or research questions.
o Research questions ask what specific information is required with respect
to the problem components.
o The formulation of the research questions should be guided not only by
the problem definition but also by the theoretical framework and the
analytical model adopted.
11
Chapter 2, 8, 9, 10 – ISB Academic Team
Hypotheses
o A hypothesis (H) is an unproven statement or proposition about a factor
or phenomenon that is of interest to the researcher.
o Hypotheses go beyond research questions because they are statements of
relationships or propositions rather than merely questions to which
answers are sought.
o An important role of a hypothesis is to suggest variables to be included in
the research design.
12
Chapter 2, 8, 9, 10 – ISB Academic Team
13
Chapter 2, 8, 9, 10 – ISB Academic Team
Nominal Scale
o A nominal scale is a figurative labelling scheme in which the numbers
serve only as labels or tags for identifying and classifying objects.
o When used for identification, there is a strict one-to-one correspondence
between the numbers and the objects.
Common examples include Social Security numbers and numbers
assigned to football players. In marketing research, nominal scales
are used for identifying respondents, brands, attributes, stores, and
other objects.
When used for classification purposes, the nominally scaled
numbers serve as labels for classes or categories.
For example, you might classify the control group as group
1 and the experimental group as group 2.
The numbers in a nominal scale do not reflect the amount of the
characteristic possessed by the objects.
The only permissible operation on the numbers in a nominal scale is
counting.
Only a limited number of statistics, all of which are based on
frequency counts, are permissible. These include percentages,
mode, chi-square, and binomial tests.
14
Chapter 2, 8, 9, 10 – ISB Academic Team
Ordinal Scale
o An ordinal scale: A ranking scale in which numbers are assigned to objects
to indicate the relative extent to which some characteristic is possessed.
o An ordinal scale allows you to determine whether an object has more or
less of a characteristic than some other object, but not how much more or
less.
Indicates relative position, not the magnitude of the differences
o Common examples of ordinal scales include quality rankings, rankings of
teams in a tournament, socioeconomic class, and occupational status.
The ordinal scales possess description and order characteristics
but do not possess distance (or origin).
In an ordinal scale, as in a nominal scale, equivalent objects receive
the same rank. Any series of numbers can be assigned that
preserves the ordered relationships between the objects.
Interval Scale
o A scale in which the numbers are used to rate objects such that numerically
equal distances on the scale represent equal distances in the characteristic
being measured.
o An interval scale contains all the information of an ordinal scale, but it also
allows you to compare the differences between objects.
o A common example in everyday life is a temperature scale. In marketing
research, attitudinal data obtained from rating scales are often treated as
interval data.
o In an interval scale, the location of the zero point is not fixed, i.e., these
scales do not possess the origin characteristic -> Both the zero point and
the units of measurement are arbitrary.
Ratio Scale
o A ratio scale possesses all the properties of the nominal, ordinal, and
interval scales and, in addition, an absolute zero point.
o Ratio scales possess the characteristic of origin (and distance, order, and
description).
o Ratio scales we can identify or classify objects, rank the objects, and
compare intervals or differences. It is also meaningful to compute ratios of
scale values.
Common examples of ratio scales include height, weight, age, and
money.
In marketing, sales, costs, market share, and number of customers
are variables measured on a ratio scale.
15
Chapter 2, 8, 9, 10 – ISB Academic Team
Likert Scale
o Likert scale is a widely used rating scale that requires the respondents to
indicate the degree of agreement or disagreement with each of a series of
statements about the stimulus object.
16
Chapter 2, 8, 9, 10 – ISB Academic Team
The data are typically treated as intervals. Thus, the Likert scale
possesses the characteristics of description, order, and distance.
o Advantages
Easy to construct and administer.
Making it suitable for mail, telephone, personal or electronic
interviews.
o Disadvantages
Taking longer to complete than other itemised rating scales
because respondents have to read each statement.
May be difficult to interpret the response to a Likert item, especially
if it is an unfavourable statement.
Stapel Scale
o Stapel Scale is a scale for measuring attitudes that consists of a single
adjective in the middle of an even-numbered range of values, from 5 to 5,
without a neutral point (zero).
o Respondents are asked to indicate how accurately or inaccurately each
term describes the object by selecting an appropriate numerical response
category.
o The higher number, the more accurately the term describes the object, as
shown in the department store project.
17
Chapter 2, 8, 9, 10 – ISB Academic Team
Scale Evaluation
o A multi-item scale should be evaluated for accuracy and applicability,
involving an assessment of reliability, validity, and generalizability of the
scale.
Reliability
o Reliability refers to the extent to which a scale produces consistent results
if repeated measurements are made.
o Reliability is assessed by determining the proportion of systematic
variation in a scale.
18
Chapter 2, 8, 9, 10 – ISB Academic Team
Validity
o The validity of a scale may be defined as the extent to which differences in
observed scale scores reflect true differences among objects on the
characteristic being measured, rather than systematic or random error.
o Validity can be assessed by examining content validity, criterion validity,
and construct validity.
CONTENT VALIDITY
Content validity, sometimes called face validity, is a
subjective but systematic evaluation of how well the content
of a scale represents the measurement task at hand.
CRITERION VALIDITY
Criterion validity reflects whether a scale performs as
expected in relation to other variables selected as
meaningful criteria (criterion variables).
CONSTRUCT VALIDITY
Construct validity addresses the question of what
construct or characteristic the scale is, in fact, measuring.
Construct validity includes convergent, discriminant, and
nomological validity.
Convergent validity is the extent to which the scale
correlates positively with other measures of the same
construct.
Generalizability
o Generalizability refers to the extent to which one can generalise from the
observations at hand to a universe of generalisations.
o The set of all conditions of measurement over which the investigator
wishes to generalise is the universe of generalisation.
In generalizability studies, measurement procedures are designed
to investigate the universes of interest by sampling conditions of
measurement from each of them.
For each universe of interest, an aspect of measurement called a
facet is included in the study.
19
Chapter 2, 8, 9, 10 – ISB Academic Team
3. Questionnaire:
Objectives: Any questionnaire has three specific objectives.
o First, it must translate the information needed into a set of specific
questions that the respondents can and will answer.
o Second, a questionnaire must uplift, motivate, and encourage the
respondent to become involved in the interview, to cooperate, and to
complete the interview.
o Third, a questionnaire should minimise response error.
20
Chapter 2, 8, 9, 10 – ISB Academic Team
21
Chapter 2, 8, 9, 10 – ISB Academic Team
Order of Questions
o Opening Questions: can be crucial in gaining the confidence and
cooperation of respondents. The opening questions should be interesting,
simple, and nonthreatening.
o Type of Information: The type of information obtained in a questionnaire
may be classified as
22
Chapter 2, 8, 9, 10 – ISB Academic Team
Logical Order: Questions should be asked in a logical order. All of the questions
that deal with a particular topic should be asked before beginning a new topic.
23
Chapter 6, 11, 13 – ISB Academic Team
CHAPTER 6,11,13
SAMPLING, SURVEY, AND FIELDWORK
Population: The aggregate of all the elements, sharing some common set of
characteristics, that comprises the universe for the purpose of the marketing
research problem.
o Census: involves a complete enumeration of the elements of a population.
o Sample: a subgroup of the population selected for participation in the study.
Sample characteristics (sample statistics) are used to make
inferences (estimation procedures and tests of hypotheses) about
the population parameters.
24
Chapter 6, 11, 13 – ISB Academic Team
Sample Census
25
Chapter 6, 11, 13 – ISB Academic Team
26
Chapter 6, 11, 13 – ISB Academic Team
27
Chapter 6, 11, 13 – ISB Academic Team
28
Chapter 6, 11, 13 – ISB Academic Team
29
Chapter 6, 11, 13 – ISB Academic Team
A sample of clusters is
Subpopulations All strata are included chosen
Strengths Weaknesses
30
Chapter 6, 11, 13 – ISB Academic Team
Difficult to construct
Simple sampling frame,
random expensive, lower
Easily understood, precision, no assurance of
sampling results projectable
(SRS) representativeness
Can increase
representativeness, Can decrease
Probability Systematic easier to implement than representativeness if
Sampling Sampling SRS, sampling frame not there are cyclical
necessary patterns
Imprecise, difficult to
Cluster Easy to implement, compute and interpret
sampling cost-effective results
31
Chapter 6, 11, 13 – ISB Academic Team
2. SURVEY
Survey method: A structured questionnaire given to a sample of a population and
designed to elicit specific information from respondents.
o The questioning is structured - the degree of standardization imposed on
the data collection process.
Structured data collection: Use of a formal questionnaire that presents questions
in a prearranged order.
The structured-direct survey involves administering a questionnaire. In a typical
questionnaire, most questions are fixed-alternative questions.
o Fixed-alternative questions: Questions that require respondents to
choose from a set of predetermined answers.
Survey Methods Classification:
32
Chapter 6, 11, 13 – ISB Academic Team
33
Chapter 6, 11, 13 – ISB Academic Team
34
Chapter 6, 11, 13 – ISB Academic Team
Refusals:
o Result from the unwillingness or inability of people included in the sample
to participate.
o Result in lower response rates and increased potential for nonresponse
bias.
35
Chapter 6, 11, 13 – ISB Academic Team
3. FIELDWORK PROCESS
36
Chapter 6, 11, 13 – ISB Academic Team
37
Chapter 6, 11, 13 – ISB Academic Team
38
Chapter 14 – ISB Academic Team
CHAPTER 14
DATA PREPARATION & RELIABILITY TEST
39
Chapter 14 – ISB Academic Team
Check skip pattern (check whether they read the skip pattern to answer the right
question)
o e.g. have you ever drunk beer? (If yes/no go to Q6/Q7, relatively).
o They answer yes but they go to Q7 -> wrong (maybe just randomly answer
so do not read the skip pattern) -> need to check (no skip pattern in ISB
questionnaires because it is hard for the beginners) -> delete the answer.
Little variance: spam 1 (strict research might count 2) and for all of the question.
o e.g. choose all 1 or 5 for all of the questions (Likert scale).
40
Chapter 14 – ISB Academic Team
2. Reliability Test
Definition:
o Reliability refers to the consistency of a test, often measured through the
internal consistency of a scale.
Internal consistency means to what extent various parts of the
scale truly capture the desired characteristics in a similar direction.
o Only constructs have a reliability test. If we don’t have constructs, we don’t
have a reliability test.
o Constructs: Variables that are measured by more than one question.
Example: Lecturer, Subject Complexity, Subjective Knowledge are
constructs (measured by 1+ question in the survey).
Purpose: make sure that your groups of questions measure the same constructs =
how “focus’ your questions are.
Technique: run a Reliability Test for each construct using Cronback’s Alpha.
o Cronback’s Alpha should be between 0 and 1.
Cronback’s Alpha = 0 -> questions are not any focus.
Cronback’s Alpha = 1 -> questions are totally focus.
Best Cronback’s Alpha ranges from 0.5 to 0.95 (too close to mean
the questions are “too close”).
o If Cronbach’s alpha is < 0.5 -> the questions are not focus enough.
If Cronbach’s Alpha is > 0.95 -> it likes asking 1 question 3 times, it
is not asking 3 questions about the same thing.
41
Chapter 14 – ISB Academic Team
42
Chapter 14 – ISB Academic Team
43
Chapter 14 – ISB Academic Team
o Example:
The Cronbach’s Alpha of the construct is .776, which is between 0.5
and 0.95, we can continue.
Running for Perceived Brand Quality: Cronbach’s alpha if PBQ4
is deleted is bigger than the Cronbach’s Alpha in Reliability Statistic
(.813 > .776), so we deleted the PBQ4 in the data set. If you delete
any other question, then the reliability will result in lower Cronbach
Alpha.
Notes: DO NOT delete any question of construct which just has 2
questions (if do so, only 1 question remain for the construct =>
cannot test reliability)
Do the same for other constructs.
44
Chapter 14 – ISB Academic Team
Running for Brand Image: deleting any question would lower the
Cronbach’s Alpha of Reliability Statistics
(.705>.669>.637>.634>.631), so we are keeping the same questions.
Check again without the deleted one to see whether there is any
better Cronbach Alpha.
45
Chapter 14 – ISB Academic Team
46
Chapter 19 – ISB Academic Team
1. Definition:
Factor analysis (or exploratory factor analysis - EFA) is a class of procedures
primarily used for data reduction and summarization. Factor analysis is an
interdependence technique in that an entire set of interdependent relationships
is examined.
o In marketing research, there may be a large number of variables, most of
which are correlated and which must be reduced to a manageable level.
o Relationships among sets of many interrelated variables are examined and
represented in terms of a few underlying factors.
Purpose: Factor analysis is used in the following circumstances.
o To identify underlying dimensions, or factors, that explain the correlations
among a set of variables.
o To reduce a large number of correlated variables into a smaller and
uncorrelated set of variables for subsequent multivariate analysis
o Example: How do you reduce your data from 30+ variables into a smaller
and more useful set of variables?
Applications in Marketing:
o Market Segmentation: identifying the underlying variables on which to
group the customers.
o Product Research: to determine the brand attributes that influence
consumer choice.
o Advertising Studies: used to understand the media consumption habits of
the target market.
o Pricing Studies: used to identify the characteristics of price- sensitive
consumers.
47
Chapter 19 – ISB Academic Team
48
Chapter 19 – ISB Academic Team
49
Chapter 19 – ISB Academic Team
50
Chapter 19 – ISB Academic Team
51
Chapter 19 – ISB Academic Team
52
Chapter 19 – ISB Academic Team
53
Chapter 19 – ISB Academic Team
54
Chapter 19 – ISB Academic Team
55
Chapter 19 – ISB Academic Team
56
Chapter 19 – ISB Academic Team
57
Chapter 19 – ISB Academic Team
58
Chapter 19 – ISB Academic Team
59
Chapter 19 – ISB Academic Team
60
Chapter 15 – ISB Academic Team
CHAPTER 15
FREQUENCY DISTRIBUTION, CROSS-TABULATION & HYPOTHESIS
TESTING
1. Frequency distribution:
61
Chapter 15 – ISB Academic Team
62
Chapter 15 – ISB Academic Team
2. Cross-Tabulations:
Definition:
o A statistical technique that describes two or more variables
simultaneously and results in tables that reflect the joint distribution of
two or more variables that have a limited number of categories or distinct
values.
o Purpose: To describe the relationship between two categorical variables.
o Contingency table: a cross-tabulation table. It contains a cell for every
combination of categories of the two variables.
o Bivariate Cross-Tabulation: Cross-tabulation with two variables.
o Example: People with different genders have different jobs, is that right?
SPSS Running:
o Step 1: Analyze -> Descriptive Statistics -> CrossTabs
o Step 2: Put your variables into Row(s) and Column(s) - Age into Row,
Occupation into Column
o Step 3: For Statistics, check Chi-square
o Step 4: Click Ok
63
Chapter 15 – ISB Academic Team
o Cross Tabulation Table: There are 8 participants who are under 18 and
none of them are employed, while there are 124 students and 2 employees
in the 18-25 age group.
This is not an important table, it only shows the samples statistics,
not the relationship.
o Chi-square Tests: We look at the sig (2-sided) of the Pearson Chi- square
If the Sig. is near 0.00 (<5%) -> There is a relationship between 2
variables -> The 18-25 age group is more independent than the
under 18 group.
3. Sample T-test:
A univariate hypothesis test using the t distribution, which is used when the
standard deviation is unknown and the sample size is small.
One sample: A hypothesis test using the t-test or the z-test to test about a single
variable against a known or given standard.
T-test: testing whether the population mean conforms to a given hypothesis (𝐻0).
Independent Samples T-test (for metric):
o Independent sample: The measurement of one sample has no effect on the
values of the second sample.
o Purpose: to check whether the sample mean and hypothesis mean is
statistically different.
o In a one-sample t-test, the variables we tested will be compared with the
known average values based on the hypothesis.
o Example: Does your female have a higher purchase intention than male?
o SPSS Running:
Step 1: Analyze -> Compare Means -> Independent Samples T-test
Step 2: For Test Variables, pick the factor that we want to test
(numeric/metric).
Step 3: For Grouping Variables, pick the factor of the group that we
are comparing (nominal/non-metric).
Step 4: In Defined Groups, define the group 1 and group 2 (1 for
male and 0 for female).
Step 5: Click Ok
64
Chapter 15 – ISB Academic Team
o SPSS Results:
Test Variables: Purchase Intention
Grouping Variables: Gender (1 for male, 0 for female)
65
Chapter 15 – ISB Academic Team
66
Chapter 15 – ISB Academic Team
o SPSS Results:
Paired Variables: we pick Perceived Price for Variable 1 and
Subjective Knowledge of Product for Variable 2.
67
Chapter 16 – ISB Academic Team
CHAPTER 16:
ANALYSIS OF VARIANCE & COVARIANCE
68
Chapter 16 – ISB Academic Team
69
Chapter 16 – ISB Academic Team
70
Chapter 16 – ISB Academic Team
o Two-way ANOVA:
How to run SPSS:
Step 1: Analyze → General Linear Model → Univariate
Step 2: Input the dependent variable and fixed factors (non-
metric)
Step 3: Click OK
71
Chapter 16 – ISB Academic Team
72
Chapter 16 – ISB Academic Team
73
Chapter 16 – ISB Academic Team
74
Chapter 16 – ISB Academic Team
75
Chapter 16 – ISB Academic Team
76
Chapter 17 – ISB Academic Team
CHAPTER 17
CORRELATION AND REGRESSION
1. Correlation Analysis
Definition:
o The product moment correlation coefficient, r, measures the linear
association between two metric (interval or ratio scaled) variables.
Its square, r2, measures the proportion of variation in one variable
explained by the other.
o The partial correlation coefficient measures the association between two
variables after controlling, or adjusting for, the effects of one or more
additional variables.
o The order of a partial correlation indicates how many variables are being
adjusted or controlled. Partial correlations can be very helpful for
detecting spurious relationships.
o A nonmetric correlation measure for two nonmetric variables that relies
on rankings to compute the correlation.
77
Chapter 17 – ISB Academic Team
SPSS Technique:
o Correlation Test: Test the relationships between 2 metric variables.
o Correlation (r): the degree to which a pair of variables is linearly related.
Range: -1 < r < 1
High r = a strong relationship between the two variables
Low r = the variables are weakly related
Special Values
r = 0 => no relationship between the variables
r = 1 => positive correlation (2 variables move in the same
direction)
r = -1 => negative correlation (2 variables move in the
opposite direction)
o How to run SPSS:
Step 1: Select Analyze → Correlate → Bivariate
Step 2: Put all of the metric variables in (both DVs and IVs)
Step 3: Make sure that you check Pearson (in Correlation
Coefficients), Two-tailed and Flag significant correlation
Step 4: Click Ok
o SPSS Results
78
Chapter 17 – ISB Academic Team
o Interpretation:
The diagonal of the table is 1 ← Correlation between 1 variable
and itself is always 1 (PerBrandQuality - PerBrandQuality).
The diagonal is also a “mirror” → the above and below triangle is
symmetric.
The Sig. here have no meaning (whether < 5% or not).
Compare the correlation of all IVs to each other
Look for correlations that is higher than 0.7 → IVs are too
near (multicollinearity). It’s not the case here, but if you
have it, go back to Factor Analysis.
Compare the correlation of all IVs to DV:
PerBrandQuality has the best correlation (.625 is the
highest) → However, we cannot conclude anything since
correlation matrix do not account the interacting power
between IVs (“fake relationships”).
Things are fine even if 1 IV is too close to the DVs.
2. Simple Regression
Definition:
o Correlation Test: Test the relationships between 2 metric variables.
o Regression analysis is a powerful and flexible procedure for analyzing
associative relationships between a metric dependent variable and one or
more independent variables.
o Purposes:
Determine how much of the variation in the dependent variable can
be explained by the independent variables: strength of the
relationship.
Determine the structure or form of the relationship: the
mathematical equation relating the independent and dependent
variables.
Predict the values of the dependent variable.
Control for other independent variables when evaluating the
contributions of a specific variable or set of variables.
o Bivariate regression derives a mathematical equation between a single
metric criterion variable and a single metric predictor variable. The
equation is derived in the form of a straight line by using the least-
squares procedure.
o When the regression is run on standardized data, the intercept assumes a
value of 0, and the regression coefficients are called beta weights.
79
Chapter 17 – ISB Academic Team
80
Chapter 17 – ISB Academic Team
SPSS Technique:
o Regression Test: test the relationship between independent variable X
(metric) and dependent variable Y (metric).
o Purpose: Used to predict the value of a variable based on the value of
another variable.
o Function: D: Y = aX + b + e
SPSS Running:
o Step 1: Analyze → Regression → Linear
o Step 2: Put the IV into Dependent, the DVs into the Independent box
o Step 3: Click Ok
SPSS Results:
81
Chapter 17 – ISB Academic Team
3. Multiple Regression
Definition:
o Multiple regression involves a single dependent variable and two or more
independent variables.
o The partial regression coefficient, b1, represents the expected change in
Y when X1 is changed by one unit and X2 through Xk are held constant.
o The strength of association is measured by the coefficient of multiple
determination, R2.
o The significance of the overall regression equation may be tested by the
overall F test.
o Individual partial regression coefficients may be tested for significance
using the t test or the incremental F test.
SPSS Technique:
o Regression Test for 1 IV and 1 DV:
Purpose: used when we want to predict the value of a variable
based on the value of another variable.
Function: D: Y = aX + b + e
Example: Propose that SocialInfluence lead to PurIntention -> Test
that (single relationship).
SPSS Running:
Analyze -> Regression -> Linear
Put the IV into Dependent, the DVs into the Independent box
Click Ok
SPSS Results:
82
Chapter 17 – ISB Academic Team
83
Chapter 17 – ISB Academic Team
SPSS Running:
Step 1: Analyze → Regression → Linear
Step 2: Put the new variable (BrandImage) into the
Independent box
Step 3: Click Ok
SPSS Results:
Interpretation:
If all Sig. are lower than 5% → both factors really lead to
Purchase Intention → Which one is stronger?
B is unstandardized but Beta is already standardized
(both represent the slope) → Look at the Beta in multi
regression and B in single regression
Model Summary: Adjusted R Square is .163 → higher than
single regression (.063) → reduce the error
84
Chapter 18 – ISB Academic Team
CHAPTER 18
DISCRIMINANT AND LOGIT ANALYSIS
1. Discriminant
Definition
o A technique for analysing marketing research data when the dependent
variable is non metric and independent variables are metric.
o The purpose of Discriminant analysis is to test the effect on the metric (IVs)
leading to 2 or 3 levels of non - metric variable (DV).
Examples
o If subscribing intention DV only has 2 levels: low and high, and attitude is
the most important among the others, is that right?
o Is Brand Image the most important factor affecting the 2-level purchase
intention?
SPSS Running and interpretation
o 2- groups discriminant
Example: Is Brand Image the most important factor affecting the 2-
level purchase intention? (2 levels are high and low)
How to run SPSS
Step 1: Transform dependent variables into independent
variables that have 2 levels
Transform → Recode into Different variables → Purchase
Intention
Name the new variable PI2lev
For the Old -> New box:
o Tick Range: type in 1 through 3 → New value: type
in Value: 1
o Type in 3 through 5 → New value: type in Value: 2
85
Chapter 18 – ISB Academic Team
86
Chapter 18 – ISB Academic Team
SPSS Interpretation
Eigenvalues table
We have 1 function because we have 2 groups (1 function to
categorize the results into 2 groups).
Eigenvalue= xxx, function 1 explains 100% of variance in
dependent variable.
Canonical Correlation = 0.xx → (0.xx2) = 0.xx = xx% of
dependent variable explained by independent variable.
Example: Canonical Correlation =0.551 → (0.303)2= 0.674
→30.3% of dependent variable explained by independent
variables.
Wilk's Lambda
Sig < 5% --> It's good. It means that SPSS can create the
discriminant line.
Sig > 5% --> Groups are mixed together too much --> SPSS
cannot create the Discriminant line (maybe can but it's not
effective).
87
Chapter 18 – ISB Academic Team
Classification results
88
Chapter 18 – ISB Academic Team
Example:
o Number of people predicted in group 1 and actually
in group 1: 40
o Number of people predicted in group 2 and actually
in group 2: 65
—> Correct discrimination ratio = (40+65)/
(55+94) = 0.7 → 70% > 50% →it is good because
there are only 2 groups being classified.
o 3- groups discriminant
Example: Is Brand Image the most important factor affecting the 3-
level purchase intention?
How to run SPSS:
Step 1: Transform dependent variables into independent
variables that have 2 levels
Transform → Recode into Different variables → Purchase
Intention
Name: PI3lev
In the Old -> New table
o Tick range from 1 through 2.3 → New value: value:
1
o Tick range 2.3 to 3.7 → New value: value : 2
o Tick range 3.7 to 5 → New value: value : 3
89
Chapter 18 – ISB Academic Team
90
Chapter 18 – ISB Academic Team
SPSS Interpretation
Eigenvalues table
We have 2 functions because we have 3 levels so it needs to
have 2 discriminant lines.
We do not need canonical correlation because we have 2 R,
which are considered as 2 levels. DO NOT care about
canonical correlation because there are two R and we
cannot square them like 2-level discriminants.
Look at the % of variance column:
o Function 1 can explain xx% of variance.
o Function 2 can explain xx% of variance.
Example
o Function 1 can explain 91.2% of variance
o Function 2 can explain 8.8% of variance
91
Chapter 18 – ISB Academic Team
Wilk's Lambda
Sig. of Function 1 through 2 < 5% → We should use both 2
functions to explain the model,
Sig of Function 2 > 5% → Do not use only function 2 alone
to explain the model
If both Sig. < 5% → We can explain the model with 2
functions or with function 2 only. However, with function 1
only, we don’t know.
92
Chapter 18 – ISB Academic Team
Classification results
93
Chapter 18 – ISB Academic Team
94
Chapter 18 – ISB Academic Team
95
Chapter 18 – ISB Academic Team
96
Chapter 18 – ISB Academic Team
o SPSS Interpretation
97
Chapter 18 – ISB Academic Team
98
Chapter 20 – ISB Academic Team
1. Definition:
Cluster Analysis (classification analysis, or numerical taxonomy) is a class of
techniques used to classify objects or cases into relatively homogeneous groups
called clusters.
o Objects in each cluster tend to be similar to each other and dissimilar to
objects in the other clusters.
Cluster analysis examines an entire set of interdependent relationships, which
mean it makes no distinction between dependent and independent variables.
Purpose: to classify objects into relatively homogeneous groups based on the set of
variables considered.
Both cluster analysis and discriminant analysis are concerned with classification.
o In cluster analysis, there is no a priori information about the group or
cluster membership for any of the objects -> You don’t know who or what
belongs in which group. You often don’t even know the number of groups.
o In discriminant analysis, prior knowledge of the cluster or group
membership are required for each object or case included to develop the
classification rule -> You can derive a rule for classifying other cases
based on the available cases
SPSS has three different procedures that can be used to cluster data: hierarchical
cluster analysis, k-means cluster, and two-step cluster.
o A large data file, a mixture of continuous and categorical variables -> two-
step procedure
o A small data set and want to easily examine solutions with increasing
numbers of clusters -> hierarchical clustering
o You know how many clusters you want and you have a moderately sized
data set -> k-means clustering
Applications in Marketing:
o Segmenting the Market: For example, consumers may be clustered on the
basis of benefits sought from the purchase of a product. Each cluster would
consist of consumers who are relatively homogeneous in terms of the
benefits they seek
o Example: how many segments should we divide the market into and what
are the characteristics of those segments.
99
Chapter 20 – ISB Academic Team
100
Chapter 20 – ISB Academic Team
101
Chapter 20 – ISB Academic Team
102
Chapter 20 – ISB Academic Team
103
Chapter 20 – ISB Academic Team
104
Chapter 20 – ISB Academic Team
105
Chapter 20 – ISB Academic Team
106
Chapter 20 – ISB Academic Team
107
Chapter 20 – ISB Academic Team
108
Chapter 21 – ISB Academic Team
CHAPTER 21
MULTIDIMENSIONAL SCALING & CONJOINT ANALYSIS
1. Multidimensional Scaling
Internal
o Definition
A technique for positioning, which means making a spatial map -
the visual display of our brands and our competitor positions, with
the purpose of comparing and knowing about the positioning of
current brands on these dimensions.
The purpose of this technique is to determine direct and indirect
competitors and the strength of them.
Example:
Who are your direct and indirect competitors, what are the
strengths of you and your competitors?
109
Chapter 21 – ISB Academic Team
110
Chapter 21 – ISB Academic Team
111
Chapter 21 – ISB Academic Team
112
Chapter 21 – ISB Academic Team
External:
o Definition
The technique can see how far each brand to be the perfect, also
which brand is the nearest to the perfect brand compared to the
other.
Example:
Between you and your competitors, who is the nearest to be
the perfect competitor in this market?
o SPSS Running and Interpretation
Example: As your firm is Samsung, between you and your
competitors, who is the nearest to the perfect competitor in this
market?
How to run SPSS:
Step 1: On the left bottom of the Data view -> Variable View ->
Creating the new variable (named Perfect – PerfectUni,
PerfectCompany,,..) in Name column. After that, we give the
all 5 for the PerfectFirm variablec -> Run the
Multidimensional again.
Step 2: Analyze → Scale → Multidimensional Scaling
(PROXSCAL)
Step 3: Multidimensional Scaling Data: Data format →
Create proximities from data
Step 4: Put our firm, the competitors and the Perfect into
the variables box
Step 5: Check the following:
o Model: Interval
113
Chapter 21 – ISB Academic Team
SPSS Interpretation
Object Common Table: which point is nearest to the
Perfect point. —> external spatial map.
114
Chapter 21 – ISB Academic Team
2. Conjoint Analysis:
Definition
o Conjoint analysis attempts to determine the relative importance
consumers attach to salient attributes and the utilities they attach to the
levels of attributes
o Conjoint tests estimate which attribute is the most and the least important
that leads to respondents’ preference.
o Example:
Brainstorm for ideas of a new product with 3 attributes, 2 levels
each. Collect preferences of all members in your own group to test
the new product.
115
Chapter 21 – ISB Academic Team
116
Chapter 21 – ISB Academic Team
Step 1.2: After enter all attribute → create new data file →
file and save your file name
Step 1.3: Save and Click Oke
Step 1.4 -Display the Plancards:
o Open that data file → Orthogonal → display and
choose profile for subject → we will have a card list
like this (see the image below)
117
Chapter 21 – ISB Academic Team
118
Chapter 21 – ISB Academic Team
o SPSS Interpretation
119
Chapter 21 – ISB Academic Team
Warnings
No reveals occurred → good things as there are no mistakes
Utilities
Utility estimate indicates which option is the best for your
new product.
In each attribute, which level has the highest utility estimate
is the best for that attribute.
The highest Utility Estimate → the best choice for each
attribute.
Example: With the results shown in the table, we can
conclude that manufacturer should create smartphone with
the following characteristics:
o High price (utility estimate is .750)
o Medium size (utility estimate is .563)
o Foldable phone (utility estimate is .375)
Importance value
The highest value → the most important attribute that leads
to people preference.
The lowest value → the least important attribute that leads
to people preference.
Example: It is the price that the manufacturer needs to pay
most attention to as it has the highest importance value
score.
120
Summary Table - ISB Academic Team
121
Summary Table - ISB Academic Team
Correlations Table
If Pearson
correlation
Test the relationship Do your questions
between 2 IVs is >
9 Correlation between 2 metric All Metric face the problem of
0.7 ->
variables. multicollinearity?
multicollinearity ->
return to Factor
analysis
Coefficient Table
122
Summary Table - ISB Academic Team
Canonical
Correlation (only
for 2-level)
Wilk’s Lambda
Is Brand Image the
Test effect on the Metric >
Beta: which one most important
Discriminant metric (IVs) lead to 2 Nonmetric
11 has the biggest factor affecting the 2-
Analysis or 3 levels of non - (Intention, effect level purchase
metric variable (DV). Satisfaction)
No Sig. provided - intention?
> cannot
determine if the
relationship is true
Nagelkerke R-
square
Dendrogram
using Ward How many segments
Hierarchy Cluster
13 Segmentation . Linkage in the market should
Analysis
be divided into?
Coefficient
Final Cluster
Identify the Centers Table What is the
K Mean Cluster
14 characteristics of the characteristic of each
Analysis Number of cases in
clusters. segmentation?
each cluster
123
Summary Table - ISB Academic Team
124
Flowchart - ISB Academic Team
125
REFERENCES
Malhotra, N. K. (2009). Marketing research: An applied orientation (6th ed.). Pearson College
Division.
126