Professional Documents
Culture Documents
3-Mastering Data Analysis Tools
3-Mastering Data Analysis Tools
Qualitative Data
(Categorical Data)
Important
Center Variation Distribution
Points
Mean
Median Range
Median
Mode Inter Quartile Range Skewness
Quartiles
Geometric Mean Variance Kurtosis
Percentiles
Harmonic Mean Standard Deviation
Trimmed Mean
Tabular Methods
Graphical Methods
Histogram
250
200
150
Frequency
100
50
0
15750 35750 55750 75750 95750 115750
Numerical Methods
Practice Session for Descriptive Analysis
• Import Customer’s Databas.xls into SPSS
• Label data properly
• Make One-way tables for variables (Age, gender, OwnHome
and Married). Also make pie chart and bar chart for these
variables
• Make Two-way tables (gender by OwnHome and Married by
OwnHome). Also make clustered bar chart for each variable
• Produce Detailed Numerical descriptive statistics for
variable “Purchases” (Mean, Median, ………..). Also make
histogram and stem & leaf and box-plot for variable
“Purchases”
Inferential Analysis
Parametric & Non-Parametric Inference
Normality Normality
Normality Normality
+ +
+ +
Equal Un-Equal
Equal Un-Equal
Variances Variances
Variances Variances
Comparing One Group
• Kinds of Research Questions
For the one-sample situation, the prime concern in research is
examining a measure of central tendency (location) for the
population of interest. The best-known measures of location
are the mean and median. For a one-sample situation, we
might want to know if the average waiting time in a doctor's
office is greater than one hour, or if the average growth of
roses is 4 inches or more with a certain fertilizer, or is annual
return is 10.2% for the banks that exercised comprehensive
planning.
Comparing Two Groups
Kinds of Research Questions
Perhaps the simplest comparison that we can make is between the means
of the two populations.
Comparing more than two Groups
Kinds of Research Questions
The first question that arises concerns which aspects (parameters) of the
populations we should compare. We might consider comparing the means,
medians, standard deviations, distributional shapes (histograms), or
maximum values. We base the comparison of parameter on our particular
problem.
H 0 : m = m0 , H A : m �m0
X -m
t=
s2
n
With
df = n - 1
Case Study
A manufacturer of high-performance automobiles
produces disc brakes that must measure 322 millimeters
in diameter. Quality control manager randomly selects
128 discs and measures their diameters.
The terminology, sign test, reinforces the point that the data are
converted to a series of plus and minus signs. The test is based
on the number of plus signs that occur. Zero differences are
thrown out, and the sample size is reduced accordingly.
Assumptions of the Sign Test
H 0 : m%= m%
0
H 0 : md = 0, H A : md �0
X d - md
t=
2
s d
n
With
df = n - 1
Case Study
A researcher in behavioral medicine believes that stress often makes
asthma symptoms worse for people who suffer from this respiratory
disorder. Therefore, the researcher decides to study the effect of
relaxation training on the severity of their symptoms.
H 0 : m%
1 = m2
%
H1 : m%1 �m 2
%
w - mw
Z=
sw
n ( n + 1)
mw =
4
n ( n + 1) ( 2n + 1)
sw =
24
w = �R+
Case Study
An educationist wants to see the effectiveness of
new teaching method. For this She selected 600
students and record their scores in a test of 150
marks. The scores are recorded before and after the
new teaching method.
H 0 : m1 = m2 , H A : m1 �m2
X 1 - X 2 - ( m1 - m2 )
t=
( 1 ) 1 ( 2 ) 2 �1 + 1 �
n - 1 s 2
+ n - 1 s 2
� �
n1 + n2 - 2 �n1 n2 �
With
df = n1 + n2 - 2
Case Study
• An analyst at a department store wants to evaluate a
recent credit card promotion. To this end, 500
cardholders were randomly selected. Half received
an ad promoting a reduced interest rate on
purchases made over the next three months, and
half received a standard seasonal ad.
• We can use Independent-Samples T Test to compare
the spending of the two groups.
SPSS Analytic Procedure
Independent Samples t-test
Unequal Variances
H 0 : m1 = m2 , H A : m1 �m2
2
�2 2
�
X 1 - X 2 - ( m1 - m2 ) s
� + �
1 s 2
�n1 n2 �
t= With df = � 2 �
� 2
� 2 2 2
s s � � � �
2
�s 1 � �s 2 �
� + 2�
1
�n1 � �n2 �
�n1 n2 � � �+ � �
� � n1 - 1 n2 - 1
Case Study
• A researcher wishes to compare the
expenditure behavior of the students, one of
the research question is to see the difference
in expenditures by gender.
SPSS Analytic Procedure
Mann-Whitney Test
• Mann-Whitney Test is used to compare the
two independent groups on the basis of
medians. This test does not require the
assumption of normality.
Mann-Whitney U Test Assumptions
The variable of interest is continuous. The measurement scale
is at least ordinal.
u - mu
z=
su mu =
n1n2
2
n1n2 ( n1 + n2 + 1)
su =
12
n1 ( n1 + 1)
u = w-
2
MSG
F=
MSE
MSG is the Mean Square of Group and MSE is the Mean Square
Error
Example
• This is a hypothetical data file that concerns the
popularity of a TV channel. Using a prototype, the
marketing team has collected focus group data. One
of the question of interest is to see the difference in
popularity of the TV channel in different age groups.
• This hypothesis can be tested using One Way ANOVA.
SPSS Analytic Procedure
One-Way Analysis of Variance
Unequal Variances
1 k ni �
( X i. - X .. ) �
2
�
k - 1 i =1 s 2 � �
F= i
� 2 �
2 ( k - 2 ) k � ni / s i �
1+ 2 � 1- k
� �/ ( ni - 1)
k - 1 i =1 � 2
�
� �n i / s i �
� i =1 �
-1
� �
2
� �
With � k �
2
� �
�
df = 2
3
� �
1- n n i Si
/
�/ ( ni - 1) �
�k - 1 i =1 � 2� �
� � �n i / S i � �
� � i =1 � �
Case Study
• A sales manager evaluates two new training courses.
• Sixty employees, divided into three groups, all
receive standard training. In addition, group 2
receives technical training, and group 3 receives a
hands-on tutorial. Each employee was tested at the
end of the training course and their score recorded.
SPSS Analytic Procedure
Kruskal-Wallis Test
H 0 : m%
1 = m 2 = ...... = m k
% %
H A : At least one pair of median is significantly diffrent
k
12 Ri
H= � - 3 ( N + 1)
N ( N + 1) i =1 ni
Case Study
A health scientist wishes to compare the
survival experiences after breast cancer with
different Pathological Tumor Size (Categories).