Professional Documents
Culture Documents
001 MBA Business Statistics Week 14 Parametric and Non Parametric Tests 17-06-2023 F1 SV
001 MBA Business Statistics Week 14 Parametric and Non Parametric Tests 17-06-2023 F1 SV
Programme
Faculty of Commerce and Management
University of Kelaniya - 2023
Step-01
Choosing the Appropriate Statistical Measures/Tests
Step-02
Performing Statistical Tests in Statistical Software;
SPSS, E-views, Minitab, SAS R, Stata…etc.
Step-03
Making Conclusion
How to Choose the
Right Statistical Test/s?
4) Select Case,
5) Cross-tabulation/Custom Table,
Questionnaire Designing in SPSS
Variable View
Covid-19 Pandemic Situation and Online Education of Undergraduates
17
Covid-19 Pandemic Situation and Online Education of Undergraduates
18
Covid-19 Pandemic Situation and Online Education of
Undergraduates
16. Available Internet Connection/s
Available Internet Connection Yes(1) No(0)- Quality of the
(a) Connection*(b)
1.SLT
2.Dialog
3.Mobitel
4.Hutch
5.Airtel
Other
Specify…………………………
…
Bursary
Other Specify
……………………………………………
……………………………………………
…………
20
Covid-19 Pandemic Situation and Online Education of
Undergraduates
18.Impact of Covid-19 Pandemic situation for your monthly family
income
Monthly Income(Rs.)
Income Earner Before Covid-19 After Covid-19
Father 70000 20 000
Mother
Other
Specify…………………………..
Other
Specify…………………………..
21
Covid-19 Pandemic Situation and Online Education of
Undergraduates
19. Provide your answers based on the previous semester Online
Education experience
Description 1 2 3 4 5
(a) Overall, how satisfied were you with
the course?
(b) How satisfied were you with the
content of the course?
(c) How satisfied were you with the ability
to navigate through the course?
Normality
Randomness
Absence of Outliers
Homogeneity of Variances
Independence of Observations, and
Linearity
Appropriate Statistical Test
+
Interval or ratio level data with a
Non-normal Distribution
Normal Distribution
Testing Key Parametric Assumptions;
Distribution of the Data?
Normal Distribution
Or
Non-Normal Distribution
Normality Test
Graphical Method
Statistical Tests
Graphical Method
Normal Skewed /Non-Normal
Distribution Distribution
Statistical Tests
K-S Test,
Shapiro Tests
Jarque-Bera Test
Anderson-Darling
test …etc.
How to test the Normality
38
How to test the Normality
Hypothesis test for a test of normality
Null hypothesis: The data is normally distributed
2. If 0.01 < P-value < 0.05 reject H0, So the distribution is NOT normal at
0.05 level of significance.
3. If the P-value > 0.05 No evidence to reject the Ho, So the distribution
is normally distributed.
❖It's important to note that failing to reject the null hypothesis does
not prove it to be true; it simply means that there is not enough
evidence to support the alternative hypothesis.
43
Testing Normality with SPSS
Skewness
Skewness is a measure of the asymmetry of the probability distribution of a
random variable about its mean. In other words, skewness tells you the
amount and direction of skew (departure from horizontal symmetry).
If skewness is 0, the data are perfectly symmetrical, although it is quite
unlikely for real-world data.
As a general rule of thumb:
❖If skewness is less than -1 or greater than 1, the distribution is highly
skewed.
❖If skewness is between -1 and -0.5 or between 0.5 and 1, the distribution is
moderately skewed.
❖If skewness is between -0.5 and 0.5, the distribution is
approximately symmetric.
Treatments for Non-normal Data
▪ Parametric statistics provide a correct picture only when
the metric data is normally distributed.
▪ It's best if you collect some data, check the normality, and
decide on a transformation before you run your actual
experiment/statistical test.
Data Transformations for Normality
The parametric statistical methods assume that the variable of interest
(dependent variable) should be normally distributed. The non-normal
data can be transformed to attain normality using the box-cox
transformation or power transformation, where the central limit theorem
cannot justify the small sample size.
Following transformations suitable in different situations can be
used to convert the skewed data to the normal distribution.
➢f (x) = log(x); use it if x is positively skewed.
➢f (x) = x2; use it if the distribution of x is negatively skewed.
➢f (x) = √x; use it if x has a Poisson distribution (x =0, 1, 2, …).
➢f (x) = 1/X ; use it if the variance of x is proportional to the fourth power of
E(x).
4. Nature of the
Samples; Independent or
Related Sample
Two common types of sample nature in
business research are independent samples
and related samples.
Independent Sample
▪ Independent samples refer to cases where the
observations in one group or sample are completely
unrelated or independent of the observations in
another group or sample.
In this case, the samples are independent because the customers in Sample
1 (Company A) are unrelated and independent of the customers in Sample 2
(Company B).
Assessing normality:
• Example: Use a Q-Q plot or perform a normality test (e.g., Shapiro-Wilk
test) to assess if the annual revenue variable is normally distributed. This is
important if you plan to use statistical tests that assume normality, such as t-
tests or ANOVA.
Correlation analysis:
• Example: Investigate the relationship between the annual revenue and
another continuous variable, such as advertising expenditure. Calculate
the correlation coefficient (e.g., Pearson's correlation) to quantify the
strength and direction of the relationship. Visualize the relationship using a
scatter plot.
The Pathway to the Analysis of Continuous
(Interval/Ratio) Variables
Hypothesis testing:
• Example: Formulate a hypothesis about the relationship between employee
satisfaction (continuous variable) and employee turnover rate. Conduct a t-test
or ANOVA to compare the mean employee satisfaction scores across different
turnover rate groups (e.g., low turnover vs. high turnover).
Regression analysis:
• Example: Perform a multiple linear regression analysis to examine how
variables like advertising expenditure, product price, and market size (all
continuous variables) affect the annual revenue. Evaluate the significance of
the regression coefficients and interpret their impact on the dependent
variable (revenue).
The Pathway to the Analysis of Continuous
(Interval/Ratio) Variables
Multivariate analysis:
• Example: Use factor analysis to identify underlying dimensions or factors that
influence customer satisfaction, based on multiple continuous variables like
product quality, customer service rating, and pricing perception.
Model diagnostics and assumptions:
• Example: Check the assumptions of regression analysis, such as linearity,
independence of errors, and homoscedasticity. Evaluate the presence of
influential observations or multicollinearity, which could affect the
interpretation of the results.
Interpretation and reporting:
• Example: Interpret the results of your analysis, explaining how variables like
advertising expenditure, employee satisfaction, or pricing perception impact
business outcomes such as revenue or customer satisfaction. Provide clear
and concise explanations, supported by relevant statistical measures and
visualizations.
Parametric Statistical Tests
Parametric Assumptions to Practical Hypotheses SPSS Procedure
Test be Satisfied Example Formation
Karl Linearity, Assessing the H0: Analyze -> Correlate
Pearson's independence of relationship There is no correlation -> Bivariate
observations, between between the two variables. Correlations
Correlation normality of variables employee job H1:
Tests satisfaction There is a correlation
scores and their between the two variables
performance
ratings
Self-Evaluation Questions-07
A manufacturing company notices an increase in the number of errors
made by its employees. To investigate the reasons for errors, 10
employees were selected and reported their level of stress, skills and
number of errors made by them as follows.
Employee Stress Scores(out of Skill Scores-X2 No. of
Name 1000)-X1 Errors-Y
A 168 53 5
B 186 18 7
C 125 75 3
D 170 45 7
E 132 70 4
F 130 81 3
G 190 48 7
H 100 88 3
I 175 55 6
J 180 41 7
P- Value Criteria – Decision Rule
1.If P-value < 0.01 reject H0, So there is a significant mean difference
among Groups and It is statistically significant at 0.01.
2. If 0.01 < P-value < 0.05 reject H0, So there is a significant mean
difference among Groups and It is statistically significant at 0.05.
SPSS Procedure:
Analyze -> Compare Means -> Independent Samples t-test
The independent samples t-test
Title
"The Effect of Corporate Governance on Firm Performance"
Research Objectives:
❖To examine the relationship between corporate governance practices and firm
performance.
❖To determine if firms with stronger corporate governance structures exhibit better
financial performance compared to firms with weaker governance structures.
Research Problem & Questions: Example
Research Problem:
❖ The research aims to explore whether corporate governance practices have an
impact on firm performance.
Research Questions:
❖Is there a significant difference in financial performance between firms with
strong corporate governance structures and firms with weak governance
structures?
❖Do firms with stronger corporate governance structures exhibit better financial
performance compared to firms with weaker governance structures?
Research Hypotheses
SPSS Procedure:
Analyze -> Compare Means -> One-Way ANOVA
ANOVA Test
❖The analysis of variance (ANOVA) test is a statistical method used to
determine whether there are any significant differences between the
means of two or more groups.
2. If 0.01 < P-value < 0.05 reject H0, So there is a significant mean
difference among Groups and It is statistically significant at 0.05.
SPSS Procedure:
Analyze -> Compare Means -> Paired Samples t-test
Paired Samples t-test
❖The paired samples t-test is a statistical test used to determine whether there
is a significant difference between the means of two related variables
in a sample.
▪ Step 1: Define the null hypothesis (H0) and the alternative hypothesis
(Ha)
▪ Step 2: Calculate the difference between the Pre and post-pandemic
financial performance for each company.
▪ Step 3: Calculate the mean (M) and standard deviation (SD) of the
differences
Steps to perform the paired samples t-test
▪ Step 4: Use the paired samples t-test formula to calculate the t-value:
t = (M - μ) / (SD / √n)
Where μ is the hypothesized mean difference (usually 0 for a
paired t-test), n is the number of paired samples (25 in this case),
M is the mean of the differences, and SD is the standard of the
differences.
▪ Step 5: Determine the degrees of freedom (df) for the t-distribution,
which is equal to (n - 1).
▪ Step 6: Determine the critical t-value or p-value based on your desired
level of significance (e.g., 0.05).
Steps to Perform the Paired Samples t-test
▪ If the calculated t-value is less than the critical t-value or the p-value is greater than
the chosen significance level, fail to reject the null hypothesis (H0) and conclude
that there is no significant difference between the pre and post-pandemic
financial performance.
P- Value Criteria – Decision Rule
1.If P-value < 0.01 reject H0, So there is a significant mean difference
between Two pairs( Pre and Post) and It is statistically significant at
0.01.
2. If 0.01 < P-value < 0.05 reject H0, So there is a significant mean
difference between Two pairs( Pre and Post) and It is statistically
significant at 0.05.
❖It's important to note that failing to reject the null hypothesis does
not prove it to be true; it simply means that there is not enough
evidence to support the alternative hypothesis.
99
Parametric Statistical Tests
Recommended
Assumptions to be Assumption Testing Practical Hypotheses
Parametric Test Satisfied Methods Example Formation
H0:
Studying the There is no effect
impact of of the training
Shapiro-Wilk test,
Independence of different methods over
Anderson-Darling
observations, training
Repeated test, or visual time
normality of data, methods on
ANOVA inspection of Q-Q H1:
sphericity employee
plots; Mauchly's test
assumption performance There is an effect
for sphericity
over three-time of the training
points methods over
time
SPSS Procedure:
Analyze -> General Linear Model -> Repeated Measures
Parametric Statistical Tests
Recommended
Parametric Assumptions to Assumption Practical Hypotheses
Test be Satisfied Testing Methods Example Formation
Investigating the H0:
Independence of Shapiro-Wilk test,
combined effect There is no interaction
observations, Anderson-Darling of product quality effect between the two
Two-Way normality ofTests
data, test, or visual and customer independent variables
ANOVA equal variances, inspection of Q-Q service on
interaction effect plots; Levene's test or customer H1:
assumption Bartlett's test
satisfaction in There is an interaction
different regions effect
SPSS Procedure:
Analyze -> General Linear Model -> Univariate
Parametric Statistical Tests
Parametric Assumptions to be Practical Hypotheses SPSS
Test Satisfied Example Formation Procedure
▪ Normality: After running the regression analysis, examine the histogram or probability plot of the
residuals to assess normality visually. You can use SPSS's "Graphs" menu to create these plots.
Additionally, you can perform formal tests for normality such as the Shapiro-Wilk or Kolmogorov-
Smirnov tests using SPSS's "Explore" or "Nonparametric Tests" options.
Organizing the Data Sheet
Company Sustainability Report Company Value Profitability
Disclosure
Company A 0.75 100,000 0.10
Company B 0.60 80,000 0.08
Company C 0.85 120,000 0.12
Company D 0.70 90,000 0.09
Company E 0.80 110,000 0.11
Company F 0.65 85,000 0.07
Company G 0.70 95,000 0.09
Company H 0.75 105,000 0.10
Company I 0.90 130,000 0.13
Company J 0.80 115,000 0.11
Company K 0.70 95,000 0.09
Company L 0.75 105,000 0.10
Company M 0.85 125,000 0.12
Company N 0.75 100,000 0.10
Company O 0.70 90,000 0.09
Company P 0.80 110,000 0.11
Company Q 0.65 85,000 0.07
Company R 0.70 95,000 0.09
Company S 0.75 105,000 0.10
Company T 0.90 130,000 0.13
Research Hypotheses