Professional Documents
Culture Documents
• Hypothesis :
Alpha
• Hypothesis :
Beta
Statistics is the science and practice of developing human knowledge through the use of
empirical data expressed in quantitative form. It is based on statistical theory which is a branch
of applied mathematics. Within statistical theory, randomness and uncertainty are modelled by
probability theory. (Wikipedia Encyclopaedia)
What is statistics?
The collecting, summarizing, and analysing of data.
The term also refers to raw numbers, or “stats”, and to the summarization of data.
Example: Frequencies
Allows an examination of the relationship between variables; is there a relationship between
these variables? Are they positively or negatively related?
• In other words, if we select a value of .05, findings would be deemed statistically significant if they were
found to be .05 or less.
E.g. in a trial of new Drug X, the null hypothesis might be that the new Drug X is no better
than the current Drug Y.
• H0: there is no difference between Drug X and Drug Y.
• A Type 1 error would occur if we concluded that the two drugs
• produced different effects when there was no difference between them.
Type 2 error is If Drug X and
failing to detect Drug Y
Beta is the
an association You kept the produced
probability of
when one null hypothesis different
making a Type 2
exists, or failing when you effects, and it
error when
to reject the should not was concluded
testing a
null hypothesis have. that they
hypothesis.
when it is produce the
actually false. same effects.
The test is applied when you have two qualitative variables from a single
population.
It is used to determine whether there is a significant association between the two
variables.
For example, in an election survey, voters might be classified by gender (male or
female) and voting preference (BJP, Congress or AAP).
We could use a chi-square test for independence to determine whether gender is related
to voting preference
Voting Preferences
Row total
BJP Congress AAP
Male 200 150 50 400
Female 250 300 50 600
Column
450 450 100 1000
total
When to Use Chi-Square Test for Independence
• The test procedure described in this lesson is appropriate when the
following conditions are met:
• The sampling method is simple random sampling.
• Each population is at least 10 times as large as its respective
sample.
• The variables under study are each categorical.
• If sample data are displayed in a contingency table, the expected
frequency count for each cell of the table is at least 5.
Regression analysis is used when you want to predict a continuous dependent variable from a number
of independent variables.