Professional Documents
Culture Documents
Part II
Dr Agarana M.C.
Hypothesis Testing
What Is a Hypothesis?
UNIVARIATE Vs BIVARIATE
REGRESSION
• Regression provides the line that "best" fits
the data. This line can then be used to:
Examine how the response variable changes
as the predictor variable changes.
• Regression predicts the value of a response
variable (y) for any predictor variable (x).
What is the origin of the word Regression?
• The word regression was first used by
SIR FRANCIS GALTON in 1877 in his study of
Heredity. He found that the heights of the
descendants of tall parents tended to regress
(i.e. Go back) towards the average height of
the population.
• He called the Mathematical line that he
developed to explain the relationship between
the height of children and the height of their
parents the line of regression.
What are the types of Regressions?
• Linear Regression
• Logistic Regression
• Polynomial Regression
• Stepwise Regression
• Ridge Regression
• Lasso Regression
• ElasticNet Regression
Types of Linear Regression
a
Y
b
X
n n
• Where,
• X is a value of the independent variable
• Y is a value of the dependent variable
• n is the number of items in the sample
Solution to above Example
• Develop the regression equation:
• i) Find b.
• ii) Find a.
• iii) Write out Y = a + bX
Drawing the line of regression
• How is the regression line placed on the
scatter diagram?
• The least squares equation Y = a + bX
is used to determine the least squares line of
regression to be drawn on the scatter diagram.
Exercise
• Draw the line of regression for the above
example.
CHI-SQUARE GOODNESS- OF-FIT TEST
NONPARAMETRIC
• Nonparametric or distribution-free tests are
Hypothesis tests concerned with nominal or
ordinal levels of measurement.
• Distribution-free test implies that these tests
are free of assumptions regarding the
distribution of the parent population.
• They are relatively easy to apply.
• Nominal-level data are the ‘’lowest’’ type of
data. They can only be classified into
categories, such as APC, PDP, and ‘’all others’’
or male and female.
• Ordinal level of measurement assumes that
one category is ranked higher than the next
one.
CHI-SQUARE TEST
• The chi-square goodness-of-fit test is one of
the most commonly used nonparametric tests.
I t is appropriate for both nominal and ordinal
levels of data.
• The purpose of goodness-of-fit test is to
determine how well an observed set of data
fits an expected set of data.
• An example can best describe the hypothesis
testing situation.
• The Test Statistic is the chi-square distribution,
given as:
fo fe 2
2
fe