Professional Documents
Culture Documents
Regression Analysis (Cases 1-3)
Regression Analysis (Cases 1-3)
Muhammad Akram Naseem (iqra4ever@gmail.com) Presenter: Research Centre for Training and Development(RCTD
3/10/2014
Model Building
Model Mathematical way to express the theory is known as model Types of Models 1. Exact Models(Mathematical Model) 2. In-Exact Models(Statistical Model)
3/10/2014
3/10/2014
Linearity Determination
Graphic Form
Y=a+bx, Fig-1
0 1 -1 -2 -3 -4 -5 2 3 4 5 6 7 8 9 10 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 8 9 10
Y=a+bx Fig-2
-6
3/10/2014
Linearity Determination
Graphic Form
Quardatic Fig-3
600 500 400 300 200
100
0 -100 -200 -300 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
3/10/2014
Linearity Determination
Linear with respect to variables and parameters Y=+X+
Y=+ 1X1 +2X2+------------------------ kXk+
3/10/2014
9.The regression model is correctly specified. Alternatively, there is no specification bias or error
10.There is no perfect multicollinearity. That is, there is no perfect linear relationship among the explanatory variable.
3/10/2014 Unlock the Potential of Data Analysis 10
Regression Analysis
Dependence of one variable on a single variable or more than one variables is known as Regression Simple Regression Dependence of one variable on a single variable is known as simple Regression Multiple Regression Dependence of one variable on more than one variables is known as multiple Regression
3/10/2014 Unlock the Potential of Data Analysis 11
Regression Analysis
Simple Regression 1-Blood Pressure(Y)depends on age(X) Y=+X+ 2-CGPA of students(Y) depend on study hours(X) Y=+X+ 3-Production of a certain crop(Y) depend on amount of fertilizer used(X) Y=+X+
3/10/2014 Unlock the Potential of Data Analysis 12
Regression Analysis
Dependent Variable Slope of line or Regression Co-efficient or Rate of change Residual term
Y=+X+
Y-intercept In Dependent Variable
3/10/2014
13
Dependent variable(Y)
Quantitative
Independent variable(X)
Quantitative
Technique
2
3 4 5 6 7 8 9
3/10/2014
Quantitative
Quantitative Categorical-Binary Categorical-Binary Categorical-Binary Categorical-Multi Category Categorical-Multi Category Categorical-Multi Category
Categorical-Binary
Categorical-Multi Category Quantitative Categorical-Binary Categorical-Multi Category Quantitative Categorical-Binary Categorical-Multi Category
Regression
Logistic Regression
Multinomial Logistic
14
Regression Analysis
Purpose of Regression Analysis 1. To find out rate of change 2. To estimate the dependent variable on the basis of independent variable(s)
3/10/2014
15
(b) < 0
(c) = 0
3/10/2014
16
n x 2 n
y n n 2 x n
byx x
3/10/2014
17
Scattered Diagram
150 140
B.P
130
120
110 10 20 30 40 50 60 70 80
AGE
3/10/2014
18
1.Click on analyze
2.Click on linear
4.Click on Ok
3/10/2014
19
ANOVAb Model Sum of Squares df Regression 1078.29 1 Residual 80.658 17 Total 1158.97 18 a. Predictors: (Constant), Age b. Dependent Variable: B.P
3/10/2014
Output tables
1-Summary table 2-ANOVA table 3-Co-efficients table
3/10/2014
21
B
(Constant) Age 112.216 0.447
Std. Error
1.401 0.030
Beta
80.097 0.965 15.075 0.00 0.00
Practice- Case-1
An experiment was conducted to study the impact of heart rate(X) on anxiety(Y). The data relate to 12 normal adults and is given in spss file. Estimate the model------- Y=+X+
3/10/2014
23
3/10/2014
24
Multiple Regression
1-Saving of household(Y)depends on monthly income(X1), size of family(X2) and so on Y=+ 1X1 +2X2+------------------------ kXk+ 2-CGPA of students(Y) depend on study hours(X1),IQ(X2) and so on Y=+ 1X1 +2X2+------------------------ kXk+ 3-Production of a certain crop(Y) depend on amount of fertilizer used(X1),water(X2) and so on Y=+ 1X1 +2X2+------------------------ kXk+
3/10/2014 Unlock the Potential of Data Analysis 25
ANOVAb Model Sum of Squares df Regression 9.213 1 Residual 2947.547 98 Total 2956.760 99 a. Predictors: (Constant), gender b. Dependent Variable: Marks
F 0.306
Sig. 0.58
3/10/2014
B
(Constant) gnder 68.455 -0.610
Std. Error
0.739 1.102
Beta
92.569 -0.056 -0.553 0.00 0.581
Practice:-Case 2
Use Case 2.save data file and determine the impact of gender on the salary of employees of an organization Dependent variable: Salary Independent variable: gender Estimated the model: Y=+X+
3/10/2014
30
ANOVAb Model Sum of Squares Regression 8.943E10 Residual 4.847E10 Total 1.379E11
df 2 471 473
3/10/2014
B
(Constant) D1 D2 63977.798 -36138.018 -33038.909
Std. Error
1106.872 1228.281 2244.280
Beta
57.801 -0.897 -0.449 -29.422 -14.721 0.00 0.00 0.00
Estimated model :
Salary=63977-36138D1-33038D2
3/10/2014 Unlock the Potential of Data Analysis
Estimated model :
Salary=63977-36138D1-33038D2 Average Salary of Clerks:63977-36138(1)-33038(0)= 27839 Average Salary of Custodian:63977-36138(0)33038(1)=30939 Average Salary of Managers:63977-36138(0)33038(0)=63977
3/10/2014 Unlock the Potential of Data Analysis 35
Multiple Regression
Dependent Variable Regression co-efficients
Residual term
Intercept
3/10/2014
36
Multiple Regression
To know the impact of age and weight on blood pressure a random sample from 20 patients is collected and analyzed BP=+ 1AGE +2Weight+
3/10/2014
37
Multiple Regression
1.Click on analyze
2.Click on linear
4.Click on Ok
3/10/2014
38
Model Summary
Model R R Square Adjusted R Square Std. Error of the Estimate
1.00
0.99
ANOVA
0.99
0.53
Sum of Squares
df
Mean Square
Sig.
2.00 17.00 19
277.59 0.28
978.25
0.00
Coefficients
Unstandardized Coefficients
Standardized Coefficients
t Sig.
Std. Error
Beta
0.33 0.82
BP=-16.58+0.71Age+1.03Weight
3/10/2014
Practice
The data given in spss file were collected using a simple random sample of 20 hypertensive patients. Y = mean arterial blood pressure (mmHg) X1 = age (years) X2 = weight (kg) X3 = body surface area (sqm) X4 = duration of hypertension (years) X5 = basal pulse (beats/min) X6 = measures of stress
3/10/2014
41
3/10/2014
42