Professional Documents
Culture Documents
Regression
Pearson correlation is a technique for describing and measuring the linear relationship
between two variables. (E.g. – SAT and GPA)
Find the relationship between SAT & GPA
The line identifies central tendency of the relationship.
The line is useful in making predictions.
The line can establish a precise relationship between each X value (SAT) and its
corresponding Y values (GPA).
Y = bx + a
Y = 5x + 25
Y = Complete cost
b = Slope
25 = Fixed amount
Y = bx + a
b determines how much the “Y” variable will change when “X”
increased by 1 point.
According to the equation, even if you would not play tennis, still you
will have to pay $25.
Y = 5x + 25
Y = 5 (10) + 25
Y = 5 (10) + 25
50 + 25
=$75
For every X value in the data the linear equation will determine a Y value on the line.
This value is the predicted Y and is called Ŷ (Y hat).
The distance between this predicted value and the actual Y value in the data is
determined by,
Distance = Y – Ŷ
Here we are simply measuring the vertical distance between the actual data point (Y) and
the predicted point on the line. The distance measures the error between the line and the
actual data.