Professional Documents
Culture Documents
Introduction
CHAPTER 4
CHAPTER 6 AND 7
CHAPTER 8
PREDICTIVE ANALYTICAL MODELS
Linear function Logarithmic function Polynomial function Power function Exponential function
y=a+bx y=In(x) y=ax2+bx+c y=axb y=abx
Polynomial functions are
functions that have a
Linear functions show
quadratic, a cubic, a
steady increases or Exponential functions
Logarithmic functions quartic and other
decreases over the Power functions are come with a property
are used when the rate properties (all functions
range of x. defined by single where y increases or
of change in a variable plus, minus,
monomials (includes decreases at constantly
increases or decreases multiplication), taking
This is the simplest type number and variables increasing rate.
quickly. just non-negative integer
of function used in that are multiplied
power of x.
predictive models. together, e.g. 3xy) E.g. Continuosly
E.g. Richter scale used
where a≠0 and b>0 compounding interest
to measure earthquake E.g. Business people use
E.g. Demand function (PV and FV)
polynomials to see how
(price and quantity)
rising of a goods will
affect its sales
TYPES OF DATA
TIME SERIES
CROSS-SECTIONAL DATA
Our syllabus
For time
series data,
MODE L I NG use a line
R E L ATI ONSHI P S chart.
AND T R E NDS I N For cross-
DATA sectional
data, use a
Create scatter chart.
charts to
better
understand
data sets.
MODELING A PRICE-DEMAND FUNCTION
To obtain the best-fitted line, minimize the distance between the actual values and the predicted values
through Ordinary Least-Squares method (OLS).
Formula OLS_SLR:
y = ß0 +ß1x
Where;
y : Dependent variable
ß0 : Intercept (often labeled the constant) is the expected mean value of y when all x=0
ß1 : Slope represents the rate of change in y as x changes.
x : Independent variable
REGRESSION ANALYSIS
☺ 𝑦- 𝑦ො = 𝜀𝑖
☺ ☺ Observed error / Residuals
☺ ☺
TEXTBOOK PAGE: 70
☺
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
• Simple linear regression (SLR) is a statistical method that allows us to summarize and study
relationships between two continuous (quantitative) variables:
• One variable, denoted x, is regarded as the predictor, explanatory, or independent variable.
• The other variable, denoted y, is regarded as the response, outcome, or dependent variable.
$100,000.00
$100,000.00
Y
$90,000.00
Linear (Y)
$80,000.00
$70,000.00
$60,000.00
1,400 1,600 1,800 2,000 2,200 2,400 2,600
SQUARE FEET
FINDING THE BEST-FITTING
REGRESSION LINE
$80,000.00
• Market value = 32,673 + $35.036 (Square feet)
REGRESSION STATISTICS
REGRESSION STATISTICS
Analysis Details Interpretation
Adjusted R2 = 0.5231
o Will be beneficial when the present
Adjusted R-
Modified R2 model is compared with other models
Squared
that incorporate more explanatory
variables.
Standard error of the estimate is the
difference between the observed
(ACTUAL) and ESTIMATED values. SE
Standard Error will be small if the data is close to None
regression line. The SE will be big if the
data is dispersed widely from the
HOME MARKET VALUE REGRESSION
RESULTS
ANALYSIS OF VARIANCE
ANALYSIS OF VARIANCE
For coefficient:
If the house size increases by 1 square feet, the home value
increases by $35.0364.
For intercept:
If there is no change in house size, thus, the home value will
be $32,673.2199
HOME MARKET VALUE REGRESSION
RESULTS
• Residuals are the observed errors associated with estimating the value of the
dependent variable using the regression line:
𝜀𝑖 = 𝑦𝑖 − 𝑦ො𝑖
RESIDUAL ANALYSIS AND REGRESSION
ASSUMPTIONS
• Linearity
• linear trend in scatterplot
• no pattern in residual plot
CONTINUED…
25 25000
20000
20 15000
Residuals
Frequency
10000
15
5000
Frequency
10 0
-50001,300 1,500 1,700 1,900 2,100 2,300 2,500
5 -10000
-15000
0 Square Feet
-3 -2 -1 0 1 2 3 More
BIN
CONTINUED…
• Homoscedasticity
CONTINUED…
• Autocorrelation