0% found this document useful (0 votes)

53 views7 pages

Confidence and Prediction Intervals in Linear Models

Uploaded by

yusra fayyaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views7 pages

Confidence and Prediction Intervals in Linear Models

Uploaded by

yusra fayyaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

2.

4 Conﬁdence intervals and prediction intervals

in Linear Models

Conﬁdence and prediction intervals

In this section, we will calculate the conﬁdence and prediction intervals for the Linear Gaussian
Models.

Conﬁdence vs Prediction Interval

Given a certain vector of predictors x∗ , we want to find a confidence interval for the conditional
mean x∗⊤ β and a prediction interval for a future unobserved observation Y ∗ = x∗⊤ β + ϵ∗ where
ϵ∗ is an error independent of ϵi, i = 1, ..., n, drawn from N (0, σ2 ). Below we give the statistical
distributions from which the confidence and prediction intervals are derived from.

Conﬁdence interval
First, note that Y ∗ is Gaussian. Its mean is

E(x∗⊤ β^) = x∗⊤ E[β

^ ] = x∗⊤ β

and its variance

Var(x∗⊤ β^) = Cov(x∗⊤ β^) = x∗⊤ Cov(β^)x∗ = σ2 x∗⊤ (X ⊤ X)−1 x∗

where X is the design matrix for the ﬁtted linear model.

So we have

x∗⊤ β^ ∼ N (x∗⊤ β, σ2 x∗⊤ (X ⊤ X)−1 x∗ )

x∗⊤ β^ − x∗⊤ β
∼ N (0, 1).
σ x∗⊤ (X ⊤ X)−1 x∗

^=
It can be shown that β(X ⊤ X)−1 X ⊤ y and y − y ^ = (I − H)y are independent (by showing
⊤ ⊤
that any row of (X X)−1 X is orthogonal to any row of I − H .

^ and (n − p)σ
It then follows that x∗⊤ β ^ 2 /σ2 has a χ2n−p
^ 2 /σ2 are independent. Since (n − p)σ
distribution, the quotient of the two has a Student-t distribution with (n − p) degrees of freedom:

(n−p)σ^2
x∗⊤ β^ − x∗⊤ β σ 2
/ ∼ tn−p
σ x∗⊤ (X ⊤ X)−1 x∗ n−p

x∗⊤ β^ − x∗⊤ β
∼ tn−p.
^
σ x∗⊤ (X ⊤ X)−1 x∗

Prediction intervals
^∗
Deﬁne Y = x∗⊤ β^ and note that E(Y ∗ − Y^ ∗ ) = 0. Since x∗⊤ β^ and ϵ∗ are independent,

Var(Y ∗ − Y^ ∗ ) = Var(x∗⊤ β^) + Var(ϵ∗ )

= σ2 x∗⊤ (X ⊤ X)−1 x∗ + σ2
= σ2 (1 + x∗⊤ (X ⊤ X)−1 x∗ ).

With a similar argument as above, it can be shown that Y ∗ − Y^ ∗ and (n − p)σ

^ 2 /σ2 are
independent. Thus

(n−p)σ^2
Y ∗ − Y^ ∗ 2
/ σ
∼ tn−p.
⊤ n−p
σ 1 + x∗⊤ (X X)−1 x∗

and upon simplifying

Y ∗ − Y^ ∗
∼ tn−p.
^
σ 1+ x∗⊤ (X ⊤ X)−1 x∗

You may wish to watch the following example video below to help you reinforce your interpretation
of this topic.

An error occurred.

Try watching this video on [Link], or enable JavaScript if it is

disabled in your browser.
How to calculate conﬁdence and prediction intervals for simple linear regression predictions in R

An error occurred.

Try watching this video on [Link], or enable JavaScript if it is

disabled in your browser.

Conﬁdence and prediction interval for Linear Gaussian Models

Example: Sydney maximum temperatures

In R, predict() is a general function while [Link]() is speciﬁcally for prediction using a linear
model object. The help ﬁle under [Link]() will give you the details of required and optional
arguments.

ﬁrst argument (required) is a ﬁtted linear model;

newdata argument: data frame, contains values at which predictions are to be made;
If newdata is omitted, predictions are made for the original predictors;
[Link] argument: if [Link] = T estimated standard errors of the predictions are returned;

The value returned by predict() for a linear model object is a list containing predictions and standard
errors if [Link] = T, otherwise just the predictions are returned.

Consider now the Sydney maximum temperature dataset [Link] :

[Link] <- [Link]("/course/data/[Link]", header=TRUE, quote="\"")

head([Link])

Maxtemp Modst Modsp Modthik

1 14.3 288.6 1000.8 5591.2
2 15.2 284.6 1010.8 5438.6
3 18.7 285.7 1011.0 5505.2
4 18.4 285.8 1004.5 5560.3
5 20.9 286.5 1002.9 5530.8
6 23.4 287.6 1003.4 5558.3

Here the response is Maxtemp and the predictors are Modst , Modsp and Modthik .

Suppose we want predictions of maximum temperature when

Modst = 285.1, Modsp = 1021.2 and Modthik=5380.4

Modst = 288.0, Modsp = 1026.8 and Modthik=5388.4

To use the [Link]() function, we set up a data frame containing our new data (one row for each
of our two predictions).

Then, using the [Link]() function we can obtain the conﬁdence and prediction intervals as
follows:

[Link] <-[Link](Modst=c(285.1,288.0), Modsp=c(1021.2,1026.8), Modthik=c(5380.4,5388.4))

[Link] <- [Link]("/course/data/[Link]", header=TRUE, quote="\"")

[Link] <- lm(Maxtemp ~ ., [Link])

[Link] <- [Link]([Link], newdata=[Link], [Link]=T, interval="confidence", level=0.95)

[Link]

[Link] <- [Link]([Link], newdata=[Link], [Link]=T, interval="prediction", level=0.95)

[Link]

$fit
fit lwr upr
1 15.50811 14.81804 16.19818
2 15.85861 15.09306 16.62416

$[Link]
1 2
0.3509169 0.3892985

$df
[1] 365

$[Link]
[1] 3.009404

$fit
fit lwr upr
1 15.50811 9.550067 21.46615
2 15.85861 9.891352 21.82587

$[Link]
1 2
0.3509169 0.3892985
$df
[1] 365

$[Link]
[1] 3.009404
Activity in R: Plotting conﬁdence and prediction intervals
Consider the following generated x -predictor and y -response vectors:

x<-rnorm(15)
x

## [1] -0.4723019 2.3502426 0.5491130 0.3013668 -1.7001447 -2.1002455

## [7] 1.0016725 1.1004800 0.5962411 -0.5250126 1.1277512 0.5301167
## [13] 0.1709285 -1.2492901 0.2313901

y<-x+rnorm(15)
y

## [1] -0.5248888 2.5948113 -1.5645586 0.9136845 -2.2149522 -2.8905325

## [7] -0.8397268 0.7740053 0.7110674 -0.8100348 0.6468322 -0.6134713
## [13] 0.5671229 -2.1764990 1.5912105

Next, deﬁne the new values of x for which your predictions will be calculated:

new <- [Link](x = seq(-3, 3, 0.5))

In this activity
1. Predict the values of y given your new values of x
2. Find conﬁdence and prediction intervals
3. Visualise your intervals using the matplot() function in R
Additional activity

Question

Which of the following is true of conﬁdence and predication intervals:

the prediction interval is narrower than the conﬁdence interval;

the conﬁdence interval refers to the conﬁdence interval for the conditional mean, in our
⊤
notation, x∗ β ;

the conﬁdence interval can be calculated in R using the predict() function.

Predictive Inference in Regression Models
No ratings yet
Predictive Inference in Regression Models
10 pages
Regression Inference Guide
No ratings yet
Regression Inference Guide
14 pages
M1T1L4-Simple Linear Regression Final
No ratings yet
M1T1L4-Simple Linear Regression Final
17 pages
Linear Regression Lecture Notes
100% (2)
Linear Regression Lecture Notes
228 pages
Exam 2 Review: Linear Regression Concepts
No ratings yet
Exam 2 Review: Linear Regression Concepts
7 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
19 pages
R Programming Student Lab Manual-52-63-3-12
No ratings yet
R Programming Student Lab Manual-52-63-3-12
10 pages
R Stastics PDF
No ratings yet
R Stastics PDF
30 pages
03 Assumptions and Gauss Markov
No ratings yet
03 Assumptions and Gauss Markov
5 pages
Econometric Methods Lecture Notes
No ratings yet
Econometric Methods Lecture Notes
152 pages
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
No ratings yet
Business Analytics Unit - V Notes - 60637708 - 2025 - 05 - 15 - 02 - 16
37 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
Linear Regression Analysis in R
No ratings yet
Linear Regression Analysis in R
9 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
43 pages
Regression Analysis Guide
100% (1)
Regression Analysis Guide
280 pages
Exam 1 Notes
No ratings yet
Exam 1 Notes
4 pages
Regression Prediction & Confidence Intervals
No ratings yet
Regression Prediction & Confidence Intervals
10 pages
Interval Estimation and Confidence Intervals
No ratings yet
Interval Estimation and Confidence Intervals
38 pages
H-311 Linear Regression Analysis With R
100% (1)
H-311 Linear Regression Analysis With R
71 pages
WST 311 Notes Part 2 2024
No ratings yet
WST 311 Notes Part 2 2024
21 pages
Regression Analysis Methods Explained
No ratings yet
Regression Analysis Methods Explained
45 pages
Econometrics: Linear Mean Model Analysis
No ratings yet
Econometrics: Linear Mean Model Analysis
8 pages
Linear Regression
No ratings yet
Linear Regression
108 pages
Advanced Econometrics PDF
No ratings yet
Advanced Econometrics PDF
58 pages
Regression Model Inference Explained
No ratings yet
Regression Model Inference Explained
4 pages
Chapter 2: Simple Linear Regression (Cont'd)
No ratings yet
Chapter 2: Simple Linear Regression (Cont'd)
37 pages
Linear Regression Analaysis - 23
No ratings yet
Linear Regression Analaysis - 23
22 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
Linear Regression Analysis in R
100% (1)
Linear Regression Analysis in R
15 pages
Day 24 Supervised Learning - REgression Analysis - 2
No ratings yet
Day 24 Supervised Learning - REgression Analysis - 2
18 pages
Simple Regression in Econometrics MOOC
No ratings yet
Simple Regression in Econometrics MOOC
4 pages
Chapter 14 (14.5)
No ratings yet
Chapter 14 (14.5)
9 pages
Point Estimation and Confidence Intervals
No ratings yet
Point Estimation and Confidence Intervals
2 pages
Reading 5 A
No ratings yet
Reading 5 A
10 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
86 pages
Statistical Estimation and Hypothesis Testing
No ratings yet
Statistical Estimation and Hypothesis Testing
10 pages
Stat Modelling Notes
No ratings yet
Stat Modelling Notes
49 pages
Linear Regression
No ratings yet
Linear Regression
56 pages
Understanding Linear Models in Statistics
No ratings yet
Understanding Linear Models in Statistics
8 pages
ch12 0
No ratings yet
ch12 0
43 pages
Var F
No ratings yet
Var F
1 page
Chap 5
No ratings yet
Chap 5
13 pages
Inference For Regression
No ratings yet
Inference For Regression
24 pages
Classical Assumptions in OLS Regression
No ratings yet
Classical Assumptions in OLS Regression
17 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
34 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
R Guide to Simple Linear Regression
No ratings yet
R Guide to Simple Linear Regression
14 pages
R Code for Simple Linear Regression
No ratings yet
R Code for Simple Linear Regression
14 pages
Linear Model
No ratings yet
Linear Model
14 pages
Confidence Intervals and Model Fit Analysis
No ratings yet
Confidence Intervals and Model Fit Analysis
8 pages
Chapter 09 Linear
No ratings yet
Chapter 09 Linear
29 pages
Statistical Analysis with R Functions
No ratings yet
Statistical Analysis with R Functions
26 pages
Classical Ols Assumption
No ratings yet
Classical Ols Assumption
6 pages
Asymptotic Theory
No ratings yet
Asymptotic Theory
43 pages
Statistical Inference Fundamentals
No ratings yet
Statistical Inference Fundamentals
63 pages
Advanced Regression Techniques
No ratings yet
Advanced Regression Techniques
3 pages
Untitled 123
No ratings yet
Untitled 123
21 pages
Applied Multivariate Statistical Analysis 6th Edition Johnson Solutions Manual
No ratings yet
Applied Multivariate Statistical Analysis 6th Edition Johnson Solutions Manual
499 pages
Understanding Time Value of Money
No ratings yet
Understanding Time Value of Money
75 pages
SHRM Impact on Performance Metrics
No ratings yet
SHRM Impact on Performance Metrics
6 pages
Forecasting Methods
No ratings yet
Forecasting Methods
11 pages
Tahoe Salt Demand Forecasting Analysis
No ratings yet
Tahoe Salt Demand Forecasting Analysis
17 pages
BE368 Lecture 4
No ratings yet
BE368 Lecture 4
28 pages
Quantitative Methods for Finance Overview
No ratings yet
Quantitative Methods for Finance Overview
21 pages
Game Theory and Operations Research
No ratings yet
Game Theory and Operations Research
43 pages
MCQ Binomial and Hypergeometric Probability Distribution With Correct Answers
83% (6)
MCQ Binomial and Hypergeometric Probability Distribution With Correct Answers
5 pages
Business Statistics MCQs by Roll Number
No ratings yet
Business Statistics MCQs by Roll Number
10 pages
Dogecoin Martingale Strategy Guide
No ratings yet
Dogecoin Martingale Strategy Guide
3 pages
Net Present Value and Cash Flow Analysis
No ratings yet
Net Present Value and Cash Flow Analysis
5 pages
Excel Financial Functions Guide
No ratings yet
Excel Financial Functions Guide
8 pages
Bayesian Stats for Students
No ratings yet
Bayesian Stats for Students
229 pages
Panduan Ancova untuk Peneliti
No ratings yet
Panduan Ancova untuk Peneliti
8 pages
P&S MCQ U 5
100% (6)
P&S MCQ U 5
8 pages
IJPER-Review Article
No ratings yet
IJPER-Review Article
11 pages
ABC Corp Sales Forecast Analysis
No ratings yet
ABC Corp Sales Forecast Analysis
9 pages
A CNN Based System For Predicting The Implied Volatility and Option Prices
No ratings yet
A CNN Based System For Predicting The Implied Volatility and Option Prices
8 pages
Decision-Making Strategies for Engineers
No ratings yet
Decision-Making Strategies for Engineers
25 pages
(PDF) Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research
No ratings yet
(PDF) Eta Squared, Partial Eta Squared, and Misreporting of Effect Size in Communication Research
16 pages
Risk and Managerial (Real) Options in Capital Budgeting Risk and Managerial (Real) Options in Capital Budgeting
No ratings yet
Risk and Managerial (Real) Options in Capital Budgeting Risk and Managerial (Real) Options in Capital Budgeting
42 pages
W3 Questions
No ratings yet
W3 Questions
2 pages
Dominant Strategies in Game Theory
No ratings yet
Dominant Strategies in Game Theory
24 pages
Handout 02 Logistic Regression
No ratings yet
Handout 02 Logistic Regression
39 pages
18 - Experimental and Quasi-Experimental Research
No ratings yet
18 - Experimental and Quasi-Experimental Research
25 pages
Bayesian Inference with INLA
No ratings yet
Bayesian Inference with INLA
14 pages
Risk Management Case Study Assignment
No ratings yet
Risk Management Case Study Assignment
2 pages

Confidence and Prediction Intervals in Linear Models

Uploaded by

Confidence and Prediction Intervals in Linear Models

Uploaded by

2.

4 Conﬁdence intervals and prediction intervals

Conﬁdence and prediction intervals

Conﬁdence vs Prediction Interval

E(x∗⊤ β^) = x∗⊤ E[β

and its variance

Var(x∗⊤ β^) = Cov(x∗⊤ β^) = x∗⊤ Cov(β^)x∗ = σ2 x∗⊤ (X ⊤ X)−1 x∗

where X is the design matrix for the ﬁtted linear model.

x∗⊤ β^ ∼ N (x∗⊤ β, σ2 x∗⊤ (X ⊤ X)−1 x∗ )

Var(Y ∗ − Y^ ∗ ) = Var(x∗⊤ β^) + Var(ϵ∗ )

With a similar argument as above, it can be shown that Y ∗ − Y^ ∗ and (n − p)σ

and upon simplifying

Try watching this video on [Link], or enable JavaScript if it is

Try watching this video on [Link], or enable JavaScript if it is

Conﬁdence and prediction interval for Linear Gaussian Models

Example: Sydney maximum temperatures

ﬁrst argument (required) is a ﬁtted linear model;

Consider now the Sydney maximum temperature dataset [Link] :

[Link] <- [Link]("/course/data/[Link]", header=TRUE, quote="\"")

Maxtemp Modst Modsp Modthik

Suppose we want predictions of maximum temperature when

Modst = 285.1, Modsp = 1021.2 and Modthik=5380.4

[Link] <-[Link](Modst=c(285.1,288.0), Modsp=c(1021.2,1026.8), Modthik=c(5380.4,5388.4))

[Link] <- [Link]("/course/data/[Link]", header=TRUE, quote="\"")

[Link] <- [Link]([Link], newdata=[Link], [Link]=T, interval="confidence", level=0.95)

[Link] <- [Link]([Link], newdata=[Link], [Link]=T, interval="prediction", level=0.95)

## [1] -0.4723019 2.3502426 0.5491130 0.3013668 -1.7001447 -2.1002455

## [1] -0.5248888 2.5948113 -1.5645586 0.9136845 -2.2149522 -2.8905325

new <- [Link](x = seq(-3, 3, 0.5))

Which of the following is true of conﬁdence and predication intervals:

the prediction interval is narrower than the conﬁdence interval;

the conﬁdence interval can be calculated in R using the predict() function.

You might also like