0% found this document useful (0 votes)

40 views19 pages

Ch2 NonParametricRegression Part2

The document discusses nonparametric regression techniques, focusing on local regression methods such as nearest neighbor and kernel methods. It explains the use of local averages, kernel regression, and local linear regression to create flexible and continuous fits, while addressing issues like boundary problems and bias-variance trade-offs. Additionally, it covers the application of these techniques to multiple predictor variables and the use of different kernel functions for improved estimation.

Uploaded by

biancaisa.arteaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views19 pages

Ch2 NonParametricRegression Part2

Uploaded by

biancaisa.arteaga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Nonparametric Regression

• Fit more flexible regression functions f (X)

• Local regression at each query point x0

• → Nearest neighbor methods

→ kernel methods

1
Local average
• Only one predictor variable

• K -nearest neighbor average at x0 : Average of

K closest points to x0
→ Simple and flexible estimator
→ Discontinuous (bumpy) fit

2
Example: 20-nearest neighbor
average

2
1
y

0
−1

0.0 0.2 0.4 0.6 0.8 1.0

3
Kernel regression
• Resolve discontinuity
• Use local weighted fits
• Weight function Kλ (x0 , x)
• Weight decreases smoothly with distance from
target point: smooth fit
∑n
Kλ (x0 ,xi )yi
• fˆλ (x0 ) = ∑i=1
n
i=1 Kλ (x0 ,xi )
→ Nadaraya-Watson kernel-weighted average

4
Weight function
( )
|x − x0 |
Kλ (x0 , x) = D
λ
• Epanechnikov: D(t) = 34 (1 − t2 ) I(|t| ≤ 1)

• Tri-cube: D(t) = (1 − |t|3 )3 I(|t| ≤ 1)

• Gaussian: D(t) = ϕ(t)

5
Kernel functions

Epanechnikov

0.8
Tri−cube
Gaussian
0.6
D(t)

0.4
0.2
0.0

−3 −2 −1 0 1 2 3

6
Weight function
• Epanechnikov and tri-cube: compact support

• Gaussian: noncompact support

• Tri-cube is flatter on top than Epanechnikov

→ More efficient results but more bias

7
Kernel-weighted average
• Continuous fit

• Uses fixed width neighborhoods

• λ in the kernel function controls the window size

→ Bias-variance trade-off
→ λ ↗⇒ bias↗, variance↘

8
Example: Gaussian kernel, λ = 0.2

2
1
y

0
−1

0.0 0.2 0.4 0.6 0.8 1.0

9
NN and kernels
• Continuous fit and adaptive neighborhoods
→ Kernels with variable window width
( )
|x − x0 |
E.g. Kλ (x0 , x) = D
|x(k) − x0 |

→ λ(x0 ) = |x(k) − x0 |: distance to k th

nearest neighbor

10
Boundary problems
Local averages can have problems at the boundary

• Asymmetric neighborhoods

• NN: wider neighborhood ⇒ bias↗

• Kernel: less points ⇒ variance↗

→ Use higher order local regression

11
Local linear regression
• Use local linear fits (lines)
• Reduces bias substantially
• Solve at each target x0
∑
n
min = Kλ (x0 , xi )(yi − β0 − β1 xi )2
β0 ,β1
i=1

→ fˆ(x0 ) = β̂0 (x0 ) + β̂1 (x0 )x0

→ Different linear model at each target x0

12
Local linear regression
• W (x0 ) = diag(Kλ (x0 , xi ))
→ fˆ(x0 ) =
x̃t0 (Xt W (x0 )X)−1 Xt W (x0 )y =
l(x0 )t y
• S kernel
λ = (l(x 1 ), . . . , l(x n )) t

→ f̂ = S kernel
λ y
→ A linear operator!

13
Effective degrees of freedom
• f̂ = S kernel
λ y
→ Effective degrees of freedom is given by
trace(S kernel
λ )
→ Useful to select tuning parameter λ

14
Example: Gaussian kernel, linear fit

2
1
y

0
−1

0.0 0.2 0.4 0.6 0.8 1.0

15
Local polynomial regression
• fit a local polynomial of degree M
( )2
∑
n ∑
M
min = Kλ (x0 , xi ) yi − β0 − βm xm
i
β0 ,β1 ,...,βm
i=1 m=1
∑M
→ fˆ(x0 ) = β̂0 (x0 ) + β̂ (x
m=1 m 0 0)xm

• Further (smaller) reduction of bias ( High

curvature regions)

• Increased variance

16
Ex: Gaussian kernel, quadratic fit

2
1
y

0
−1

0.0 0.2 0.4 0.6 0.8 1.0

17
More than 1 predictor
• d-dimensional kernel functions
• Typically radial functions
( )
∥x − x0 ∥
Kλ (x0 , x) = D
λ
→ Standardize predictors
• More boundary problems
→ Use linear fits!

18
More than 1 predictor
More general kernel
( −1 )
(x − x0 t
) A (x − x0 )
Kλ (x0 , x) = D
λ
• A: positive semidefinite matrix

• Weigh components

• Correlations between features

Kernel Smoothing & Regression Guide
No ratings yet
Kernel Smoothing & Regression Guide
5 pages
Kernel Methods: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
No ratings yet
Kernel Methods: Dept. Computer Science & Engineering, Shanghai Jiao Tong University
29 pages
Local Polynomial Regression Review
No ratings yet
Local Polynomial Regression Review
23 pages
Kernel
No ratings yet
Kernel
3 pages
K-Nearest Neighbors Regression Explained
No ratings yet
K-Nearest Neighbors Regression Explained
3 pages
Machine Learning Basics Cheat Sheet
No ratings yet
Machine Learning Basics Cheat Sheet
9 pages
Stats 205 Notes
No ratings yet
Stats 205 Notes
99 pages
Flexible Regression Analysis Techniques
No ratings yet
Flexible Regression Analysis Techniques
23 pages
Kernel Regression Methods Explained
No ratings yet
Kernel Regression Methods Explained
28 pages
Multivariat Kernel Regression
No ratings yet
Multivariat Kernel Regression
3 pages
Applied Nonparametric Regression
No ratings yet
Applied Nonparametric Regression
433 pages
(eBook-PDF) - Statistics - Applied Nonparametric Regression
No ratings yet
(eBook-PDF) - Statistics - Applied Nonparametric Regression
433 pages
Hardle - Applied Nonparametric Regression
No ratings yet
Hardle - Applied Nonparametric Regression
433 pages
Applied Nonparametric Regression: Wolfgang H Ardle
No ratings yet
Applied Nonparametric Regression: Wolfgang H Ardle
433 pages
Intro To Regression
No ratings yet
Intro To Regression
4 pages
2IIG0 Cheat Sheet 1
No ratings yet
2IIG0 Cheat Sheet 1
2 pages
Nonparametric Statistics Overview
No ratings yet
Nonparametric Statistics Overview
122 pages
Locally Weighted Regression Explained
No ratings yet
Locally Weighted Regression Explained
3 pages
Week 4-Nonparametric and Semiparametric Estimation
No ratings yet
Week 4-Nonparametric and Semiparametric Estimation
33 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
19 pages
SVMs: Classification & Regression Guide
No ratings yet
SVMs: Classification & Regression Guide
66 pages
Report
No ratings yet
Report
6 pages
COMS 4771 Practice Problems Guide
No ratings yet
COMS 4771 Practice Problems Guide
20 pages
ML Day3
No ratings yet
ML Day3
10 pages
Kernel Methods for Pattern Analysis
No ratings yet
Kernel Methods for Pattern Analysis
140 pages
Polynomial Regression Insights
No ratings yet
Polynomial Regression Insights
69 pages
Support Vector Machines For Classification and Regression: Steve R. Gunn
No ratings yet
Support Vector Machines For Classification and Regression: Steve R. Gunn
66 pages
Linear Regression for House Pricing
No ratings yet
Linear Regression for House Pricing
23 pages
Week 6
No ratings yet
Week 6
34 pages
Linear Regression and Predictive Modeling
No ratings yet
Linear Regression and Predictive Modeling
30 pages
Advanced Statistical Methods Overview
No ratings yet
Advanced Statistical Methods Overview
194 pages
Modern Statistical Methods Overview
No ratings yet
Modern Statistical Methods Overview
66 pages
Nonparametric Regression Techniques
No ratings yet
Nonparametric Regression Techniques
11 pages
Binary Classification and Fitting Methods
No ratings yet
Binary Classification and Fitting Methods
21 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
90 pages
Splines
No ratings yet
Splines
37 pages
Non-linear Regression Techniques Explained
No ratings yet
Non-linear Regression Techniques Explained
56 pages
Lec 08 - Polynomial Regression
No ratings yet
Lec 08 - Polynomial Regression
56 pages
Evaluating ML Systems & Linear Regression
No ratings yet
Evaluating ML Systems & Linear Regression
34 pages
Kernel Methods for Nonlinear Regression
No ratings yet
Kernel Methods for Nonlinear Regression
23 pages
CS 2008 3complete PDF
No ratings yet
CS 2008 3complete PDF
53 pages
Supervised Learning: Regression & Classification
No ratings yet
Supervised Learning: Regression & Classification
19 pages
DAV 2201079 Exp 2 2-1
No ratings yet
DAV 2201079 Exp 2 2-1
35 pages
K-Nearest Neighbor Algorithm for Iris Classification
No ratings yet
K-Nearest Neighbor Algorithm for Iris Classification
8 pages
On The Pointwise and Sup-Norm Errors For Local Regression Estimators
No ratings yet
On The Pointwise and Sup-Norm Errors For Local Regression Estimators
58 pages
ML Classifiers & Regression Guide
No ratings yet
ML Classifiers & Regression Guide
46 pages
TOBo ML
No ratings yet
TOBo ML
120 pages
Week2 Summary Detail
No ratings yet
Week2 Summary Detail
13 pages
KNN, RBF, and LWR in Machine Learning
No ratings yet
KNN, RBF, and LWR in Machine Learning
5 pages
Cleveland, Local Regression - 000008
No ratings yet
Cleveland, Local Regression - 000008
11 pages
Ebook Econometrics
No ratings yet
Ebook Econometrics
1,006 pages
Lecture 11
No ratings yet
Lecture 11
32 pages
DATimeS A Machine Learning Time Series GUI Toolbox For Gap-Filling and Vegetation Phenology Trends Detection
No ratings yet
DATimeS A Machine Learning Time Series GUI Toolbox For Gap-Filling and Vegetation Phenology Trends Detection
14 pages
Matrices:: Mathematical Physics IV (PHHT-411)
No ratings yet
Matrices:: Mathematical Physics IV (PHHT-411)
5 pages
QZ 1
No ratings yet
QZ 1
2 pages
Quantile Regression Analysis
No ratings yet
Quantile Regression Analysis
6 pages
Optical Models For Direct Volume Rendering
No ratings yet
Optical Models For Direct Volume Rendering
20 pages
SRM Formula Sheet
No ratings yet
SRM Formula Sheet
16 pages
Test On Econometrics Topics
No ratings yet
Test On Econometrics Topics
3 pages
BE368 Lecture 4
No ratings yet
BE368 Lecture 4
28 pages
Nonlinear Regression Functions Explained
No ratings yet
Nonlinear Regression Functions Explained
47 pages
Alu PHB en
No ratings yet
Alu PHB en
320 pages
Linear Regression for Data Scientists
No ratings yet
Linear Regression for Data Scientists
28 pages
Lagrange Interpolation Methods
100% (1)
Lagrange Interpolation Methods
156 pages
Marvellous Infosystems Machine Learning - Logistic Regression
No ratings yet
Marvellous Infosystems Machine Learning - Logistic Regression
3 pages
Exercises Chapter2 Part1
No ratings yet
Exercises Chapter2 Part1
2 pages
Engineering Mathematics - BOS - 2025
No ratings yet
Engineering Mathematics - BOS - 2025
39 pages
ECO303 PT Final Fa22
No ratings yet
ECO303 PT Final Fa22
5 pages
Week 15 Lecture 27 Regression
No ratings yet
Week 15 Lecture 27 Regression
12 pages
PS-09 07 Deployment EngineeringPipeline Maritime
No ratings yet
PS-09 07 Deployment EngineeringPipeline Maritime
20 pages
Analisis Kepadatan Penduduk dan COVID
No ratings yet
Analisis Kepadatan Penduduk dan COVID
5 pages
Questions For Practice (Econometric Methods)
No ratings yet
Questions For Practice (Econometric Methods)
2 pages
CE NSP224 Module 8 Numerical Integration
No ratings yet
CE NSP224 Module 8 Numerical Integration
34 pages
Newton's Divided Difference Polynomials
No ratings yet
Newton's Divided Difference Polynomials
3 pages
Module 9 - Simple Linear Regression & Correlation
No ratings yet
Module 9 - Simple Linear Regression & Correlation
29 pages
Electronics and Communication Engineering PDF
100% (1)
Electronics and Communication Engineering PDF
27 pages
Aminu Stata Result
No ratings yet
Aminu Stata Result
4 pages
Tutorial 1 AEM 2 (MATH1065)
No ratings yet
Tutorial 1 AEM 2 (MATH1065)
3 pages
Econometrics Exam 2 Study Guide
No ratings yet
Econometrics Exam 2 Study Guide
3 pages
1999 Bookmatter HandbookOfSplines
100% (1)
1999 Bookmatter HandbookOfSplines
15 pages
Ha01 - PP Test
No ratings yet
Ha01 - PP Test
17 pages
Polynomial Interpolation Methods
No ratings yet
Polynomial Interpolation Methods
25 pages

Ch2 NonParametricRegression Part2

Uploaded by

Ch2 NonParametricRegression Part2

Uploaded by

Nonparametric Regression

• Fit more flexible regression functions f (X)

• Local regression at each query point x0

• → Nearest neighbor methods

• K -nearest neighbor average at x0 : Average of

0.0 0.2 0.4 0.6 0.8 1.0

• Tri-cube: D(t) = (1 − |t|3 )3 I(|t| ≤ 1)

• Gaussian: D(t) = ϕ(t)

• Gaussian: noncompact support

• Tri-cube is flatter on top than Epanechnikov

• Uses fixed width neighborhoods

• λ in the kernel function controls the window size

0.0 0.2 0.4 0.6 0.8 1.0

→ λ(x0 ) = |x(k) − x0 |: distance to k th

• NN: wider neighborhood ⇒ bias↗

• Kernel: less points ⇒ variance↗

→ fˆ(x0 ) = β̂0 (x0 ) + β̂1 (x0 )x0

0.0 0.2 0.4 0.6 0.8 1.0

• Further (smaller) reduction of bias ( High

0.0 0.2 0.4 0.6 0.8 1.0

• Correlations between features

You might also like