Least Square Estimation

Least-Squares Estimation
Robert Stengel
Optimal Control and Estimation, MAE 546,
Princeton University, 2013"
Estimating unknown
constants from redundant
measurements"
Perfect Measurement of a
Constant Vector"
Given "
Measurements, y, of a constant vector, x"
Estimate x"
Assume that output, y, is a perfect
measurement and H is invertible"
Least-squares"
Weighted least-squares"
Recursive weighted leastsquares estimator"
y=Hx
Estimate is based on inverse transformation"
x = H 1 y
Copyright 2013 by Robert Stengel. All rights reserved. For educational use only.!
http://www.princeton.edu/~stengel/MAE3546.html!
http://www.princeton.edu/~stengel/OptConEst.html!
Imperfect Measurement of a
Constant Vector"
Cost Function for LeastSquares Estimate"
Given "
Measurement-error residual"
Noisy measurements, z, of a
constant vector, x"
Effects of error can be reduced if

measurement is redundant"
Noise-free output, y"
y=Hx
y: (k x 1) output vector"
H: (k x n) output matrix, k > n"
x : (n x 1) vector to be estimated"
Measurement of output with error, z"
z= y+n=Hx+n
z: (k x 1) measurement vector"
n : (k x 1) error vector"
y: (n x 1) output vector"
H: (n x n) output matrix"
x : (n x 1) vector to be estimated"
= z H x = z y
dim() = ( k 1)
Squared measurement error = cost function, J"

Quadratic norm!
1 T
1
T
= ( z H x ) ( z H x )
2
2
1 T
= z z x T HT z z T H x + x T HT H x
2
J=
What is the control parameter?"

The estimate of x!
dim ( x ) = ( n 1)
Static Minimization Provides

Least-Squares Estimate"
Error cost function"
J=
1 T
z z x T HT z z T H x + x T HT H x
2
Static Minimization Provides

Least-Squares Estimate"
Estimate is obtained using left pseudo-inverse matrix"
)(
x T HT H HT H
Necessary condition"
T
T
J
1
= 0 = 0 ( HT z ) z T H + ( HT H x ) + x T HT H
x
2
= x T = z T H HT H
(row)
or
x = HT H
HT z (column)
x T HT H = z T H
Example: Average Weight

of a Pail of Jelly Beans"
Average Weight of
the Jelly Beans"
Measurements are equally uncertain"
Optimal estimate"
1
z1
1
z2
x = 1 1 ... 1
1
1
...
1
...
...
zk
zi = x + ni , i = 1 to k
Express measurements as"
z = Hx + n
Output matrix"
H=
1
1
...
1
Optimal estimate"
x = H H
T
= (k)
( z1 + z2 + ... + zk )
Simple average"
T
H z
x =
1
zi
k i=1
[sample mean value]
Least-Squares Applications"
Least-Squares Linear
Fit to Noisy Data"
Error Cost Function!
Measurement vector"
Find trend line in noisy data"
y = a0 + a1 x
z = ( a0 + a1 x ) + n
J=
Higher-degree curve-fitting"
Multivariate estimation "
Least-Squares Image Processing"

Original Lenna !
Degraded Image!
Restored Image (Adaptively

Regularized Constrained
Total Least Squares)!
Chen, Chen, and Zhou, IEEE

Trans. Image Proc., 2000"
z1 ( a0 + a1 x1 )
z2 ( a0 + a1 x2 )
=

zn ( a0 + a1 xn )

x1
n1
1 x2 a0
+n
z =
a1
n2
1 xn

Ha + n
nn
Least-squares estimate of trend line"

Estimate ignores statistics of the error"
Error cost function"
More generally, least-squares

estimation is used for"
zi = ( a0 + a1 x ) + ni
1
( z H a )T ( z H a )
2
a0
= HT H
a =
a1
HT z
y = a0 + a1 x
Measurements of
Differing Quality"
Suppose some elements of the measurement,
z, are more uncertain than others"
z = Hx + n
Give the more uncertain measurements less weight
in arriving at the minimum-cost estimate"
Let S = measure of uncertainty; then express error
cost in terms of S1"
1
J=
T S 1
Error Cost and Necessary

Condition for a Minimum"
Weighted Least-Squares
Estimate of a Constant Vector"
Error cost function, J"
1
1
T
J = T S 1 = ( z H x ) S 1 ( z H x )
2
2
1 T 1
= z S z x T HT S 1z z T S 1H x + x T HT S 1H x
2
Necessary condition for a minimum"
x T HT S 1H z T S 1H = 0
x T HT S 1H = z T S 1H
Necessary condition for a minimum"
Weighted left pseudo-inverse provides the solution"
J
=0
x
=
x = HT S 1H
T
T
1
0 ( HT S 1z ) z T S 1H + ( HT S 1H x ) + x T HT S 1H
The Return of the

Jelly Beans"
Error-weighting matrix"
1
S A =
a11
a22
...
0
...
0
a11
0
x = 1 1 ... 1
...
0
a22
...
0
... 0
... ...
... akk
...
x = H S H
T
a) Normalize the cost function according to

expected measurement error, SA!
H S z
J=
1
1
1
...
1
a11
0
1 1 ... 1
...
0
k
x =
a z
i =1
k
ii i
a
i =1
ii
HT S 1 z
How to Chose the Error

Weighting Matrix"
Optimal estimate of
average jelly bean weight"
... 0
... ...
... akk
...
0
... ...
... akk
...
a22
...
...
0
z1
z2
...
zk
1 T 1
1
1
T
S A = ( z y ) S 1
( z H x )T S1A ( z H x )
A (z y) =
2
2
2
b) Normalize the cost function according to

expected measurement residual, SB!
1
1
J = T S 1
( z H x )T S1B ( z H x )
B =
2
2
Measurement Residual
Covariance, SB"
Measurement Error
Covariance, SA"
Expected value of outer product

of measurement residual vector"
Expected value of outer product of

measurement error vector"
S B = E T
T
S A = E ( z y ) ( z y )
T
= E ( z Hx ) ( z Hx )
T
= E ( H + n ) ( H + n )
T
= E ( z Hx ) ( z Hx )
= HE T HT + HE nT + E n T HT + E nnT
Recursive Least-Squares
Estimation"
HPH + HM + M H + R
where
Requires iteration ( adaptation )

of the estimate to find SB"
R = E nnT
= E nnT R
T
P = E ( x x ) ( x x )
T
M = E ( x x ) n
Prior Optimal Estimate"

Initial measurement set and state
estimate, with S = SA = R"
! Prior unweighted and weighted least-squares

estimators use batch-processing approach"
z1 = H1x + n1
! All information is gathered prior to processing"

! All information is processed at once"
x 1 = ( H1T R11H1 ) H1T R11z1

1
! Recursive approach"
! Optimal estimate has been made from prior
measurement set"
! New measurement set is obtained"
! Optimal estimate is improved by incremental
change (or correction) to the prior optimal estimate"
= ( z Hx )
dim ( z1 ) = dim ( n1 ) = k1 1
dim ( H1 ) = k1 n
State estimate minimizes"
J1 =
dim ( R1 ) = k1 k1
1 T 1
1
T
1 R1 1 = ( z1 H1 x 1 ) R11 ( z1 H1 x 1 )
2
2
Cost of Estimation Based on Both

Measurement Sets"
New Measurement Set"

New measurement"
Cost function incorporates estimate

made after incorporating z2"
z 2 = H2 x + n2
R 2 : Second measurement error covariance
J 2 = ( z1 H1x 2 )
Concatenation of old and new measurements"
z1
z
z 2
dim ( z 2 ) = dim ( n 2 ) = k2 1
dim ( H 2 ) = k2 n
dim ( R 2 ) = k2 k2
HT2
H
1 H1T
H 2
= ( H1T R11H1 + HT2 R 1

2 H2 )
(H R
T
1
R 1 0
1
HT2
0 R 1
2
z + HT2 R 1
2 z2 )
1
1 1
1 0
R1
0 R 1
2
( z H x )
1
1 2
( z 2 H 2 x 2 )
Both residuals refer to x 2
Simplification occurs because

weighting matrix is block diagonal"
R 1 0
1
0 R 1
2
= ( z1 H1x 2 ) R11 ( z1 H1x 2 ) + ( z 2 H 2 x 2 ) R 1

2 ( z 2 H2x2 )
Optimal Estimate Based on Both

Measurement Sets"
x 2 = H1T
( z 2 H2 x 2 )
Apply Matrix Inversion Lemma"

Define"
P11
H1T R11H1
Matrix inversion lemma"

z
1
z 2
(H R
(P
T
1
H1 + HT2 R 1
=
2 H2 )
1
1
1
1
1
+ HT2 R 1
=
2 H2 )
1
P1 P1HT2 ( H 2 P1HT2 + R 2 ) H 2 P1
1
Improved Estimate Incorporating

New Measurement Set"
x 1 = P1H1T R11z1
I = A 1A = AA 1 , with A H 2 P1HT2 + R 2
New estimate is a correction to the old"
x 2 = x 1 P1HT2 H 2 P1HT2 + R 2
H 2 x 1
+ P1H I n H 2 P1H + R 2
T
2
T
2
H 2 P1H R z
T
2
1
2 2
1
= I n ( H 2 P1HT2 + R 2 ) H 2 P1HT2 x 1
1
+P1HT2 I n ( H 2 P1HT2 + R 2 ) H 2 P1HT2 R 2 1z 2
Recursive Optimal Estimate"

! Prior estimate may be based on prior
incremental estimate, and so on"
! Generalize to a recursive form, with
sequential index i!
x i = x i 1 Pi 1HTi Hi Pi 1HTi + R i
x i 1 Ki ( z i Hi x i 1 )
with
1
i 1
1
i
Pi = P + H R Hi
T
i
) (z
1
Hi x i 1 )
dim ( x ) = n 1; dim ( P) = n n
Simplify Optimal Estimate

Incorporating New Measurement Set"
dim ( z ) = r 1; dim ( R ) = r r
dim ( H ) = r n; dim ( K ) = n r
x 2 = x 1 P1HT2 ( H 2 P1HT2 + R 2 )
x 1 K ( z 2 H 2 x 1 )
( z 2 H2 x 1 )
K : Estimator gain matrix
Example of Recursive
Optimal Estimate"
z= x+n
xi = xi1 + pi1 ( pi1 + 1)
pi1
( pi1 + 1)
ki = pi1 ( pi1 + 1) =
1
1
pi = ( pi1
+ 1) =
1
( zi xi1 )
(p
i1
index p-sub-i k-sub-i

0
11
0.5
0.5
2
0.333
0.333
3
0.25
0.25
4
0.2
0.2
5
0.167
0.167
1
1
+ 1)
x 0 = z0
x1 = 0.5 x0 + 0.5z1
x 2 = 0.667 x1 + 0.333z2
x 3 = 0.75 x2 + 0.25z3
x 4 = 0.8 x 3 + 0.2z4
H = 1; R = 1
Optimal Gain and Estimate-Error

Covariance"
T
1
Pi = Pi1
1 + H i R i H i
Ki = Pi 1HTi Hi Pi 1HTi + R i
! With constant estimation error matrix, R,"

! Error covariance decreases at each step"
! Estimator gain matrix, K, invariably goes to zero
as number of samples increases"
Next Time:
Propagation of Uncertainty
in Dynamic Systems
! Why?"
! Each new sample has smaller effect on the
average than the sample before!
Weighted Least
Squares ( Kriging )
Estimates
(Interpolation)"
Can be used with arbitrary
interpolating functions"
Supplemental Material!
http://en.wikipedia.org/wiki/Kriging!
Weighted Least Squares Estimates of Particulate Concentration (PM2.5)"

Delaware Sampling Sites!
Delaware Dept. of Natural

Resources and Environmental
Control, 2008!
Delaware Average Concentrations!
DE-NJ-PA PM2.5 Estimates!

Least Square Estimation

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Least Square Estimation

Uploaded by

Copyright:

Available Formats

Least-Squares Estimation

Recursive weighted leastsquares estimator"

Estimate is based on inverse transformation"

Cost Function for LeastSquares Estimate"

Effects of error can be reduced if

Measurement of output with error, z"

Squared measurement error = cost function, J"

What is the control parameter?"

Static Minimization Provides

Static Minimization Provides

Example: Average Weight

Measurements are equally uncertain"

[sample mean value]

Error Cost Function!

Least-Squares Image Processing"

Restored Image (Adaptively

Chen, Chen, and Zhou, IEEE

Least-squares estimate of trend line"

Error cost function"

More generally, least-squares

Error Cost and Necessary

Error cost function, J"

Necessary condition for a minimum"

Necessary condition for a minimum"

Weighted left pseudo-inverse provides the solution"

The Return of the

a) Normalize the cost function according to

How to Chose the Error

b) Normalize the cost function according to

Expected value of outer product

Expected value of outer product of

Requires iteration ( adaptation )

Prior Optimal Estimate"

! Prior unweighted and weighted least-squares

! All information is gathered prior to processing"

x 1 = ( H1T R11H1 ) H1T R11z1

State estimate minimizes"

Cost of Estimation Based on Both

New Measurement Set"

Cost function incorporates estimate

Concatenation of old and new measurements"

= ( H1T R11H1 + HT2 R 1

Both residuals refer to x 2

Simplification occurs because

= ( z1 H1x 2 ) R11 ( z1 H1x 2 ) + ( z 2 H 2 x 2 ) R 1

Optimal Estimate Based on Both

Apply Matrix Inversion Lemma"

Matrix inversion lemma"

Improved Estimate Incorporating

New estimate is a correction to the old"

Recursive Optimal Estimate"

Simplify Optimal Estimate

K : Estimator gain matrix

index p-sub-i k-sub-i

Optimal Gain and Estimate-Error

! With constant estimation error matrix, R,"

Weighted Least Squares Estimates of Particulate Concentration (PM2.5)"

Delaware Dept. of Natural

Delaware Average Concentrations!

DE-NJ-PA PM2.5 Estimates!

You might also like