M Hayes Statistical Digital Signal Proc Part 2

FINITE DATA RECORDS __187 Note that these equations are singular, indicating that a lower model onder is possible. ‘Therefore, setting a(2) =O and solving for a(1) we find re, O)/re(1, 2) = a(l) = Thus, Ag) =1tz" and, with 6(0) = 1, the model is, L Tre As a result, the model produces an exact fit to the data for Before concluding, we mention some modifications and generalizations to the covari- ‘ance method that may be appropriate in certain applications. Recall that in the development of the covariance method it was assumed that x(n) is only known or specified over the finite interval (0, N — 1]. As a result, Prony’s error was modified so that it included only those values of e(n) that could be evaluated from the given data. Specifically, the covariance ‘method minimizes the error & Yet? (4.132) However, if x(n) begins at timen = 0, ie.,x(n) =O for < 0, then e(n) may be evaluated for all n < NV. Therefore, the covariance method may be modified to minimize the error EF =) lem? = In this case, the all-pole coefficients ae found by solving the covariance normal equations in Eq (4.127) with the autocoreltions r,(k, 1) computed as follows relkD = x(n = Dx" = 8) 4.133) ‘This method is sometimes referred to as the prewindowed covariance method. In a similar vein, if x(n) is known to be equal to zero for > NV, then we may minimize the error ef = Yel? Which leads to the post-windowed covariance method. Again, the all-pole coefficients are found by solving Eq, (4.127) withthe autocorrelation sequence computed as in Eq. (4.133) with the sum extending from n = pon = 00. Each form of covariance method may also be modified to accommodate the case in which x(n) is given over an arbitrary interval, say [Z., U]. Forexample, with the covariance method the error to be minimized would become188 SiGNALMODEUNG and the normal equations that must be solved are the same as Eq. (4.127) with rey = x(n — Dx" — by Finally, although derived only for he case of all-pole modeling, the covariance method may also be used for pole-zero modeling. In this case, Eq. (4.132) would be modified as follows a= lem? saleri.p) and the coefficients are found by solving the normal equations Eq, (4.35) with rk D= YY x(n —Dxt(n— 8 max.) 4.7 STOCHASTIC MODELS In the preceding sections, we looked at different approaches for modeling deterministic signals. In each ease, the values of x(n) were known, either forall» or for values of n over some fixed finite interval. In some applications, however, itis necessary to develop models for random processes —signals whose values are unknown and may only be described probabilisticlly. Examples include signals whose time evolution is affected or driven by random or unknown factors, as isthe case for electrocardiograms, unvoiced speech, pop- lation statistics, sunspot numbers, economic data, seismograms, and sonar data. Models for random processes differ from those for deterministic signals in two ways. First, since a random process may only be characterized statistically and the values of x(n) are only known in a probabilistic sense, the errors that are minimized for deterministic signals are no longer appropriate. Recall for example, that with Prony's method the coefficients a(k) sare found by minimizing the deterministic squared error & DY lewr Y [x94 Pap@xe— of 4.134) Therefore, if x(n) is only known probebilistcally, then it izing £,. The second difference is in the characteristics of the signal that is used as the input to the system that i used to model x(n). Whereas for deterministic signals the input signal was chosen to be a unit sample, for a random process the input signal must be @ random process. Typically, this input will be taken to be unit variance white noise. With these two differences in mind, in this section we consider the problem of modeling 1 wide-sense stationary random process. As we will see, the Yule-Walker equations will play an important role in the development of stochastic modeling algorithms. We begin, in Section 4.7.1, with the problem of modeling an autoregressive moving average process. We will develop two approaches, the Modified Yule-Walker Equation (MYWE) method and the least squares MY WE method. Both of these approaches are based on solving the Yale~ Walker equations. Then, in Section 4.7.2, we consider the special case of autoregressive (all-pole) processes. Here we will see that all-pole modeling ofa random process is similar to all-pole modeling of a deterministic signal. The primary difference is in how the autocorrelations are defined in the stochastic all-pole normal equations. Finally, in Section 4.7.3, weSTOCHASTIC MODES __189 consider the problem of modeling a moving average (all-zero) process and develop two new approaches. The first, spectral factorization, requires factoring a polynomial of order 2g for a gth-order moving average process. The second, Durbin’s method, avoids polynomial rooting by modeling the moving average process with a high-order all-pole model and then. forming an estimate of the moving average coefficients from the all-pole coefficients 4.7.1 Autoregressive Moving Average Models ‘As we saw in Section 3.6.1, wide-sense stationary ARMA(p, 4) process may be generated by filtering unit variance white noise u(n) with causal linear shiftinvariant filter having poles and g zeros, HG) Therefore, a random process x(n) may be modeled as an ARMA(p, 4) process using the ‘model shown in Fig. 4.21 where v(n) is unit variance white noise. To find the filter cocffi- ccients that produce the best approximation £(n) to x(n) we could take the approach used. in Section 4.2, replacing the least squares error £5 with a mean square error Eus = E{|x(n) ~ 8m)|?} However, this would lead to a set of nonlinear equations just as it did in the least squares ‘method of Section 4.2. Therefore, we will consider another approach, In Section 3.6.1 (p. 110), we saw that the autocorrelation sequence of an ARMA(p, 4) process satisfies the Yule-Walker equations"? 2 | rk) +) ap(re(k — 1) = cg(k) (4.135) where the sequence ¢,(k) is the convolution of by (K) and h"(—k), ee cg (k) = by (k) hi (—k) = bg (1+ ATO) (4.136) vn) Ba(z) Fin) —_4 #= Ape Figure 4.21 Modeling a random process x(n) as the response ofa linear shift-invariant filter to unit variance white "Note that 2 in Eg. (3.115) has bee St equal 1 one since here wear assuming that») has unit variance.90 SIGNAL MODEUNG. and r; (k) is a statistical autocorrelation, relk) = E{xtn)x*(n —&)} Since h(n) is assumed to be causal, then c,(k) = O for k > g and the Yule-Walker equations for k > q are a function only of the coefficients ap(k), rik) + ajrk—D =0 ¢ k>g 4.137) Expressing Eq. (4.137) in matrix form fork = q+ 1g +2,....4 + pwehave r@) ng V + reg p+ TL apf) ng +d ng +l) rQ) req p+2) || aj) rg +2) reg + PW) reg tp—2 re) ap?) rg P) (4.138) which is a set of p linear equations in the p unknowns, ap(k). These equations, referred to as the Modified Yule-Walker equations, allow us to find the coefficients «p(k) in the filter H(2) from the autocorrelation sequence r,(k) for k = 4.q + 1,..-.g + p. This approach, therefore, is called the Modified Yule-Walker Equation (MYWE) method. If the autocorrelations are unknown, then they may be replaced with estimated autocorrelations #,(k) using a sample realization of the process. ‘Comparing the modified Yule-Walker equations to the Padé equations for the denom inator coefficients given in Eq. (4.13), we see that the two sets of equations have exactly the same form, In fact, the only difference between them is in the definition of the “data sequence.” In the modified Yule-Walker equations, the “data” consists of the sequence of autocorrelations r,(k), whereas in the Padé approximation, the data are the values of x(n). ‘As in the Padé approximation method, since the matrix in Eq, (4.138) is a nonsymmetric ‘Toeplitz matrix, these equations may be solved efficiently for the coefficients a(k) using the Trench algorithm [18]. Once the coefficients ap(k) have been determined, the next step is to find the MA coefficients, by(k). There are several approaches that may be used to accomplish this. If x(n) isan ARMA(p, q) process with power spectrum By(2)BS(A/2") Ap@Ag/2) Pa) = then filtering x(1) witha linear shift-invariant filterhaving asystem function 4 (2) produces, an MA(@) process, (1), that has a power spectrum (2) = By (2) BY(1/2") ‘Therefore, the moving average parameters by (K) may be estimated from y(n) using one of, the moving average modeling techniques described in Section 4.7.3. Alternatively, we may avoid filtering x(n) explicitly and solve for ,(k) as follows. Given ay(), the Yule-WalkerSTOCHASTIC MODELS ‘equations may be used to find the values of the sequence ¢y(k) for k= 0,1, .--.ds nO) FQ) 1 40) rDrx(0) p=) || apt GQ) rg) ng +0) 7.0) a(0) (a) which may be written as Reap = (4.139) Since ¢4(K) = O for k > q the sequence ¢,(k) is then known forall k > 0. We will denote the z-transform of this causal or positve-time part of c,(k) by [Co(2)] Similarly, we will denote the anticausal or negative-time partby [Cy(2)]_. Selb . Recall that cy (k) is the convolution of b,(K) with h*(—K). Therefore, [c,@], Cy(z) = By(2)H*(/" Multiplying C,(z) by 45(1/2") we have 4.140) which is the power spectrum of an MA(g) process. Since ap(k) is zero for k < 0, then 31/2") contains only positive powers of z. Therefore, with Pyle) = Cyl2VA5 (1/2 [Q@laaey+[Q@l4ya/) G14 since both [C,(z)]_ and 43(1/z*) are polynomials that contain only positive powers of 2, then the causal part of P,(2) is equal to [Re], = [lee |. As/2) (4.142) ‘Thus, although ¢, (k) is unknown for k < 0, the causal part of P,(z) may be computed from the causal part of cy(k) and the AR coefficients a,(k). Using the conjugate symmetry of P,(z) we may then determine P,(z) for all z. Finally, performing a spectral factorization92_ SIGNAL MODEUNG. of Py(2), 4.143) produces the polynomial B, (: ‘The procedure is illustrated in the following example. Example 4.7.1. The MYWE Method for Modeling an ARMA(I,1) Process ‘Suppose that we would like to find an ARMA(I,1) model for a real-valued random process x(n) having autocorrelation values, (0) = 26 re =T 3 (2) =7/2 ‘The Yule-Walker equations are 0) re() ‘i <0) Q) re() : ‘Thus, the modified Yule-Walker equations are r(Way(1) = —rg2) which gives a (1) = =r(2)/re(l) = =1/2 For the moving average coeflicients we begin by computing cy(0) and (1) using the ‘Yule-Watker equations as follows nO ne) JP 1 Jf re) re) J a(t) el) With the given values for r.(k), using (1) = ae wwe find Lo 7 [*% [S03 % -in -6 [ee], 0.52 we have ‘Multiplying by 4y(1/2*) = 21) (1-052) = M24 8-6! ‘Therefore, the causal part of P, (nel. =[leve].aia29] Using the symmetry of Py (2), we have cai Performing a spectral factorization gives Bue)BU(/=") = 2401 3 By(2) BU(1/z") = ~62 + 3 — Na - 42)STOCHASTIC MODES__193 so the ARMA(I,1) model is = 0.252"! H@) =W6 a Just as with the Padé approximation, the MYWE method only uses the values ofr, (K) fork =9.q+1,....q+ ptoestimate the coefficients a,(k). If, (R) is unknown and must be estimated from the data, then the accuracy of the model will depend on how accurately ‘ra(B) may be estimated. However, suppose that we may estimate r,(K) for k > p+q. This would certainly be possible, for example, if 3 p-+g where Nis the length of x(n). How may these estimates be used to improve the accuracy of the coefficients ay(k)? As we did in ‘extending the Padé approximation method to Prony’s method, we may consider the problem of finding the set of coefficients a(R) that produces the “best fi,” ina least squares sense, 10 a given st of autocorrelation values. For example, given the autocorrelation sequence rx(k) fork =0, 1,..., L, we may form the sct of extended Yule-Walker equations by evaluating Eq. (4.135) fork = 0, 1, ..., L as follows: rs(0) nd 77Q) (py €q(0) rl) 7) nd) ri(p-V 1 eq(l) @,(1) re) eq —) req 2) ra(q— P) 02) | =! cq) n@tD n@) n@-) req — P+ 1) 0 : : : ap(p) r(L) (LV r(L—2) +++ (Lh = pd 0 (4.145) From the last L ~ q equations we have nq) re(q— 1) nq - p+) a,(1) nq@tl) ngt 0 re + rdg—p+2 || aj) rq +2 @+) (q) (q = ) 0G ) a (9 +2) (4.146) r(L—1) rb 2) + ra(L ~ p) app) rely Ryfiy = —ty41 (4.147) Which is an overdetermined set of linear equations in the unknowns ap(k). As we did in Prony’s method (p. 154), we may find the least squares solution, which is found by solving. the equations (R/Re) RM a (4.43) ‘where (R!'R,) isa px p Hermitian Toeplitz matrix whose elements are the autocorrelations of the sequence r,(k).194 __ SIGNAL MODEUNG 4.7.2 Autoregressive Models Awide-sense stationary autoregressive process of order pisaspecial case of an ARMA(p, 4) process in which q = 0. An AR(p) process may be generated by filtering unit variance white noise, v(1), with an all-pole filter of the form H@) = —2 caus 14+ Yantyz* Just as with an ARMA process, the autocorrelation sequence of an AR process satisfies the Yule-Walker equations, which are 2 rob) + Do ap(Dre(k 1) = BOP) > k=O (4.150) ‘Writing these equations in matrix form for k = 1, 2,... p, using the conjugate symmetry, of r.(k), we have (0) na 2) rb ay(l) rl) re) (0) nad re(p—2) ay(2) (2) rx(2) r(l) 7.0) re(p—3) a,(3) | 2G) PD reP—D rp- AO) a(P) rp) asp, Therefore, given the autocorrelations r,(k) for k = 0, 1..., p we may solve Eq. (4.151) for the AR coefficients ap(k). This approach is referred to as the Yule-Waalker method. If we compare Eq. (4.151) to the normal equations for all-pole modeling of a deterministic signal using Prony’s method, Eq, (4.79), we see that the two sets of equations are identical, The only difference between them is in how the autocorrelation r; (A) is defined. In the all-pole Prony method, r-(k) is a deterministic autocorrelation, whereas in the Yule-Walker method ‘r<(k) is a statistical autocorrelation. Finally, to determine the coefficient 6(0) in the AR model we may use the Yule-Walker equation for = 0 as follows [B(O)? = r.(0) + > ap(kyre(K) (4.152) In most applications, the statistical autocorrelation r, (k) is unknown and must be esti- ‘mated from a sample realization of the process. For example, given x(n) for 0 del + lkDOZD (4.154) ra Note that, unlike the case for an autoregressive process, these equations are nonlinear in the model coefficients, by (). Therefore, even if the autocorrelation sequence were known exacly, finding the coefficients by () may be difficult. Instead of attempting to solve the Yule-Walker equations directly, another approach is to perform a spectral factorization of the power spectrum P,(z). Specifically, since the autocorrelation of an MA(q) process is ‘equal to zero for |k| > q. the power spectrum is a polynomial of the form By(2)Bs(1/2") “ “ = [6,07 [ [a - Bez") Ja - 62) (4.155) where Belz) = J b_(B)2* rd Using the spectral factorization given in Eq, (3.102), P,(z) may also be factored as follows Pela) = 09 OC) ON(A/2 of[]Ja-ash [Jaa 4.156) ‘where Q(z) is a minimum phase monic polynomial of degree q, i¢.,jax| < 1." Comparing Eqs. (4.155) and (4.156) it follows that by (0) = o9 and that (2(2) is the minimum phase version of B, (2) that is formed by replacing each zero of By (2) that lies outside the unit circle with one that lies inside the unit circle at the conjugate reciprocal location. Thus, given the autocorrelation sequence of an MA(g) process, we may find model for x(n) as, follows. From the autocorrelation sequence r, (k) we form the polynomial P,(z) and factor it into a product of a minimum phase polynomial, Q(2),and a maximum phase polynomial Q"(1/z*) as in Eq, (4.156). The process x(n) may then be modeled as the output of the ‘minimum phase FIR filter ince there is nothing that proibis B=) andths Q(=) fom having 2s onthe unit ice, here we are allowing the factors of (2) 9 have roots onthe unit circle196 siGNALMODEUNG driven by unit variance white noise. It should be pointed out, however, that this model is not unique, For example, we may replace any one or more factors (1 — axz~') of Q(z) with factors of the form (1 — a2). The following example illustrates how spectral factorization ‘may be used to find a moving average model for a simple process. Example 4.7.2. Moving Average Model Using Spectral Factorization Consider the MAI) process that has an autocorrelation sequence given by re(k) = 1788) +4[50k — 1) +80 + 1] ‘The power spectrum is the second-order polynomial P(e) = Tae! 4 de = [44 172 4 ae] Performing a spectral factorization of P,(z) we find Pea) =2(4t 214 de") = (44. 2°)(442) ‘Therefore, x(n) may be modeled as the output of an FIR filter having a system function equal to either =44+2 +4e4 ‘As an alternative to spectral factorization, a moving average model for a process x(n) may also be developed using Durbin's method [5]. This approach begins by finding a high-order al-pole model 4p(2) for the moving average process. Then, by considering the coefficients of the all-pole model a,(A) to be a new “data set.” the coefficients of a qth order moving average model are determined by finding a qth-order all-pole model for the sequence ap(k). More specifically, let x(n) be a moving average process of order q with By) = bth so that x(n) = bE — b) where w(n) is white noise. Suppose that we find a pth-order all-pole model for x(n) and. that p is large enough so that"* 1 Baz) (a.s7 4,@) ay(0)-+ Yapthye For example, if, By(2) = b(0) ~ BYE "Not that the coefficient hat normally appears in he numerator ofthe all-pole mode! has ben absorbed nto the denominator. thereby making the coefficient a (0) some value exer than one n generalogee aaa meeae te ee eeaeae OCW Motes 222197 and [6(0)| > |6(1)| then 1/8, (2) may be expanded in a power series as follows : : Sgt B@ 7 HO =o * FO LP where bu) = 50) ‘Therefore, if p is sufficiently large so that 6” ~ 0 then By(z) may be approximated by the ae 5) + Bete + BPz ‘Once ahhigh-order all-pole model for x(n) has been found, itis then necessary to-estimate the MA coefficients b, (k) from the all-pole coefficients ap(k). From Eq, (4.157) we see that since (4.158) Bia) = @*7 1 Agl2) © ee bg (0) + > by(K)2* 3 then 1/B, (2) represents a qth-order all-pole model for the “data” ap(k). The coefficients of the all-pole model for a)(k) are then taken as the coefficients of the moving average model. ‘Thus, Durbin’s method may be summarized as follows. 1, Given x(n) for n =0, 1.4. — Lor re(h) for k = 0,1, -.2,.N = Lea pth-order all-pole model is found and the coefficients are normalized, as in Eq. (4.157). by the gain (numerator coefficient) of the all-pole model. Typically, the model order p is chosen so that itis at least four times the order q of the moving average process [17] 2. Using the AR coefficients derived in Step 1 as data, a gth-order all-pole model for the sequence ap(k) is then found. The resulting coefficients, after dividing by the gain term, are the coefficients of the moving average model. A MATLAs program for Durbin's method that uses the autocorrelation method for both, all-pole modeling problems is given in Fig. 4.22. Durbin’s Method function b = durbin(x.p.@) . x =x) Ef poelength(x), ervor(’Model order too large’), end [avepsilon) = acm(x,p); Ubyepsiton] = acm(a/sart (epsilon) ,4): b = b/sart (epsilon) fends Figure 4.22 A Matas program for finding the moving average coefficients of an al-zero model fora signal x(n) using Durbin's method. Note that this program calls acm. (see Appendix).198 SIGNAL MODEUNG Example 4.7.3 The Durbin Algorithm To illustrate how the Durbin algorithm is used to find a moving average model, consider the signal x(n) that has a z-transform Xz) br where f isa real number with |B} <1. The frst step in Durbin’s method requires that we finda high-order all-pole model for x(n). Since 1 1 Say rae de then the all-pole model for x(n) using the autocorrelation method with p > 1 i approximately 1 TH Betas Bree ‘The next step is to fit a qth-order all-pole model to the sequence (4 Example 4.6.1 (p. 179) a first-order all-pole model for this sequence is va Tate H@) BA As we saw in He) where a and La pier rea ‘Therefore, assuming that 6?” < 1, so that ¢1 ~ 1, the first-order moving average model is, Note that if, instead of the autocorrelation method in the second step, we were to use the covariance method, then we would have, as we saw in Example 4.6.2 (p. 184), ac) =—6 and the first-order moving average model would be Buz) be! 4.7.4 Application: Power Spectrum Estimation Spectrum estimation is an important application of stochastic signal modeling. As we saw in Section 3.3.8, the power spectrum of a wide-sense stationary random process x(n) is Pe) = D> rele He (4.159)STOCHASTIC MODES _199 where relk) = E[x(n)x"(n ~ 1} (4.160) is the autocorrelation of x(n). Since r(k) is generally unknown, spectrum estimation iscon- ‘cerned with the estimation of P,(e/#) fromasample realization of x(n) forn = 0,1, .... N. One approach that may be used to estimate P,(e/) is to estimate r,(k) from x(n) and then use this estimate in Eq. (4.159). However, with only N+ 1 values of x(n) the autocorrelation may only be estimated for lags |k| < NY, and the power spectrum estimate would have the form Pye) D Fete (4.161) : This estimate is limited by two factors. First, since estimated autocorrelations are used instead of the true values, the accuracy of P,(e/) will be limited by the accuracy of the estimates 7, (K). Second, since P,(e/) does not include any estimates ofr, (k) fork] > N, the power spectrum estimate willbe limited in resolution unless r_() ~ 0 for [kl > N. The estimation of P, (e) may be facilitated if something is known about the process x(n) in addition to the signal values. For example, suppose that x(7) is known to be an autoregressive process of order p. Since the power spectrum of an AR(p) process is 2 Py(ei®) = Or 4.162) i Bee then we may use the Yule-Walker method with estimated autocorrelations to estimate the coefficients a)(k) and b(0), and then use these estimates in Eq. (4.162) as follows: 1B [1+ dye"? ft Assuming that the estimates of the model coefficients are sufficiently accurate, this approach ‘may result in a significantly improved spectrum estimate. ca) (4.163) Example 4.7.4 AR Spectrum Estimation ‘Shown in Fig, 4.23 are N = 64 samples of an AR(4) process that is generated by filtering unit variance white noise with the fourth-order all-pole filter 0) a@ = 20 _ 14+ Dae & where b(0) = 1 and a(1) = 0.7348; a(2)= 1.8820 ; a(3)=0.7057 ; a4) which corresponds toa filter having a pair of poles at z = 0.98¢*/*/? and a pai 0.96e*/°"/". Estimating the autocorrelation sequence for |k| < NV using 1M x xt ~h) FP)0 10 20 30 40 50 60 — “ @ 6) Figure 4.24 Power spectrum estimation. (a) The estimate of the power spectrum by using estimated ‘autocorrelaions. (b) The estimate of the power spectrum using the Yule-Walker method. In both “figures the true power spectrum is shown by the dashed line. and substituting these estimates into Eq, (4.161) we obtain the estimate of the spectrum shown in Fig. 4.24a, On the other hand, using these estimates of r,(k) in the Yule-Walker equations, Eq, (4.151), we find the following estimates for b(0) and a(t), (0) = 1.4844 ; a(1) =0.7100 ; a) = 1.6198 ; a(3) = 0.6193 ; a(4) = 0.6908 Incorporating. these estimates into Eq. (4.162) gives the spectrum estimate shown in Fig. 4.246, As we will see in Chapter 8, there are two limitations with this model-based approach to spectrum estimation. First of all, it assumes that we know how x(n) should be modeled. In Example 4.7.4, for example, it was assumed that we knew that x(n) was an AR(p) process. Ifx(n) is not an autoregressive process and cannot be sufficiently approximated as an AR process, then this approach will lead to an erroneous spectrum estimate. The second problem lies in the requirement that the order of the process be known. If p is unknown, then some criterion must be used to select an appropriate value for the model order. If this criterion does not produce a precise estimate for p, then x(n) will either be overmodeledee aaae ea 208 (p too large) or undermodeled (p to0 small), Both cases will generally result in inaccurate spectrum estimates. 4.8 SUMMARY In this chapter, we presented a number of different approaches to signal modeling. The first part of the chapter was concerned with modeling deterministic signals. Beginning with the least squares (direct) method, we found that this approach was only of limited use since it required finding the solution to a set of nonlinear equations. However, in those instances here it is necessary to find the true least squares solution, we saw that the method of iterative prefltering could be used. We then looked at the Padé approximation, which only requires finding the solution to a set of linear equations and produces a model that exactly matches the data forthe first p +q +1 values, where p is the number of poles and q is the number of zeros in the model. In most applications, however, this property of perfect matching over a fini is overly restrictive and compromises the accuracy of the ‘model for valucs of n outside this interval. We then developed Prony’s method, which relaxes the perfect matching property of the Padé approximation and finds @ model that is ‘more accurate, on the average, for all values of. AS with the Padé approximation, Prony’s method only requires finding the solution to a set of linear equations. After discussing Shanks’ method, which modifies Prony’s method in the way that the numerator coefficients, are determined, we looked at the special case of all-pole signal modeling using Prony's ‘method. The advantage of an all-pole model is that the normal equations are Toeplitz, ‘and may therefore be easily solved using the Levinson-Durbin recursion (Chapter 5). The Toeplitz structure also provides an advantage in terms of storage and reduces the amount of computations required to evaluate the autocorrelations in the normal equations. Finally, ‘we considered the autocorrelation and covariance methods, which address the issue of how to determine the model when only a finite length data record is available. Here we saw that there is a tradeoff between computational efficiency and model accuracy. Specifically, ‘whereas the autocorrelation method preserves the Toeplitz structure ofthe all-pole normal ‘equations and guarantees a stable model, it requires thatthe data be windowed, which results in a decrease in the accuracy of the model. The covariance method, on the other hand, does not apply a window to the data and is therefore more accurate in its determination of the ‘model coefficients, but the Toeplitz structure of the normal equations is lost along withthe guaranteed stability of the model In the second part of the chapter, we considered briefly the problem of modeling a random process. Beginning with an ARMA process, we saw that the Yule-Walker equations could be used to find the coefficients of the model. In the case of an autoregressive process, the Yole-Walker equations are identical to the Prony all-pole normal equations. Therefore, with estimated autocorrelations, stochastic all-pole modeling is essentially equivalent to deterministic all-pole signal modeling. Finally, for a moving average process, since the Yale-Walker equations are nonlinear in the filter coefficients we looked at two methods for deriving the model parameters, The first involved a spectral factorization of the power spectrum of the process, whereas the second derived the model through a succession of all-pole models. Before concluding this chapter it should be pointed out that an important problem in signal modeling has been, to this point, overlooked. This is the problem of model order202 SIGNAL MODEUNG estimation. In each case that we have considered thus far, we have assumed that a model of ‘given order was to be found. In the absence of any information about the correct model order, it becomes necessary to estimate what an appropriate model order should be. As one ‘might expect, misleading information and inaccurate models may result if an inappropriate ‘model order is used. At this point, however, we will simply note that although a number of different mode! order estimation techniques are available, all of them are somewhat limited in terms of their robustness and accuracy (A more detailed discussion of model ‘order estimation may be found in Chapter 8). References 1. B.S. Atal and J. R. Remda, "A new model of LPC excitation for producing natural-sounding speech at low bit rates," Proc. IEEE In. Conf. on Acoust. Speech, Sig. Proc, Patis, pp. 614-617, May 1982, 2. F. Brophy and A. C. Salazar, “Considerations ofthe Padé approximant technique inthe synthesis ‘of recursive digital ites," /EEE Trans. Audio and Electroacoust, pp. 00-505, December 1973. 3. R. V. Churchill and J. W. Brown, Introduction 10 Complex Variables and Applications, 4th ed., ‘McGraw-Hill, New York, 1984, 4,1. R, Deller, J. G, Proakis, and J, H. L. Hansen, Discrete-time Processing of Speech Signals, ‘MacMillan, New York, 1993. 5. J. Durbin, “Efficient estimation of parameters in moving-average models.” Biometrica, vol. 46, P. 306-316, 1959. 6. JH. McClellan, “Parametric Signal Modeling.” Chapter | in Advanced Topics in Signal Process- ing, Prentice-Hall, Englewood Cliffs, NJ, 1988. 17. M. Morf, B. Dickenson, T Kailath and A. Vieira, “Efficient solution of covariance equations for linear prediction.” IEEE Trans. Acoust, Speech, Sig. Proc., vol. 25, pp. 429-433, October 1977. 8, A.V. Oppenheim and R. W. Schafer, Discrete-Time Signal Processing, Prentice-Hall, Englewood lifts, NI, 1989, HE. Padé, “Surlareprésentation approchéed'une fonction pur des fractions rationelles,” Annales Scientifque de !'Ecole Normale Supérieure, vol. 9, no. 3 (supplement), pp. 1-93, 1992. 10. G.R.B, Prony, “Essai expémental et analytque sur les lois de la dilatablité de ides elatiques ‘¢t sur celles de la force expansion de ls vapeur de I’alcool, différentes températures.” Journal de Ecole Polytechnique (Paris), vol. 1, no. 2, pp. 2476, 1795. 11, L.R. Rabiner and R, W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, Englewood Clifis, NI, 1978 12, R.A. Roberts and C. T. Mullis, Digital Signal Processing, Addison Wesley, Reading, MA. 1987. 13, E.A. Robinson and S. Treitel, Geophysical Signal Analysis, Prentice-Hall, Englewood Cliffs, NI, 1980, 14, J.L, Shanks, “Recursion filters for digital process 1967, 15, K.Steigltz and L. E. McBride, “A technique forthe identification of linea systems," EEE Trans. (on Automatic Control, vol. AC-10, pp. 461~464, October 1965. 16. K.Steiglitz, "On the simultaneous estimation of poles and zeros in speech analysis,” IEEE Trans. Acoust, Speech, Sig. Proc., vol. 25, pp. 229-234, June 1977. 17. C. W. Therrien, Discrete Random Signals and Statistical Signal Processing, Prentice-Hall, En- glewood Ciffs, NJ, 1992 18, W.F Trench, “An algorithm forthe inversion of nite Toeplitz matrices; pp. 512-522, 1964. * Geophysics, vol. 32, pp. 33-51, February SIAM, vol. 12, 00.3,PROBLEMS 203 4.9 PROBLEMS 4.1. Find the Padé approximation of second-order to a signal x(n) that is given by (210 101001 | ive., x0) = 2,x(1) = 1, x) the form 0, and so on. In other words, using an approximation of (0) + bz"! + BQ)z? Tea(he! FaQz find the coefficients b(0), (1), b(2), a(1), and a(2) 4.2. A third-order all-pole Padé approximation to a signal x(n) has been found to be 1 Tea te? +3 ‘What information about x(n) can be determined from this model? H@) HG) 4.3. Suppose that a signal x(n) is known to be of the form Yacaytum where the A,’s are distinct complex numbers, x(n) (@) Show thatthe Padé approximation method can be used to determine the parameters, and Ay for k = 1,2,..., L. 1s the answer unique? (b) The first eight values of a signal x(n), which is known to be of the form given above with [32, 16, 8, 12, 18, 33, 64.5, 128.25)" Determine cy and Ay for k = 1, 2, 3. 4.4, Aconsumer electronics device includes a DSP chip that contains a linear shift-invariant, digital filter that is implemented in ROM. In order to perform some reverse engineering on the product, it is necessary to determine the system function ofthe filter. Therefore, the unit sample response is measured and itis determined that the first eight values of h(n) are as listed in the following table. Unit sample'response mele o ina aa sgy: ay [Fsieeatigeea tain Having no knowledge of the order of the filter, itis assumed that /1(z) contains two poles and wo zeros. (@) Based on this assumption, determine a candidate system function, H(z), forthe filter. (b) Based on the solution found in (a) and the given values for /(n), is it possible to determine whether or not the hypothesis about the order of the system is correct? Explain.DO soni cei ees er etre ue aeaeai 4.5. The Padé approximation models a signal as the response of a filter to a unit sample input, 6(). Suppose, however, that we would like to model a signal x(n) asthe step response of a filter as shown in the following figure un) pre) gy 20 sa aiaaG) In the following, assume that H(z) is a second-order filter having a system function of the form (0) + b(1)27 + BQ)z" T+ a()e? +aQ)2" (a) Using the Padé approximation method with a unit step input, derive the set of equations that must be solved so that H@) (nm) = x(n) for n= 0,1,...4 (b) Ifthe frst cight values of x(n) are 1,0,2, 1, 2,0, 1, 2)" find (0), b(1), 6(2), a(1), and a(2). 4.6, With a real-valued signal x(n) known only for m = 0, 1, ...,V, the backwards covariance method finds the coefficients of the all-pole model that minimize the backward prediction error = Sie onr where ep (n) = x(n — p) + Ya(Wx(n +k = p) (a) Show that the coefficients a,(A) that minimize & satisfy a set of normal equations of the form Ry where Ay = [ap(1), 0902), «-. app]? and find explicit expressions for the entries in Ry and F. (b) Is the solution to the backwards covariance method the same as the solution to the covariance method? Why or why not? (©) Consider a new error that is the sum of the forward and backward prediction errors, eha Yfejmr +le,P|gaa ace ee] where ¢; (1) i the backwards prediction error defined above, and (n) i the forward prediction error used in the covariance method, p ef) = x00) + Dagx(n — 8) Derive the normal equations for the coeffi known as the Modified Covariance Method.) (@) Consider the signal Ef. This approach is BPs m=O With p = 1 find the fist-rder all-pole model that minimizes £2 and determine the value of £2. For what values off is the model stable?” What happens to the model and the modeling error as N > 00? x(n 4.7, Suppose that we would like to derive a rational model for an unknown system S using the approach shown in the following figure, xn) vin) en) s Aw Be) For a given input x(n) the output of the system, y(n), is observed. The coefficients of the two FIR filters A(z) and B(2) that minimize the sum of the squares of the error signal e(t) are then to be determined. Assume that the sum is for all n > Oas in Eq. (4.73) (a) Derive the normal equations that define the optimal solution for the coefficients of A(z) and B(2), (b) The philosophy ofthis method is that ifthe error is small then B(2)/ A(z) is areasonable ‘model for S. Suppose that S is a linear shift-invariant system with a rational system function. Show that this method will identify the parameters of S exactly assuming that the orders of the filters A(z) and B(2) are chosen appropriately (15) Consider a signal, x(n), which is the unit sample response of a causal all-pole filter system function ‘We observe x(n) over the interval 0 1. (a) Using the covariance method, we determine a third-order all-pole model for x(n) ‘What, if anything, can you say about the location of the poles in the model? Do the pole locations depend on NV? If so, where do the poles go as N > 00? (b) Repeat part (a) for the case in which you use the autocorrelation method. 4.9, Equation (4.129) may be used to reduce the amount of computation required to set-up the covariance normal equations.206 SIGNAL MODEUING (a) Show that the elements along the main diagonal may be computed recursively beginning with r5(1, 1) (b) Show how the elements along the lower diagonals may be computed recursively beginning with rs (&, 1). How may the terms along the upper diagonals be obtained? (€) Determine how many multiplies and adds are necessary to set-up the covariance normal ‘equations (do not forget the evaluation of the vector on the right-hand side). 4.10, We want to model a signal x(n) using an all-pole model ofthe form nee (0) Derive the normal equations that define the coefficients a(k) that minimize the Prony error Yew? where e(n) = x(n) + Yap(Dx(n and derive an expression for the minimum error. 4.11. Suppose that we would like to model a signal x(n) as shown in the following figure. 3(n) Ra) ———| 4H - where h(n) is an allpole filter that has a system function ofthe form 2) 14 Daye fi H@, Modify the Prony normal equations so that one can determine the coefficients a,(k) in H(z) from a sequence of signal values, x(n) 4.12, Suppose that we would like to model a signal x(n) that we believe tobe quasiperiodic. Based on our observations of x(n) we estimate the autocorrelations through lag k = 10 to be re) =[L0, 04, 04, 0.3, 0.2, 0.9, 0.4, 04, 0.2, 0.1, 0.7)" (@) In formulating an all-pole model to take into account the suspected periodicity let usPROBLEMS 207 consider a two-coefficient model of the Form 50) T+ aG)z3 + a(l0y-” Find the values for the coefficients a(5) and a(10) that minimize the all-pole modeling HG@)= (b) Compare the error obtained with the model found in (a) to the error that is obtained with a model of the form EO} Trae! +aQe (©) Now consider an all-pole model of the form HQ) = bo) T+ a) ‘where both a (NV) and NV are considered to be model parameters. Find the value for a(V) ‘and N that minimize the all-pole modeling error and evaluate the modeling error. He= 4.13, We would like to build a predictor of digital waveforms. Such a system would form an estimate of a later sample (say np samples later) by observing p consecutive data samples. 2 ‘The predictor coefficients ak) are to be chosen to minimize 01+ 19) Ep =D [x + 9) — $0 + 0) = (a) Derive the equations that define the optimum set of coefficients ap(. (b) If'ny = 0, how is your formulation of this problem different from Prony’s method? 4.14. You are told that itis always possible to determine whether or not a causal all-pole filter is stable from a finite number of values of its unit sample response. For example, if H(2)isa pth-order all-pole filter, given h(n) for m = 0, 1, ..., N then the stability of H(z) may be determined, If this is rue, explain the procedure and list any conditions that must 'be placed on p or N. If false, explain why it cannot be done. 4.18. Let H(z) be a first-order model for a real-valued signal x(1), and let et i: Eis = De) — hy? be the error that is to be minimized. By setting the derivatives of Es with respect to b(0) and a(1) equal to zero, try to find an analytic solution for the values of (0) and a(1) that ‘minimize £1. (This problem illustrates how difficult the direct method of signal modeling may be, even for a first-order model.)208 SIGNAL MODEUNG 4.16, We have a signal x(n) for which we would like to obtain an all-pole model of the form 0) +a@z ‘method, find explicit formulas for 6(0), a(1), and a(2) in terms H@ =— ® ie Using the autocorrelat of (0), Pe (1), and rs (2). 4.17. If one is modeling a signal x(n) whose transform, X(2), contains zeros, then an all-pole model may be used to effectively model a zero with an infinite number of poles In this problem we look at how a zero is modeled with the autocorrelation method. Let, x(n) = 6(n) — a8(n ~ 1) where fa} < 1 and or is teal (a) Determine the pth-order all-pole model 4 p(z) for x(n) where p is an arbitrary positive integer, and find the value of the squared error E>, (b) For the all-pole model determined in part (a), what is the limit of A(z) as p + 007 What does £, converge to as p —> 00? Justify your answers. (©) Repeat parts (a) and (b) for ar] > 1 4.18, Find a closed-form expression for the FIR least squares inverse filter of length NV for each of the following systems. a} <1 4.19. An important application of least squares inverse filtering is deconvolution, which is concerned with the recovery of a signal d(n) that has been convolved with a filter g() x(n) =d(n) * gn) ‘The problem isto design a filter fiy(n) that may be used to produce an estimate of d(n) from x(n), din) = x(n) * hy(n) One of the difficulties, however, is that noise in the observed signal may be amplified by the filter. For example, if we observe yn) = d(n) * a(n) + u(n) then the filtered observations become in) + vlan) # hyn) = din) + un) Yin) Kyl) = where s(n) = vin) * hy(n) is the filtered noise, One way to reduce this noise is to design a least squares inverse filter ‘that minimizes E=) lel? +2 {lumP} =prosiems 209 where [(n) = 8(n — m0) — y(n) * tn) and 2, > is a parameter that is to be selected, Note that for large values of A, minimizing E will force a large reduction in the filtered noise at the expense of a decrease in resolution, i.e, larger e(n), whereas smaller values of 2 lead to higher resolution and larger noise. (2) Assume that »(n) is zero-mean white noise with a variance @2. Show that El |u(n)i?} = ofbY by ‘where hy is a vector containing the coefficients of the filter hy (7). (©) Derive the normal equations that result from minimizing the error E=elle+io2hhy where [e(O), e(1), ...]". and show that they may be written in the form (Re+al)hy = 2%, where @ > is prewhitening parameter that depends upon the values of 4 and gf, is the vector onthe right-side of Eq (4.101) 4.20, We are given a signal, x(n), that we want to model as the unit sample response of an all-pole filter. We have reason to believe that the signal is periodic and, consequently, the poles of the model should lie on the unit circle. Thus, assuming a second-order model for the signal, the system function is constrained to have the form 6 Trae" ee" With |a(1)| <2 this model produces a pair of poles on the unit circle at an angle @ defined, by H@) 2cos@ = -a(I) (a) Using the autocorrelation method, derive the normal equations that define the value of a(1) that minimizes the error f= Lem (b) Find an expression for the minimum error, {&}min- 4.21. Voiced speech may be modeled as the output of an all-poe filter driven by an impulse train Pas(n) = 8(n— kr) fat where the time between pulses, 9, is known as the pitch period. Suppose that we have 1a segment of voiced speech and suppose that we know the pitch period, mo. We extract a subsequence, (1), of length N = 2no and model this signal as shown in the following. figure210 SIGNAL MODEUNG. Past) 60) Hin) TEE ante ‘where the input, pp (1), consists of two pulses, Pry(t) = 5(n) + 5( — no) Find the normal equations that define the coefficients ap(k) that minimize the error y yew & & where efit) = ap(n) x(n) — Bln) # Ppa) and b(n) = b(0)8(n). 4.22. You would like to design a linear predictor of a signal x(n) but, due to hardware constraints, are limited to a two-tap predictor. However, since delays can be tolerated in the predictor, you decide to design a predictor ofthe form S(n) = a(l)x(n — Ni) + aQ2)x(n — Na) where Nj and 3 are positive integers. The goal is to find the values of a(1), a(2), Mi. and No that minimize the mean-square error £{e?(n)} where en) = x(n) ~ $(n) Assuming that x(n) is a zero mean wide-sense stationary process, you estimate the autocorrelation of x(n) and find that the values of r,(k) for k = 0,1,...,7 are as given in the following table. For k > 7, it is determined that the autocorrelation is approximately er. ‘Autocorrelation values ie seed eet Qos See rss eg eT, ri) [10 01 00 05 02 06 02 02 (a) If you were to design an optimum predictor of the form $(n) = a(1)x(n — 1), what ‘would be the mean-square error in your prediction of x(n)? What about forthe predictor 2(n) = a(I)x(n = 3)? (b) Derive a general expression for the minimum mean-square error for a predictor of the form (1) = a(1)x(1 — Mi) with your expression given only in terms of r-(k). What value of M; minimizes the mean-square error? (©) Find the values of a(1), (2), Nand Np in the two-coefficient linear predictor defined above that minimize the mean-square error E{e(n)} (@) Find the value for the mean-square prediction error for your predictor designed in part (6).COMPUTER EXERCISES 211 4.23. If r.(0) = 1, re(1) = 0.5, and 742) = 0.75, find the values of a(1), a(2), and b(0) in the following AR(2) model for x(n), x(n) + a(1)x(n — 1) + a(2)x(n — 2) = bO)w(n) where w(m) is unit variance white noise. 4.24, Use the method of spectral factorization to find a moving average model of order 2 for a process whose autocorrelation sequence is r= [3 15, 1)" 4.25. Suppose that the first five values in the autocorrelation sequence for the process x(n) ry = [3, 9/4, 9/8, 9/16, 9/32 ...]” (a) Use the modified Yule-Walker equation method to find an ARMA(I,1) model for x(n). (0) Are the given values in the autocorrelation sequence consistent with the model that you found in pat (a)? i computer Exercises (C4.1. In this problem, you are to consider the design of lowpass filters using the Padé and Prony methods. (Suppose you would like to design a lowpass filter with a cutoff frequency we = 77/2. Using the Padé approximation with p+ + 1 = 20, compare the designs that result when p = 0,2,4,....20. Which design is the best? (Note: one of the parameters that ‘you will need to experiment with in your design is the delay ng ofthe ideal unit sample response.) (©) Repeat part (a) fora narrowband lowpass filter that has a cutoff frequency «2. = 7/16. Which design isthe best and how do these filters compare to those designed in par (a)? (©) Repeat part (a) using Prony’s method and compute the minimum error €».« for each filter. Which filter is the best? Compare your designs to those obtained using Padé approximation (4) Using the m-file e11p.m in the Signal Processing Toolbox, design a tenth-order elliptic lowpass filter with a cutoff frequency « = 2/2, and evaluate the Prony error. ‘Compare this filtertothe best Prony filter found in part (€) and explain what you observe. ‘C4.2.. We have seen that the direct method of signal modeling leads to a set of nonlinear ‘equations that need to be solved for the model coefficients. Iterative prefiltering, however, js an approach that may be used to avoid having to solve these nonlinear equations. In this exercise we look at the method of iterative prefiltering and compare it to Prony’s method. (a) Let 1-09-71 +08 T1978" 2.8532? — LTTE + 0.9086— be the system function of a linear shift-invariant system. Generate the first 100 samples Of the unit sample response f(n) of this filter.212 ___ SIGNAL MODEUNG (©) Using the method of iterative prefitering, find a two-zero, four-pole model for (1). How many iterations are required for the coefficients to converge? What happens if ‘h(n) is over-modeled using p = q = 4? What about p = q = 5? (©) The model found in part (b) assumes exact measurements of the unit sample response ‘(n). Suppose that the measurements of h(n) are noisy and you observe y(n) = h(n) + v(a) where v(n) is white Gaussian noise with a variance o2. Repeat part (b) using these noisy measurements with 02 = 0.0001, 0.001, 0.01. Comment on the accuracy of your models and the sensitivity of the coefficients tothe noise variance. (@) Repeat parts (b) and (¢) using Prony's method and compare your results with those obtained using iterative prefitering. Which method works the best? C43. In this problem, we look briefly at the problem of deconvolution using FIR least squares inverse filters. Suppose that we have recorded a signal, »(n), that is known to have been blurred by a filter having a unit sample response cos(0.2{n — 25])exp{—O.01[n—25P] : O x(k)3(n — me) a ‘where the values of x() and ny are as listed in the following table. m [25 40 55 65 85 95 110 130 140 155 x [1 08 07 05 07 02 09 05 06 03 (@) Make a plot the observed signal y(n) = g(n) # x(n) and determine how accurately the amplitudes and locations of the impulses in x(n) may be estimated by simply looking atthe peaks of y(n). (b) Using the m-file spike.m, design the least squares inverse filter hy(n) of length 1N = 50 that has the optimum spiking delay. (©) Filter y(n) with your optimum spiking filter and plot the output of the filter £(m) = ‘iy (n) « y(a). What are your estimated values for the amplitudes and locations of the impulses in x(n)? (@) Your results in part (c) assume noise-free observations of y(n) = g(n) * x(n). Suppose these measurements are noisy, Hn) = a(n) # x(n) + v(m) Where v(n) is white Gaussian noise with variance 02. Repeat part (¢) using j(m) with 92 = .0001 and a2 = .001 and comment on the accuracy of your estimates of x(k) and nyCOMPUTER ExERCISES 213, (©) As discussed in Problem 4.19, the effect of measurement noise may be reduced by incorporating a prewhitening parameter a in the design of the least squares inverse filter. Write a MATLAB m-file or modify spike .m to allow for noise reduction in the least squares inverse filter design. Using this m-file repeat your experiments in part () using different values for the prewhitening parameter. Comment of the effectiveness of @ in reducing the noise. What values for: seem to work the best? (©, Your results in parts (b) and (c) assume perfect knowledge of g(n). Repeat the design of ‘your least squares inverse filter assuming that g(a) has been measured in the presence of noise, ie, you are given &la) = ain) + win) where w(n) is white noise that is uniformly distributed between (~.005, .005]. Filter _»(a) with your optimum spiking filter and plot the output ofthe filter y(n) = hiv(2) y(n). How accurate are your estimates of the amplitudes and locations ofthe impulses inxeny? C44, In this exercise, we consider the problem of finding a moving average model for a signal x(n). (a) In Fig. 4.22 is an m-fileto find the moving average coefficients b,(k) of a process x(n) using Durbin’s method. Write an m-file to find these coefficients using the method of spectral factorization. (b) Generate WV = 256 samples of the process w(n) + 0.9w(n = 2) where 1(n) is unit variance white Gaussian noise. Using the method of spectral factorization, find a second-order moving average model for x(n) (©) Repeat part (b) using Durbin’s method, Compare your results with the method of spectral factorization and discuss how the accuracy of your model is affected by the order of the all-pole model used in the first step of Durbin’s method, (@) Modify the m-file durbin.m by replacing the autocorrelation method in the second step of Durbin’s method with the covariance method. Find a second-order moving average model for x(n) and compare your results with those obtained in part (b). Repeat for other moving average processes and discuss your findings. (©) Replace the autocorrelation method in both steps of Durbin's method with the covariance method and repeat part (d). xe C4.S. Inthis exercise, we consider the problem of finding an autoregressive moving average model for a process x(n). (a) Write an m-file to find an ARMA(p, q) model for a process x(n) using the modified ‘Yule-Walker equation method, given the autocorrelation sequence r; (k), (b) Generate 100 samples of an ARMA(4,2) process x(n) by filtering unit-variance white Gaussian noise with 1-921 408127 11.9782 4 2.85322 — 187723 + 0.90824 and make a plot of the power spectrum of x(n). H@)214 SIGNAL MODEUNG (©) Using your m-file for the modified Yule-Walker equation method in part (a), find an ARMA(4,2) model for x(n).Compare your model with the coefficient of 1(2)-Repeat for ten different realizations of the process x(n) and examine the consistency of your model coefficients, Are your estimates of dp(K) and by (k) close to the correct values, on the average? How mich variation is there from one realization to the next?THE LEVINSON RECURSION 5. INTRODUCTION In Chapter 4, we saw that several different signal modeling problems require finding the solution to a set of linear equations of the form Ra, =b 6.1) where R, isa Toeplitz matrix. Inthe Padé approximation method, for example, the denom- nator coefficients ap (k) which are represented by the vector a are found by solving 2 set of Toeplitz equations ofthe form (5.1) where R, is a non-symmerric Toeplitz matrix containing the signal values x(g), x(@ +1). «...x(q+ p— 1) in the first column and the signal values x(q). *(g — I, -.-.8(q — p +1) inthe frst row. In addition, the vector b contains the signal values x(q + 1), x(q +2), ...,x(9 + p) ands therefore tightly constrained by the values in the matrix R,. A similar set of linear equations is also found in the modified YYle-Walker equations used in modeling an ARMA process. We saw that Toeplitz equations also arise in all-pole modeling of deterministic signals using either Prony's method oF the autocorrelation method and in all-pole modeling of stochastic processes using the ‘Yule-Walker method. Unlike the Padé approximation, however, in these cases R, is a Her- ‘mitian Toeplitz matrix of autocorrelation values r,(0),r4(I).---,Fx(p — 1)- In addition, since b = ~[r.(1),....re(p)]", the vector on the right side of Eq. (5.1) is again tightly constrained by the values in the Toeplitz matrix... In Shanks’ method for finding the numerator coefficients. we again find a set of Hermitian Toeplitz equations. In this ease, however, unlike the previous examples, the vector bis not constrained by the values in the matrix R,. In Chapter 7, Toeplitz equations will again be encountered when we consider the design of FIR Wiener filters. As in Shanks’ method, R, will be a Hermitian Toeplitz matrix but the veetor b willbe, in general, independent of the values in R. ue to the importance of solving Toeplitz equations in a variety of different problems, inthis chapter, we look at efficient algorithms for solving these equations. In the process of deriving these algorithms, we will also discover a number of interesting properties ofthe solutions to these equations and will gain some insight into how other approaches to signal ‘modeling may be developed, We begin, in Section 5.2, withthe derivation ofthe Levinson- Durbin recursion, This recursion may be used to solve the Prony all-pole normal equations and the autocorrelation normal equations. The Levinson-Durbin recursion will also lead us216 __ THE LEVINSON RECURSION to several interesting results including the lattice filter structure, the Schur-Cohn stability test for digital filters, the Cholesky decomposition of a Toeplitz matrix, and a procedure for recursively computing the inverse of a Toeplitz matrix. In Section 5.3, we develop the Levinson recursion for solving a general set of Hermitian Toeplitz equations in which the vector b is unconstrained. The Levinson recursion may be used in Shanks’ method, and it may be used to solve the general FIR Wiener filtering problem developed in Chapter 7. Finally, in Section 5.4 we derive the split Levinson recursion. This recursion is slightly more efficient than the Levinson-Durbin recursion and introduces the idea of singular predictor polynomials and line spectral pairs that are of interest in speech processing applications. 5.2 THE LEVINSON-DURBIN RECURSION In 1947, N. Levinson presented a recursive algorithm for solving a general set of linear symmetric Toeplitz equations R,a = b. Appearing in an expository paper on the Wiener linear prediction problem, Levinson referred to the algorithm as a “mathematically trivial procedure” {16]. Nevertheless, this recursion has led to a number of important discoveries including the lattice filter structure, which has found widespread application in speech processing, spectrum estimation, and digital filter implementations. Later, in 1961, Durbin improved the Levinson recursion for the special case in which the right-hand side of the ‘Toeplitz equations is a unit vector (7). In this section we develop this algorithm, known as, the Levinson-Durbin recursion. In addition, we will explore some of the properties of the recursion, show how it leads to a lattice filter structure for digital filtering, and prove that that the all-pole model derived from the autocorrelation method is stable. 5.2.1 Development of the Recursion Alepole modeling using Prony's method or the autocorrelation method requires that we solve the normal equations which, for a pth-order model, are rk) + Dap Ork-D=0 5 k=1,2.. (82) at where the modeling error is. = 160) + apr (5.3) ‘Combining Eqs. (5.2) and (5.3) into matrix form we have rr) ZQ) RO) 1 1 nQ 6) ra) R=) |] at) 0 rQ) (1) (0) + 1-2) |] a2) | =e,| 9 | cay rp) ndp— 1) p=) 14(0) a,(P) 0 Which isa set of p+ 1 linear equations in the p+ 1 unknowns ap(1), dp(2),..-+@p(p) and ép- Equivalently, Eg, (5.4) may be written as [® , i] 63)THE LEVINSON-DURBIN RECURSION 217 where Ry is a (p + 1) x (p + 1) Hermitian Toeplitz matrix and uy = [1,0,..., 0] isa unit vector with | in the fist position. In the special case of real data, the Ris asymmetric ‘Toeplitz matrix. “The Levinson-Durbin recursion for solving Ea, (5:5) is an algorithm that i recursive in the model order. In other words, the coefficients of the (j + 1)st-order all-pole model, a) +1, are found from the coefficients of the j-pole model, a. We begin, therefore, by showing how the solution to the jth-order normal equations may be used to derive the solution to the (j + Dst-order equations. Let a;(2) be the solution tothe jth-order normal equations 7 MQ nn) 1 6 re) (0) ng=) || aa 0 nQ nO + FU-2D |] a@ Ja} O | 6) ri) rG—D GA 2) + re) a) ° which, in matrix notation is Ria) =u, 67) Given aj, we want to derive the solution to the (j + I)st-order normal equations, Risa = esi (5.8) ‘The procedure for doing this is as follows. Suppose that we append a zero to the vector a and multiply the resulting vector by Rj. The result is nO A) Qo AG) GED] 1 6 rere) IG =D AG) YT gy 0 nQ A) AO) G2) AG—D |] ge o : a lS 69) nd) GD 2 re rt) | | GW 0 GED BG) GD rel) 7400) 0 % where the parameter y is ya=ri t+ Dan +1-) (5.10) Note that if yj = 0, then the right side of Eq, (5.9) isa scaled unit vector and as = 1, 4j(1),....4j()),01" is the solution to the (j + I)st-order normal equations (5.8). In general, however, y) # Oand (1, aj(1),....4;(J), 01" is not the solution to Eq, (5.8). The key step in the derivation of the Levinson-Durbin recursion is to note that the Hermitian Toeplitz property of R.1allows us to rewrite Eq. (5.9) in the equivalent form 0) (1) re) rei + WY o ¥ TQ) re) re) GD ral) ai(j) 0 AQ AO oe nG-DueG—d]faG-v|_}o 2) RU » ne goge ae AD AG-DRG=2-- OQ |] wy 0 AGED RG) AGI A) re) 1 4.2 THE LEVINSON RECURSION Taking the complex conjugate of Eq. (5.11) and combining the resulting equation with Eq, (5.9), it follows that, for any (complex) constant F) 1. 1 0 s yy (1) ai) 0 0 4,(2) aj) 0 0 Ry 2 | eRe = +0]. | 612 ati) al) ° 0 0 1 y gE Since we want to find the vector a; which, when multiplied by R41, yields a scaled unit ‘vector, note that if we set (5.13) then Eq, (5.12) becomes Ray = Gsm where 1 0 a) a 4;Q) gG-) a= +01 6.14) aii) a) 0 1 which is the solution to the (j + I)st-order normal equations. Furthermore, + Tf =6[1 uP] 5.15) is the (j + I)storder modeling error. If we define aj(0) = 1 and aj(j + 1) = 0 then Fg, (5.14), referred to asthe Levinson order-update equation, may be expressed as oi +t | 616) an O=qW+TaaU-i+D 5 Allthatis required to complete the recursion is to define the conditions necessary to initialize the recursion. These conditions are given by the solution for the model of order .49(0) “© (0) 6.17) In summary, the steps of the Levinson-Durbin recursion are as follows. The recursion is first initialized with the zeroth-order solution, Eq. (5.17). Then, for j = 0, ,....p — Is the (J + I)st-order model is found from the jtn-order model in thee steps. The fist step is to use Eqs. (5.10) and (5.13) to determine the value of I'j.1, which is referred to as "since «i al then the complex conjugate in Ea, (5.13) may be dropped.THE LEVINSON: DURBIN RECURSION. ‘Table 5.1 The Levinson-Durbin Recursion | nitialize the recursion 2 @ want D+ LDina@nd i+) © Ca =—y le, © For 12.) aie () = off) + Farag E41) @ aad +D =F le eae inh) 3. the (J + Ist reflection coefficient. The next step of the recursion isto use the Levinson order-update equation to compute the coefficients aj41() from ay (2). The final step of the recursion isto update the error, using Eq, (5.15). This error may also be written in two ‘equivalent forms that will be useful in later discussions. The frst is rf [1-1 (5.18) and the second, which follows from Eq, (5.3), is jet eat = 10) + Do ayes lirs(i) (5.19) ‘The complete recursion is listed in Table 5.1 and a MATLAB program is given in Fig. 5.1? Example 5.2.1 Solving the Autocorrelation Normal Equations Let us use the Levinson-Durbin recursion to solve the autocorrelation normal equations and find a third-order all-pole model for a signal having autocorrelation values 7) =1, 141) = 08, (2) =05, 43) =0.25 ‘The normal equations for the third-order all-pole model are 1 0S 057 ac) os os 1 05 || a |=-| os 05 05 1 33) 0.25 Using the Levinson-Durbin recursion we have 2 The convention usd in he Levnson-Durbin eeusin varies abit from one author to another. Some authors for example, we —T. in Eq. (5.12), wile otbers use. This esl of course, in diferent sign conventions for220 __THELEVINSON RECURSION The Levinson-Durbin Recursion function [a,epsilon]=rtea(r} > pelength(r) 17 avi: epeiton-r (1) + for j=2:p+1 (ganna=-r (2:4) *£1{pud (a) / a2[2/0) + gamma’ (0;con3 (f1spud(a)) epsilon=epsilon* (1 ~ abs (gasma}“2): ond Figure 5.1.4 MatLAs program for solving a set of Toeplitz equations using the Levinson-Durbin Re- ‘cursion. This file provides a mapping from the autocorrelation sequence rk) othe filter coefficients 4, (B), hence the name 20a. 1. First-order model: [1 =F? w-[h [a2] n=nQ)-+a (nO) = 4 nlo=-$ 3 esali-r and 2. Second-order model: he [pho 8] 3. Third-order model: 22 = reQB) + a2(1)r5(2) + 227-1) = and e=ell-r]=% and 0 1 1-18 -3/8 8| -18 -3/8 1 18 Finally, with 00) = f= veTHE LEVINSON-DURBIN RECURSION 221 the all-pole model becomes HQ) = Having found the third-order all-pole model for x(n), let us determine what the next value in the autocorrelation sequence would be if Hs(z) were the correct model for x(n),i¢., if x(n) is the inverse z-transform of H(z). In this case, the model will remain unchanged fork > 3,i.e., Ty =O and, therefore, 7 = 0 for & > 3. Thus, it follows from Eq, (5.10) by setting j = 3 that rd) + Yasir =i which may be solved for (4) as follows r(8) = — Daxlidr(4— i) = 7/32 Successive values ofr, (k) may be determined in a similar fashion, For example, if k > 4, then rel) = — Prav@r(k - 1) which are the Yule-Walker equations for an AR() process. It follows from our discussions in Section 3.6.2 of Chapter 3 that if white noise with a variance of o2 is filtered with fistorderall-pole filter H@) : then the output will be an AR(1) process with an autocorrelation sequence of the form In the following example, the Levinson-Durbin recursion is used to find the sequence of reflection coefficients P, and model errors ¢, that are associated with this process. Example 5. 2 The Reflection Coefficients for an AR(I) Process Let x(n) be an AR(1) process with an autocorrelation Te = alae... 07)" ‘To find the reflection coefficients associated with this vector of autocorrelations we may use the Levinson-Durbin recursion as follows. Initializing the recursion with m=1 5 =O) =222 __THELEVINSON RECURSION it follows that the first reflection coefficient is and that ‘Thus, for the first-order model For the second-order model Yi = P42) +a (I)rQ(1) = 0 and 0 the coefficients of the model are Continuing, we may show by induction that P, suppose that at the jth step of the recursion F; and ¢) = 02 forall j > 1. Specifically, Oand oF Since Han ++ PaO -i+) =n +) and) \we see from the form of the autocorrelation sequence that y; = 0. Therefore, Pj.1 = Oand ‘a;41 is formed by appending a zero to a, and the result follows. In summary, for an AR(1) process, We now look at the computational complexity of the Levinson-Durbin recursion and compare it to Gaussian climination for solving the pth-order autocorrelation normal equations. Using Gaussian elimination to solve a set ofp linear equations in p unknowns, approximately |p° multiplications and divisions are required. With the Levinson-Durbin recursion, on the other hand, tthe jth step ofthe recursion 2j + 2 multiplications, 1 dvi- Sion, and 27 + | additions are necessary, Since thre are p stops in the recursion, the totalTHE LEVINSON:DURBIN RECURSION _ 223 ‘number of multiplications and divisions is’ et Dei+a +2p and the number of additions is Lei+n = ‘Therefore, the number of multiplications and divisions is proportional to p? for the Levinson- Durbin recursion compared with p? for Gaussian elimination. Another advantage of the Levinson-Dusbin recursion over Gaussian elimination is that it requires less memory for data storage. Specifically, whereas Gaussian elimination requires p? memory locations, the Levinson-Durbin recursion requires only 2(p + 1) locations: p + 1 forthe autocorrelation sequence r,(0),...re(p), and p for the model parameters dp(I),...dp(p), and one for the error ep In spite of the increased efficiency of the Levinson-Dusbin recursion over Gaussian climination, i should be pointed out that solving the normal equations may only be a small fraction of the total computational cost in the modeling process. For example, note that, for a signal of length V, computing the autocorrelation values r4(0), -...re(P) requires approximately N(p + 1) multiplications and additions, Therefore, if NV >> p, then the cost associated with finding the autocorrelation sequence will dominate the computational requirements of the modeling algorithm. 5.2.2. The Lattice Filter One of the by-products of the Levinson-Durbin recursion is the latice structure for digital filters Lattice filters are used routinely in digital filter implementations as a result of number of intresting and important properties that they possess. These properties include ‘A modular structure, low sensitivity to parameter quantization effects, and a simple method to ensure filter stability. In this section, we show how the Levinson order-update equation may be used to derive the lattice filter structure for FIR digital filters. In Chapter 6, other latice filters structures will be developed, including the all-polelatice and the pole-zer0 Iatice filters, ad it wil be shown how these filters may be used for signal modeling ‘The derivation ofthe latice filter begins with the Levinson orde-update equation given in Eq. (5.16), Fist, however, it will be convenient to define the reciprocal vector, denoted by af which i the vector that is formed by reversing the order of the elements in a, and taking the complex conjugate 1 aj) a) a=) 42) ai -2) = a(j-0 aj) aj(4) 1 Recall the summations given in Table 2.3.16224 __ THE LEVINSON RECURSION afi) = af ~1) (5.20) for i = 0,1,...,.j. Using the reciprocal vector in the Levinson order-update equation, Eq. (5.16) becomes azai(i) = aii) + Tyna @ ~ 1) (5.21) With A(z) the z-transform of aj(i) and Af(z) the z-transform of the reciprocal sequence ‘af(), it follows from Eq. (5.20) that A;(z) is related to AR(2) by ARQ) = 2FAZQ/2°) (522) Rewriting Eq. (5.21) in terms of A,(z) and Af (2) gives Ajai le) = Aj(2) + Pye AR) (523) which is an order update equation for (2). ‘The next step is to derive an order-update equation for af, (7) and Af, (2). Beginning with Eg, (5:21), note that if we take the complex conjugate of both sides ofthe equation and replace‘ with j i + 1, we have ait D=aG it DFT GO (5.24) Incorporating the definition of the reversed vector, Eq. (5.20), into Eq. (5.24) gives the desired update equation for a (i). af =afG — +0 g@ (5.25) Expressed inthe z-domain, Bq, (5.25) gives the desired update equation for AA, (2). ARG) = 2 TARG) + Pf AVG 526) In summary, we have a pair of coupled difference equations, ajaa(n) = a(n) + Pyar — 1) afi y(n) = a(n — 1) +P} yayln) 627) for updating a(n) and af (n), along with pair of coupled equations for updating the system functions Aj(z) and AN(2) which may be written in matrix form as Ais 1 Ty Ale vt) ] He ve) 628) AR Ls ARG) Both ofthese representations describe tothe two-port network shown in Fig. 52. This wo- port represents the basic module used to implement an FIR lattice filter. Cascading, p such lattice filter modules with reflection coefficients Py, P2,.... Fp forms the pth-order lattice filter shown in Fi. 5.3. Note that the system function between the input (n) and the output ‘ap(n) is A(z), whereas the system Function relating the input tothe output af(n) is AB (2)THE LEVINSON:DURBIN RECURSION 225, atm at) fn Tit ea ah Figure 5.2 A two-pon netnork relating a(n) and aka) 0 ay() ad af). Ths network ingle Stage of an FIR latice filter extn) avin) ain) apaitn) agin) an) a n n > akin) afin) afin) af sm) af) Figure 5.3 A prl-order FIR lace filter built up asa cascade of p two-port networks 5.2.3 Properties In this section, we will establish some important properties of the reflection coefficient sequence that is generated by the Levinson-Durbin recursion and we will establish an important property of the solution to the autocorrelation normal equations. Specifically, we will show that the roots of A,(z) will be inside the unit circle if and only if the reflection coefficients are bounded by one in magnitude or, equivalently, if and only if R, is positive definite, We will also show that the all-pole model obtained from the autocorrelation method is guaranteed to be stable. Finally, we will establish the autocorrelation matching property, which states that if we set b(0) = /& in the autocorrelation method and if h(n) is the inverse z-transform of 2) Aye) then the autocorrelation of h(n) is equal to the autocorrelation of x(n). ‘The first property ofthe reflection coefficients is the following. H@= Property 1. The reflection coefficients that are generated by the Levinson-Durbin recursion to solve the autocorrelation normal equations are bounded by one in magnitude, [P| <1 ‘This property follows immediately from the fact that since ¢; is the mi = (Ednin with,226 __ THE LEVINSON RECURSION where e(n) isthe jth-order modeling erro, then ¢; > 0. Therefore, since if -1,P] it follows that if ¢, > O and €;; > 0, then [Ty] < 1. . Is important to keep in mind that the validity of Property 1 relies on the nonnegativity of ¢. Implicit in this nonnegativty isthe assumption thatthe autocorrelation values r,(k) in R, are computed according to Eq, (4.121). I is not necessarily true, for example, that 6 = Oand [P| < 1 if an arbitrary sequence of numbers r5(Q), F< (1)... Fs(P — 1) are used in the matrix R,. To illustrate this point, consider what happens if we set r<(0) 1 and r,(1) = 2 im the normal equations and solve for the first-order model. In this case, Ty = =1(D)/r(0) = 2 which has a magnitude that is greater than one. This example is not in conflict with Prop- erty 1, however, since for any valid autocorrelation sequence, |r,(0)| > |rs(1)]. As we will soon discover (see Property 7 on p. 253), the unit magnitude constraint on the reflection coefficients I’, and the nonnegativity of the sequence ¢; are tied to the positive definiteness of the matrix R,. The next property establishes a relationship between the locations of the roots of the polynomial A,(z) and the magnitudes of the reflection coefficients 7, Property 2. If ap(k) is aset of model parameters and I’; is the corresponding set of reflection coefficients, then Ap) = 1+ )ap(k)z* i will bea minimum phase polynomial (all of the roots of A(z) wil be inside the unit circle) if and only if [P| < 1 forall j. Furthermore if [P| <1 forall j then the roots of Ap(z) mus lie either inside or on the unit eirele ‘There are many different ways to establish this property [14, 20, 24]. One way, as demonstrated below, is to use the encirclement principle (principle of the argument) from complex analysis [3]. Encirclement Principle, Given a rational function of z ae) Pa) “ A(z) let C be a simple closed curve in the z plane as shown in Fig. 5.4. As the path C is traversed in a counterclockwise direction, a closed curve is generated in the P(z) plane that encircles the origin (WV: — N) times in a counterclockwise direction where 'N; is the number of zeras inside C and N, is the number of poles inside C*THE LEVINSON-DURBIN RECURSION ‘227° Ime) Im Pe) Re) REA Figure 5.4 A simple closed curve in the z-plane encircling a simple zero and the corresponding ‘contour in the P(2) plane. ‘The encirclement principle may be interpreted geometrically as follows. Since C is simple closed curve, beginning and ending at zo, then P(z) will also be a closed curve, beginning, and ending at the same point, P (zo). If there is a zero contained within the contour then the angle @ from the zero to a point on the contour will increase by 27 as the contour is traversed. In the P(2) plane, therefore, the curve that is generated will encircle the origin in a counterclockwise direction. Similarly, if there is a pole inside the contour, then the angle will decrease by 2rr as C is traversed, which produces a curve in the P(z) plane that encircles the origin in the clockwise direction. With N, zeros and Np poles inside C, the change in angle is 2x(N, — Np) and the encirclement principle follows. Although not stated in the encirclement principle, it should be clear tha if there isa zero of (2) on the contour CC, then the curve in the P(2) plane will pass through the origin. ‘With the encirclement principle, we may use induction to establish Property 2. Specif- ically, for a first-order model, Au) =14T2! and it follows that A, (2) will be minimum phase if and only if || < 1. Let us now assume that 4j(z) is minimum phase and show that Aj, (z) will be minimum phase if and only if IPysil < 1. To accomplish this, we will use the z-domain representation of the Levinson ‘order-update equation given in Eq, (5.23), which is repeated below for convenience Ajeil2) = Aj(2) + Pyne ARO Dividing both sides of this update equation by A,(z) we have ARG) A@ (5.30) iis nepative then the curve P(2) ences the origin na clockwise diection pl times.228 THE LEVINSON RECURSION We will now show that if Cis a closed contour that traverses the unit circle then the number of times that P(z) encircles the origin in a clockwise direction is equal to the number of zeros of ,+1(z) that are outside the unit circle. Since A,(z) is assumed to be minimum phase, A;(z) has j zeros inside the unit circle and j poles at z = 0. Now let us assume that Aji (2) has | zef0s outside the unit circle and j +1 ~1 zer0s inside. Since Aj4(2) will have j +1 poles at z = 0, it follows that the number of times that P(z) encircles the origin in a counter clockwise direction is Ne=Np=G+1-D-G41) of, 1 times in a clockwise direction. Now, note that since the contour C is the unit circle, then z =e and Afte*)| = yal Therefore, it follows from Eq. (5.30) that P(z) traces out a curve which is a circle of radius [P11 that is centered at z = 1 (see Fig. 5.5). Thus, if (P'j+i| < 1, then P(e) does not encircle or pass through the origin. Conversely, if |Ij+;| > 1 then P(z) encircles the origin and Aj+i(z) will not have all of its zeros inside the unit circle. Conse- quently if Aj(z) is minimum phase, then Aj.1(z) will be minimum phase if and only if Iyeal <1 . ‘There is an intimate connection between the positive definiteness ofthe Toeplitz matrix, R,. and the minimum phase property of the solution to the normal equations. In particular, the following property states that A(z) will be minimum phase if and only if Ry is positive definite (see also Section 5.2.7). Property 3. If ay isthe solution to the Toeplitz normal equations Ry@ = €t, then Ap(z) will be minimum phase if and only if R, is positive definite, R,, > 0. Imre) Figure 8.5 The curve P(:), which isacircle of radius C411 that is centered at 2 = |THE LEVINSON-DURBIN RECURSON __ 229 To establish this property, let a be a root of the polynomial Ap(z). In generale will be complex and we may factor A,(z) as follows Taz") + biz + beet Aple) = 1+ Sapo bp ee) ‘We will show that if Ry > 0, then ja < 1. Writing the vector a, in factored form as 1 Teo a,(1) bod a=] m2) |] b&b [eae (30) i 0 it follows that (5.32) Multiplying Eq. (5.32) on the let by BY we have waa t]o[8] Since B has full rank, if R, is positive definite, then B”R,B will be positive definite (see p.40). Therefore, B’R,B [ ] >0 which implies that Isol? > Isl? (5:33) txa[ 2 ]-[ 2722] Since then sya = sf and (5.34) ‘Thus, Eq. (5.33) implies that ja] < 1 and A(z) is minimum phase. Conversely, if Ap(2) is not minimum phase, then A(z) will have a root a with lal > 1. Therefore, from Eq, (5.34) it follows that |si| > [sol and Ry. is not positive definite. . Property I states that the reflection coefficients that are generated in the course of solving the autocorrelation normal equations are bounded by one in magnitude. Property 2 states that a polynomial Ap(z) will have all of its roots inside the unit circle if its reflection coefficients are less than one in magnitude. Combining Properties 1 and 2 leads to the following property.230 __ THE LEVINSON RECURSION Property 4. The autocorrelation method produces a stable all-pole model. In the next property, we consider what happens to the roots of the polynomial A(z) when [Tp] = 1 with the remaining terms in the reflection coefficient sequence being less than one in magnitude. Property 5. Let ap(k) be a set of filter coefficients and I’ the corresponding set of reflection coefficients. If [Pj] <1 for j = 1,...,p— land (Ppl = 1, then the polynomial Ap(2) = 1+ Dap (hie has all of its roots on the unit circle. This property may be established easily as follows. Let [be a sequence of reflection coefficients with [| < 1 for j < pand|T,| = 1. Since [P| < 1 forall j, it follows from Property 2 that all of the zeros of p(z) are on or inside the unit circle. If we denote the zer0s of Ap(z) by 24 then A,(2) Tle az") By equating the coefficients of z~? on both sides of the equation, we have a,(p) =|] (5.35) However, since ap(p) then Tis ‘Therefore, if there isa zero of A(z) that has a magnitude that is less than one, then there must be at least one zero with a magnitude greater than one, otherwise the product of the roots would not equal one. This, however, contradicts the requirement that all of the roots must lie on or inside the unit circle and Property 5 is established. . ‘This property may be used to develop a constrained latice filter for detecting sinusoids in noise (see computer exercise C6.1 in Chapter 6). It also forms the basis for an eff- cient algorithm in spectrum estimation for finding the Pisarenko harmonic decomposition (Chapter 8) ofa stationary random process [11] ‘The next property relates the autocorrelation sequence of x(n) to the autocorrelation sequence of h(n) the unit sample response ofthe al-pole filter that i used to model x(n) Specifically, as we demonstrate below: if b(0), the numerator coefficient in H(z) is selected 636)THE LEVINSON-DURBIN RECURSION _231 so that x(n) satisfies the energy matching constraint, r,(0) = r4(0), then b(0) = /& and the autocorrelation sequences are equal for |k| < p. This is the so-called autocorrelation ‘matching property Property 6—Autocorrelation matching property. If b(0) is chosen to satisfy the ‘energy matching constraint, then b(0) = /€5 and the autocorrelation sequences of x(n) and h(n) are equal for [kl hen — HM =D = 60) Y-HempHU =) cd fat tat as = bO)A(-D (5.39) With r= CAMA =D = HD = we may rewrite Eq. (5.39) in terms of r,(/) as follows ral) + Yap bral — k) = OAD ot Since h(n) is causal, then h(n) = 0 for n < O-and h(0) = b(0). Therefore, for ! > 0, it follows that r+ Soap Rnd =k) = [bP 5 120 (6.40)232 __ THE LEVINSON RECURSION Using the conjugate symmetry of rx(I) we may write Eq. (5.40) in matrix form as rn) 1) rR) Oo) 1 1 ma) 0) rg 1) | | ap) 0 rH) ral) 40) eo —2) 1] a2) | = wor} 9] Gan rR) ra(0— 1) ral —2) 4) } Lagtr 0 Raa = 1600)! (5.42) Therefore, ay isthe solution to each of the following sets of Toeplitz equations, Rady = bol?uy Reay = eputy 6.43) Now, suppose that (0) is chosen to satisfy the energy matching constraint, 1a) = Son = 7H? = 140) os) By induction we may then show that r(K) = ra(k) for k > 0 as follows. Using the Levinson-Durbin recursion to solve Eq, (5.43) we have ro) __ra() (0) r4(0) Since r,(0) = r(0), it follows that r_(1) = re(1). Now, assume that r_(k) = ra(k) for = 1,2... J, Using Eqs, (5.10) and (5.13) we have a(t) nG+D = Tyg — Da @nG +l nG+D = Tyng — an +1-d (5.45) ‘Therefore, since r(i) = ra(i) ford = from Eq, (5.41) note that j then rs(j + 1) = ra(j + D. Finally, VCO)? = ra(0)-+ Ya (kdra(k) With r4(k) = re(k) it follows that rl0)-+ ap r(8) 5.46) ‘Therefore, b(0) = /ép and the autocorrelation matching property is established. . 5.2.4 The Step-Up and Step-Down Recursions ‘The Levinson-Durbin recursion solves the autocorrelation normal equations and produces an all-pole mode! for a signal from the autocorrelation sequence r,(k). In addition to the ‘model parameters, the recursion generates a set of reflection coefficients, P),P2,..-, PpsTHE LEVINSON-DURBIN RECURSION __233 along with the final modeling error, ¢p. Thus, the recursion may be viewed as a mapping tay J 4-Day), ---sap(P), BO) {rs rey .--ore(p)) Pra taes from an autocorrelation sequence to a set of model parameters and to a set of reflection coefficients. There are many instances in which it would be convenient to be able to derive the reflection coefficients, I, from the filer coefficients ap(k), and vice versa, In this, section, we show how one set of parameters may be derived from the other in a recu fashion. The recursions for performing these transformations are called the step-up and step-down recursions. ‘The Levinson order-update equation given in Bq. (5.16) isa recursion for deriving the filter coefficients ap (k) from the reflection coefficients, T. Specifically, since ay = ay) + Tjeragj -F+D (647 then the filter coefficients aj, (i) may be easily found from ay(i) and T)1. The recursion is initialized by setting ay(0) = | and, afterthe coefficients.a,(k) have been determined, the recursion is completed by setting b(0) = /é- Since Eq, (5.47) defines how the model parameters fora jth-order filter may be updated (stepped-up) oa (j + I)storder filter given P} Eq, (547) is referred to as the step-up recursion. The recursion, represented pictorally as (Pa Paves Pps ep} SES” (ap(1). ap (2), sap). 60) is summarized in Table 5.2 and a MATLAB program is given in Fig 5.6. Table 5.2 The Step-up Recursion 1. Initialize the recursion: 29(0) 2 Forj=Ol..pal Pri = 12 aye) = y(t) + Tyas 4D) Wain) 3. bO=VG The Step-Up Recursion a=[8;0] + gamma (j-1)*10;conj (fLipud(a}) 1: end Figure 5.6 A MATLAB program for the step-up recursion to find the filter coefficients a,(k) from the reflection coefficients.234 __THELEVINSON RECURSION Example 5.2.3 The Step-Up Recursion Given the reflection coefficient sequence n=[F.m) the first- and second-order all-pole models may be found from the step-up recursion as follows. From Eq, (5.14), the first-order model is *-[oo]-[o 0 ]-[4] ‘Again using Eq, (5.14) we have for a [ashe] [ilel ‘Therefore, in terms ofthe reflection coefficients P and Tp, the general form for the second- ‘order all-pole model is Arle) = 14 (Cy + PGP ade" + Mae? Given a second-order model, az, note that if a third reflection coefficient, Ps, is appended to the sequence, then as may be easily found from a2 as follows. 1 1 o =| am |_|] ea 432) %=) aa |=] a [+] aay 433) o 1 1 1 = | 2+ Prax) |_| + PPD) + Z =} ayers) |=] try +rirp mh rs ‘Thus, it is always possible to increase the order of the filter without having to solve again for the lower-order filters. ‘The step-up recursion is a mapping, £ : Tp —> ap, from a reflection coefficient sequence I’, 10 a filter coefficient sequence a,. In most cases, this mapping may be inverted, £7! ay —> Land the reflection coefficients determined from the model parameters, ay ‘The mapping that accomplishes this i called the step-down or backward Levinson recursion. ‘The procedure for finding the reflection coefficients is based on the fact that since Ty =a) 6.48) then the reflection coefficients may be computed by running the Levinson-Durbin recursion backwards. Specifically, beginning with a, we set Up = ap(p). Then, we recursively findeach of the lower-order models, aj, for j = p— 1p 2... illustrated below a, = afl) a2) a,(p 2) a,(p ~ 1) Bp = Mp ll) aps) ++ aya(p—2) p21) ay2(2) Tp = aw [h = fh ‘Tose how the jth-order model may be derived from the (j +1)st-order model, let us assume that the coefficients a, .i(#) are known. The Levinson order-update equation, Eq, (5.16), expresses aj,(i) in terms of a)(i) and a(j ~ i + 1) as follows ay) =) + Tyna 7+ D 6.49) which provides us with one equation in the two unknowns, a,(i) and af(j—i +1). However, if we again use the Levinson order-update equation to express aj, (J — i + 1) in terms of, aj(j ~i + 1) and a3 (i) we have aa it Daj i+ +l ygi7@ which, after taking complex conjugates, GG -i+ Dag -i+ D4 TQ @ (5.50) gives us a second linear equation in the same two unknowns. Writing Eqs. (5.49) and (5.50) in matrix form we have qu ft tn ati) [endren |-[ 1 [Loc] cy If [Pysl # 1, then the matrix in Eq. (5.51) is invertible and we may solve uniquely for 4aj(0)2 The solution is (6.52) which is the step-down recursion. This recursion may also be written in vector form as follows aj(1) ay,1(1) GG) 42) 412) an) ° v@) |p| gad (8.53) a(i) ay) a.) Once the sequence aj(i) has been found, we set T = aj(j) and the recursion continues by finding the next lower-order polynomial. The step-down recursion, represented pictorially SI Fjs11 = I. then Ea. (5.51) isa singular se of equations and the mapping isnot uniquely invertible,236 __ THE LEVINSON RECURSION {ap(1). 4902), ---+4p(p), b(0)) SS" (Py, Pa, Pp ep) is summarized in Table 5.3, ‘Another and perhaps more elegant way to derive the step-down recursion is to use Eq, (5.28) to solve for the jth-order polynomial A, (2) in terms of (2). The solution is easily seen to be A) ] Ake |o Therefore, Bare! Paya x sae is) Met Afi) [ante - Tafa] (555) A= H Tak which is the z-domain formulation of Eq. (5.52). A MATLAB program for the step-down recursion is given in Fig. 5.7. ‘Table 5.3. The Step-down Recursion () Seer; = aj) The Step-Down Recursion function gennanatog (a) j=1) = gamma (3) *#1pudteons (a(L:4-1019) «/ = abs (gamma (5))°2) ganna j-1)=8(3-1); ena Figure 5.7 A MATLAB program for the step-down recursion to find the reflection coefictents from a set of filter coefficients a,(k).THE LEVINSON-DURBIN RECURSION __237 Example 5.2.4 The Step-Down Recursion ‘Suppose that we would like to implement the third-order FIR filter HG) = 14052" 0.127 - 052% using a lattice filter structure. Using the step-down recursion, the reflection coefficients may be found from the vector [1. 05, -0.1, -0.5]” and then implemented using the structure shown in Fig. 5.3. The step-down recursion begins by setting P'3 = ax(3) = —0.5. Then, the coefficients of the second-order polynomial @ = [1,a2(1), a2(2)]" are found from a3(i) using Eq. (5.53) with j = 2 as follows feo] 1 a) ]_ pf 2) a2) |~ 1-73 IL a2) *L asl) From the given values for a; and I, we find ax(1) 1 0s -01 ]] _[ 06 (38) ]>r=0as{[ 21 }+°3[ 05 ]}- [08 ] and Py = a(2) = 0.2. Having found a;, Eq (5.52) may again be used to find a) as follows 1 a(t) [a2(1) = P2a(1)] = 0.5 " Thus, 7) = a)(1) = 0.5 and the reflection coefficient sequence is r=[05, 02, -05]" Therefore, a latice filter having the given system functi is as shown in Fig. 5.8. Oe 052% An interesting and useful application of the step-down recursion is the Schur-Cohn stability test for digital filters [27]. This test is based on Property 2 (p. 226), which states that the roots of a polynomial will lie inside the unit circle if and only if the magnitudes of the reflection coefficients are less than one. Therefore, given a causal, linear shift-invariant filter with a rational system function, Boe) A@ the filter may be tested for stability as follows. First, the step-down recursion is applies to the coefficients of the denominator polynomial A(z) to generate a reflection coefficient HO238 THE LEVINSON RECURSION sequence, ['. The filter wil then be stable if and only if all ofthe reflection coefficients are Jess than one in magnitude. The following example illustrates the procedure. Example 5.2.5 The Schur-Cohn Stability Test Let us use the Schur-Cohn stability test to check the stability of the filter 1 Tha Re He) First note that the leading coefficient in the denominator polynomial is not equal to one. ‘Therefore, we begin by rewriting H(z) as follows os 152? +05" 4O= Ta With Ts = a3(3) = 0.5 we then solve for the second-order polynomial A3(z) = 1 + a,(I)2"! + a3(2)2-? using Eq, (5.53) as follows [22 J= re (L238 ]-e [28 J} s{[ is }-o[ -[43] =10/3, then |['2| > 1 and the filter is unstable. Therefore, since P= a2(2) 5.2.5. The Inverse Levinson-Durbin Recursion We have seen how to derive the reflection coefficients and the model parameters from an autocorrelation sequence. We have also derived recursions that map a sequence of reflection coefficients into a sequence of model parameters and vice versa. It is also pos- le to recursively compute the autocorrelation sequence from the reflection coefficients {1 ,,1'2..., Tp} and ép or from the coefficients of the pth-order model dp(k) and (0). ‘The recursion for doing this is referred to as the inverse Levinson recursion. To see how the recursion works, assume that we are given the reflection coefficient sequence I for J = lees p and the pth-order error ¢,, Since €, =n] [= IP) (5.56) then the first term in the autocorrelation sequence is Oo)THE LEVINSON-DURBIN RECURSION 239 ‘Table 5.4 The Inverse Levinson-Durbin Recursion 1. nitilize the recursion (a) 7900) = 9/ [TC — PD ©) a(0)=1 2. For} =0,1,..., 7 (@) Fori=1,2, ayali) = afi) +P jrafj 1+ ©) gi + D= Tp © nt D=-Lih ann +1-9 3. Done This, along with the zeroth-order model (5.58) initializes the recursion, Now suppose that the first j terms of the autocorrelation sequence are known, along with the jth-order filter coefficients a, (i). We will now show how to find the next term in the sequence, r,(j + 1). Fist we use the step-up recursion to find the coefficients a4) from T)41 and a/(i). Then, setting p = j +1 and k = j +1 in Eq. (5.2) we have the following expression for rs(j + 1) 4 n+) =-Fayng +1-) (5.59) Since the sum on the right only involves known autocorrelation values, this expression may beusedto determine r, (j+1), which completes the recursion. The inverse Levinson-Durbin, recursion, which may be represented pictorally as asvy! {Pi Pas Py {reO),re(D), «+. rep) } is summarized in Table 5.4, Example 5.2.6 The Inverse Levinson-Durbin Recursion Given the sequence of reflection coefficients, Py = T2 = Py = $ and a model ervor of 6 = 2), let us find the autocorrelation sequencer, = [re(0).re(1). (2). Initializing the recursion with rx(0) = —— a o The -1?)240 THE LEVINSON RECURSION ‘we begin by finding the first-order model, which is given by *-[8 ]-[05] re{l) = =r.) = 1.0 Updating the model parameters using the step-up recursion we have 1 ° Lela ie 1 a=| a fer] am |=] 2 ]+5] 12 |=] 3 0 1 ° 1 12 Therefore, Thus re(2) -ax(1)re(1) = @3(2)r,(0) = 3/4 — 1 = —1/4 Applying te step-up recursion to wehave 1 0 1 0 1 ae [*] ao [=f ae |*3] 3a [=| oe 0 1 3 Ty Lip Finally, for r,(3) we have re) = ~as(I)re(2) = asQ)r,(1) — a5(3)re(0) = 1/4 + 7/8 and the autocorrelation sequence is 18 re =[2, -1, -1/4, 1/8)" The inverse Levinson recursion described above provides a procedure for finding the autocorrelation sequence from the reflection coefticients and ¢,. If, instead of the reflection coefficients, we were given the filter coefficients a,(K), then the autocorrelation sequence may still be determined by using the step-down recursion to find the sequence of reflection coefficients and then using the inverse Levinson recursion. MATLAB programs for the inverse Levinson recursion to find r,(k) from either the reflection coefficients I’ or the filter coefficients a (k) are given in Fig, 5.9 In summary, combining the results of Sections 5.2.4 and 5.2.5 we see that there is an equivalence between three sets of parameters: the autocorrelation sequence r,(k), the all- pole model a,(4) and b(0), and the reflection coefficients P along with. This equivalence is illustrated in Fig, 5.10, which shows how one set of parameters may be derived from another. 5.2.6 The Schur Recursion* ‘The Levinson-Durbin recursion provides an efficient solution tothe autocorrelation normal equations. Fora pth-order model, this recursion requires on the order of p? arithmetic operations compared to an order of p? operations for Gaussian elimination. Therefore, with a single processor that takes one unit of time for an arithmetic operation, the Levinson- Durbin recursion would require on the order of p units of time to solve the normal equa tions. With the increasing use of VLSI technology and parallel processing architecturesTHE LEVINSON-DURBIN RECURSION. 241 The Inverse Levinson-Durbin Recursions function r-ator (gamma, epsilon) pelength (gamma) aaegama(1!7 re(1 gana (1): a= (aa;0) +gamma (3)* {eons (£1ipud (aad) 117 rely -f1ipir() aa): gamnaratog (a): Fegter (gamma) : it margin == 2, = rragrt (b) /prod(1-abe (ganma) .“2); ena: Figure 5.9 Marcas programs o find the autocorrelation sequence r,(k) from either the reflection oeficients °, and the modeling error ¢y, oF the filter coefficients ap(k) and b(0). F(O).Fe(Dsv-0te(P) - Levinson Dutsin Schur Pi Paye Pp + pon Step-Down ‘Step-Up 4g (1),p(2), spp), 60) < Figure 5.10 The equivalence benseen the autocorrelation sequence, the all-pole ‘model, and the reflection coeffciens. in signal processing applications, an interesting question to ask is how much time would bbe necessary to solve the normal equations if We were to use p processors in parallel. Although one would hope that the computation time would be proportional to p rather than p?, this is not the case. In fact, it is not difficult to see that with p processors the242 __THE LEVINSON RECURSION Levinson-Durbin recursion would solve the normal equations in atime that is proportional to p logs p instead of p. The reason for this is the requirement to compute the inner product 1 i ai(1) YH =HGEDAD G@OruG+1=-) = [nG+Dsi wD, oo oD] (5.60) a) which is the primary computational bottleneck in the Levinson-Durbin recursion. Al- though the j multiplications may be performed in one unit of time using j processors, the j additions require log, j units of time. Therefore, with p processors, finding the solution to the pth-order normal equations requires on the order of plog, p units of time. In this section, we derive another algorithm for solving the normal equations known, as the Schur recursion. This recursion, originally presented in 1917 as a procedure for ‘a polynomial to see if it is analytic and bounded in the unit disk (9, 22}, avoids the inner product in the calculation of y, and is therefore better suited to parallel processing. Unlike the Levinson-Durbin recursion, which finds the filter coefficients a, (k) in addition to the reflection coefficients I, the Schur recursion only solves for the reflection coefficients. ‘Thus, the Schur recursion is a mapping from an autocorrelation sequencer (k) toareflection coefficient sequence (re), re), «=. re(p)} 8 By avoiding the computation of the filter coefficients, ap(k), the Schur recursion is slightly ‘more efficient than the Levinson-Durbin recursion in terms ofthe number of multiplications, even in a single processor implementation of the algorithm. ‘The development of the Schur recursion begins with the autocorrelation normal equations fora jth-order model 8 EPL Pa, es Pps 6p) 661) i rl + Park —D 20 RELA 6.02) IF we st (0) =| then the jthorder normal equations may be writen as Yadnk-p=0 + k } 663) hich we wil refer to asthe orthogonality condition, Noe thatthe left side of Eq (563) is the convolution ofthe fit length sequence a) withthe aulocoreaton sequence ,(L). ‘Therefore, let us define g(k) oe the sequence tat i formed by convolving ay(k) with rh) Blk) = Sarak =D) (5.64) i ‘Thus, as shown in Fig. 5.1], gj(A) is the output ofthe jth-order prediction error filter (2) when the input isthe autocorrelation sequence r,(k) and the onhogonality condition statesBk) a 0 l ll Ae cocecel G ( 4 say rb) shy Liha AR) 1 I Td Figure 5.11 The sequences g(k) and gh(k) expressed a the output ofthe filter A, (2) and A (2). respectively when the inputs the autocorrelation sequence (k) Due tothe orthogonality condition, voosed and gh) = Ofork= 0.000 j =I. that g)(k) is equal to zero for k ay(k) Led (5.65) In addition, since wo-Eeuno=6 where the last equality follows from Eq. (5.3), then (0) is equal tothe jth-order modeling (prediction) error. Now consider what happens when a (k) is replaced with af(k) and g/(k) is the convolution of af (k) with re(k). i (&) = ah Or 1D = af) wre) (5.67) tat “Thus, gf (k) isthe response ofthe filter A(z) to the input r,(k). Since a(K) = af (j — Kk), then i 1 8 = asd -Drek-) = Yay Using the conjugate symmetry of the autocorrelation sequence, Eq. (5.68) becomes Ore = §+D (5.68) sh) = Pan -H-) =e -8 (5.69) ts244 __THE LEVINSON RECURSION Therefore, it follows from Eqs. (5.65) and (5.66) that Bh) =0 5 k=ON FHT (8.70) and sf=e (7) ‘These constraints on g(k) are illustrated in Fig. 5.11. ‘The next step is o use the Levinson-Durbin recursion to show how the sequences gy (k) and gf) may be updated to form gy4(k) and gi ,(K). Since gy41(k) = ay43(K) # rs(K)e using the Levinson order-update equation for a; 1(k) we have gj) = 474108) #728) = [ay(k) + Pinsahk — D] + re) gk) + Ty sighk =D 6.72) Ina similar fashion, since gf (k) = afk) « (4), application of the Levinson order- update equation gives Bhi a8, (8) rel) = [ah — 1) +}, ,a)(8)] #78) = kD +E e® 673) Equations (5.72) and (5.73) are recursive update equations for gj(k) and g¥(k) and are ‘dential in form to the update equations given in Bq, (5.27) for a(k) and af). The only difference between the two recursions is in the initial condition. Specifically, for aj(k) we hhave ao(k) = af'(k) = 8(K), whereas for g)(k) we have go(k) = gh (k) = re(k). Taking the z-transform of Eqs. (5.72) and (5.73) and putting them in matrix form gives 1 Eye! 2 Gr | _ GQ) cig oo] Lr, ot Lee A lattice filter interpretation ofthese update equations is shown in Fig. 5.12. Recall that our goal is o derive a recursion that will take a Sequence of autocorrelation values and generate the corresponding sequence of reflection coefficients. Given the first j teflection coefficients, the lattice filter shown in Fig, 5.12 allows one to determine the Sequences g,(k) and gk) from the autocorrelation sequence. Al that remains isto derive a method to find the reflection coefficient, ['-+1, from gj(k) and g(k). This may be done as follows, Since g).1(j + 1) = 0, evaluating Eq. (5.72) fork = j + | we have ga J +) = Bi + DF Tingh) =0 ‘Therefore, G+) af) and the recursion is complete. In summary, the Schur recursion begins by and afk) for k = 1,2... p with r(k), Bo(k) = gf 00 = rel) then computed from the ratio aol) rat 880) 720) ‘The first reflection coefficientTHE LEVINSON-DURBIN RECURSION __245 2) gn) Tis Te sho sh (2) Latice ter module for updating (A) and g(k). ab ae) 2h) folk) rl) a ry tn br — |. ee ahh) ah i) rp) ahi (®)A pub-oderatice fe for generating gp(k and gf (from 7.8). Figure 8.12 The latice filter used in the Schur recursion to generate the sequences gp(k) and g(k) from the autocorrelation sequence. Given T;, a lattice filter is then used to generate the sequences g,(k) and gf (k). The second reflection coefficient is then computed from the ratio = 812) FD which, in tum, is used in the lattice filter to generate the sequences g2(K) and gf(k) and so on. In updating the sequences g;41(k) and g,(k), note that since gj41(k) = 0 for k= 1,2,...,j + Land gf,(k) = 0 for k = 0,1,..., , not all p values need to be updated, Specifically, all that is required isto evaluate gy41(k) for k > j + Land gi ,(k) fork > j. The complete recursion is given in Table 5.5. ‘Additional insight into the operation of the Schur recursion may be gained if itis formulated using vector notation. Let g, and gf be the vectors that contain the sequences j(b) and g(k), respectively, and consider the 2 x (p + 1) matrix that has gy in the frst Table 8.5 The Schur Recursion 1 Seth) = 60 =r) fork = 0,1 2 Perf =Ohyeen p= (@)SetTja1 =H + DAD () Fork j +2... gyoilk) = 4) + Tyas ~ 1) © Fok=j+l, Bh =a - DFT 0 a)246 __THE LEVINSON RECURSION row and gin the second row gg ]_[ es a B(P) (af)" FO) FCI) ah p) From Eqs. (5.65), (5.66), and (5:70), we see that this matrix has the form® 7 ie O0 Doe BD ¥ )[° 2 wen 8) ca OO Bhi BhU+D fo) We are now going to perform a sequence of operations on this matrix that correspond to those expressed in the z-domain by Eq. (5.74). These operations will comprise one iteration of the Schur recursion. The first operation is to shift the second row of the matrix tothe right by one with g*(—1) entering as the first element of the second row (this corresponds to a delay of ¢(k) by one or a multiplication of G*(z) by z~'). Note that after this shift, the ratio of the two terms in column number (j + 2) is equal to —I'j.1. Therefore, evaluating, the reflection coefficient we may then form the matrix 1 Th ou-[ at, 8] om and multiply the shifted matrix by ©. to generate the updated Sequences gj and gf as follows Bia [ft San 0 BG +1) alr) ho Lt 1 JLo 0 Bk) ghp= ga 0 0 ga #2) = gale) 0 0 etd ah G+2 Bhs) This completes one step of the recursion. We may, however, simplify these operations slightly by noting that the first column of the matrix in Eq. (5.76) never enters into the calculations. Therefore, we may suppress the evaluation of these entries by initializing the first element inthe fist column to zero and by bringing in a zero into the second row each time that itis shifted tothe right, Note that since ¢, = g(j) then the error isnot discarded ‘with this simplification. In summary, the steps described above lead to the following matrix formulation of the Schur recursion, Beginning with co=[ 8 0 Qo ° 40) rel) (2) rsp) ] «~» (5.79) which is referred to as the generator matrix, a new matrix, Go, is formed by shifting the second row of Go to the right by one ra 0 re 72) re(p) G=lo nO nM) ney | Setting equal to the negative ofthe rato ofthe two terms in he second column of Go, We could aso use Bg, (5.71) 1 set Cj) = «in this matrix, ba this would make the nest step of computing the veeton coeficien Fusing Bq, (5.75) lithe more obscure.THE LEVINSON:DURBIN RECURSION __ 247 o-[s io] [3 0 812) ai(p) ] 0 sf) 82) at(p) ‘The recursion then repeats these three steps where, in general, at the jth step we 1, Shift the second row of G; to the right by one, 2. Compute P'j41 as the negative of the ratio of the two terms in the (j +2)nd column, 3. Multiply G) by @j4, t0 form Gy. ‘The following example illustrates the procedure. we then form the matrix and evaluate G; as follows 6, = OG (5.80) Example 5.2.7 The Schur Recursion Given the autocorrelation sequence ry = [2, 1, —1/4, 1/8)", let us use the Schur recursion to find the reflection coefficients Py, T'2, and I's 1, Step 1: We begin the Schur recursion by initializing the generator matrix Go to Gul 0 2 n@ nO ]_fO -1 14 us ° 70) re) re(2) (3) JL 2-1 -1/4 1/8 2. Step 2: Forming the shifted matrix &u[ “1-4 8 i [8 eet | 1 os o=[es This step only requires one division. 3. Step 3: Forming the product @ Go we find ~e&-| | oS }fo -1 -14 8 G=et=[o5 TF )[o 2 ot An] _fo 0° -3/4 0 =Lo 32 -98 3/16 Since we know thatthe second element in the first row of G, will be equal tozeroafter multiplying Gp by @), this step of the recursion only requires five multiplications. 4, Step 4: From the shifted matrix OW ae S [oe ap on 1 os e=[o5 | ‘Again, in this step, we have one division. it follows that = 0.5 and wwe see that [= 0.5 and248 __ THE LEVINSON RECURSION 5. Step S: Forming the product @26 we have a[ 1 95][0 0 -34 0 ]_fo0 0 -916 cmem=[ os ][0 0 at 3 ]=[0 0 om Wom For this step we have three multiplications. 6. Step 6: Finally, forming the shifted matrix 00 0 -916 000 98 1 os e=[as | 1 0S][0 00 -916]7_[o000 0 os 1 jlo 00 98 |=Lo 0 0 27/32 ‘Therefore, the error is we find that P3 = 0.5, es = 27/32 In this last step, we have one division and one multiplication, ‘Counting the number of arithmetic operations we find that this recursion required 9 multiplications and 3 divisions, Unlike the Levinson-Durbin recursion, the Schur recursion is well suited to parallel implementation. In particular, Kung and Hu [13] developed the pipelined structure shown in Fig. 5.13 for implementing the Schur recursion. This structure consist ofa cascade of p ‘modular cells, each containing an upper processing unit that computes the sequence values 2y(&) and a lower processing unit that computes the values g/(k). The upper processing Unit in cell | differs from the others in that i isa divider cell for computing the reflection coefficients I. The operation ofthis pipelined architecture is as follows 1, The upper processing units are initialized with the autocorrelation values rx(I),..-. 1r<(p) and the lower units with r, (0), ....r«(p—1) as shown in Fig. 5.13a. This step loads the cells with the values of the matrix Go, 2. The divider cell computes the first reflection coefficient by dividing the contents in the upper unit of cell | with the contents of the lower unit, Py = =r (1/10) This reflection coefficient is then propagated simultaneously to each of the other cells. 3. The upper units of each cell are updated as follows. a=W FT ink-D 5 kab and the lower units updated as BPO) =relk— +P rO + PeTHE LEVINSON-DURBIN RECURSION __ 249 o#@ oT. Jt od Ti a ry ro (2) Iniaiztion ofthe processing unis with the atocorelation sequence r(k). The divider cell computes the fat election coeficient which is then simsltaneouly propagated to each of te oter units ry ry ry _t ._t ie It UU ror oT 7 (b) Stat ofthe Schur pipeline after the fst pdt. Figure 5.13 A pipelined structure for implementing the Schur recursion. ‘This operation comesponds tothe updating of by multiplying by @, to form the new generator matrix G 4. The contents ofthe upper units in each cell are then shifted to the lft by one cll Note tha the last cell the top row i et vacated 5. The nex reflection coefficient is then computed in the divider cell by dividing the contents in the upper unit of cell | with the contents of the lower unit. This reflection. coefficients then propagated simultaneously to cach of the other cells. AC this point, the pipeline structure is nthe state shown in Fig, 5.13, 6. The upper units ofeach cell are again updated using 8k) = 8/(b) + ysis he — 1250 __ THE LEVINSON RECURSION and the lower units using Bh) = ahh — +0}, 18%) Steps4, Sand 6 are repeated until all ofthe reflection coefficients have been computed We conclude this discussion of the Schur recursion with an evaluation of the computational requirements ofthe algorithm. Note that at the jth stage of the recursion, one division isrequired to determine I and p— j—1 multiplications along with p— j—1 additions are necessary to evaluate the terms in the sequence g,41(k) (step 2b in Table 5.5). In addition, (p— 3) multiplications and (p — j) additions are necessary in order to evaluate the terms in the sequence gf, (k) (step 2c in Table 5.5). With p steps in the recursion, the total number of multiplications and divisions is P+2y (-)-p=P +p along with p? additions. Thus, in comparing the Schur recursion to the Levinson-Durbin recursion (p. 223), we see that although the same number of additions are required with the ‘Schur recursion, there are p fewer multiplications, 5.2.7 The Cholesky Decomposition We have seen that the Levinson-Durbin recursion isan efficient algorithm for solving the autocorrelation normal equations. I may also be used to perform a Cholesky decomposition of the Hermitian Toeplitz autocorrelation matrix R,.. The Cholesky (LDU) decomposition ‘of a Hermitian matrix Cis a factorization ofthe form c=LDL" 68) where L is a lower triangular matrix with ones along the diagonal and D is a diagonal matrix. Ifthe diagonal terms ofthe matrix D are nonnegative (Cis positive semidefinite), then D may be split into a product of two matrices by taking square roots D=p'"p'2 6.82) and the Cholesky decomposition of C may be expressed asthe product of an upper triangular and a lower triangular matrix as follows c= LD") (D'7L") 583) With a Cholesky decomposition of the autocorrelation matrix we will easily be able t0 establish the equivalence between the positive definiteness of R,, the positivity of the eror sequence ¢,, and the unit magnitude constraint onthe reflection Coefficients I'..In addition, ‘we will be able to derive a closed-form expression for the inverse of the autocorrelation ‘matrix as well asa recursive algorithm for inverting a Toeplitz matrix To derive the Cholesky decomposition of Rp, consider the (p + 1) x (p + 1) upper triangular matcix. 1 af) a3) +» — af(p) Oa iela teas) a3(p — 1) o 0 ! a;(p — 2) (5.84) 0 0 Oe 4THE LEVINSON-DURBIN RECURSION 251. ‘This matrix is formed from the vectors ay, ay, ..., @y that are produced when the Levinson- Durbin recursion is applied to the autocorrelation sequence r,(0),..., r«(p). Note that the {th column of A, contains the filter coeicients, a, padded with eros. Since Raf =u; (585) where uj = (0, 0, ..., 1] is aunit vector of length j +1 with a one in the final position, then &@ 00 ° te tol o RpA,=| * * @ 0 (586) shang which is a lower triangular matrix with the prediction errors ¢, along the diagonal (an asterisk is used to indicate those elements that are, in general, nonzero). Although we have not yet established Eq. (5.85), it may be inferred from the Hermitian Toeplitz property of R,. For example, since JR3J = R, then Ryay = GR;a; = eu, (5.87) Multiplying on the left by J and using the fact that JP = Vand Ja, = (af')*, Bq. (5:85) follows from Eq, (5.87) after taking complex conjugates. Since the product of two lower triangular matrices isa lower triangular matrix, if we multiply R,Ap on the leftby the lower triangular matrix A/! then we obtain another lower ‘wiangular matrix, A//R,A,, Note that since the terms along the diagonal of A, are equal to one, the diagonal of AM R,A.y will be the same as that of RpAp, and AYR, A, will also hhave the form «@ 0 0 0 * 4 0 0 ABR Ay aut . (5.88) Even more importantly, however, is the observation that since A//R, A, is Hermitian, then the matrix on the right side of Eq (5.88) must also be Hermitian. Therefore, the terms below the diagonal are zero and (5.89) where D, isa diagonal matrix D, = diagleo.er. Since Ai! isa lower triangular matrix with ones along the diagonal, then A! is nonsingular with det(A,) = 1 and the inverse of Al is also a lower triangular matrix. Denoting the inverse of A!” by Lp, if we multiply Eg, (5.89) on the left by Ip and on the right by Lf252 THE LEVINSON RECURSION we have R,=L,D,L! 5.90) Which isthe desired factorization. ‘An interesting and important property that may be established from Eq, (5:89) is that the determinant of the autocorrelation matrix Ris equal to the product of the modeling detRp = [Te 91) io To see this, note that if we take the determinant of both sides of Eq. (5.89) and use the fact that det(A,,) = 1, then det (AZR,A,) (det Ai!) (det Rp) (det Ap) deR, 692) and Eq, (5.91) follows since acto, = [ex is Using Eq (5.91) we may also show tat the ostvty of the autocorrelation matix Ry is tied to the positivity of the error sequence . In particular, note that R, will be pos definite if and only if Bt >0 5 F= Oe 6.93) Therefore, since det Ry = [Je (6.94) is it follows that R, will be positive definite if and only if ¢¢ > 0 for k = 0,1... 7. Furthermore, Ry will be singular, det(R,) = 0, if ¢ = 0 for some k. This relationship between the positive definiteness of R, and the positivity ofthe error sequence é may also be established using Sylvester's law of inertia (21), which states that if A, is a nonsingular matrix, then A//R, A, has the same number of positive eigenvalues as R, the same number of negative eigenvalues, and the same number of zero cigenvalucs. Since A! R,A, = Dy with A, being nonsingular, then Rp and D, have the same number of positive, negative, and zero eigenvalues. Since the eigenvalues of D are equal to ¢, it follows that R, > Oif and only if ¢, > 0 for all k, In summary, we have established the following fundamental property of Hermitian Toeplitz matrices.THE LEVINSON-DURBIN RECURSION _253 Property 7. For any (p + 1) x (p + 1) Hermitian Toeplitz matrix R,. the following are equivalent: 1 Rp>0 2g >O, fhe 3B Ml 2-1 ‘Therefore, the allowable values for a and f lie within the shaded region shown in Fig. 5.14. 5.2.8 The Autocorrelation Extension Problem In this section, we consider the autocorrelation extension problem, which addresses the following question: Given the frst (p + 1) values ofan autocorrelation sequence, r4(X) for k= 0.1.....p. how may we extend (extrapolate) this partial autocorrelation sequence fork > p insuch a way that the extrapolated sequence is a valid autocorrelation sequence? In Section 3.3.5 of Chapter 3 we saw that in order for a finite sequence of numbers to be a valid partial autocorrelation sequence, the autocorrelation matrix formed from this sequence must be nonnegative definite, R, > 0. Therefore, any extension must preserve this nonnegative definite property, ic.,.Ry+1 2 0, and Ry+2 > 0, and so on. In Section 3.6.1 of Chapter 3 we discovered one possible way to extrapolate a partial autocorrelation sequence. Specifically, given r(k) for k = 0, 1,.... p, if re(k) is extrapolated for k > p according to the recursion Lark) (5.95) f where the coefficients ap() are the solution to the Yule-Walker (normal) equations rb (5.96)THE LEVINSON:DURBIN RECURSION 255 then the resulting sequence will be a valid autocorrelation sequence. This follows from the fact that this extrapolation generates the autocorrelation sequence of the AR(p) process that is consistent with the given autocorrelation values, ie., re(K) for |k| 0? The answers to these questions may be deduced from Property 7 (p. 253), which states that if R, is nonnegative definite, then R41 will be nonnegative definite if and only if |Tp+1| = 1. Therefore, by expressing rs(p + 1) in terms of the reftection coefficient [p41 we may place @ bound on the allowable values for (p+ 1). The desired relationship follows from Eqs. (5.10) and (5.13), which gives, for rept Ds rep +1) =—Ppniép =) ap(kre(p —k +1) (5.97) With Fp41 a complex number that is bounded by one in magnitude, (P"y41| < 1,Eq. (5.97) implies that r, (p + 1) is constrained to lie within or on a circle of radius €, that is centered at C = ~ SP. ag(k)re(p ~ k + 1), as shown in Fig. 5.15, In the rel case, the range of allowable values is Cn K+) SPD sep— La Wr(p~k+V (5.98) ‘There are two special cases of autocorrelation sequence extension that are worth some dis~ cussion. The first occurs when we set I, = 0. In this case, the extrapolation is performed according to Eq. (5.95) with re + = Dap Dre(p +1 =D Furthermore, if the extrapolation of r,(k) is continued in this manner forall k > p, then the resulting autocorrelation sequence will correspond toan AR(»p) process that has a power Infsto +] Lay ayihisio—k+ 0] Refrip + D) Re[- Diss apitirtp A+) Figure 5.15 The allowable values for the autocorrelation coefficient r4(p + 1) given the partial ‘autocorrelation sequence r4(k) for k = 0, 1.-.+4 p are shown to lie within the circle of radius €y centered at C.256 THE LEVINSON RECURSION spectrum equal to loo? 7 699) Sap teye- ‘The second case occurs when |Pyai] = 1. When the magnitude of Tp. is equal to one, (p+ 1) lies on the circle shown in Fig. 5.15, the (p+ 1)st-order model error is zero, eon = 6[! Pen] =0 (6.100) the matrix R,,41 is singular, and the predictor polynomial A,s1(z) has all of its roots on the unit circle (see Property 5, p. 230). As we will see in Section 8.6 of Chapter 8, this case will be important for frequency estimation and for modeling signals as a sum of complex exponentials P(e) Example 5.2.9 Autocorrelation Extension Given the partial autocorrelation sequence r,(0) = 1 and r4(1) = 0.5, let us find the set of allowable values for r,(2), assuming that r,(2) is real. For the first-order model we have rel) 1 =-05 0) and ay(1) =P) = =0.5. Thus, with a first-order modeling errr of 1 = re (O)[1 — [Py] = 0.75 we have 1,2) = —Prer — a4 (1)r4(1) = -0.7572 +0.25 ‘Therefore, with |P2| < 1 it follows that 05 <7,2) <1 In the special case of FP: = 0, r5(2) = 0.25 and, in the extreme cases of 2 = +1, the 0.5 and r(2) autocorrelation values are r,(2) 5.2.9. Inverting a Toeplitz Matrix’ In this section, we will derive a recursive algorithm for inverting a Toeplitz matrix R,, and show how this recursion may be used to derive the Levinson recursion for solving a general set of Toeplitz equations Ryx=b ‘We begin by showing how the decomposition derived in Section 5.2.7 may be used 10 express the inverse of a Toeplitz matrix in terms of the vectors ay and the errors ¢;. The Levinson-Durbin recursion will then be applied to this expression for R,;' to derive the Toeplitz matrix inversion recursion. Let R, be a nonsingular Hermitian Toeplitz matrix. Using the decomposition given in Eq, (5.89), taking the inverse of both sides, we have (AUR,A,)' = A;'R;'A,” = D;! (5.101) Multiplying both sides of this equation by A, on the left and by A! on the right gives theTHE LEVINSON-DURSIN RECURSION __ 257 desired expression for A,D;tAH 6.102) Since D, is a diagonal matrix, D5" is easily computed, Therefore, finding the inverse of R, simply involves applying the Levinson-Durbin recursion tothe sequence r,(0)y..-yFx(P)> forming the matrix A,,. and performing the matrix product in Eq, (5.102). Example 5.2.10 Inverting a Toeplte Matrix Let us find the inverse of the 3 x 3 autocorrelation matrix 201 p=] 020 ae) We begin by using the Levinson-Durbin recursion to find the vectors aj and the errors « for j =0, 1,2. For j = 0, we have a = 1 and éq = r,(0) = 2. With Os o-[ asell-1F R; the first-order model is with an error For the second-order model, 74(2) +ar(1)re(1) and Thus, and o=all- Finally, using Eq. (5.102) we have for the inverse of Ra, 10 -12]712 0 0 1 Ry =Apylar=|0 1 0 0 12 0 0 O20 a1 0 0 2 Jl-i2 20-13 Lo 0 12 0 oo. o 0 73 || -12 0 23 0-1/3 Omatyz eso “13 0 23 -oo |258 THE LEVINSON RECURSION Note that the inverse of a Toeplitz matrix is not, in general, Toeplitz. However, as discussed in Section 2.3.7 (p. 38) and as illustrated in this example, the inverse ofa symmetric Toeplitz, ‘matrix is centrosymmetrc [17]. In the next example, Eq, (5.102) is used to find the inverse autocorrelation matrix of a first-order autoregressive process. Example 5.2.11 The Inverse Autocorrelation Matrix for an AR(I) Process Let R, be the autocorrelation matrix of an AR(I) process that is generated by the difference equation x(n) =ax(n~ 1) + win) ‘where [a < 1 and w(7) is zero mean white noise with a variance o. Thus, the autocorrelation sequence of x(n) is and the autocorrelation matrix is 1 @ a? oP @ 1 @ spec Sere 1 ar? eye i From Example 5.2.2 (p.221) we know thatthe jth-order model for x(n) is a = [1-21 0,...,0]" and that the sequence of modeling errors is and ‘Therefore, 1-2 0 00 ou = 00 G0ue1 00 Gr ou0 1 -a 0 0 0 o1 andTHE LEVINSON-DURBIN RECURSION _ 259 From Eq. (5.102) it follows that 1 0 0 0 00 0 ~ 0 1 00 1] 0 1 0 0 00 Riaa : : i 0 0 1 Onn0 10 Oun01 20: 0 Ot) on TiO: 00 1 0 0 -a 00 0: LO: 0: 0: -a a TiO. 0 0 1 aol 00 0 Onna 0 0 0 er Secerceeer tae 0 (ea) -al Loe -a lta 0 -« O20) +a? <0 O220; eae: Thus, we see that R;' is centrosymmetric and tri-diagonal. Equation (5.102) provides a closed form expression for the inverse of a Hermitian ‘Toeplitz matrix in terms of the matrices A, and D. However, since these matrices may be built up recursively using the Levinson-Durbin recursion, we may fold this recursion into the evaluation of Eq. (5.102) and derive a method for recursively computing the inverse of R,. In particular, given Ry! =Apyay (6.103) we may find Ry = AD Ay i Al 6.100)260 __ THE LEVINSON RECURSION To see how this may be accomplished, let us partition Ay; and Dy_1 as follows? (6.105) (6.106) 6.107) | (6.108) we see that R;" appears in the upper-left corner of the first term and that the second term may be written as an outer product (5.109) ah caf)" Note othe reader: Although there is nothing difficult about the following derivation from a mathematical point of view, froma notational point of view its bit eompex, Since there isnot much tobe gained in going through tis derivation, othe than 3 feeling of accomplishment, the eader may wish to jump to the main results which are contained in Eqs, (5.110) and (3.114. Although the discussion following Eq, (54) uses these results to derive the general Levinson recursion, this eeursin i re-derved in the following scton, without using these results.THE LEVINSON-DURBIN RECURSION 261 ‘Therefore, we have 6.110) ‘whichis the desired update equation for 0 =re(0), and ay = 1. Then, from Rj, ¢), and a), the Levinson-Durbin recursion is used to determine a, and ¢;; which, in turn, are then used in Eq. (5.110) to perform the update of R;!. The following example illustrates the procedure. Example 5.2.12 Recursive Evaluation of an Inverse Autocorrelation Matrix In this example we show how to recu matrix ly compute the inverse of the 33 autocorrelation 1 172 12 R=| 12 1 12 12 172 1 ing the recursion we have Bm =1 3 co=re(=1 y Rol = ‘Then, using the Levinson-Durbin recursion to calculate the first-order model Py = =re(D)/r9(0) = =1/2 (an)262 __THELEVINSON RECURSION Using the Levinson-Durbin we next generate the second-order model. With = re) + a(Dra() = 1/4 Py = -yfe=-1/3 a =all-r]=23 we find, for as, 1 0 1 m=] -12 |-3] -12 |=] -18 ° 1 -18 ‘Therefore the inverse of Re is te Ri Ry o 4/3 -2/3 0 =| 2 43 0 |+ Geo. 1 3/2 -1/2 -172 =| -12 32 -1n (5.112) -12 -172 3/2 which is the desired result, slight variation of the recursion given in Eq. (5.110) may be derived by exploiting the ‘Toeplitz structure of Rp. Specifically, since IR,J= Ry where J is the exchange matrix, taking the inverse of both sides of this equation, using the fact that J-! = J, we find R5')S ‘Applying this transformation othe expression for given in Eq. (5.110) we have RE SURI = Fea®,,)*af.,)'F (6.113)THE LEVINSON-DURBIN RECURSION __ 263 ‘Simplifying the expression on the right by multiplying by the exchange matrices, we find (s.114) which isthe desired recursion, In the next section, we derive the Levinson recursion which may be used to solve a ‘general set of Toeplitz equations of the form R,x, = by (6.115) where b, = [B(0), b(1), -... b(p)]” is an arbitrary vector. Here, we will derive an ‘equivalent form of the recursion that is based on the recursion given in Eq, (5.110). The derivation proceeds as follows, Suppose that we are given the solution to the Toeplitz equations Rx, = by and that we want to derive the solution to Ryde = Djs (5.116) ~-[ Applying Eq, (5.110) to Eq, (5.116) we have where 6.117) or (5.118) where ay = (a8) = SG WanGtl-H | G19 =} | Equations (5.118) and (5.119) along with the Levinson-Durbin recursion for performing the updates of a, constitute the Levinson recu264 __ THE LEVINSON RECURSION 5.3 THE LEVINSON RECURSION We have seen how the Toeplitz structure of the all-pole normal equations is exploited inthe Levinson-Durbin recursion to efficiently solve a set of equations of the form Rya, = eth 6.120) where R, isa (p-+1) x (p+ 1) Hermitian Toeplitz. matrix and uy is a unit vector. There are many applications, however, that require solving a more general set of Toeplitz equations Ryx, = b 6.121) Where the vector on the right-hand side is arbitrary, For example, as we saw in Chapter 4, Shanks’ method requires solving a set of linear equations of this form. In addition, we will see in Chapter 7 that the design of an FIR digital Wiener filter requires finding the solution tothe Wiener-Hopf equation Ryty = Tax where R,, is a Toeplitz matrix and ray is a vector of cross-correlations that is, in general, unrelated to the elements in R. Therefore, in this section, we derive the original recursion developed by N. Levinson in 1947 for solving a general set of Hermitian Toeplitz equations, of the form given in Eq. (5.121) where b is an arbitrary vector. This recursion, known as the Levinson recursion, simultaneously solves the following two sets of linear equations © nd) f@ nd) 1 6 AQ) ®t) nG-0 |} ga o Qn) nO rG-2 |} a2 |=] 0 | 6.122) re) GM) eG 2) (0) aD o and re) 7Q) mg) (0) >) re) AG) |} »@ bay re) ra) FO) AGB |] 9@ |=] b@ | 6.123) n@) n= nG-2 (0) iD a for j = 0, 1,..., p. Since Eq, (5.122) corresponds to the all-pole normal equations, we will find that, embedded within the Levinson recursion, is the Levinson-Durbin recursion, In Section 5.2.9 we used the Toeplitz. matrix inverse recursion to derive one form of the Levinson recursion. Here, we will rederive the recursion without using this matrix inverse recursion. The derivation, in fact, wil closely resemble that used for the Levinson-Durbin recursion in Section 5.2 ‘The Levinson recursion begins by finding the solution to Eqs. (5.122) and (5.123) for J =0. The result is clearly seen to be Go =re(0) and x0(0) = B(O)/r4(0) Now let us assume that we know the solution to the jth-order equations and derive the solution to the (J + 1)st-order equations. Given the solution a; to Eq. (5.122) the Levinson-THE LEVINSON RECURSION 265, Durbin update equation (5.14) gives us the solution ay, to the (j + 1)st-order equations rn) F20) nO nU+D 1 oat nO) GD rt) ayaa) ° = 6.124) nO nD nO) ray ail) ° nG+D r@ © JLaau+D ° Exploiting the Hermitian Toeplitz property of Ry.1, if we take the complex conjugate of Eq, (5.124) and reverse the order of the rows and columns of R1, Eq. (5.124) may be written in the equivalent form nO Mo AG AG+D) Paden o QO) BUD RGD ah) o im (5.125) mG) mG AO) AD aj.) 0 nGt) re) rl) re(0) 1 du Since it is assumed that the solution x; to Eq. (5.123) is known, suppose that we append a zero tox, and then multiply this extended vector by R,.1, as we did in the Levinson-Durbin recursion. This leads to the following set of linear equations OD NG) G+) |] 5 50) nO) nO RGD AG) |] oa i =] 5 | 6.126) nO BUD nO na [paw] | oa GAD rd) re) rx) 0 4 where 5 is given by §= Dn l= dy i Since 8, is not, in general, equal to b(j + 1), the extended vector [x70] is not the solution to the (j + 1)st-order equations. Consider what happens, however, if we form the sum x0) a+) b@) x) aa) ba) Ras bl bape i = (5.127) 0) a.) bu) ° 1 BAGGTHE LEVINSON RECURSION where gyi is an arbitrary complex constant. If we set then B+ ayiefes =O +0) and the vector x) G40 4) a) Xie = tae 6.129) i) a}, 0 1 is the desired solution. Note that since ¢; is real, the complex conjugate in Eq. (5.128) may be neglected. Thus, the Levinson recursion, as summarized in Table 5.6, is defined by Eqs. (5.128) and (5.129) along with the Levinson-Durbin recursion, Example 5.3.1 The Levinson Recursion Let us use the Levinson recursion to solve the following set of Toeplitz equations, 421 ][ xo 9 24 2/] xa) |=] 6 124]/] @ 12 ‘The solution is as follows. 1, Initialization of the recursion. O=4 90) = b(0)/r(0) = 9/4 2. For | =0, yw =r) —w/e Therefore we have, for the first-order model, [ ] [ | “ - a -2 Coniming wehwe = eo! iru] = 4-1/4) =3 bo ra(O)r(1) = 2(9/4) = 9/2 a = [60 = a]/er = 6-9/3 = 172 -1/2THE LEVINSON RECURSION _'267 ‘Table 5.6 The Levinson Recursion 1. Initialize the recursion @ (0) (b)—x0(0) = b(09/r5 (0) © «=r: 2 Forj=Ol..pod @) yen DFC aOnG =i +d © T= ale () Feri = Nad al) =a) +P ai 1+) @ Gut D=Pt © Ger Sell 1 pt?) © = Li gato 14D @ — gn =BU+D-Slen () Ford 0.4-0) sel 10+ gai ~14 0 wo ay +1) = aint which leads tothe first-order solution of? eel? [2] Faia : | 0 1 0 1 12 3. For j = 1 we find that n= r(2) +a(I)r(1) = 14 2-1/2) =0 ‘Therefore, ) = 0 and the second-order model is simply In addition, €» = 1 and rQ) 4 [x0 ne] (2) J=20202 3 a = [b@ -si)/o = 2-9/3 =3 “Thus, the desired solution is x10) 4x2) 2 ° 2 m=| nc) pte} acm [=] 2 [+3] -v2 f=] -1 0 1 0 1 3268 __ THE LEVINSON RECURSION The General Levinson Recursion function x-glev(r,b) (2:51/*#15puaiay: germa=-g/ epsilon; (a0) + ganma* {0;conj (f2ipud (a) 1): fepsilonsepsiions (2 - absigamma) 2): Gortasr (2:3) /*fLipud 2): g=(b(3} delta) /epstion: x= (x70) + q* [eons (£1 ipudiay)] ond Figure 5.16 A MArLAs program for solving a set of Toeplitz equations of the form RX, the general Levinson recursion, ‘We conclude this section by looking atthe computational complexity of the Levinson recursion. From Table 5.6 we see that at jth step of the recursion there are two divisions, 4(j +1) multiplications, and 4j +3 additions. Since there are p steps in the recursion, the ‘oial number of multiplications and divisions that are required is Yi +6 = 2p? +47 i and the number of additions is (4j +3) = 2p + is “Therefore, the number of multiplications and divisions is proportional to p? for the Levis son recursion compared with p? for Gaussian elimination, and twice that required for the Levinson-Durbin recursion. A MATLAB program for the general Levinson recursion is pro- vided in Fig. 5.16. 5.4 THE SPLIT LEVINSON RECURSION* ‘The Levinson-Durbin recursion provides an efficient algorithm for solving the pth-order autocorrelation normal equations given in Eq, (5.5) using p? + 2p multiplications and p* additions. Itis possible, however, to reduce the number of multiplications by approximately factor of 2 while maintaining approximately the same number of additions using what is known as the Split Levinson Recursion [5]. The key to achieving this reduction isto replace the predictor polynomials a; with symmetric vectors, ;, and use these instead of a; in a “Levinson-like” recursion. Due to the symmetry of), only half of the coefficients need toTHE SPUT LEVINSON RECURSION” _ 269 bbe computed. What needs to be done to make this procedure work isto develop a recursion for the symmetric vectors along with a procedure to find a, from the symmetric vectors. ‘The split Levinson algorithm was originally developed by Delsarte and Genin for the case of real data [5]. Later they extended this work by developing a split Schur recursion along with a split lattice filter structure [6]. Although split algorithms have also been developed for the complex case (1, 2, 12) these algorithms are a bit more complicated. Therefore, in this section we will restrict our attention to the case of real data and follow the development “originally presented in [5]. In the following chapter, we derive the split lattice filter structure. From our development of the Levinson-Durbin recursion in Section 5.2.1, we have seen thatthe solution to the pth-order normal equations is built up recursively from the sequence of reflection coefficients I for j = 1... p andthe prediction error filters A, (z) as follows Aj) = AAT AAL 5 Aol) = 1 (5.130) where AR(2) = 24A,(2~!) is the reciprocal polynomial® If we replace z with 2~! i Eq, (5.130) and multiply both sides by ~! we find, as we did in Section 5.2.1, that AR( satisfies a similar recursion given by AR) = TAR) +E /Ajue) ¢ ARG)=L 6.1 Let us now define a second set of polynomials, 5)(z) and $c), that are formed by setting the reflection coefficient in Eq. (5.130) equal to F = 1 and I; = —1, respectively, SO) = Ayre) + 71AL (5.132) S$@ = Apa) — 2A (5.133) ‘These polynomials ae referred toa the singular predictor polynomials: By adding together Eqs. (5.130) and (5.131) we may relate $)(z) to A,(2) and AR(z) as follows, (L415) S)(2) = Ay(2) + AR) (5.134) Similarly if we subtract Eq. (5.131) from Ba. (5.130) we find that the singular polynomial SP(2) is related to 4,(2) and AR(:) by (= rp ste = A@ AR) 6.135) If we denote the coefficients ofthe singular predictor polynomials S,(z) and SF(z) by 5) and s (0), respectively, Yswet 3 and bre: ‘Recallthat we ae assuming that the coficions (are rel. Therefore, Ay (2) is conjugate symmetic, A (2) .A3(2" and, 25a rel, the reciprocal polynomial hss he Form given bere [eompare hist the definition of (2) sven in Eg (52 is ntresting to noe that the singular predictor polynomials ae identical tothe line spectral pairs that are of imerestin speech processing [423270 __ THE LEVINSON RECURSION and let §; = (54(0),...»5()] and sf = [3f(),....8f(j)}? be the singular predictor vectors, then it follows from Eqs (5.134) and (5.135) that 7 aj) a) aj-)) atte} f+ (5.136) aG-) a) 4) 1 and 1 a) a) aG-) (-Tpst= -| 3 6.137) aj -1) a) (i) ' Therefore, ; isa symmetric vector that is formed by taking the symmetric (even) part of the vector a, and s* is an antisymmetric vector corresponding to the antisymmetric (odd) part of a Thus, $;(2) is symmetric polynomial SQ) = 24S" ‘and S$ (z) is an antisymmetric polynomial, S@= and asa result, both have linear phase." We now want to show that 8) and sf satisfy a set of Toeplitz equations. Clearly from Eq. (5.136) it follows that ste) C4T)R Sy = Ria + Rak Therefore, since 6f1,0,...,077 and then (5.138) "1 for his reason tat the algorithm we are developing scaled a “spi Levinson recursion, it splits the predictor polynomial a Int is symmetric and antisymmetric par "We may aso see this rom the definitions of 52) and S¥(<) given in Eqs (5.132) and (6.139THE SPUT LEVINSON RECURSION" _27 where we have defined yee 5.139) [Note that x, may be evaluated from (i) and the coeficent ofthe singular predictor vector 5; a follows (8.140) 6.141) where = 6.142) Given that 5, and s} satisfy a set of Toeplitz equations, we may now derive a recursion for the singular predictor polynomials §;(2), similar to what we did for A,(z) and A(z) in the Levinson-Durbin recursion. To do this, we must first show how to express Aj(z) in terms of 5)(2) and S;41(2). Beginning with Eq. (5.134) we have Aj(2) = (1 +P) S;@) — AR) (5.143) Replacing j by (j + 1) in Eq, (5.132) and solving for AP (z) gives ARG) = 25j01(@) ~ 24j(2) (6.148) Substituting Eq, (5.144) into Eq, (5.143) we find Ayla) = 0+ E))S)) ~ 25.4102) +24) “Therefore, solving for Aj(2), (1= Aj) = +P) S@)— 25.10) 5.145) which is the desired relation. Using Eq. (5.145) we may now derive a recursion for $)(2). We begin with the Levinson-Durbin recursion Ay(z) = Aj) + Pj2 TAR Cz) (5.146) and note from Eq, (5.132) that AP (2) may be expressed in terms of A,-.(2) and $)(2) as follows, Af (2) = 25)(2) — 2Aj-1(2) Substituting this expression for AP , (2) into Eq, (5.146) we have Ay(2) = Aya) +2j2-"[25)(2) ~ 24-1] T= T)A 1G) +P S5@) (s.147)272 __THELEVINSON RECURSION Now, as we saw in Eq. (5.145), A, (z) may be written in terms of S}(z) and S;,(z). Therefore, multiplying both sides of Eq, (5.147) by (1 — 2) (= DA) = =F) = DA, 1) FEV = DS) and using Eq, (5.145) 0 eliminate (1—2)A,(z) and (1—z)A;-1(2) wehave, after combining together common terms, Sie) 1295) 5215 4) 5.148) ‘where we have defined § = Td FT) (5.149) In the time-domain, Eq (5.148) becomes S10) = 5) +50 = 1) = 55-40 = 1) (8.150) which is referred to as the three-term recurrence." Expressed in terms of the singular predictor vectors, this three-term recurrence is sj) 50) o 0 si(l) 5) 50) 5-10) = + -% : (5.151) san) 5) 5G-) si -D san +) o 5) 0 Equation (5.148) or, equivalently, Eq. (5.151), gives us a method for computing the singular predictor polynomial 5},.(z) given the previous two polynomials, S)(z) and )-1(z), and the parameter 5,. What is needed to complete the recursion is a method for computing 4,. Since the reflection coefficients I are not known, we cannot use Eq, (5.149) directly. However, using Eq. (5.139) we see that g_ ltt T+ ¢. However, since 5 -(1 —F9), then y _ O-Td +h. oa 1+T) ‘which, from Eq (5.149), is equal to ;. Therefore, 5; may be evaluated using 1-1) +0 j-1) (5.152) (5.153) "2s relation is called a thee-term recurence since there are thee terms that are used to update the singular predictor vector By contrast, noe thatthe Levinson Durbin order upat equation is 3 rwt-tem recurrence.THE SPLIT LEVINSON RECURSION” _273 slog with (140 Eauatons (5.40) (5.150) and (153 consi the spit Levinson tuto for gesagt singular redcor polyoma Te nil eden ued toes the reunion 1 Tr ] Siz) = Ao(z) +271 AS @) = 14271 (6.154) Now that we have a recursion for S;(z), all that is left is to show how to derive the prediction error filter A(z) from the singular predictor polynomials, From Eq, (5.145) with j = p we have Se) = [Avte + ARG a- Multiplying both sides by DAp() = (1+ Py )Sp(2) ~ 2S 12) 6.155) -2~' and solving for Ap(z) gives Ap (2) = 2°! Ap(2) + Span(2) — 27M + Pp) Sp(2) (5.156) which, in the coefficient domain becomes api) » 6.157) With the initial condition a,(0) = I we thus have a recursion for the coeflicients ap(j). All that is missing is an expression for (1 + I",). However, this may be computed from S)4.1 and §, as follows. Setting z = 1 in Eq, (5.155) we have C+ PPS (6.158) Therefore, since S| then (6.159) ‘The complete Split Levinson Recursion i summarized in Table 57. Iti presented, however, ina form that doesnot take full advantage ofthe symmetries inthe polynomials §(z). Init ‘mostefficient form, for example, the evaluation of the coefficients t; would be accomplished as follows vena Y [po+nd-o]so@ 3 joad ueip Ye [nW tnd] +rG2qU2 5 jeven274 __THELEVINSON RECURSION ‘Table 5.7_The Split Levinson Recursion 1. Initialize the reeursion (@) =2ands) = (1.177 2 Forj @ oy Psi) ale © 59 O=5 nF +D=1 @) Fork=led 8) = 918) + 54k D=540k~D Set 141) = TE syal/ Tha) A For fad.2oep ()ap(J) = pl = D+ 5p) — + Fy = 1D 5. Done In addition, due to the symmetry of s), only half of the coefficients need to be evaluated, thereby reducing the number of multiplications and additions by approximately a factor of | two, Although not included as apart ofthe split Levinson recursion in Table 5.7, iti also possible to compute the reflection coefficient recursively from the coefficients 5. Specifically, solving Eq. (5.152) for T we find that t= ey (5.160) ‘The initial condition for this recursion is My = 0. Example 5. Let us use the split Levinson recursion to find the predictor polynomial corresponding to the autocorrelation sequence The Split Levinson Recursion r=[4,1, 1, -2]" In order to make the discussion easier to follow, we will not employ the symmetry of 5; in ‘our calculations. 1. Initialization of the recursion,THE SPUT LEVINSON RECURSION" _275 2. For j n=[ al : |=: naa o/4 Now, using E9.(5.151) we have 1 ° ° 1 s={a1 f+] a |-s]2]=] -12 ° 1 o 1 3. For j = 2we have 1 neni] -12 |=92 : & =n =0/)0/5)=9/10 1 Again using Eq.(5.151) to finds; we have 1 0 ° 1 -1n 1 |_| as n= a + -9/10 = . 1 -12 [>] as ° 1 ° 1 4. Finally, for j = 3 we have \ 1s peau} 79 | aos 4s 1 by = n/n = 6/9Q/9) = 4/15, Again using Ea (5.151) to finds we have 1 ° 0 1 4s 1 \ 1p sua] as |+] -2s |-4n5] -12 |=] -25 1 2/5 1 3 ° 1 ° 1 Finally, we compute the predictor polynomial with the help of Eqs. (5.157) and (5.159).276 __THELEVINSON RECURSION First, using Eq, (5.159) we have Youti) (ars =5/3 Lao ‘Then, from Eq, (5.157) we have 1 0 1 0 ax 1 sa(l eee aoe ash) 432) ax(1) 54) s() 4303) 4302) 543) $302) Incorporating the values for the singular predictor vectors gives 1 0 1 ax I 4/3 wD | eliaae a3(2) axl) 0 a3(3) a3(2) I Solving this equation recursively for the coefficients as(i) we find _| 123 = -3 2 5.5 SUMMARY Inthis chapter, we have looked at several diferent algorithms for recursively solving asetof Toeplitz equations. Each ofthese algorithms exploits the symmetries and redundancies of the Toeplitz matrix. This chapter began with a derivation of the Levinson-Durbin recursion forsolving aset of Hermitian Toeplitz equations in which the right-hand side i @ unit vector, Rya, = 6m (5.161) ‘Compared with O(p3) operations that are required to solve a set of p linear equations using Gaussian elimination, the Levinson-Durbin recursion requires O(p*) operations. Us- ing the Levinson-Durbin recursion we were able to establish some important properties of the solution a, to these equations. It was shown, for example, that a, generates a minimum phase polynomial if and only ifthe Toeplitz matrix R, is positive definite and, inSUMMARY 277 this case, the reflection coefficients [that are generated by the Levinson-Durbin recursion are bounded by one in magnitude, [P| < 1. As a result, we were able to establish that the all-pole models that are formed using Prony’s method and the autocorrelation method are guaranteed to be stable, We were also able to demonstrate that if R, > 0, then re(K) for k = 0, 1,..-. p represents a valid partial autocorrelation sequence and may be extended for k > p. This extrapolation may be performed by setting [P| < 1 for k > p and finding the corresponding autocorrelation sequence using the inverse Levinson-Durbin In addition to the Levinson-Durbin recursion, we derived a number of other “Levinson- like" recursions. For example, with the view thatthe Levinson-Durbin recursionis a mapping froma set of autocorrelationsr,(K) to a set of filter coefficients a (k) and a set of reflection coefficients I, we saw that it was possible to derive recursions to find the filter coefficients from the reflection coefficients (step-up recursion), the reflection coefficients from the filter coefficients (step-down recursion), and the autocorrelation sequence from the reflection coefficients and the modeling error ¢, (inverse Levinson-Durbin recursion). Two additional recursions that were developed for solving equations of the form given in Eq. (5.161) are the split Levinson recursion and the Schur recursion. The split Levinson recursion reduces the computational requirements of the Levinson-Durbin recursion by approximately 1 factor of two by introducing the set of singular vectors s,. The Schur recursion, on the ‘ther hand, achieves some modest computational savings by generating only the reflection coefficients from the autocorrelation sequence. The main advantage of the Schur recursion ‘over the Levinson-Durbin and split Levinson recursions, however, is that itis suitable for parallel processing in a multiprocessor environment. Finally, we derived the general Levinson recursion for solving a general set of Toeplitz equations ofthe form Ryay =D (5.162) where b is an arbitrary vector. As with the Levinson-Durbin recursion, the general Levinson recursion requires O(p) operations. Inaddition to the recursions developed in this chapter, there are other fast algorithms that ‘may be of interest, With respect to the problem of signal modeling, one particular algorithm of interest is the fast covariance algorithm for solving the covariance normal equations. given in Eq, (4.127). As with the Levinson and the Levinson-Durbin recursions, the fast covariance algorithm requires on the order of p? operations. Although the approach that used to derive the fast covariance algorithm is similar in style to that used for the Levinson recursion, it is considerably more involved. For example, the fast covariance algorithm requires a time-update recursion in addition to the order-update recursion that is found in the Levinson recursion, This time-update recursion relates the solution to the covariance normal equations over the interval [L, U’] to the solution that is obtained over the intervals [L +1, U] and [L,, U — 1]. The details of this recursion may be found in (18, 19]. Another recursion of interest is the Trench algorithm, which generalizes the Levinson recursion by allowing the Toeplitz matrix to be non-symmetric (10, 25]. The Trench algorithm therefore ‘may be used to solve the Padé equations and the modified Yule-Walker equations. In its most general form, the Trench algorithm gives both the inverse matrix along with the solution x, to Fg. (5.162). Finally, it should be noted that, in addition to the “Levinson-like” recursions, there are other fast algorithms for solving linear equations that have other types of structured ‘matrices such as Hankel and Vandermond matrices (8, 15]278 __THE LEVINSON RECURSION References 1. ¥. Bistritz, H. Lev-Ari, and T. Kailath, “Complexity reduced lattice filters for digital speech processing,” Proc, 1987 Int. Conf. on Acoust, Speech, Sig. Proc., pp. 21-24, April 1987. 2. ¥. Bistitz, H. Lev-Ari, and T. Kailath, “Immittance-type three-term Schur and Levinson recursions for quasi-Toeplitz and complex Hermitian matrices.” SIAM Journal on Matrix Analysis ana Applications, vo. 12, no. 3, pp. 497-520, July 1991. 3. R.V. Churchill and J. W. Brown, Complex Variables and Applications, McGraw-h 1990. J. R, Deller, J. G. Proakis, and J. H. L. Hansen, Diserete-time Processing of Speech Signals, MacMillan, New York, 1993. 8. P. Delsarte and ¥. V, Genin, “The split Levinson algorithm,” IEEE Trans. Acoust, Speech, Sig Proc., vol. ASSP-34, no. 3, p. 470-478, June 1986, 6. P. Delsarte and ¥. V. Genin, “On the spliting of classical algorithms in linear prediction theory.” IEEE Trans. Acoust, Speech. Sig. Proc, vol. ASSP-35, no. 5, pp. 645-653, May 1987. 7. J. Durbin, “The fitting of time series models” Rex. Internat. Statist. Inst, Vol. 23, pp. 233-244, 1960. 8. 1. Gohberg, T. Ksilath, and I. Koltrcht, “Efficient solution of linear systems of equations with recursive structure.” Lin. Alg. Appl, vol. 80, no. 81, 1986. 9. L.Gohberg, ed. Schur Methods in Operator Theory and Signal Processing, vol. 18, Bikhiuser, Boston, 1986, 10. G.H, Golub and C. F Van Loan, Matrix Computations, Johns Hopkins University Press, Balt- ‘more, 1989, |. M. H. Hayes and M. A. Clements, “An efficient algorithm for computing Pisarenko’s harmonic p then I, = 0 for Ik > p. (b) Given an autocorrelation sequence, r, (A) for k ‘Toeplitz, matrix + Prif the (p +1) x (p +1) Toep {rs (0), re(1),.--4rp(P)} is positive definite, then rx), +5726 P)-0,0,-. J” Will always be a valid autocorrelation sequence, ie, extending r,(K) with zeros is a valid autocorrelation extension. (©) Ifr,(k) is periodic then P; will be periodic with the same period. 5.10, In our discussions of the Levinson-Durbin recursion, we demonstrated the equivalence between the following three sets of parameters + rere (Du ooo te) © ap (V), 492)... dy(p), BO) HT Pas cee Prey For each of the following signal transformations, determine which of these parameters change and, if possible, describe how. (a) The signal x(n) is scaled by a constant C, x(n) = Cx(n) (b) The signal x(n) is modulated by (—1)", Xn) =(-1)"xe) |. The autocorrelation ofa signal consisting of a random phase sinusoid in noise is 1e(k) = P cos(kan) + 035(k) $i where ay is the frequency of the sinusoid, P is the power, and o3 is the variance of the noise. Suppose that we fit an AR(2) model to the data. (a) Find the coefficients ay o3,and P (b) Find the reflection coefficients, and Ts, corresponding to the AR(2) model (©) What are the limiting values of the AR(2) parameters and the reflection coefficients as a3 0? 1, a2(1), a2@2)]" of the ARQ) model as a function of ay, 5.12, Given that r,(0) = 1 and that the first three reflection coefficients are Ty Py =05, and Pr = 0.25, (a) Find the corresponding autocorrelation sequence r_(1), re(2). F<(3). (b) Find the autocorrelation sequence r,(1), r<(2).rx(3) for the case in which the reflection coefficient I; is a free variable, ic., solve for the autocorrelation values as a function of Ps, (©) Repeat part (b) when both I, and I’ are free parameters.282 THE LEVINSON RECURSION ‘5.13. Using the autocorrelation method, an all-pole model of the form EO) Trae +aQe?* +a@)e hhas been found for a signal x(n). The constant in the numerator has been chosen so that the autocorrelation of the signal matches the autocorrelation of the model, ie., rekk) = rib) where 1, (k) is the autocorrelation of h(n). In addition, you know the following about the signal and the model: 1 r(0) =4 20s =05 3. P1 > Oand 2 > 0 4, x) 5. €1 = 11/16, where eis the third-order modeting error. 6. det(R:) = 27, where Re = Toep{rs(0), 7: (1), te(2)h Find the values for the model parameters a(), aQ), a), and (0) ‘5.14, ‘The frst seven values of the unit sample response, h(n), of a 3rd-order linear shift- invariant filter HO = +66)27 ae 1+ bz"! + b@)z MO= ATL Ge tae ae given by h=[l, 1/4, 1/2, 1, 0, 0, 7/8)” Determine whether or not the filter is stable. If more information is needed, state what is required and make the necessary assumptions. 5.15. Theextendibility problem in power spectrum estimation concems the issue of whether ‘or not a finite length sequence, 16D Q) Brees) may be extended (extrapolated) into legitimate autocorrelation sequence so that Pel) = Yo rei (P5.15-1) isa valid power spectrum. In other words, is it possible to find values of r_(k) for [kl > p such that P, (e/®) in (PS.15-1) is a non-negative real function of (a) Develop a procedure that uses Levinson’s recursion to test the extendibility of a sequence. (b) Use your procedure developed in (a) to determine the constraints on a and b that are necessary and sufficient in order for the sequence nO) =1, rl) =a, rx(2) to be an extendible sequencePROBLEMS 283 (©) Assuming that the sequence in (b) is extendible, find two different legitimate extensions. 5.16. Which of the following autocorrelation sequences are extendible? For those that are extendible, find an extension and determine whether or not the extension is unique. @) , =[1.0, 06, 0.6)” ©) Fr, =[1.0, 06, -0.6]" (© ry =[1.0, 0.0, 1.0)” 5.17. Let Ry be the symmetric Toeplitz matrix formed from the autocorrelation sequence 7, (0), re). Fx), and 7-(3). If the reflection coefficients that result from applying the son-Durbin recursion to Rs are and if r.(0) = | find the determinant of Rs. 5.18. Let r,(k) = 028(k) + 1. Find the reflection coefficients My for all k > 0 and find the all-pole models A (k) for all p > 1 5.19. A pth-order all-pole model fora signal x(n) is parameterized by the p+ 1 parameters
O. Again your method should have less than 4.N multiplications and divisions at each step. (©) Write an expression fr the error Ey, at the noth step of the recursion in terms of the coefficients g(n) and the coefficients of the least squares inverse filter hy) (4) Write a MATLAB program that implements the Simpson sideways recursion. (©) How can this recursion be used to find the inverse of a Toeplitz matrix? © computer Exercises (5.1, Modify them-filer tog -mtofind the Cholesky (LDU) decomposition ofa Hermitian Toeplitz matrix R,. Compare the efficiency of your mile withthe MATLAB Cholesky decomposition progam choi .n (C52, ‘The inverse Levinson-Durbin recursion isa mapping from he sequence of refction coefficients I) tothe autocorrelation sequence r;(k). Beginning with , (0), the autocorrelation F(t) fork > Ore determined recursively In this problem, we derive and implement another approach forthe recovery ofthe autocortclation sequence (a) Begining with the normal equations R, rewrite these equations in the form pty where Ay, is a matrix containing the pth-order model coefficients a,(&) and rp is the286 THE LEVINSON RECURSION vector containing the autocorrelation values that are to be determined (assume thatthe data is complex). What isthe structure of the matrix A, and how may it be generated? (&) Write a MATLAB program that will set up these equations and solve forthe autocorrelation sequence (©) Compare the complexity of this solution to the inverse Levinson-Durbin recursion. C53. ‘The Schur recursion is a mapping from the autocorrelation sequence r,(k) to a set of reflection coefficients (a) Write a MATLAB program that implements the Schur recursion. Using the Flops com- ‘mand, compare the efficiency of the Schur recursion to the Levinson-Dusbin recursion to find the reflection coefficients I’ from an autocorrelation sequence rs(K).. (6) Derive the inverse Schur recursion that produces the autocorrelation sequence r(k) from the reflection coefficients I, and write a MATLAB program to implement this recursion, Compare the efficiency ofthe inverse Schur recursion othe inverse Levinson- Durbin recursion to find the autocorrelation sequence from the reflection coefficients. (C5.4, In the derivation of the split Levinson recursion, we defined a new set of coefficients, 5). As we will see in Chapter 6, these coefficients are used in the implementation of a split lattice filter (a) Show that these two sets of coefficients are equivalent in the sense that one set may be derived from the other, (apes esaeene pt Ree (Hay Hesse (b) Write a MaTLAs program gtod .m that will convert the reflection coefficients I into the split Levinson coefficients 8, (©) Write a MATLAB program dtog .m that will convert the split Levinson coefficients 5, into a set of reflection coefficients I. (@) Use these m-files to study the relationship between the coefficients 'y and 6). For example, what does the constraint [P| < L imply about §;? What happens if T, = +1? Is it possible to determine whether or not |F'| < 1 by simply looking at 5;? 55, The line spectral pair (LSP) coeficients were introduced inthe 1980's as an alter native othe filter coefiicientsap(k) and the reflection coeicients I) in the representation of an alk-pole model for a signal x(n). Given the prediction error filter A(z), the LSP representation is as follows. With 5)(z) and $¥(z) the singular predictor polynomials that were introduced in the derivation ofthe spit Levinson recursion, note that A(z) may be represented in terms of Spy1(2) and Sf, (2) as follows, 1 Es Ant = HY puter4 Sh] Since the roots of $,:(z) and Sf, ,(2)lieon theunitcircle, the angles of these zeros uniquely define the singular predictor polynomials and, thus, the prediction error filter A(z). (a) Generate several different singular predictor polynomials, Sp.y(c) and Sf, (2). and Jook atthe locations ofthe roots ofthese polynomials. What relationship do You observe ‘between the locations of the roots of S);(z) and the roots of S*,,(<)?(b) © @ COMPUTER EXERCISES 287 Each zero of A(z) maps into one zero in each of the polynomials 5,,(z) and S%, (2). Investigate what happens when a zero of Sp4.1(z) is close to a zero of S%,,(z). Does this imply anything about the spectrum A,(e/*)? Let 64 denote the angles of the roots of the polynomials $,.1(c) and Sf, (2) that lie within the interval [0, x]. Assume that the angles are ordered so that 4,1 = 6x, and let ‘Af = 6441 — 9. Write @ MATLAB program 1sp.m that will produce the prediction error filter A,(z) from the angles A04. Generate a number of different AR(12) processes by filtering unit variance white noise with a 12th-order all-poe filter. Place the poles ofthe filter close to the unit circle, at approximately the same radius, and at angles that are approximately harmonically related to each other. Compare the accuracy in the model for x(n) when quantizing the model coefficients ap(k) to 16 bits and quantizing the LSP coefficients A to 16 bits.LATTICE FILTERS 6. INTRODUCTION In Chapter 5 we discovered how the reflection coefficients from the Levinson-Durbin recursion may be used in a lattice filter implementation of the inverse filter A, (z). This structure hhas a number of interesting and important properties including modularity, low sensitivity to parameter quantization effects, and a simple test for ensuring that 4,,(z) is minimum phase. As a result, lattice filters have found their way into many different and important signal processing applications and have been used both for signal analysis and synthesis. In this chapter, we look again at the lattice filter structure, derive some alternative lattice filter forms, and explore how these fillers may be used for signal modeling. ‘This chapter begins with a derivation of the FIR lattice filter structure in Section 6.2 where we explore the relationship between the lattice filter and the problems of forward and backward linear prediction. In Section 6.3 we then develop the split lattice filter whi based on the singular predictor polynomials and the split Levinson recursion. All-pole |: filters, allpass lattice filters, and pole-zero lattice filters are then considered in Section 6.4. In Section 6.5 we then turn our attention to the use of lattice filters for all-pole signal modeling. We derive three approaches that are based on the sequential estimation of the reflection coefficients. These methods are the forward covariance method, the backward covariance method, and Burg’s method. We also.present a nonsequential method known as the modified covariance method. Finally, in Section 6,6 we look at how the lattice methods of signal modeling may be used to model stochastic signals. 6.2 THE FIR LATTICE FILTER ‘To derive the FIR lattice filter structure, we will proceed exactly as we did in Section 5.2.2, ‘beginning with the problem of all-pole signal modeling. Therefore, letxr(m) bea signal that is to be modeled as the unit sample response of an all-pole filter of the form _ 60) eae) 289290 varie rurees ‘AS we saw in Section 4.4.3 of Chapter 4, with Prony’s method the coefficients of A,(z) are found by minimizing the squared error,’ Liev? where “ p(n) = x(n) # apn) = x(n) + Lowe —k with (0) = 1 Nt tha by minimizing we ae soving forth eons g(t) that minimize the difference between x(n) and what may be considered to be an estimate or prediction of x(n), Hn) = = Paypyxem —b) ra ‘This estimate is formed by taking a linear combination of the previous p values of x(n) and is referred to asthe forward predictor of x(n) and epi) = x(n) — (0) is referred to as the pth-order forward prediction error. In order to distinguish between the forward prediction error and the backward prediction error, which will be introduced shortly, we will use the following notation, ef) = x(n) + Yap x(n — 1) fi In addition, we ill use Eto denote the sum of the squares of ef (n), 2) Equation (6.1) allows us to write the pth-order forward prediction error in the z-domain as follows Ef(@) = Ap@XG@) (63) where . Ap(2) = 1+) ap(kyz* (64) Therefore, ¢-(n) may be generated by filtering x(n) with the FIR filter Ap(2) which is referred to asthe forward prediction error filter. This relationship between ¢7-(n) and x(n) is illustrated in Fig. 6.14 ‘As we saw in Section 4.4.3, the coefficients of the forward prediction error filter are the solution to the normal equations Ryay = pth oo) where R, is a He tian Toeplitz matrix of autocorrelations. As we have seen in Chapter 5, Note that we have added a subserpt p toe) indicate the order of the model.THE FIRLATICE FLTER __291 x(n to) x(n etn = Apt) ee o AR) oe. + @ © Figure 6.1 (a) The forward prediction error. ef), expressed as the output of the forward prediction error filter. Ay(c)and(b) the backward prediction error; (n), expressed asthe output (of the backsvard prediction error filter AN(2), these equations may be solved recursively using the Levinson-Durbin recursion. Thus, the coefficients ofthe (+ 1)storder prediction error filter djs), are related othe coefficients of the jth order prediction error filter through the model-order update equation given in Eg. (5:16) which is repeated below for convenience, aqui) = a + Ppa E+) 66) Expressing Eq. 6.6) in the z-domain, we may relate the prediction error filter Aj,.(2) to Ay(z) as follows Ayan (2) = Aj(2) + Djs [2-9 PAZCA/2")] (6.7) Multiplying both sides of Eq, (6.7) by X(z) and using Eq, (6.3) we obtain the following relationship between E¥, (2) and Ef (2), 68) ES) + where we have defined E> (z) as follows Ep @) =2IX@AVA/2) 69) If, a5 in Chapter 5, we introduce the notation AR(2) writen as follows -FAS(I/2"), then Ey (2) may be Ej (2) = AR @)X(@) (6.10) ‘Taking the inverse z-transform of both sides of Eq. (6.8), we obtain the time-domain recur- » 1D that relates e},(n) to ef (n) and e5 (n), A signal processing interpretation of the signal e; (n) may be derived from Eq. (6.9) by taking the inverse z-transform as follows, Gn) = x(n = j) + Draikyxtn — j +h) (6.12) ‘As we did with the forward prediction error, e7 (n) may be expressed as the difference between x(n — j) and what may be considered to be the estimate or prediction of x(n — j) that is formed by taking a linear combination of the j signal values x(n)..x(n — 1),.--5 x(n J+ Vie, eft) = x(n j) = 80 j)292 _vamice nurses where bh Lax -j+0 What is interesting about this interpretation is that if we minimize the sum of the squares of; (n). F=Lieor then we find thatthe coefficients that minimize E> are the same as those that minimize &* And therefore are found by solving the normal equations given in Eq. (6.5). Thus, ¢;(n) is referred toas the jth-order backward prediction error and AR(:) is known as the backward prediction error filter. The relationship between x(n) and ¢; (n) is illustrated in Fig. 6.16. Equation (6.11) provides a recursion for the (j + 1)st-order forward prediction error in terms ofthe jth-order forward and backward prediction errors A similar recursion may be derived for the backward prediction error as follows. Taking the complex conjugates of both sides of Eq. (6.6) and substituting j —i + 1 fori we have ai G E+) =a - E+ DFT aD (6.13) Expressing Ea. (6.13) in the z-domain we find that UMAR U2") = UMAR 2) + Pf AVC) (6.14) Multiplying both sides of Bg, (6.14) by X 2) and using the definitions for (2) and E (2) ings. (6.3) and (6.9), respectively, yields Eq @) 221) + EFO) (6.15) Finally, taking the inverse z-transform we obtain the desired recursion D+ Een) 6.16 Equations (6.11) and (6.16) represent a pair of coupled difference equations that correspond. to the two-port network shown in Fig. 6.22. With a cascade of two-port networks having. reflection coefficients I'j, we have the pth-order FIR lattice filter as shown in Fig. 6.2b. Note that since eS (m) = ep (n) = x(n) 6.17) then the two inputs to the first stage ofthe lattice are the same. Itis interesting to note that with a forward prediction error filter of the form @.18) the system function ofthe backward prediction error filter is ABQ) = 2-3/2) = [](<" -a7) (6.19) Therefore, the zer0s of A(z) are at the conjugate reciprocal locations of those of AR(2), ie. If there is zero atz = ay in Ap(z) then there isa 2ero at z = I/ar in A(z). Consequently,THEFRLATICE FLIER __ 293 go Sam) pon —of Lee eum Ta on jm sue) | fe sum (Sing sage ofa FIR atic fee ein. go gm hm em xo «im ean eo om (Oy pre FIR aie fier Figure 6.2 The FIR lattice filer since the forward prediction error filter that is obtained from the autocorrelation method is ‘minimum phase (assuming R,, > 0) then the backward prediction error filter is maximum ‘Phase. Another interesting relationship to note is that since E}@) = A,@X@ and Ez (2) = AR@X(2) then =) = APO pe “+ + FO= BOBO ER (2) = Hop(ZEf(2) (6.20) where 21) is an allpass filter. Thus, as illustrated in Fig. 6.3, the backward prediction error may be generated by filtering the forward prediction érror with an allpass filter? Note that when ‘Ay(z) is minimum phase, all of the poles of the allpass filler are inside the unit circle and all ofthe zeros are outside. Therefore, Hzp(z) is a stable and causal filter. ‘There area numberof advantages ofalatice filter over adirect form filter tha often make it a popular structure o use in signal processing applications. The first is the modularity of the filter. It is this modularity that allows one to increase or decrease the order of a lattice filter in a linear prediction or all-pole signal modeling application without having to recompute the reflection coefficients. By contrast, when the prediction error filter is implemented in direct form and ifthe filter order is increased or decreased then all ofthe filter coefficients, in general, will change. Lattice filters also have the advantage of being easy to determine whether or not the filter is minimum phase (all ofthe roots inside the unit circle). Specifically the filter will be minimum phase if and only if the reflection 2m allpas atice filter for computing () from «} (x) wil be derived in Seeton 6.1294 __vamce ures a Figure 6.3 Alipas fier for generating the backward prediction error e, (n) fram the for ward prediction error €3(n} coefficients are bounded by one in magnitude, Inthe case of IIR lattice filter, which will be discussed in the following section, this allows one to easily ensure that a filter is stable by simply imposing the constraint that [T;| < 1. This is particularly useful in an adaptive filtering environment in which the reflection coefficients are changing as a function of time (See Chapter 9). This minimum phase condition thus allows one to easily update the filter coefficients in time in such a Way that, foreach value of n, the roots of the filter remain inside the unit circle. Finally, compared to other filter structures, the lattice filter tends t0 be less sensitive to parameter quantization effects [13]. Therefore, a latice filter is frequently used ‘when it is important to minimize the parameter quantization effects or when a signal is to bbe coded in terms of the filter coefficients of the model that provides the best approximation to the signal. There are other forms of FIR lattice filter, including the one-multiplier and the normalized latice filters (7] Inthe following section, we derive the split lattice filter. 6.3. SPLIT LATTICE FILTER* In Section 5.4 of Chapter 5, the singular predictor polynomials were introduced as a way ‘of imposing symmetry in the prediction error filters A,(z), thereby allowing for an increase in the efficiency of the Levinson-Durbin recursion. These polynomials were defined to be S)@) = Aja) +21 AR ‘and they were shown to satisfy the 3-term recurrence Si = (1 +2 YG (2) ~ 271 GHz) We now show how this recurrence leads toa spit lattice filter structure [4]. Asin Section 5.4 ‘where we derived the split Levinson algorithm, here we restrict ourselves tothe case of real signals. Paralleling our development ofthe FIR lattice filter in the previous section, note that if we were to filter x(n) with §)(z), then the filter's response s(n) #.x(n) will be the forward prediction error that would result if, at the jth stage of the Levinson- Durbin recursion, I, = 1. Therefore, we will refer toe, (n) as the jth-order singular forward prediction error. Using the 3-term recurrence for 5j(z), it follows that ¢;(n) satisfies the difference equation en epan(ad) = (ln) + e(n — 1D) ~6)-10 - This relationship, shown in block diagram form in Fig. 64a, represents one stage of a spit lattice filter. The next stage, which takes the two inputs ¢;41(1) and e,(n~ 1) and produces the outputs ¢;42(n) and &) (1 — 1), is defined by the equation e420) = ejgal) + 6410 = 1) —jxrey(~ 1)SPUTLATTICE FUER’ 295 a(n) gin-1) g-a=1) seit) y (a) A network corresponding to the 3m recurence forthe singular predictor polynomials atin ee) eat) ete) stn) (©) A network that produces ¢42(7) from én) and e(n—1), eu em) gaat) eum, aun) “1 rt (6) A second-order split lattice module tht is formed by cascading the net works in (a) nd () x(n) os 2 et eta) (4) A 3rd-order split atc filter that produces the singular prediction eros e(n) and ey(n — 1) Figure 6.4 A split lattice filter,296 _Lamice rurers and is shown in block diagram form in Fig. 6.4. Note that this network is the mirror image of the previous one. Cascading these two sections together produces the second-order split lattice module shown in Fig. 6.4c. What is needed to complete the split lattice filter is an initial seetion to transform x(n) into o(n) and ex(n), and a final section to convert the singular prediction errors ¢p41(”) and ¢p(n) into the forward prediction error (n). The initial conditions in the split Levinson recursion are So(z) = 2 and S\(z) = 1+ 2~! (see p. 273), which require that we form the signals eo(n) = 2x(n) ex(n) = x(n) +x(0 — 1) ‘These signals may be generated as illustrated in Fig. 6.4d which shows a 3rd-order split lattice filter for producing the singular prediction errors e4(n) and es(n — 1). Note that @ ‘multiplication may be saved if we combine the factor of 2 with 8. Finally, to convert the singular prediction errors ¢p41(n) and ¢p(n) at the output of the split lattice filter into the forward prediction error ef (n), recall that Ap(z) is related to the singular predictor polynomials as follows Ap(e) = 2! Ape) + Sparle) — 271 + Pp) Sp(2) ‘Therefore, the pth-order forward prediction error may be generated from the singular pre- ion errors ép(n) and ép41(n) as follows, ep (a) = ep (nV) + Epil) — (LF Ppp = 1) (6.22) ‘The following example illustrates the procedure. Example 6.3.1 Split Lattice Filter Consider the third-order all-pole filter He! In order to implement this filter using a split lattice filter structure, itis necessary that we derive the coefficients 8, from the coefficients a(). This may be done in a straightforward ‘manner as follows. First, we convert the coefficients a(k) into reflection coefficients Tj Using the m-file atog .m we find r=[-1/4, -1/s, 2/3)" Second, using the relationship between I’, and 8; given in Eq, (5.149), ie. 8 =(-Tpd +E) y Po we have 4 5/4 & = (=P y+ Ty) = 9/10 $5 = (=P +P) = 4/15 Finally, with P's = 2, using the termination specified in Eq. (6.22) we have the spit lattice filter shown in Fig. 6.5MRLATIICE FUTERS __ 297 xm et 5 fa mans 0) st 3j10 " as e Figure 6.5.4 third-order split lattice filter 6.4 IIR LATTICE FILTERS In Section 6.2 we saw how the reflection coefficients Ij could be used to implement an FIR filter Ap(z) using a lattice filter structure. In this section we develop lattice filter structure for IR filters. We begin with a derivation of the “standard” two-multiplier all-pote lattice filter for implementing the system function 1/Ap(z). At the same time we show how this all-pole lattice may also be used as an allpass filter Hap(z) = A®(2)/Ap(2). We then look at some other structures including the Kelly-Lochbaum lattice filter, the normalized all-pole lattice, and the one-multiplerall-pole lattice. As with the FIR lattice filter, these structures enjoy the same advantages of modularity, simple tests for stability, and decreased sensitivity to parameter quantization effects. Finally, we will look at how the all-pole lattice filter may bbe modified to implement a general pole-zero filter of the form H(z) = By (2)/Ap(2). 6.4.1 Alk-pole Filter A pth-order FIR lattice filter is shown in Fig. 6.2b. The input tothe filter is the zeroth-order forward prediction error ej (n) and the outputs are the pth-order forward and backward prediction errors, ¢¥(n) and e; (n), respectively. The system function ofthis filter is E@) Ap) ER@) ‘The all-pole filter Ap@) arene (ey rie 1 Eee yeawe fon the other hand, would produce a response of ej (n) to the input e7 (n). Therefore, as illustrated in Fig. 6.6, whereas the FIR lattice filter builds up the prediction errors ¢}(n) and, ¢7 (n) from ¢@ (n), the all-pole lattice filter produces the lower order prediction errors, e (n) and ¢; (n), from e(n). Thus, in order to implement a pth-order all-pole lattice filter, we must determine how the lower-order prediction errors ¢!(n) and e; (n) may be generated frome? (n). Note that if we solve Eq. (6.11) for e} (n) we have, along with Eq, (6.16), the following pair of coupled difference equations FO) = ef) —Pynep@—1) (623) Gal) = ef DFT rere 624) ‘These two equations define the two-port network shown in Fig, 6.7a and represent a single stage ofan all-pole lattice filter. With a cascade of p such sections we have the pth-order298 Lame rurers go go fo) 1 gw of Ape - + pf) x (A pthceder al 0 Ber fo goneting em) (OVA p-rderall pole er for enering ef) from $0 from eft Figure 6.6 Generating the pth-order forward prediction error ef (n) from @(n) and vice versa a go stat» of ee etn Fat a go ee ee (Single sage ofan at oe tice am sm sm qo ao 5 Pet n eae am aon em (0) ponder pole atice iter ands pt-rder alas iterthatiefomed by xseading p malas fe form given in (2 Figure 6.7 A lattice filter for implementing all-pole and allpass filters. all-pole filter shown in Fig. 6.7b. Recall that since ej (n) = ej (n) = x(n) then the lattice filter is terminated by feeding e? (n) back into the lower input of the last module, While the system function relating ¢ (n) to ef (n) isthe all-pote filter FE) _ ot EX@ ~ 4,@) note that, as we saw in Section 6.2, the system function relating ¢*(n) toe; (n)is the allpass filter (6.25) HG) © _ ASAI") _ ARO) E@* ~Ap@ ~ Ap@ Therefore, this structure may also be used to implement allpass systems (6.26) Example 6.4.1 All-pole Lattice Filter Let us implement the third-order all-pole filter 0328 T1081 + 0642-7 — 0.5122 H@mmamice FureRs 299 (oy Analg atic fer Figure 6.8 All-pole and allpass latice iter implementations using a lattice filter structure. With the step-down recursion we find that the reflection coefficients corresponding to the filter coefficients a(k) are P= [-0.4878, 0.3123, -0.512]” ‘Thus, the all-pole lattice implementation of H (z) is asillustrated in Fig, 6.8a. This structure ‘may also be used to implement the allpass filter as illustrated in Fig. 6.86. 6.4.2 Other All-pole Lattice Structures* Beginning with he all-pole lattice filter in Fig. 6.7, we may develop some addtional al-pole latice filter structures. One such structure may be derived from Eqs. (6.23) and (6.24) as follows. Note that Eg, (6.23) expresses the forward predietion error ¢(n) in terms ofthe two latice filter inputs e7,(n) and e; (a — 1) whereas Bq, (6.24) expresses the backward prediction error e>, (n) in terms of the output 2 (n) and the input e; (n ~ 1). However, we may also express ¢;,,(n) in terms of the two inputs e,,(n) and e; (n ~ 1) by substituting Eq, (6.23) for ef (n) into Eq, (6.24) as follows elm) = FMD +P yet n) =F -D+T haar) — Hiner - d] = [Ie inate D4 tpi 627) Equation (6.27) along with (6,23) define the two-port network shown in Fig, 6.9. Note that, unlike the all-pole lattice in Fig. 6.7, this implementation requires three multiplies per stage.900 Lame runes Although nota particularly useful structure, through a sequence of flowgraph manipulations described below we may derive some interesting and important all-pole lattice structures ‘The first is the Kelly-Lochbaum model that has been used in speech modeling as well as ‘models for the propagation of waves through a layered media [11,12]. Next, we derive the normalized all-pole lattice filter which, although requiring four multiplications per section, hhas an advantage in terms of computational accuracy using fixed-point arithmetic [10] Finally, we derive the one-multiplier lattice which is the most efficient structure when ‘measured in terms of the number of multiplications per section. Beginning with the three-multiplier lattice module in Fig. 6. suppose we scale the ‘input tothe upper branch by @ constant 1/a+1 and scale the outputs of the branch point by the inverse of this scale factor, aj,1. This produces the equivalent two-port network shown, in Fig. 6.10a, Next, if we divide the two inputs to the adder in the lower branch by ay and multiply the output of the adder by a1 then We have the equivalent network shown in Fig, 6.10b. Now, recall that both of the outputs ¢}(n) and ¢;, () are formed by taking a linear combination of the two inputs e*, (n) and e; (n). Therefore, if we scale the two inputs by a1 and rescale the outputs by I/a» then the network will remain the same. The net effect, as shown in Fig. 6.10c, is to bring the two scale factors on the left of the two-port network over to the right side. Forming a cascade of these modules, moving all ‘of the scale factors to the end of the network, results in the equivalent all-pole lattice filter shown in Fig. 6.10d where Since ef (n) = 5 (n) then the network may be terminated by feeding the output ofthe final stage of the lattice back into the input. With the scale factors of a and 1/ar canceling each ‘other atthe input to the last module, we are then left with only a scale factor of Ir at the ‘output that is applied to ¢¢ (n). This leads to the structure shown in Fig. 6.11 consisting of a cascade of modified all-pole lattice modules. Having dropped the final scale factor |/a from the network, the system function between the input x(n) and the output y(n) is y@) _ Mer Az) = (6,28) = Fe = “As 68) andthe system function relating x(n) t0 wn) i the allpas iter 2) An we AR@ 629 X@) ~ A,@ eu so Tha Ty Ga) go =i? Figure 6.9 A chree multiplier all-pole lattice filter module.mame nurers 30’ Vays aut ef) — +» + sm auth y 4 Py a) ea) 1a Iya? (4) Scaling the input in the upper branch of the allpole latice module by 1/1 and rescaling the outputs ofthe fist branch point by a1 Vayst ast fat —e ty Thay 4 Tint Gy) ——_o0—__+—_+ yen = ID P)/ ay (@) Scaling the inputs tothe adder inthe lower branch by I/a.., and rescaling the ourpu by 4.1 Ot Tay fa) >> 9 + et) Th | A Ty Sue) ——_4—_—_l| em : Cai ae ajaet (©) Moving the scale factors a4; and 1a. on the left wo the right ise gin am tem y Tet rn eo) ea) am ae) (4) A cascade of latice filter modules of the form given in (c) with all of the scale factors moved to the final stage Figure 6.10 A sequence of latce filter lowgraph manipulations that lead 10 equivalent al-pole lattice filter structures302 __vamce nurses fae Cx) gail) gm re > i : Iya P/ayt (@) The modified al-pte ace module. xo) vn) t Pt n wo) (©) A cascade of modified all-poteltice modules. Figure 6.11 A modified all-pole lance filter built-up as a cascade of modified all-pole lattice ‘modules shown in (a) Given the all-pole module in Fig. 6.11, we may now derive three important all-pole lattice filter structures. In each case, we assume that the reflection coefficients are real- valued. The first is based on the factorization (I~ F,,) = (1+ Pj4i)(0 — P41) and is obtained by setting ay S14 Tia 630) This results in the lattice filter module shown in Fig. 6.12a which is known as the Kelly- LLochbaum model. Tis structure has been used as a model for speech production in which the vocal tract is approximated as an interconnection of cylindrical acoustic tubes of equal length and differing cross-sectional areas where the reflection coefficients I correspond to the amount of acoustic energy that is reflected atthe interface between two adjacent tubes having a differing acoustic impedance [3,6]. The difference equations for this structure are ef) = [1+ Fyaletaen — hep —) a) = [1 —PjpiJep @ ~ + Pipef a) (631) ‘and the system function is T.a+n) Ape ‘This structure requires four multiplications, two additions, and one delay for each stage The next all-pole lattice structure is derived by introducing a new set of variables, 41, ‘where 61 is related tothe reflection coefficients I 41 a8 follows. 841 = Sin“ Pye (633) HH, (6.32)mmamnce rurers 303 Le Di eu) ——— fm) Tyas 4) —+~—_+_+_1 em (@)Kelly-Lochbaum late fer. cos 6j1 = ea) —>— ef sin da 4 = sins e440) —+——_+— ea) : cos +1 d (©) Normalized lace fier. Fey fm eau) em (6) One-moltiplier atice fer Figure 6.12 Three all-pole luce filter structures with real-valued filter coefcients. Fach of these lattice modules is equivalent 10 within a scale factor Assuming thatthe filter is stable so that |T'j4:| < 1, this transformation is well-defined, With Tyst = sind 634) 1) = 605? Ban (635) let ws define the sale factor 1 by aj41 = 08645 636)304 vamce rurers This leads to the normalized all-pole lattice filter shown in Fig. 6.12b. The difference equations for this network are ef (n) = [00s A+, Jez, ,(m) — [sin 8,41 Jes ( — 1) ejqulnt) = [005 6) 1]es On = 1) + [sine si Jef, (m) (6.37) and the system function is {Tome HQ (6.38) Ap(z) In this implementation, there are four multiplications, two additions, and one delay for each stage as in the Kelly-Lochbaum structure. However, what makes this structure intresting is the fact that it requires only one complex multiplication, Specifically, note that if we form the complex signal, (1) + je (n—1) from the wo input signals, andthe complex signal 6} (2) + ej4(n) from the two Ourput signals, then these wo signals ae related as follows? Pn) + jer eit [etm tien v] (6.39) In particular, note that Eq. (6.37) corresponds to the real and imaginary parts of both sides of {his equation. This relationship also demonstrates thatthe complex signals are normalized in the sense that their magnitudes do not change from one section tothe next, lePon + erase] = lets) + jer m— | (6-40) “Thus, this structure is referred to as the normalized late. One of the advantages of the normalized latice filter is its robustness to round-off errors when using fixed-point arithmetic [10]. ‘The last structure is the one-multplir lattice which, as its name implies, requires ony one real multiplication per module [5]. This structure may be derived by ve-wrting the Kelly-Lochbaum equations as follows, ef) = Pyrifepim —eF@— D] + ef) (41) Fails) = Vysifep. sy — ep (n— D] + ep @— 1) (6.42) Since the reflection coefficient [',,1 multiplies the same signal in both equations, forming the sum ef, (n)—eF (n— 1) prior to multiplying by 1 leads toa structure requiring only cone multiplication per stage. The one-multiplcr lice structure is shown in Fig. 6.12c. In addition to one multiply, this structure requires three additions and one delay. 6.4.3 Lattice Filters having Poles and Zeros In the previous section, several different all-poe lattice structures were derived from the basic all-pole lattice given in Fig. 6.7. From this structure we may also develop a lattice filter that realizes a general rational system function of the form Bg(z) _ by(O) + bo(1)z7! +--+ + byl) H@ Te a(et +o agp) 6.43) Note that we have changed the subscript indices on the forward and backward prediction etors fom j oF ia ‘order to avoid confusion with the imaginary number J = J/=1am LATTICE FUTERS 305 bg (Rw(n — k) is the output of the FIR filter By(z), Since the zeros of H(z) are introduced by taking linear combination of delayed outputs ofthe all-poe filter, withthe all-poe lattice filter in Fig. 6.7, rational system function may similarly be realized with a linear combination of the signals ef (n — k) fork = 0,1, ..., q. However, a more efficient way isto take a linear combination of the backward prediction errors ¢;(n) yin) = Dey Ade; (n) (6.44) 4s shown in Fig. 6.13 (in this figure we assume that q = p). To show that ths filter does, ‘in fact, have a rational system function, we begin by expressing Eq, (6.44) in the z-domain as follows YO = Nee (645) with it follows that Ay A, ve = ra) “Therefore, the system function relating the input £'+(z) to the output ¥ (c) is Vaart frag): ‘which has p poles and q zeros. Note that the zeros of H(z) are the roots of the polynomial HC (6.46)306 Larice rurers B= GAF@) 647) which is a function of not only the tap weights cq(j) but also the reflection coefficients P To see how the coefficients by(/) ae related tothe coefficients cj), we substitute the following expression for A®(), J ARG) = AG(/e*) = Yo aziz"! {nto Eq. (6.47) as follows By(2) = eg) Vapimper! (6.48) With the substitution k = j — m, Eq. (6.48) becomes LV Veaary - oe jab eb Interchanging the order of the summations and making the appropriate changes in the indexing we have 2-3 [5 ea -»]e ado (6.49) lia fad Finally, equating powers of z on both sides of Eq. (6.50) gives the desired expression balk) = Yea Dapj —®) (6.50) i ‘This equation shows how the coefficients b,(k) may be found from the coefficients ¢y(k) and T'. Specifically, from the reflection coefficients T of the all-pole filter we may use the step-up recursion to find the all-pole coefficients for model orders j = 0,1, ....4 With a,() along with the coefficients c, (k), Eq. (6.50) may then be used to determine the ‘coefficients by (E). Equation (6.50) may also be used to find the coefficients cy (K) required toimplement a given rational system function with a numerator B, (2). Todo this, Eq. (6.50) is rewritten as follows rik =e + DO eaDaf sy Sh (here we have used the fact that 2(0) = 1). Thus the eoeicints c(k) may be computed recursively as follows glk) = byl) ~ YO eg Daz —& (6.52) for k =~ 1,q-~2,-..0. This recursion is initialized by setting 4g) = boa) (653) ‘The following example illustrates the procedure.LATTICE METHODS FOR ALL-POLE SIGNALMODEUNG 307. x) Figure 6.14 Lice fier waza ofa tv ore lon-pss li fier, Example 6.4.2 Lattice Filter Implementation of a Low-Pass Elliptic Filter ‘A third-order low-pass elliptic filter with a cutoff frequency of w = 0.51 has a system function given by 0.2759 + 0.51212"! +0.51212~? +0.27592~ T= 0.001021 + 0.654627 — 0.077527 To implement this filter using a lattice filter structure, we first transform the denominator coefficients into reflection coefficients. Using the step-down recursion we find r H@ = [0.0302, 0.6584, —0.0775]" with the second-order coefficients given by ay = [1, 0.0501, 0.6584)" and the first-order coefficients equal to a, =[1, 0.0302)" Next, we find the coefficients ¢3(k) that correspond to the given numerator using the recursion in Bq.(6.52). Beginning with 6303) = bs(3) = 0.2759 we then find 642) = by(2) ~ e4(3)as(1) = 0.5124 ex(1) = by(1) = ex(2)aa(1) ~ e9(3)as(2) = 0.3058 6x(0) = by(0) — es(1)au(1) ~ €9(2)a2(2) = e9(3)ax(3) = ‘This leads to the lattice filter implementation illustrated in Fig. 6.14. 6.5 LATTICE METHODS FOR ALL-POLE SIGNAL MODELING In Chapter 4 we developed several different approaches for finding an all-pole model for a signal x(n). In each case, we first defined an error that Was to be minimized and then solved the minimization problem for the optimum set of transversal filter coefficients a(K). Since the lattice filter provides an alternate parameterization ofthe all-pole filter, ie. in terms of its reflection coefficients, we may also consider formulating the al-pole signal modeling308 __Lamce rurers problemas one of finding the reflection coefficients that minimize some erro. In this section ‘we look at several such lattice methods for signal modeling including the forward covariance method, the backward covariance method, Burg's method, and the modified covariance ‘method. Except for the modified covariance method, each of these methods is a sequential ‘optimization procedure. That is to say, given an error that is to be minimized, such as the sum of the squares ofthe forward prediction error, the first reflection coefficient, I, is found that minimizes this error. Then, with [ fixed, the optimum value of Fis determined, Thus, in general, with Py, ...,T)—1 held constant, the value of P is found that minimizes some error, We begin, inthe following section, with the forward covariance method. 6.5.1 The Forward Covariance Method Let x(n) form = 0,1,...,.N be a signal, either real or complex, that is to be modeled as the unit sample response of an all-pole filter of order p. For an all-pole lattice filter, it is necessary to determine the reflection coefficients which, for reasons soon to become apparent, we will denote by [7-In the forward covariance method the reflection coefficients are computed sequentially with te jth reflection coefficient, determined by minimizing the covariance-type error = Lilet? (6.54) a Note that since &}' is a function only of the signal values x(n) over the interval [0, N], as with the covariance method described in Section 4.6.2, itis not necessary to window the signal or make any assumptions about the values of x(7) outside the given interval To begin the sequential minimization of &;" using the forward covariance method, we ‘must first find the reflection coefficient that minimizes the first-order error Det? (655) ‘This minimization may be performed by setting the derivative of €;' with respect to (I}?)* equal to zer0 as follows yy Oe ae arn -he Oar lefmy =o (6.56) Since ef) = eft) + PFeg@—1) 657) then afermy . Sar 7 koe UT and Eq. (6.56) becomes Veronese vt (638)

M Hayes Statistical Digital Signal Proc Part 2

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

M Hayes Statistical Digital Signal Proc Part 2

Uploaded by

Copyright:

Available Formats

You might also like