Professional Documents
Culture Documents
m-81
$3.00 0 1994 I E E
0-7803-1775-0/94
RLS algorithms. Hence, we use the deviation in the esti- due to finite-precision quantization, and find the average
mation error due to finite-precision as the figure of merit. value of the same. The errors are propagated down the
Thus, we find expressions for the squared average devia- systolic array for all three algorithms and the deviation in
tion in the estimation error for the three algorithms for the estimation error is found. The intermediate results are
fixed-point and floating-point arithmetic. not shown here. The expected values of the square of the
deviation in the estimation error for STAR-RLS, PSTAR-
RLS and QRD-RLS are shown in that order in Table B
I1 BACKGROUND below.
TABLE B
The QRD-RLS, STAR-RLS and PSTAR-RLS algo-
rithms are described in [3],[4],[6]. To obtain simplify
the finite-precision performance expressions we exploit the
steady-state properties of the algorithms. We find expres-
sions for the steady-state properties and these are shown
in Table A.
TABLE A
111-82
and this greatly simplifies the comparisons. First we com- inputs are quantized to the given format (fixed-point or
pare the floating-point and fixed-point implementations for floating-point) and the wordlength. The actual operation
the same algorithm. It should be kept in mind that the is carried out and the result is then quantized. In the case
variance U,’ is different for the two implementations. For of floating-point, each number is represented as a vector of
STAR-RLS and QRD-RLS, we find that the fixed-point length 2, one for the exponent and the other for mantissa.
implementation is always better than floating point. The The handling of the mantissa and exponent is done aa in
same is true for PSTAR-RLS if hardware. The rounding scheme used is the round-to-the-
nearest scheme.
The simulations are run on a 5000 long data stream and
the average results are computed by averaging over the last
With p = 5 and A = .98, this condition is equivalent to 1250 samples. This ensures that the results are calculated
M 2 4. after steady-state has been reached. The infinite precision
Next, we compare the different algorithms with the same results are also calculated alongside, and the deviation is
arithmetic being used. In the case of fixed-point arithmetic determined. The plots show the deviation in estimation
it can be shown that PSTAR-RLS always performs better error as a function of the number of bits for the fractional
than STAR-RLS. The same would be true for floating- part k. The theoretical expressions are plotted with dashed
point implementation if line while the simulation results are shown by ”+”. Fig.1
shows the results for the fixed-point case while Fig.2 shows
the results for the floating-point case.
References
The above would be satisfied if p is not too small (p > 3).
Thus, in general PSTAR-RLS has better numerical prop- [I] W. M. Gentleman and H. T. Kung, “Matrix triangu-
larization by systolic arrays,” Proc. SPIE Real Time
erties than the STAR-RLS algorithms. This would not be Signal Processing IV, vol. 298, pp. 298-363, 1981.
expected since more computations are involved in PSTAR- [2] J. G. McWhirter, “Recursive least-squares minimiza-
RLS. But it should be noted that the computation of tan- tion using a systolic array,” Proc. SPIE Real Time
gent in PSTAR-RLS is simpler, and the effect of forgetting Signal Processing IV, vol. 431, pp. 105-li2, 1983.
factor is to multiply the delayed version of cell content by [3] S. Haykin, Adaptive Filter Theory. Englewood Cliffs,
Xp instead of A. This factor controls the finite-precision
- NJ: Prentice Hall, 1986.
deviation and thus helps PSTAR-RLS. Thus, the extra ad- [4] K. J. Raghunath and K. K. Parhi, “High-speed RLS
dition operations needed in PSTAR-RLS get compensated using scaled tangent rotations (STAR),” Proc. of
due to these reasons. IEEE Intl. Symp. on Circuits and Systems (ISCAS-
Next, we compare STAR-RLS and QRD-RLS. In the 93), pp. 1959-1962, May 1993.
case of fixed-point it can be shown that STAR-RLS per- [5] K. K. Parhi and D. G. Messeeschmitt, “Pipeline inter-
forms better than QRD-RLS if leaving and parallelism in recusive digital filters- part
I: Pipelinin usin scattered look-ahead and decom-
X ( l - 1.5XZM-’) position,” fEEE %“actions on Acoustics, S ech,
M > - (5.3) and Signal Processing, pp. 1099-1117, July 1 9 8 r
2.5(1 -X)(2XzM-’ - 1 ) ’
[6] K. J. Raghunath and K. K. Parhi, “Pipelined imple-
The right-hand-side (RHS) is positive only when .5 < mentation of hi h-speed STAR-RLS adaptive filters,”
Proc.. SPIE, Akanced Si nal Processin A Lgorithms,
xZM-1
- < .67. The condition (5.3) will hence be satis- Architectures, and Implkentatrons b, vol. 2027,
fied for most of time except when X2M-’ is close to and 1993.
greater than .5. In the case of floating-point arithmetic, [7] H. Leung and S. Haykin, “Stability of recursive QRD
STAR-RLS has a better performance than QRD-RLS if LS algorithms ;sin finite precision systolic array
.25 < Since is close to 1, this condition would implementation IBEE Transactions on Acoustics,
S ech, and Signal Processing, pp. 760-763, May
be satisfied for most applications. lK9.
[8] S. H. Ardalan and S. T. Alexander, “Fixed-point er-
ror analysis of the exponential windowed RLS algo-
VI SIMULATION RESULTS rithm for time-varying systems,” IEEE Transactions
on Acoustics, Speech, and Signal Processing, vol. 35,
pp. 770-783, June 1987.
Next, we show results of the simulations conducted to
verify the theoretical expressions developed in this paper. [9] H. Dedieu and M. Hasler, “Error propagation in re-
cursive QRD LS filter,” Proc. of IEEE Intl. Conf. on
We use the 11 tap equalizer example of [3], which hass also Acoustics, Speech and Signal Processing (ICASSP),
been used in [4] and [6]. pp. 1841-1844, 1991.
To simulate the finite-precision behavior of the algo- [lo] K. J. R. Liu, K. Yao, and C. T. Chiu, “Dynamic
rithms we carry out the computations as would be done range, stabilit and fault-tolerant capability of finite-
in an actual circuit. We have developed subroutines in C precision RLgsystolic array based on Givens rota-
language to mimic the operations of finite-precision arith- tion,” IEEE Transactions on Circuits and Systems,
pp. 625-636, June 1991.
metic. For every operation of add, multiply, etc., the
III-83
(4 STAR-RLS
0-
+
Q -20:'. t
*. +
40.
"*
'4 +.
1
w -80
+.
+-+Y
Y
Y
,g -la,. -+-t
**. t
3
d-120.
+.+.+
'
-140'
(d) PSTAR-RLS@=lo)
Or
-140' -1401 .
10 I5 20 10 15 20
Numbcrofbiu fobadimd pltol) Numtu dbiu for fnstionl pn(k)
Or
111-84