Professional Documents
Culture Documents
0002 10.1049@el.2014.3861
0002 10.1049@el.2014.3861
a3 = (a1 + c1 )/2
a1 = a + d
b1 = b c
b3 = (d1 + b1 )/2
c =b+c
c = (d1 b1 )/2
1
3
d1 = a d
d3 = (a1 c1 )/2
(1)
a1 = (a d)/2
a2 = 23/32a1 + 156/256b1
b2 = 23/32a1 3/4b1
b1 = (b + c)/2
c = c1
c =bc
1
2
d2 = d1
d1 = a + d
a3 = a2 + d2 /2
b3 = b2 c2 /2
c = b2 + c2 /2
3
d3 = d2 /2 a2
a2 = x11 a1 + x12 b1
a1 = (a d)/2
b = x21 a1 + x22 b1
b1 = (b + c)/2
step 1 2
c = x31 c1 + x32 d1
c1 = b c
2
d2 = x41 c1 + x42 d1
d1 = a + d
a3 = a2 + d2 /2
b = b2 c2 /2
step 2 3
(4)
c = b2 + c2 /2
3
d3 = d2 /2 a2
where xij are the corresponding coefcients of the second module, i
{1, 2, 3, 4}, j {1, 2}. Therefore, we can describe the algorithm as
three modes, and each mode has its coefcient matrix. The coefcient
matrices of Hadamard transform, 1D rotation and 2D rotation are
given as
1 0
55/64 3/8
0 1
3/8
1
Xh =
1 0 , X odd = 55/64 3/8 ,
0 1
3/8
1
(5)
23/32 156/256
3/4
23/32
Xoddodd =
1
0
0
1
For different modes, the sequence of data ow should be changed. In
step1, let a1 and d1 exchange their values, and so do b1 and c1 for 1D
rotation mode. In step2, let a2, b2, c2, d2, respectively, correspond to
d2, c2, a2, b2 for Hadamard mode and 1D rotation mode.
VLSI architecture: The computational attractiveness of the previous
operations is that they require additions, and/or right shifts and/or multiplications. However, these operators are implemented by three architectures. Here, we consider that these operations are in the same
category. According to the above algorithm, a unied VLSI architecture
with three stages of pipeline, denoted as dotted lines, is presented as
shown in Fig. 1. The simple multiplexers are applied to the exchanges
of data ow.
a
d
b
a2 = 55/64a1 + 3/8b1
a1 = a + d
b2 = b1 3/8b1
b1 = b c
z
c = 55/64c1 + 3/8d1
c = (b + c)/2
1
2
d2 = d1 3/8c1
d1 = (a d)/2
a3 = a2 /2 + c2
b3 = b2 /2 d2
c = c2 a2 /2
3
d3 = d2 b2 /2
(2)
(3)
x11
x
21
x 12
+ >>1
+
x22
x31
x
>>1
41
x 32
+ >>1
x42
>>1
adders, but prolong the critical path latency to 3Ta with the same stage of
pipeline.
+
a1
>>2
>>1
x11a1
+
>>2
>>1
+
0
>>1
x21a1