Professional Documents
Culture Documents
2
Adders circuits….
Ripple-carry
Mirror
Carry by-pass
Linear carry select
Carry Looked Ahead
Transmission gate based
Manchester carry chain
3
Data paths in Digital Processor Architectures
Components of digital processor data paths, memory,
control and input/output blocks.
4
The design of the arithmetic operators is the topic
of our discussion.
5
6
Data paths are often arranged in a bit-sliced
organization as shown in Figure.
8
The Binary Adder: Definitions
A & B are adder inputs
Ci carry input
S sum
C0 carry output
9
10
11
(1)
(2)
(3)
12
(4)
13
G & P are only functions of A & B are independent
upon Ci.
14
For some i/p signals, no rippling effect occurs at all,
while for others, the carry ripples all the way from its
LSB to MSB.
The propagation delay of such a structure (critical
path) is defined as the worst case delay over all
possible input patterns.
In case of ripple carry adder worst case delay occurs
when a carry generated at LSB position propagates
all the way to MSB position.
This carry will be finally consumed in the last
stage to produce the sum.
15
The delay is then proportional to the number of
bits in input words N is given by:
(4)
16
Two important conclusions can be drawn
from Eq (4):
1. The propagation delay of ripple carry adder is
linearly proportional to N.
17
Inverting property of Full Adder
Statement: “Inverting all inputs to a full
adder results in inverted values for all
outputs”
It is expressed in equations as follows:
(5)
18
This property is useful in optimizing the speed of
ripple-carry adder.
The reorganized eq. set here is:
(6)
19
C0
20
The corresponding adder using
complementary static CMOS requires
28 transistors, consuming large area
and also it is slow.
Tall PMOS transistor stacks in both
carry & sum generation circuits.
Intrinsic load capacitance of C0 is
large
(2 diffusion, 6 gate capacitance &
wiring capacitances)
Signal propagates through two
inverting stages in carry generation
circuit.
Sum generation ckt needs one extra
logic stage.
21
Mirror Adder Design
Improved adder ckt ….
Operation based on Eq. (6)
Carry generation ckt is imp. To analyzed
1. Carry-inverting gate is eliminated
2. PUN & PDN are not duals instead they form
the cleaver implementation of P/G/D functions.
This results in considerable reduction in both area
and delay.
22
6
12 12 4 4
4 4 6
12 4 4 6
6 2 2 3
6 2
6 2 2 3
2
3
23
Observations from mirror adder design
1. This FA cell requires only 24 transistors
2. The NMOS & PMOS chains are completely
symmetrical.
3. Maximum of two series transistors in carry-
generation circuitry. (tall transistor stacking in
PUN is reduced)
4. The transistors connected to Ci are placed
closest to o/p of the gate.
5. Only transistors in carry stage have to be
optimized for speed keeping transistors in sum
stage for minimum size (W/L)
24
5. Transistor sizing for minimum delay:
25
Carry Looked-ahead Adders
Algorithm based on prediction of carry.
Benefits of CLA circuit:
1. Speed improvement due to pre-computation of carry
before it is actually calculated at all the stages of adder
circuits.
26
27
CLA LOGIC
28
Carry Looked-ahead Adders
Implementation issues in CLA circuit:
1. Irregular gate structures
2. Needs extra logic (means extra H/W) for speed
improvement.
3. Restrictions on Fan-in parameter….since the
designer can’t go beyond adding too many i/p’s to a
particular gate.
4. Non-modularity of CLA structure
5. Hence, layout will not be symmetric.
29
Comparison of CLA with conventional FA
representation
In conventional FA, we
get regular structure i.e
the design follows the
concept of Modularity
Layout will be
symmetric
30
Transmission Gate XOR
B
M2
A
A
F
M1 M3/M4
B
31
Another ex of effective
use of txion gate is the
popular XOR ckt.
The complete
implementation requires
only 6 transistors
(including inverters to
generate B’) compared
with the 12 transistors
required for its CMOS
implementation
32
To understand the operation of the ckt we need to only analyze B=0 & B=1 cases
separately.
For B=1, M1 & M2 acts as inverter, M3 & M4 OFF
----o/p function F1= A’B
For B=0, M1 & M2 are disabled , M3 & M4 is operational
----o/p function F2=AB’
Overall function F= F1+F2
F=A’B+AB’
33
Transmission-Gate-Based Adder
A FA implementation using transmission gate is
shown in Fig. below and uses 24 transistors.
34
Based on propagate signal, o/p carry is either set
to i/p carry or either one of the i/ps A or B.
35
36
Benefits of Txion gate adder ckt
Compact area
Similar delays in both sum and carry generation
circuits
37
Fig: Manchester carry gates (a) Static, using P, G & K
38 (b) dynamic using only P & G signals
A Manchester carry chain adder uses a cascade of
pass transistors to implement carry chain.
39
The dynamic implementation
(Fig b above) makes this ckt
more simple.
40
41
During pre-charge (phi=0), all intermediate nodes
of the pass transistor carry chain are precharges
to VDD.
42
43
Carry by-pass adders OR Carry-skip adders
44
The Carry-Bypass Adder
Consider 4-bit adder block of Fig.(a).
45
In other words,
If (P0P1P2P3=1) then C0,3=Ci.0
Else either DELETE or GENERATE occurred
This speeds up the adder operation.
When BP=P0P1P2P3=1, the incoming carry is
forwarded immediately to the next block through
the by-pass transistor Mb- hence name carry-
bypass adder or carry-skip adder.
If this is not the case the carry is obtained by way
of the normal route.
46
47
48
Carry Bypass in Manchester Carry-Chain
Adder
Fig shows the possible carry propagation paths
when FA circuit is implemented in Manchester-
carry style.
49
Carry By-pass in Manchester carry chain
adder
Fig shows the possible carry-
propagation paths when full-
adder is implemented in
Manchester-carry style.
This fig shows how the by-
pass speeds up the addition:
The carry propagates either
thru. By-Pass path, or a carry
is generated somewhere in
the chain.
In both ths cases delay is
smaller than normal ripple
conf.
50
Area overhead incurred by adding by-pass
transistor is small typically between 10 % to 20 %.
51
Worst case delay in carry-skip adders
To compute the delay of N-bit adder.
52
The approximate total propagation delay time is given
by:
55
Carry Select Adder
In ripple adder, every FA cell has to wait for the
incoming carry before outgoing carry can be
generated.
The approach here is to anticipate both possible
values of carry i/p and evaluate the result
(i.e. sum) for both possibilities in advance.
Once the real value of incoming carry is known, the
correct result is easily selected with a simple MUX
stage.
This idea is called carry-select adder and is shown
in Fig below.
56
57 Fig: Four bit carry select module (Topology)
Instead of waiting on the arrival of o/p carry of
bit k-1, both 0 and 1 possibilities are analyzed.
59
60
Class Assignment :-
1. Comparison of CMOS, NMOS & Transmission gate logic
circuits.
2. Comparative analysis of adder circuits wrt Speed, power
and area parameters (prepare the tabular comparative
analysis )
61
SHIFTERS
Combinational vs sequential circuits
63
SHIFT REGISTERS
64
One bit shifter
Right nop Left
Ai Bi
Ai-1 Bi-1
Bit-Slice i
...
66
Fig shows a basic 1-bit shifter.
67
Thus, a more structured approach is needed for
shifter designs.
1. Barrel shifter 2 Logarithmic shifter
68
Barrel shifter
Definition
Structure
Operation
69
It is a type of shifter which plays important role in
data shifting and rotation in a single clock cycle.
70
But for some specific application there may be the
need to shift several bits of information in one
clock cycle and also to vary the length of the clock
cycle.
71
Structure of Barrel shifter
It consists of an array of transistors.
72
4×4 barrel shifter
73
Barrel Shifter: Area dominated by wiring
A3
B3
Sh1
A2
B2
Sh3
A0
B0
74
4×4 Barrel Shifter Layout
A3
A2
A1
A0
76
An important property of this ckt is the the layout
is dominated by the no. wires running through the
cell and not the transistors.
77
Operation
Barrel shifter can be designed to perform foll 6-
types of operations:
78
Designing of a barrel shifter is almost symmetrical
and can be done using repetitive combinational
logic blocks.
It can be designed using MUX.
Trees of 2:1, 4:1, 8:1 MUX are used for this
purpose.
79
80
Logarithmic Shifter
While the Barrel shifter implements the whole shifter
as a single array of pass transistors, the Logarithmic
shifter uses a staged approach.
81
An ex of shifter with a max shift value of seven bits is shown
in Fig below
For ex to shift over five bits, the first stage is set to shift
mode, second to pass mode and third again to shift mode.
82
83
Furthermore, the series connection of pass transistors
slows the shifter down for larger shift values.
We conclude that, a barrel shifter is appropriate for
smaller shifters.
For larger shift value, logarithmic shifter becomes
more effective, in terms of both area and speed.
Further, the logarithmic shifter is easily parameterised,
allowing for automatic generation.
84