Professional Documents
Culture Documents
For Radix- 4, Can Be Used to Generate 3a No Booths Slight Delay Penalty from CSA 3 Gates
For Radix-2, Better Use in Keeping Cumulative Product in Redundant Form for First k -1 Cycles Then Use a CPA in the Last Cycle
Each Cycle Faster Due to CSA Sign of sa, s2a Incorporated Directly in Recoder/Selector Instead of Add/Subtract Signal Generation
Booth Recoder/Selector
Circuitry Shown on Following Slide Negative Multiples a, -2a in 2s Complement a, 2a Aligned at Right with Position i Must be Padded with i Zeros to Right Bitwise Complement (when a, -2a Needed) Converts zeros to ones Followed by LSb add of 1 Converts Back to zeros Causes a Carry-in of 1 into Position i Can Ignore Positions 0 through i -1 (in neg. multiples) Insert carry-in directly (dot)
Radices > 4
Radix-8 (3 bits at a time-k/3 multiples) Requires 3-Level CSA Tree
Might as Well Use Radix-16 (4 bits at a time) Still 3-level tree with one more CSA
MUXes Can Be Replaced with Booth Recoder/Selector Circuits in Higher Radix Multipliers Can Continue to Increase Radix (256-8bits) Leading to Wider Trees Tradeoff is Speed Versus Area
Radix-16 Multiplication
Classification of Multipliers
Tree Results in Product in Redundant Form (2 Values Carry-Store for Example) Final Product Formed With Converter (Fast CPA for Exmaple)
If Redundant Result in Carry-Save Form, Converter is Just a CPA Could Use Other Redundant Adders Such as Signed Binary (4:2 Compressors) High Radix Multipliers Lead to Fewer Values to Accumulate
Sequential Design Fewer Cycles Parallel Design Smaller Tree Tradeoff Tree Complexity Versus Multiple Forming Circuit
4 v 4 Example
16 AND Gates Used to Form xiaj Terms (dots)
1 2 3 4 3 2 1
Wallace Example
1 2 3 4 3 2 1
Dadda Examples
1 2 3 4 3 2 1 1 2 3 4 3 2 1
Implementation Issues
Logarithmic Depth Tree Irregular Structure Design/Layout Difficult Various Length Signal Propagation Paths Hazards and Signal Skew Need Iterated Recursive Structures Automatic Synthesis and Layout Motivates Search for Alternative Reduction Tree Structures
Converter Stage is Fast CPA Can Also Use SBD With SBD the Converter Stage is a Fast Subtractor
Array Multipliers
Can Eliminate Top CSA With 0 Input Can Replace 0 With y to Compute ax+y
Array Multipliers
Tree is One-Sided Longest Delay is 4 CSA Plus k-bit CPA Slower than Wallace/Dadda Tree Regular Structure
short wires in horiz., vert., diag. positions simple, efficient layout easily pipelined (latches after each CSA row)
5 by 5 Array Multiplier