You are on page 1of 6

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/366240978

Design and Analysis of 8-Bit Multiplier for Low Power VLSI Applications

Conference Paper · October 2022


DOI: 10.1109/MysuruCon55714.2022.9972421

CITATION READS

1 138

3 authors, including:

Nagabushanam Munivenkatappa Parameshwara M C


Ramaiah Institute of Technology Vemana Institute of Technology
24 PUBLICATIONS 86 CITATIONS 29 PUBLICATIONS 90 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Nagabushanam Munivenkatappa on 16 August 2023.

The user has requested enhancement of the downloaded file.


2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon)

Design and Analysis of 8-Bit Multiplier for Low


Power VLSI Applications
Divyansh Jangalwa1 M. Nagabushanam2 M. C. Parameshwara3
Department of Electronics and Department of Electronics and Department of Electronics and
Communication Engineering Communication Engineering Communication Engineering
2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon) | 978-1-6654-9790-9/22/$31.00 ©2022 IEEE | DOI: 10.1109/MysuruCon55714.2022.9972421

M. S. Ramaiah Institute of Technology M. S. Ramaiah Institute of Technology Vemana Institute of Technology


Bangalore, India Bangalore, India Bangalore, India
djangalwa123@gmail.com nagabhushanam1971@msrit.edu pmcvit@gmail.com

Abstract—Multiplier is the most essential and primitive element performance of the system. In the literature, we can find
of multiply and accumulate (MAC) unit, and is typically found different types of adders and multiplier architectures such
in many digital signal processing (DSP) applications. as an array multiplier, Booth's multiplier, Booth-Wallace
Conventionally the multiplication operation is carried out Modified multiplier, and Wallace tree multiplier (WTM),
through repeated addition; hence multipliers use extensively the Dadda tree multiplier (DTM) [3-4][11-15].
full adders (FAs) for the addition process. The energy efficiency
of the multiplier is determined in terms of power delay In this study, we present the study of an 8-bit AMM based
consumed per bit operation. Thus, FA plays a vital role in on different FA architectures that are reported in the
determining the overall efficiency of the multiplier. In this current literature. The organization of the paper is as
paper, a study, design, and simulation of an 8-bit additive follows. Section I is Introduction. Section II presents the
multiply module (AMM) using different FA circuit architectures
is presented. Further, the designed AMM is compared against
literature survey; Section III presents the concept of
the traditional Wallace and Dadda tree multipliers in terms of AMM multiplication process. Section IV discusses the
design metrics. To perform comparison of all the multipliers, design and results of AMM and its counterpart
they are described using RTL codes, simulated and verified architectures. Finally, Section V concludes this paper.
using Cadences’ EDA tools. To extract power and area metrics,
the verified designs are further synthesized under common II. LITERATURE SURVEY
constraints using Cadences’ generic 180 nm technology file.
Significant efforts are being made around the world to
Keywords—Additive Multiply Module, Dadda Tree Multiplier, develop multipliers for low-power VLSI applications.
Wallace Tree Multiplier, full adder architecture. This section provides an overview of the recently
proposed multipliers aimed at improving the design of
I. INTRODUCTION low-power multipliers. A thorough discussion of previous
The FA and multiplier circuits (FMCs) being the work is included in the literature survey. The referred
primitive the blocks of multiply and accumulate circuit works provide an insight limitation and propose a novel-
(MAC) determines the overall performance of the DSP based multiplier to further reduce power for VLSI
system [1-3]. These FMCs consume a significant amount Applications, the following are the most recent works
of power in any DSP-based system and hence it requires mentioned in the overview.
optimizing the performance of FMCs for targeted P. Choppala et al.[2] proposed a multiplier design that is
applications [4]. In today's world of rapidly expanding area efficient, low power, and Full Swing Hybrid
soft computing and digital technology, there is a huge Multipliers. This research paper focuses on Gate Diffusion
demand for power-efficient hand-held gadgets. These Input based hybrid FA of a 1-bit with only 14 transistors,
gadgets are essentially battery-operated and need to within the standard array and WT multipliers. This work
process computationally intensive operations. These implements a hybrid adder and Wallace Tree multipliers
highly intensive operations consume a lot of power and within the array; thus the names H-A and H-WT are kept.
hence battery may soon dry out. Therefore, the power The use of WT raises power consumption and is
consumption of a device is an important consideration [5- compensated by the use of Gate Diffusion Input
10] that the designers need to give immediate attention. technology, which also decreases the area. But as a
Thus, the power optimization task constrains the limitation that the GDI structures fail to achieve full
designers to design and develop energy-efficient gadgets. swing, it can be overcome by the use of H-A & H-WT
multipliers. Ultimately partial products are added by CSA.
Most portable electronic gadgets, such as cellular Tanner EDA tool was used to implement the proposed
telephones, iPods, and laptops, as well as the present designs with 250 nm technology and power consumption
Internet of Things (IoT) devices, need to consume low for the designs is 583 µW and 693 µW.
power without any compromise in performance. A A. Jain et al. [3] implemented an 8×8 Multiplier which is
significant power-consuming element in microprocessor an integration of Vedic Mathematics and Booth WT
ALU seems to be the multiplier. The device's power Multiplier. It uses the Radix-4 Booth algorithm, Wallace
consumption can be lowered at many design levels, Tree algorithm, and fusion of Modified Booth and
including the logic, circuit, gate, and transistor levels. To Wallace algorithms. It also uses the Vedic multiplication
increase the speed of multiplication, it is necessary to algorithm and it combines the Vedic-Booth-Wallace
speed up partial product accumulation. The performance algorithms and the new proposed multiplier are designed
of the multiplier is critical, in determining the overall which results in a reduction in delay and area.

978-1-6654-9790-9/22/$31.00 ©2022 IEEE

Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on December 14,2022 at 03:52:37 UTC from IEEE Xplore. Restrictions apply.
M. Munawar et al. [4] implemented a modified version of
the DT multiplier which uses a carry select adder and
binary to an excess-1 converter, as a result, it increases its
efficiency in terms of power and speed. The modified
multiplier is Implemented and compared with existing
multipliers in terms of delay, dynamic power, and PDP.
The analysis depicts that the proposed multiplier
outperforms other multipliers in terms of performance.
M. C. Parameshwara and M. Nagabushanam [5] proposed
a 4 novel reversible FAs which are based on cutting-edge Fig.1. An abstract model of4×2 AMM
FA architecture design. The merits of proposed RFAs are
compared to those of reported reversible FAs on the basis (1)
of QGMs. According to the result comparison, the The output of each AMM submodule, in general, is
proposed RFAs are efficient in terms of the number of represented as in Eqn. 2 (from Fig.1), where Pi is an
constant inputs/garbage outputs and the quantum cost. intermediate partial product
Che-Wei Tung et al. [6] proposed pipeline MAC
(2)
architecture with high speed and low power design.
Where carry propagations of additions frequently result in All the generated intermediate PP are shifted and added as
higher power consumption and path delay in a per their respective weights as shown in Fig.2. An
conventional MAC. Additions to the partial product example of M=187×235=43945 using a binary 8×8 AMM
reduction process were incorporated to address these is illustrated in Fig.3. The process of addition and
issues. The implemented MAC design does not perform pipelining of all intermediate partial products generated
the operation of addition and accumulation of most from each sub-module of AMM is pictorially represented
significant bits until the partial product (pp) reduction in the block diagram shown in Fig.4
process of the following multiplication. To properly deal
with an overflow in the PP reduction process, an adder
with lower bit size is used to accumulate the carries.
Experiment analysis reveals that the implemented MAC
design can significantly decrease both the power
consumption of the circuit and the area occupied by it
under the same timing constraint as compared with
existing works.
Kokila Bharti Jaiswal et al. [7] implemented a modified
version of FA using multiplexers and an EX-OR gate.
When the modified FA is used in the reduction stage of
the WT multiplier, the delay, area, and average power are Fig.2. A pictorial representation of an 8 × 8 multiplicationvia 4 × 2
reduced by 17.65 percent, 45.75 percent, and 37.45 AMM
percent respectively, when compared to existing methods.
The synthesis results indicate the proposed design
dissipates 1.106 mW of power for an 8-bit multiplier and
is appropriate for low power, area applications.

III. MULTIPLICATION PROCESS USING AMM

The multiplication of two eight numbers to get a final


product M=X×Y is explained as follows [12]. To multiply
X with Y using the AMM, the multiplicand X is halved
into two equal parts namely lower half bits (XL) and
higher half bits (XH) respectively. The multiplier is also
divided into four equal parts viz. Y1, Y2, Y3, and Y4,
where the weight of the bits will increase from Y1 to Y4.
The multiplication of XL and XH with each part of Y is
carried out using an independent additive multiply block
as shown in Fig.1.The pictorial representation of the
process of multiplying two numbers using AMM is Fig.3. An example of 8×8 multiplication using AMM module
illustrated with an example of 8×8 multiplication in Fig.2,
along with an example shown in Fig.3. In Fig.2 and Fig.3,
the X and Y represent multiplicand and multiplier
respectively. The final product using AMM multiplier in
general is represented as in Eqn.1.

Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on December 14,2022 at 03:52:37 UTC from IEEE Xplore. Restrictions apply.
multiplier is less than WTM and DTM and could be used
for various low-power VLSI applications.

TABLE 1 AREA & POWER COMPARISON OF ALL THE


MULTIPLIERS

Multi FA
Area in Leakage Dynamic Total Power
plier archite
µm2 Power in nW Power in μW in μW
cture
WTM 4916.42 249.25 419.48 419.73

DTM Generic 4497.29 233.65 373.86 374.09

AMM 4716.84 242.20 362.74 362.98


XOR-
5229.1 210.05 371.16 371.37
MUX
XNOR-
5229.1 204.62 318.20 318.40
WTM MUX
XOR/XN
OR- 4916.42 249.25 419.48 419.73
MUX
XOR-
4816.63 193.92 336.30 336.49
MUX
XNOR-
4816.63 188.36 283.52 283.71
DTM MUX
XOR/XN
OR- 4497.29 233.65 373.86 374.09
MUX
XOR-
Fig.4. Block diagram of binary 8×8 AMM 5036.17 202.42 310.97 311.17
MUX
XNOR-
AMM 5036.17 196.90 268.81 269.00
Beginners are taught how to multiply long-term on base- MUX
XOR/XN
10 things using this way. An 8-bit AMM (Additive OR- 4716.84 242.20 362.74 362.98
Multiply Module) multiplier was built using Verilog's 4x2 MUX
AMMs. The primary goal of this work is to optimize the
power consumption of 8-bit multiplier.
Table 1 shows the comparison of parameters like area,
power of WTM, DTM, and AMM with all three FA
IV. RESULTS, AND DISCUSSION architectures viz. XOR-Mux, XNOR-Mux, and
XOR/XNOR-Mux. It can be seen clearly from Table 1that
AMM with XNOR-MUX FA has the least power
The 8×8-bitWTM, DTM, and AMM multiplier consumption and hence it can be used for several low-
schematics are designed using Verilog RTL codes based power VLSI applications.
on generic FA architecture [1,5] and tested for various
input data combinations and the functionality is verified,
the proposed designs are also synthesized with Cadence
Genus tool (180 nm standard cell library).The proposed
multipliers are then implemented with different FA
architectures viz. XOR-Mux, XNOR-Mux, and
XOR/XNOR-Mux [5] to further study the power
behaviour of the multipliers under different FA
architectures.

The performances are compared by extracting area,


leakage power, and dynamic power of various multipliers Fig.5. Area of estimation of various multipliers
under different architectures of FAs, which are listed
inTable1.In this table, the lowest and highest values are Fig.5 shows the graphical representation of area
highlighted using ‘Blue’ and ‘Red’ text. The following estimation for different multipliers. In summarizing that
results show that the power consumed by the AMM the area of DTM using generic FA consumes the least
area compared to other multipliers.

Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on December 14,2022 at 03:52:37 UTC from IEEE Xplore. Restrictions apply.
REFERENCES

[1] Parameshwara, M.C., “Approximate full adders for energy


efficient image processing applications" Journal of Circuits,
Systems and Computers, vol. 30, issue 13, pp.1-17, Oct. 2021.
[2] P. Choppala, V. Gullipalli, M. Gudivada and B. Kandregula,
"Design of Area Efficient, Low Power, High Speed and Full
Swing Hybrid Multipliers," 2021 International Conference on
Computing, Communication, and Intelligent Systems (ICCCIS),
2021, pp. 929-934.
[3] A. Jain, S. Bansal, S. Khan, S. Akhter and S. Chaturvedi,
“Implementation of an Efficient N×N Multiplier Based on
Vedic Mathematics and Booth-Wallace Tree Multiplier” 2019
International Conference on Power Electronics, Control and
Automation (ICPECA), 2019, pp. 1-5.
Fig.6. Power consumption of various multipliers based on generic [4] M. Munawar et al., “Low Power and High Speed Dadda
FA Multiplier using Carry Select Adder with Binary to Excess-1
Converter,” 2020 International Conference on Emerging Trends
Fig.6showsthe comparison of power consumption of in Smart Technologies (ICETST), pp. 1-4, 2020.
various multipliers based on generic FA, the summary of [5] Parameshwara, M.C., Nagabushanam, M. “ Novel low quantum
cost reversible logic based full adders for DSP applications. "
this is as follows: International Journal of Information Technology. 2021, vol. 13,
pp.1755–1761.
• Among the leakage power comparison, the [6] C.W.Tung and S.H. Huang, “A High-Performance Multiply-
Accumulate Unit by Integrating Additions and Accumulations
DTM has the lowest leakage power and WTM
into Partial Product Reduction Process,” in IEEE Access, vol. 8,
has the highest leakage power consumption. pp. 87367-87377, 2020.
• Among dynamic and total power consumption [7] K. B. Jaiswal, Nithish Kumar V, P. Seshadri and
Lakshminarayanan G, “Low power Wallace tree multiplier
the AMM has the least power consumption and using modified full adder,” 2015 3rd International Conference
WTM has the highest power consumption. on Signal Processing, Communication and Networking
(ICSCN), pp. 1-4, 2015.
[8] S. Venkatachalam and S. Ko, “Design of Power and Area
Efficient Approximate Multipliers,” in IEEE Transactions on
Very Large Scale Integration (VLSI) Systems, vol. 25, no. 5, pp.
1782-1786, May 2017.
[9] P. Yadav, A. Pandey, M. R. K., R. P. K.J., V. M.H. and N. K.
Y.B., “Low Power Approximate Multipliers with Truncated
Carry Propagation for LSBs,” 2018 IEEE 61stInternational
Midwest Symposium on Circuits and Systems (MWSCAS), pp.
500-503, 2018.
[10] Xiong, X., Lin, M. “Low Power 8-Bit Baugh–Wooley
Multiplier Based on Wallace Tree Architecture,” In: Sobh, T.,
Elleithy, K. (eds) Emerging Trends in Computing, Informatics,
Fig. 7 Power consumed by various multipliers using different FA Systems Sciences, and Engineering. Lecture Notes in Electrical
architectures. Engineering, vol. 151. Springer, New York, NY, 2013.
[11] N. Kumar M., R. S. Adithyaa, B. Kumar D. and T. Pavithra,
“Design Analysis of Wallace Tree based Multiplier using
Fig. 7 shows the comparison of power consumption of Approximate Full Adder and Kogge Stone Adder,” 2020 6th
various multipliers based on XOR-Mux, XNOR-Mux, and International Conference on Advanced Computing and
XOR/XNOR-Mux FAs, the summary of this is as follows: Communication Systems (ICACCS), pp. 612-616, 2020.
[12] Behrooz Parhami, “Variations in Multipliers,” in COMPUTER
• Among the leakage power comparison, the DTM ARITHMETIC Algorithms and Hardware Designs II. New
York, NY, USA: Oxford University Press, pp. 242-244, 2010.
has the lowest leakage power and WTM has the [13] Z. Gu and S. Li, “Optimized Interpolation of Four-Term
highest leakage power consumption. Karatsuba Multiplication and a Method of Avoiding Negative
• Among dynamic and total power consumption Multiplicands,” in IEEE Transactions on Circuits and Systems I:
the AMM has the least power consumption and Regular Papers, vol. 69, no. 3, pp. 1199-1209, March 2022.
[14] M. Heidarpur and M. Mirhassani, “An Efficient and High-Speed
WTM has the highest power consumption. Overlap-Free Karatsuba-Based Finite-Field Multiplier for FGPA
Implementation,” in IEEE Transactions on Very Large Scale
V. CONCLUSION Integration (VLSI) Systems, vol. 29, no. 4, pp. 667-676, April
2021.
In this paper, a design of three multiplier architectures [15] Venkatachalam, Suganthi, Elizabeth Adams, Hyuk-Jae Lee and
namely Wallace tree, Dadda tree, and Additive multiplier Seok-Bum Ko. “Design and Analysis of Area and Power
module have been designed with various FA circuit Efficient Approximate Booth Multipliers.” IEEE Transactions
architectures using Verilog RTL codes, simulated, and on Computers 68 pp: 1697-1703, 2019.
[16] C. V. Gowdar, M. C. Parameshwara and S. Sonoli, “Approximate
synthesized using Cadences’ Genus tool based on
Full Adders for Multimedia Processing Applications,” 2020 IEEE
standard cell library (180 nm). Based on the synthesis International Conference for Innovation in Technology (INOCON),
results the design parameters such as power and area have 2020, pp. 1-4.
been extracted compared, and important inferences have [17] Nitin Krishna V, “Performance Analysis of MAC Unit using
been drawn. Booth, Wallace Tree, Array and Vedic multipliers,” International
Journal of Engineering Research and Technology, Vol. 09, Issue
09,Sept. 2020.

Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on December 14,2022 at 03:52:37 UTC from IEEE Xplore. Restrictions apply.
[18] Kadi Jaya Ramesh, Chepuri Pravalika, Dr.Rajkumar Sarma,
“Multiply and Accumulate Architectures for Digital signal
Processing and digital image Processing,” Turkish Journal of
Computer and Mathematics Education, Vol.12, No. 11, pp:3797-
3804,2021.
[19] Karthik S Kulkarni, H V Ravish Aradhya,“MAC Unit optimization
for Area, Power and Timing Constraints” International journal for
research in applied science and engineering technology,
Vol.9,Issue V, May 2021.
[20] Z. Gu and S. Li, “Optimized Interpolation of Four-Term Karatsuba
Multiplication and a Method of Avoiding Negative Multiplicands,”
in IEEE Transactions on Circuits and Systems I: Regular Papers,
vol. 69, no. 3, pp. 1199-1209, March 2022.
[21] M. Heidarpur and M. Mirhassani, “An Efficient and High-Speed
Overlap-Free Karatsuba-Based Finite-Field Multiplier for FGPA
Implementation,” in IEEE Transactions on Very Large Scale
Integration (VLSI) Systems, vol. 29, no. 4, pp. 667-676, April 2021.
[22] S. Aruna, S. Venkatesh and K. Srinivasa Naik, “A Low Power and
High-Speed Array Multiplier Using On-The-Fly Conversion,”
International Journal of Recent Technology and Engineering
(IJRTE) ISSN: 2277-3878, Volume-7, Issue-5S4, February 2019.
[23] K. Rajesh, G. Umamaheswara Reddy, “FPGA Implementation of
Multiplier-Accumulator Unit using Vedic multiplier and Reversible
gates,” International Conference on Inventive Systems and Control
(ICISC 2019).
[24] S. Venkatachalam, E. Adams, H. J. Lee and S. -B. Ko, “Design
and Analysis of Area and Power Efficient Approximate Booth
Multipliers,” in IEEE Transactions on Computers, vol. 68, no.
11, pp. 1697-1703, 1 Nov. 2019.

Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on December 14,2022 at 03:52:37 UTC from IEEE Xplore. Restrictions apply.
View publication stats

You might also like