You are on page 1of 6

ISSN 2321 3361 © 2020 IJESC

Research Article Volume 10 Issue No.3

Design and Analysis of 16X16 Multiplier using Kogge Stone Adder


and Approximate 15-4 Compressor
R. Agnes Lilly1, V. Abinaya2
B.E Student1, 2
Department of Electronics and Instrumentation Engineering
St. Joseph’s College of Engineering, Semmancheri, Chennai, India

Abstract:
The major role of electronics device is to provide compact area with high speed performance. The most complex module in digital
building blocks system is multiplier. So, Multiplier is chosen and approximate computing is applied on it so that it can be used in
electronic applications like multimedia etc. A design approach of 16-bit Wallace Tree approximate multiplier with 15-4
compressor is considered to provide more reliability. The 16x16 Wallace tree multiplier is synthesized and simulated using Xilinx
ISE 14.7 software. Multiplier involves generation of partial products and addition of partial products which takes longer time for
computation. Using compression technique in Wallace tree approach, the number of partial products are reduced before addition
is performed. This paper focuses on Design and Analysis of 16x16 Multiplier to optimize parameters like Area and Delay.
Optimization is done using 15-4 approximate compressor. Here, we modify the design by replacing the 4-bit Parallel Adder by
4-bit Kogge Stone Adder in approximate 15-4 compressor and also compare the performance of both the designs of the 16x16
Multiplier. Thus, the better optimization of area and delay is identified.

Keywords: Area, Compressor, Delay, Kogge Stone Adder, Multiplier, Parallel Adder, Wallace tree.

I. INTRODUCTION Add the multiplicand into an accumulator, if the LSB of


Multiplier is ’1’. Shift the Multiplicand one bit to the left and
Two major concerns in optimization have been delay and area Multiplier one bit to the right. Stop when all the bits of the
in VLSI circuit design. Multiplier circuit is chosen to achieve Multiplier are zero.
reduction in the area and delay of circuits. Multipliers involve
generation and addition of partial products. Owing to large Where,Y=Yn-1Yn-
2………………...Y2Y1Y0(Multiplicand).X=Xn-1Xn-
number of partial products, the computation time has become
high. In order to reduce the computation time, Wallace Tree 2………………………….X2X1X0 (Multiplier)
Approach was developed. Accurate compressors have been
used in Wallace tree Approach to reduce the number of partial
products. Though accurate compressors give error-free results
area and delay does not reach the best level of optimization.
To achieve better level of optimization, approximate
compressors came into existence. These compressors generate
approximate results and also they reduce the area and delay.
This idea can be utilized in error tolerant applications. In
Image Processing, the size can be reduced by using
multipliers. Kogge Stone Adder is incorporated in the existing
design and implanted in the compressor for improved speed
and for better optimization. This is carried out in simulation
with the help of Xilinx software. So, the main objectives are to
achieve better optimization in area and delay of the multiplier
circuit, to design the 16X16 multiplier using Wallace tree Figure.1. nXn Multiplication Process
Approach and approximate compressor, to obtain the outputs
of the design using simulation, to analyze the performance of XY multiplied terms are the intermediate partial products. P
existing design and proposed design. terms are the final output of the multiplication which is
nothing but the addition of partial products. The number of
II. MULTIPLIER rows of partial products are n and the number of bits in the
result would be 2n where n is the number of bits in the
Multipliers have an important role in various applications. multiplier and multiplicand.
With many advancements in technology, many researchers
have been tried for creating new designs for multiplier with the III. COMPRESSORS
best features like high speed, low power consumption, layout
regularity, less area or even combination of any of these Number of partial products to be added in the Multiplier is the
features in a multiplier to make the best design which paves main parameter that strongly proves it’s performance. So, the
way for compact VLSI implementation. The most common performance of a multiplier can be improved by reducing the
multiplication algorithm used is “add and shift” algorithm. number of partial products which can be done using

IJESC, March 2020 24921 http:// ijesc.org/


compressors. Compressors are a critical component in a operating at ultra-low voltages are presented. The architecture
Multiplier since it can add several partial products at once to lays emphasis on the use of multiplexers in arithmetic circuits
reduce logic depth of the generated partial product. In 3-2 that result in high speed and efficient design [8]. In 2008,
compressor, it compresses the 3 bits of the input into 2bits Compressor was designed which uses bit sliced adder
which is what taking place in Full adder circuit. In 4-2 architecture to exploit the parallelism in the computation of
compressor, it uses both the full adder as well as the half adder sum of 15 input bits by 5 full adders. This compressor is
circuits. Therefore, compressors can also be said that it centered around the design of a novel 5-3 compressor. Here,
constitutes full adder circuit or half adder circuit or a the compressor makes use of minimum hardware resources
combination of both. With this idea many compressors were and is tested using 14 transistor and 10 transistor adder designs
developed that is big compressors can be constructed using [9].
small compressors. There is a major division in compressors
such as accurate compressors and approximate compressors. In 2009, 16X16 bit multiplier was developed using high speed
Accurate compressors give accurate results whereas compressor. It reduces the number of adders by introducing
Approximate compressors give approximate results with special kind of adders that are capable of adding 5 or 6 or 7
which better optimization of parameters is done which cannot bits per decade [10].
be done in accurate compressors. Here, 15-4 approximate In 2010, a new design concept that engaged accuracy as a
compressor is used. design parameter in multipliers was proposed. By introducing
accuracy as a design parameter, the bottleneck of conventional
IV. WALLACE TREE APPROACH digital IC design techniques can be breakthrough to improve
on the performances of power consumption and speed [11]. In
It multiplies two integers in a digital circuit where hardware is 2011, novel architecture of multiplier with error tunable
efficiently implemented. It was invented by the Australian characteristics was designed. It highlighted a fact that power
Computer Scientist Chris Wallace in 1964. Compression savings can be done in a multiplier based on designs [12]. A
technique is used in the Wallace Tree so that the number of novel approximate multiplier with low power consumption
partial products can be reduced before addition is performed. and a shorter critical path than traditional multipliers was
They partition the partial products into columns and in each designed [13].
columns, they group up the bits and compress them.
In 2012, Wallace Tree Carry Save Adder structures were used
V. LITERATURE SURVEY to sum the partial products in reduced time. Speed of the
Wallace tree multiplier was improved by using compressor
Microprocessors and Digital Signal Processors (DSP) are technique [14]. Approximate computing has gained significant
playing a significant role to handle the complexity of digital attention due to the popularity of multimedia applications. A
signal. In 2008, Dr. Liu researched that 95% of the processors novel inaccurate 4-2 counter that can effectively reduce the
in the market are based on digital signal [1]. K . K . P a r t h I a partial product stages of the Wallace Multiplier was designed
dds that digital signal processors take care of convolution, in 2013 [15]. An improved version of tree based multiplier
correlation and filtering of digital signal [2]. Multipliers, architecture called Wallace tree multiplier was developed. It
shifters and adders are mainly used to accomplish these tasks. used Carry Save Addition Algorithm to reduce the latency
Among the three modules, multiplier is the most complex one. [16]. In 2014, Multipliers based on Wallace reduction
The fact that multipliers take more time and consume higher provided an area efficient strategy [ 17]. A new power and
power than other two modules is mentioned in the year 2013 area efficient Approximate Wallace Tree Multiplier for error
by Y.Kim, Y.Zhang, P.Li [3]. tolerant applications was presented. It used bit width aware
approximate multiplication algorithm for optimal design of the
Multipliers have three phases - generation of partial products, multiplier [18]. To give fruitful results for unsigned integers,
reduction of partial products and final stage addition. to reduce delay and to enhance speed Wallace tree multiplier is
Reduction of partial products take much time and power in the used [19].
multiplier. Many techniques were proposed to reduce the
critical path in the multiplier. Among them, the use of In 2015, a design approach was developed for compressor
compressors in partial product reduction stage is the most based approximate multipliers. It also highlighted a fact that it
popular one. Compressors are basic circuits which are made of is attractive for error resilient applications and for arithmetic
full adders or half adders to count the number of “ones” in the applications [20]. Analysis of approximate compressors for
input. Several compressors are required in the partial product Multiplication was done. It proposed a fact that inexact
reduction stage [4]-[5]. computing is an attractive paradigm for digital processing at
nanometric scales and also for computer arithmetic designs.
Recently developed method is to design the Approximate This analysis dealt with two new approximate 4-2 compressors
15-4 compressor by using approximate 5-3 compressor. This for utilization in a multiplier [21]. Approximate Multiplier was
is implemented in 16X16 multiplier Approximate 5-3 designed where both the approximation and truncation were
compressors are used in 4 different ways in 15-4 compressor. considered. In this it proposed three compressors with an
Simulation results are compared with the accurate 15-4 accuracy constraint for the partial product reduction [ 22]. By
compressor [6]-[7]. comparing the hardware complexity of designs, critical path
delay, and error rate and power consumption for the three
Various compressors such as 3-2, 4-2, 5-2 and 5-3 were designs of approximate compressors, study of approximate
proposed by researchers in the last 20 years. The 3-2, 4-2 and compressors for multiplication using FPGA is done [23].
5-2 compressors are the basic components in many
applications, in particular partial product summation in VI. EXISTING SYSTEM
multipliers. In 2007, Novel architectures and designs of high
speed, low power 3- 2, 4-2 and 5-2 compressors capable of In this, multiplier used 15-4 approximate compressor which

IJESC, March 2020 24922 http:// ijesc.org/


constitutes 4-bit parallel adder for adding the partial products. VII. PROPOSED SYSTEM
It uses 5 full adders to get 15 inputs to it. Every Full adder
gives 2 inputs. Here two approximate 5-3 compressors are In this multiplier, almost same design is used but with the
used where one compressor gets all the sums of the full adders small alteration. In the existing system, each adder used in the
and the other one takes all the carries generated by the full 4-bit parallel adder gets carry input only when previous adder
adders. Since each compressor gives 3 outputs, a null bit is finishes it’s operation and it causes delay. So, instead of 4-bit
added to it for 4-bit parallel addition using 4- bit parallel parallel adder 4-bit Kogge Stone Adder is used. Therefore, it
adder. Following figure will show all of its contents and how involves 16X16 Multiplier where Wallace Tree approach is
they are constructed together. used in which various compressors are used. Highest form of
compressor used is approximate 15-4 compressor in which 4-
bit Kogge Stone Adder is used for final stage addition.

Figure.2. 15-4 Approximate Compressor using 4-bit


Parallel Adder

Figure.5. 16x16 Multiplier using Wallace tree approach

Figure.3. Design of 5-3 approximate compressor

Here, output 2 is approximated and following are the output


equations of this figure 3.

Figure.6. 15-4 Approximate Compressor using 4-bit Kogge


Stone Adder

4-bit Kogge Stone Adder is a parallel prefix adder which is a


form of carry look ahead adder. Using 3 stages of operation it
computes the carry. Equations used in each stages is given
Figure.4. Output Equations of the approximate 5-3 below. At pre-processing stage, P=Ai XOR Bi, G=Ai AND Bi
compressor At prefix stage, P=Pi AND Piprev,

IJESC, March 2020 24923 http:// ijesc.org/


G=(Pi AND Giprev) OR Gi
At final computation, Gi=Ci ,Si=Pi XOR Ci-1

Figure.9. Interior View of the RTL Schematic

Figure.7. Working of Kogge Stone Adder

It gets the inputs and produces generate and propagate signals.


Then, it takes up two pairs of generate and propagate signals
and continues the computation and finally it founds the carry
using which it results the sum output. This process takes place
by using the different equations at three different stages.

VIII. RESULTS AND COMPARISON

Using Xilinx ISE 14.7, simulation output is obtained. RTL


Figure.10. Simulation Output Waveform
schematic and the simulation output (if inputs are same) are
same for both the existing and the proposed. RTL Schematic
But the Device Utilization Summary (Area) and Delay
may look the same from outside for both the systems but on
Analysis (Delay) differ in both the systems.
the inside, it is actually different since alteration in the circuit
is done. The circuits used can be clearly seen when it is
zoomed in.

Figure.11. Device Utilization Summary of Existing System

In the Device Utilization Summary, Area in terms of LUTs.


(Look up Table). Following figure gives the details about
delay involved in the existing system circuit.

Figure.8. RTL Schematic Figure.12. Delay Analysis of Existing System

IJESC, March 2020 24924 http:// ijesc.org/


IX. CONCLUSION

The approximate 16x16bit Wallace tree multiplierusing15- 4


compressor architecture has been designed and synthesized
using on Spartan 6 XC3S100E board and simulated in Xilinx
ISE 14.7. The performance of proposed Multiplier with Kogge
Stone Adder is compared with the same architecture of
multiplier using parallel adder. It can be inferred that 16x16
multiplier architecture using 15-4 compressor with Kogge
Stone Adder is faster compared to multiplier with parallel
adder. In future the performance of the proposed multiplier can
Figure.13. Device Utilization Summary of Proposed System be improved and applied in applications like video and image
processing.

X. REFERENCES

[1]. D.Liu, “Embedded DSP Processor Design”, 1st ed. Morgan


Kaufmann Publishing, 2008.
Figure.14. Delay Analysis of Proposed System
[2]. K. K Parthi, VLSI Digital Signal Processing Systems:
Simulation input and output is also implemented in FPGA. Design and Implementation, 1st ed. John Wiley and Sons,1999.
Spartan 6 Board is used. Inputs can be given through switches
and outputs can be observed from led lights. [3]. Y. Kim, Y. Zhang, and P.Li,“An energy efficient appro
ximate adder with carry skip for error resilient neuromorphic
Input i[7:0]=01011101, Input j[7:0]=01010101 VLsI systems,” inproc. Of International conference on
Output sum[15:0]=0001011001100001 Computer-Aided Design (ICCAD), Nov.2013, pp.130-137.

[4].A.Pishvaie, G.Jaberipur, and A.Jahaniam,“Improved CMOS


(4:2) compressor designs for parallel multipliers”, Computers
and Electrical Engineering, vol. 38,no.6, Nov.2012, pp. 1703-
1716.

[5]. D. Baran, M. Aktan, and V.G.Oklobdziia, V.G, “Energy


Efficient Implementation of Parallel CMOS Multipliers with
Improved Compressors,” in proc. ACM/IEEE International
Symposium on Low-Power Electronics and Design (ISLPED),
Aug.2010, pp.147-152.
Figure.15. FPGA Implementation
[6]. Marimuthu, R., Elsie Rezinold, Y., &Malick, P.S., “Design
and analysis of multiplier using approximate 15-4 compressor”,
Table.1. Result Comparison Table
IEEE Acess, 5, 2017, pp. 1027-1036.
PARAMETER EXISTING PROPOSED
SYSTEM SYSTEM [7]. Pabithra, S., &Nageswari, S., “Analysis of Approximate
Compressor Approximate Approximate Multiplier Using 15-4 Compressor for Error Tolerant
type 15-4 compressor 15-4 compressor Application”, International Conference on Control Power,
Type of Adder 4- bit parallel 4- bit Kogge Communication and Computing Technologies, pp.410-415.
used in adder Stone adder
approximate [8]. Veeramachaneni, S., Kirthi Krishna, M., Avinash, L,
15-4 Puppala, S.R., & Srinivas, M.B., “Novel architectures for high-
compressor speed and low power 3-2, 4-2 and 5-2 compressors”, In
Number of Used Available Used Available Proceedings of the IEEE International Conference on VLSI
Slice LUTs 682 5720 507 5720 Design, 2007, pp. 324-329.

Number of 0 682 0 507 [9]. Chowdhury, S.R., Banerjee, A., Roy, A., &Saha, H.,“
fully used Design, simulation and testing of a high speed low power 15-4
LUT-FF pairs compressor for high speed multiplication applications”, In
Number of 65 102 65 102 Proceedings - 1st International Conference on Emerging
bonded IOBs Trends in Engineering and Technology, 2008, pp. 434–438
Delay 202.873ns 128.910ns
[10]. Dandapat, A., Ghosal, S., Sarkar, P., & Mukhopadhyay,
Levels of Logic 177 112 D., “A 1.2-ns16×16-bit binary multiplier using high speed
compressors”, World Academy of Science, Engineering and
Since Area in terms of LUTs is reduced from 682 to 507 and Technology, 39,2009, pp. 627–639
Delay is reduced from 202.87ns to 128.910ns, the proposed
approximate compressor can be identified as a better method to [11]. Kyaw, K. Y., Goh, W. L., & Yeo, K. S., “Low-power
optimize area and delay. high-speed multiplier for error-tolerant application”, IEEE

IJESC, March 2020 24925 http:// ijesc.org/


International Conference of Electron Devices and Solid-State
Circuits, 2010.

[12]. Kulkarni, P., Gupta, P., &Ercegovac, M., “Trading


accuracy for power with an under designed multiplier
architecture”, In Proceedings of the IEEE International
Conference on VLSI Design, 2011, pp. 346–351

[13]. Liu, C., Han, J., & Lombardi, F., “A low-power, high-
performance approximate multiplier with configurable partial
error recovery”. In Proceedings -Design, Automation and Test
in Europe, 2014.

[14]. Gahlan, N. K., Shukla, P., & Kaur, J., “Implementation of


Wallace Tree Multiplier Using Compressor”, International
Journal of Computer Technology & Applications, vol.3, Isuue
3,2012, pp. 1194–1199.

[15]. Lin, C. H., & Lin, I. C., “High accuracy approximate


multiplier with error correction”, IEEE 31st International
Conference on Computer Design, 2013, pp. 33–38.

[16]. Sureka, N., Porselvi, R., & Kumuthapriya, K., “An


efficient high speed Wallace tree multiplier”, International
Conference on Information Communication and Embedded
Systems, 2013, pp. 1023–1026.

[17]. Asif, S., & Kong, Y., “Low-area Wallace multiplier”,


VLSI Design, 2014.

[18]. Bhardwaj, K., Mane, P. S., & Henkel, J., “Power- and
area-efficient approximate Wallace tree multiplier for error-
resilient systems”, In Proceedings - International Symposium
on Quality Electronic Design, ISQED,2014 pp. 263–269.

[19]. Ramya, A. S., Ramesh Babu, B. S. S. V., Raju, K. S. N.,


Ravi Chandra, B., Sirisha, A., &Srikala, E., “On the reduction
of partial products using Wallace tree multiplier”, In Smart
Innovation, Systems and Technologies, Vol. 78, 2018, pp. 525–
532.

[20]. Maheshwari, N., Yang, Z., Han, J., & Lombardi, F., “A
Design Approach for Compressor Based Approximate
Multipliers”, In Proceedings of the IEEE International
Conference on VLSI Design, 2015, pp.209–214.

[21]. Momeni, A., Han, J., Montuschi, P., & Lombardi, F.,
“Design and analysis of approximate compressors for
multiplication”, IEEE Transactions on Computers, Vol.64,
Issue 4, 2015.

[22]. Yang, Z., Han, J., & Lombardi, F., “Approximate


compressors for error-resilient multiplier design In Proceedings
of the IEEE International Symposium on Defect and Fault
Tolerance in VLSI and Nanotechnology Systems, pp. 183–186,
2015.

[23].Mody, J., Lawand, R., Priyanka, R., Sivanantham, S.,&


Sivasankaran, K., “Study of approximate compressors for
multiplication using FPGA”, In IC-GET- Proceedings of Online
International Conference on Green Engineering and
Technologies, 2015.

IJESC, March 2020 24926 http:// ijesc.org/

You might also like