Professional Documents
Culture Documents
Report FINAL B
Report FINAL B
INTRODUCTION
In domain of digital signal processing the number representation is either in
fixed-point or floating-point form. The contribution deals with a binary
representation of real numbers. The advantage of floating-point representation
over fixed-point and integer representation is that it can support a much wider
range of values. To increase the speed and efficiency of the real-
number computations, computers or Floating Point Unit (FPU) typically used
binary floating-point format to represent real numbers. These representations
play major role in digital and radar imaging to reduce the complexities during
the processing and to achieve the accuracy and efficiency.
There are many processors with fixed or floating-point representation and there
are also several blocks used for arithmetical operations. Floating-Point (FP) Fast
Fourier Transform (FFT) processors are widely used in high resolution radar
imaging applications for the task of pulse compression.
1
1.1 BASIC CONCEPTS OF FLOATING POINT
3
2) Double Precision
4
increase Elites, (in this case, E4, or 2) by 5 So that Elites, equals E, (in this case,
E, or 7). In normalization of the sum, if Ms., is not already in normalized form,
then it is normalized to yield normalized sum S. In other words, if required, the
binary point of Ms., is shifted left or right as appropriate until there is only one
significant digit to the left of the binary point to yield normalized mantissa sum
Ms. Then, E, is adjusted by y to yield the exponent Es of normalized sum S. In
the example above, Ms., 10.0 is normalized by shifting the binary point one
place to the left to yield Ms 1.0. Then, E, 7 is increased by 1 to yields Es of 8.
Exceptions: Overflow, Underflow, and Zero.
An exception occurs when a floating-point operation yields a result which
cannot be represented in the floating-point numbering system used. Three
common exceptions are overflow, underflow, and Zero. Overflow and
underflow exceptions occur when addition results in a Sum, the absolute value
of which is either too large (overflow) or too small (underflow) to be
represented in the floating-point numbering system used. For example, IEEE
754 32-bit single-precision format is not capable of representing a positive
number
greater than (2-2°)x2' (positive overflow) or less than 2' (positive underflow), or
a negative number the absolute value of which is greater than (2-2)x2'''
(negative overflow), or less than 2'' (negative underflow). Furthermore , IEEE
754, with its implied leading digit of 1, is incapable of naturally representing 0
(zero exception).
1.5 APPLICATIONS
OFDM Systems.
4G communication technology.
5
Generating sine and cosine which will reduce the usage of large
Rom.
Space craft and launching rockets.
Deep Learning
Media processing
Security
Big data
Analytic processing.
6
Fig.1.1.4.Architecture of floating point addition
CHAPTER 2
7
DESIGN ARCHITECTURE OF FLOATING POINT
ADDITION
2.1 STEPS TO DESIGN FLOATING POINT ADDITION
ARCHITECTURE
According to the architecture of floating point addition it having many
blocks which can be individually designed using
Model simulator and get correlated to form the overall architecture. This
architecture can be build by step by step
Pipelining process, following steps are:
8
Round the result if the logical condition R”(M0 + S’’) is true, where M0
and R’’ represent the pth and (p + 1)st bits from the left end of the
normalized significant. New sticky bit (S’’) is the logical OR of all bits
towards the right of the R’’ bit. If the rounding condition is true, a 1 is
added at the pth bit (from the left side) of the normalized significant. If p
MSBs of the normalized significant are 1’s, rounding can generate a
carry-out. in that case normalization (step 4) has to be done again.
2.2 LZA
9
2.1.1Figure Le0/ading zero/one anticipator operation
10
CHAPTER 3
LITERATURE REVIEW
[1] J. Sohn, J. Earl E. Swartzlander Improved architectures for a
fused floating-point add-subtract unit gives the information about the fused
floating point add and subtract unit with the employment of pipeline and dual
path algorithm. The fused point 40 percent of area and power consumption and
the dual path design reduces the latency by 30 percent compared to the discrete
floating point designs. The fused dual path floating point add and subtract unit
can be split into two pipelines and with this the throughput increased by 80
percent.
11
[4] Xing Wei , Haigang Yang , Wei Li , Zhihong Huang , Tao Yin, Le
Yu A reconfigurable 4-GS/s power-efficient floating-point FFT processor
design and implementation based on single-sided binary-tree
decomposition. The design of the floating point operations (fp) with fast
Fourier transform to attain high throughput .To improve the efficiency and
reduce the power consumption, area consumption. The above aim is achieved
by designing the proposed design using silicon based on
SMIC’s(Semiconductor Manufacturing International Corporation) 28 nm
CMOS technology with the active area 1.39mm^2.In this proposed work and
existing work the process reduced from 45nm,1.25V to 28nm,1.05V,working
frequency reduced to 1.49GHZ to 0.5 GHZ. area consumption reduced from
100% t0 80.3% and the power consumption reduced from 2.47 to 1.45.
12
mode QPdDPqSP (quadruple precision with dual double precision and quad
(four parallel) single precision) arithmetic. These have two prime targets, first to
provide quadruple precision arithmetic support and second is aimed towards the
inclusion of SIMD (single instruction multiple data) processing support for
double precision and single precision arithmetic in it.
CHAPTER 4
PROBLEM STATEMENT
Thus the floating point addition is designed to low power
consumption, and with increased efficiency and minimum power delay.
In previous work the following conditions mentioned above cannot be
achieved. To overcome this LZA logic block is included. LZA logic plays
major role and it is used to speed up the operation and reduces the delay
introduced by normalization. And using the pipelining methods it can be
done step by step sequentially and the required minimum delay, minimum
13
power and less area consumption area.
CHAPTER 5
COMPARATOR:
A magnitude digital Comparator is a combinational circuit
that compares two digital or binary numbers in order to find out whether one
binary number is equal, less than or greater than the other binary number. We
logically design a circuit for which we will have two inputs one for A and other
for B and have three output terminals, one for A > B condition, one for A = B
condition and one for A < B condition.
14
Fig:5.1.1 comparator diagram
5.1CHARACTERISTICS:
Linear characteristics of scale
Quick in results
High magnification
Versatility
Compensation from Temperature effects
1. If A3 = 1 and B3 = 0
2. If A3 = B3 and A2 = 1 and B2 = 0
3. If A3 = B3, A2 = B2 and A1 = 1 and B1 = 0
4. If A3 = B3, A2 = B2, A1 = B1 and A0 = 1 and B0 = 0
Similarly the condition for A<B can be possible in the following four cases:
15
1. A3 = 0 and B3 = 1
2. If A3 = B3 and A2 = 0 and B2 = 1
3. If A3 = B3, A2 = B2 and A1 = 0 and B1 = 1
4. If A3 = B3, A2 = B2, A1 = B1 and A0 = 0 and B0 = 1
The condition of A=B is possible only when all the individual bits of one
number exactly coincide with corresponding bits of another number.
16
A>B
A<B
17
A=B
CHAPTER 6
SHIFTER:
6.1 Operation of shifter:
19
6.1.2 Left arithmetic shift
20
Right shifter
21
Left shifter
A Left Arithmetic Shift of one position moves each bit to the left by
one. The vacant least significant bit (LSB) is filled with zero and the most
significant bit (MSB) is discarded. A Right Arithmetic Shift is one position
moves each bit to the right by one. The LSB is discarded and the vacant MSB is
filled with zero.
CHAPTER 7
Multiplexer:
22
7.1.1 Figure of 4:1multiplexer
23
4:1 Multiplexer
CHAPTER 8
ADDER:
8.1 Operation of adder:
24
8.1.1 Truth table of adder
. A full adder logic is designed in such a manner that can take eight inputs
together to create a byte-wide adder and cascade the carry bit from one adder to
the another.
25
CHAPTER 9
Exponent Difference And Exponent Increment:
9.1 EXPONENT DIFFERENCE:
This is used to take difference between the two inputs given to the
perform floating point arithmetic. But it is necessary to be the two input
exponent values same. For that purpose we take difference between the given
input exponent values .if the value becomes zero ,further process to be proceed
otherwise it value should be increment or decrement to get exponent difference
zero. This is achieved by using formula ,
D = Y-Z;
where , Y= (2**a);
Z= (2**b);
26
9.1.2 Simulated result Exponent increment:
27
For Exponent decrement :
Y= (a-1);
When input is a ^ y , b^z ,where ( y>z)
9.2 Bit inverter:
An inverter circuit outputs a voltage representing the opposite
logic-level to its input. Its main function is to invert the input signal applied.
Digital
electronics circuits operate at fixed voltage levels corresponding to a logical 0
or 1.
.
28
CHAPTER 10
REFERENCES
29
Conference on Electronics, Circuits and Systems (ICECS), 2011, pp.
153–156, https://doi.org/10.1109/ICECS.2011.6122237.
30
31
32
33
34
35
36
37