You are on page 1of 20

Registers

Today well see another common sequential device: registers. Theyre a good example of sequential analysis and design. They are also frequently used in building larger sequential circuits. Registers hold larger quantities of data than individual flip-flops. Registers are central to the design of modern processors. There are many different kinds of registers. Well show some applications of these special registers.

Registers

What good are registers?

Flip-flops are limited because they can store only one bit. We had to use two flip-flops for our two-bit counter examples. Most computers work with integers and single-precision floatingpoint numbers that are 32-bits long. A register is an extension of a flip-flop that can store multiple bits. Registers are commonly used as temporary storage in a processor. They are faster and more convenient than main memory. More registers can help speed up complex calculations. Well discuss RAM next time, and later well also see how registers are used in designing and programming CPUs.

Registers

A basic register

Basic registers are easy to build. We can store multiple bits just by putting a bunch of flip-flops together! A 4-bit register from LogicWorks, Reg-4, is on the right, and its internal implementation is below. This register uses D flip-flops, so its easy to store data without worrying about flip-flop input equations. All the flip-flops share a common CLK and CLR signal.

Registers

Adding a parallel load operation



The input D3-D0 is copied to the output Q3-Q0 on every clock cycle. How can we store the current value for more than one cycle? Lets add a load input signal LD to the register. If LD = 0, the register keeps its current contents. If LD = 1, the register stores a new value, taken from inputs D3-D0.

LD 0 1

Q(t+1) Q(t) D3-D0

Registers

Clock gating

We could implement the load ability by playing games with the CLK input, as shown below. When LD = 0, the flip-flop C inputs are held at 1. There is no positive clock edge, so the flip-flops keep their current values. When LD = 1, the CLK input passes through the OR gate, so the flipflops can receive a positive clock edge and can load a new value from the D3-D0 inputs.

Registers

Clock gating is bad



This is called clock gating, since gates are added to the clock signal. There are timing problems similar to those of latches. Here, LD must be kept at 1 for the correct length of time (one clock cycle) and no longer. The clock is delayed a little bit by the OR gate. In more complex scenarios, different flip-flops in the system could receive the clock signal at slightly different times. This clock skew can lead to synchronization problems.

Registers

A better parallel load

Another idea is to modify the flip-flop D inputs and not the clock signal. When LD = 0, the flip-flop inputs are Q3-Q0, so each flip-flop just keeps its current value. When LD = 1, the flip-flop inputs are D3-D0, and this new value is loaded into the register.

Registers

Shift registers

A shift register shifts its output once every clock cycle.


Q0(t+1) = SI Q1(t+1) = Q0(t) Q2(t+1) = Q1(t) Q3(t+1) = Q2(t)

SI is an input that supplies a new bit to shift into the register. For example, if on some positive clock edge we have: SI = 1 Q0-Q3 = 0110 then the next state will be: Q0-Q3 = 1011

The current Q3 (0 in this example) will be lost on the next cycle.


Registers 8

Shift direction
Q0(t+1) = SI Q1(t+1) = Q0(t) Q2(t+1) = Q1(t) Q3(t+1) = Q2(t)

The circuit and example make it look like the register shifts right.
Present Q0-Q3 ABCD SI X Next Q0-Q3 XABC

But it really depends on your interpretation of the bits. If you consider Q3 to be the most significant bit instead, then the register is shifting in the opposite direction!
Present Q3-Q0 DCBA SI X Next Q3-Q0 CBAX

Registers

Shift registers with parallel load

We can add a parallel load, just like we did for regular registers. When LD = 0, the flip-flop inputs will be SIQ0Q1Q2, so the register shifts on the next positive clock edge. When LD = 1, the flip-flop inputs are D0-D3, and a new value is loaded into the shift register, on the next positive clock edge.

Registers

10

Shift registers in LogicWorks

Here is a block symbol for the Shift Reg-4 from LogicWorks. The implementation is shown on the previous page, except the LD input here is active-low instead.

Registers

11

Other types of shift registers

Logical shifts Standard shifts like we just saw. In the absence of a SI input, 0 occupies the vacant position. Left: 0110 -> 1100 Right: 0110 -> 0011 Circular shifts (also called ring counters or rotates) The shifted out bit wraps around to the vacant position. Left: 1001 -> 0011 Right: 1001 -> 1100 Switch-tail ring counter (aka Johnson counter) Similar to the ring counter, but the serial input is the complement of the serial output. Left: 1001 -> 0010 Right: 1001 -> 0100 Arithmetical shifts Left shifting is the same as a logical shift. Right shifting however maintains the MSB. Left: 0110 -> 1100 Right: 0110 -> 0011; 1011 -> 1101
Registers 12

Serial data transfer



One application of shift registers is converting between serial data and parallel data. Computers typically work with multiple-bit quantities. ASCII text characters are 8 bits long. Integers, single-precision floating-point numbers, and screen pixels are up to 32 bits long. But sometimes its necessary to send or receive data serially, or one bit at a time. Some examples include: Input devices such as keyboards and mice. Output devices like printers. Any serial port, USB or Firewire device transfers data serially. Recent switch from Parallel ATA to Serial ATA in hard drives.

Registers

13

Receiving serial data

To receive serial data using a shift register: The serial device is connected to the registers SI input. The shift register outputs Q3-Q0 are connected to the computer. The serial device transmits one bit of data per clock cycle. These bits go into the SI input of the shift register. After four clock cycles, the shift register will hold a four-bit word. The computer then reads all four bits at once from the Q3-Q0 outputs.

serial device

computer
Registers 14

Sending data serially

To send data serially with a shift register, you do the opposite: The CPU is connected to the registers D inputs. The shift output (Q3 in this case) is connected to the serial device. The computer first stores a four-bit word in the register, in one cycle. The serial device can then read the shift output. One bit appears on Q3 on each clock cycle. After four cycles, the entire four-bit word will have been sent.

computer

serial device

Registers

15

Registers in Modern Hardware


Registers store data in the CPU Used to supply values to the ALU. Used to store the results. If we can use registers, why bother with RAM?
CPU Pentium 4 Athlon XP Athlon 64 PowerPC 970 (G5) Itanium 2 MIPS R14000 GPR's 8 8 16 32 128 32 Size 32 bits 32 bits 64 bits 64 bits 64 bits 64 bits L1 C ach e 8 KB 64 KB 64 KB 64 KB 16 KB 32 KB L2 C ach e 512 KB 512 KB 1024 KB 512 KB 256 KB 16 MB

Answer: Registers are expensive! Registers occupy the most expensive space on a chip the core. L1 and L2 cache are very fast RAM but not as fast as registers.
Registers

16
Dunnington. Source INtel

Shift versus Multiply and Divide



You can multiply by powers of two by shifting to the left Example: 3 * 2 = 6; 3*22 = 12 0011 << 1 = 0110; 0011 << 2 = 1100 You can divide by powers of two by shifting to the right Example 8 / 2 = 4; 8 / 22 = 2; 4 / 2= 2; -4 / 2 = -2 1000 >> 1 = 0100; 1000 >> 2 = 0010; 0100 >> 1 = 0010; 1100 >> 1= 1110 -4 en twos complement -2 en twos complement

Registers

17

Why should I user shift instead of multiply?


Program 1: Multiplication #include .. Program 2:Shift #include ..

main() { scanf(%d,&mul); start = _rdtsc(); for (i=0;i<60;i++){ b=b*mul; } end = _rdtsc(); printf("It took %llu cycles\n", end-start); return 0; }
Output: It took 1056 cycles

main() { scanf(%d,&mul); start = _rdtsc(); for (i=0;i<60;i++){ b=b << mul; } end = _rdtsc(); printf("It took %llu cycles\n", end-start); return 0; }
Output: It took 896 cycles

Registers

18

Why should I user shift instead of integer division?


Program 1: Division #include .. Program 2:Shift #include ..

main() { scanf(%d,&mul); start = _rdtsc(); for (i=0;i<60;i++){ b=b/mul; } end = _rdtsc(); printf("It took %llu cycles\n", end-start); return 0; }
Output: It took 29328 cycles

main() { scanf(%d,&mul); start = _rdtsc(); for (i=0;i<60;i++){ b=b >> mul; } end = _rdtsc(); printf("It took %llu cycles\n", end-start); return 0; }
Output: It took 912 cycles

Registers

19

Registers summary

A register is a special state machine that stores multiple bits of

data. Several variations are possible: Parallel loading to store data into the register. Shifting the register contents either left or right. Counters are considered a type of register too! One application of shift registers is converting between serial and parallel data. Most programs need more storage space than registers provide. Well introduce RAM to address this problem. Registers are a central part of modern processors, as we will see in coming weeks.

Registers

20

You might also like