Professional Documents
Culture Documents
Chapter 2
Computer Evolution and Performance
William Stallings : Computer Organization and Architecture, 9th Edition
+ 2
Objectives
Objectives
After studying this chapter, you should be able to:
Present an overview of the evolution of computer
technology from early digital computers to the
latest microprocessors.
Understand the key performance issues that
relate to computer design.
Explain the reasons for the move to multicore
organization, and understand the trade-off
between cache and processor resources on a
single chip.
+ 4
Contents
Base number 10 2 16
Set of digits { 0, 1, 2, …, 9 } { 0. 1 } { 0, 1, 2, …, 9, A, B, C, D, E, F }
Basic operations +, -. *, / +, -. *, / +, -. *, /
Number Systems:
Representing a quantity
Choose a system
37d = ?b = ?h
69d = ?b =?h
42d = ?b= ?h
+ 9
1001100b = ?h 11001110b = ? h
2AFh = ?b 49Ch= ?b
BF7h = ?b 7EAh = ?b
+ 11
Typical computers:
ENIAC (Electronic Numerical Integrator And Computer)
EDVAC (Electronic Discrete Variable Computer) and John Von
Neumann
IAS computer (Princeton Institute for Advanced Studies)
Commercial Computers: UNIVAC ((Universal Automatic
Computer)
IBM Computers ( International Business Machines)
+ First Generation: ENIAC Computer 13
(Read by yourself)
Electronic Numerical Integrator And Computer
Designed and constructed at the University of Pennsylvania
Started in 1943 – completed in 1946, by John Mauchly and John Eckert
Its first task was to perform a series of calculations that were used
to help determine the feasibility of the hydrogen bomb
Major
Memory drawback
consisted
was the need
Occupied of 20
Contained Capable
1500 Decimal accumulators,
more of for manual
Weighed square 140 kW rather each
than 5000 programming
30 feet Power than capable
18,000 additions by setting
tons of consumption binary of
vacuum per switches
floor machine holding
tubes second and
space a
10 digit plugging/
number unplugging
cables
+ 15
IAS computer
Princeton Institute for Advanced Studies
Prototype of all subsequent general-purpose computers
Completed in 1952
16
data
Instruction
One word contains 2 instructions
+
Structure
of
IAS
Computer
AC: Accumulator
MQ: Multiplier Quotient
MBR: Memory Buffer Register
IBR: Instruction Buffer Register
PC: program counter
IR: Instruction register
MAR: Memory Address Register
+ 19
Table 2.1
The IAS
Instruction
Set
Hexadecimal Code:
+ 010FA210FB
20
Backward compatible
22
+
Was the major manufacturer of
punched-card processing
equipment
IBM
7094
Configuration
Read by yourself
Microelectronics
+ A computer consists of gates,
29
and
Gate
Relationship
+ Chip Growth 31
Number of
transistors
Year m: million
bn: billion
Moore’s Law 32
Generations
VLSI
Very Large
Scale
Integration
ULSI
Semiconductor Memory Ultra Large
Microprocessors Scale
Integration
+ Semiconductor Memory 37
In 1974 the price per bit of semiconductor memory dropped below the price per bit
of core memory
There has been a continuing and rapid decline in Developments in memory and processor
memory cost accompanied by a corresponding technologies changed the nature of computers in
increase in physical memory density less than a decade
Each generation has provided four times the storage density of the previous generation, accompanied
by declining cost per bit and declining access time
+ 38
Microprocessors
The density of elements on processor chips continued to rise
More and more elements were placed on each chip so that fewer
and fewer chips were needed to construct a single computer
processor
• Image processing
• Speech recognition
• Videoconferencing
• Multimedia authoring
• Simulation modeling
+ Microprocessor Speed 42
Performance
Balance
Adjust the organization and Increase the number
of bits that are
architecture to compensate retrieved at one time
by making DRAMs
for the mismatch among the “wider” rather than
“deeper” and by
capabilities of the various using wide bus data
paths
components
Reduce the
Architectural examples frequency of memory
access by
include: incorporating
increasingly
complex and
efficient cache
structures between
the processor and
main memory
Increase the
Change the DRAM interconnect
interface to make it bandwidth between
more efficient by processors and
including a cache or memory by using
other buffering higher speed buses
scheme on the DRAM and a hierarchy of
chip buses to buffer and
structure data flow
Typical I/O Device Data Rates 44
+ 45
Improvements in Chip
Organization and Architecture
Increase hardware speed of processor
Fundamentally due to shrinking logic gate size
More gates, packed more tightly, increasing clock rate
Memory latency
Memory speeds lag (slow down) processor speeds
47
+ Processor Trends
+ 48
Multicore
CPU: CPU has some cores running
concurrently.
MIC: Many integrated core
GPGPU: General Purpose Graphical
Processing Unit
The use of multiple 49
MIC GPU
Leap (fast growth) in Core designed to perform parallel
performance as well as the operations on graphics data
challenges in developing
software to exploit such a large Traditionally found on a plug-in
number of cores graphics card, it is used to encode
and render 2D and 3D graphics as
The multicore and MIC strategy well as process video
involves a homogeneous (same
kind) collection of general Used as vector processors for a
purpose processors on a single variety of applications that require
chip repetitive computations
Read by Yourself 51
Some definitions:
CISC: Complex Instruction Set Computer, CPU is equipped a
large set of instructions
RISC: Reduced Instruction Set Computer, CPU is equipped basic
instructions only based on the thinking: A high instruction is
created using some basic instructions.
ARM: Advanced RISC Machine
+ 52
Factors
Clock Speed and Instructions per Second
Instruction execution rate
Methods: Benchmarks
Some laws: Read by yourself
Amdahl’s Law
Little’s Law
+ 53
System Clock
Benchmark
Benchmark
-The design of fair benchmarks is something of an art,
because various combinations of hardware and software
can exhibit widely variable performance under different
conditions. Often, after a benchmark has become a
standard, developers try to optimize a product to run that
benchmark faster than similar products run it in order to
enhance sales (MS Computer Dictionary)
Beginning in the late 1980s and early 1990s, industry
and academic interest shifted to measuring the
performance of systems using a set of benchmark
programs
+ 58
SPEC
An industry consortium
Defines and maintains the best known collection of
benchmark suites
Performance measurements are widely used for
comparison and research purposes
Best known SPEC benchmark suite
+
yourself) Illustrates
the problems facing
industry in the development of
multi-core machines
Software must be adapted to a
highly parallel execution
environment to exploit the
power of parallel processing
Can be applied to almost any system that is statistically in steady state, and
in which there is no leakage
Queuing system
If server is idle an item is served immediately, otherwise an arriving item
joins a queue
There can be a single queue for a single server or for multiple servers, or
multiples queues with one being for each of multiple servers
2.2 What are the four main components of any general-purpose computer?
2.3 At the integrated circuit level, what are the three principal constituents of a
computer system?
Computer Evolution
and Performance
Chapter 2
Multi-core
First generation computers
MICs
Vacuum tubes
Second generation computers GPGPUs
Transistors Performance assessment
Third generation computers Clock speed and instructions
Integrated circuits per second
Benchmarks
Performance designs
Amdahl’s Law
Microprocessor speed
Little’s Law
Performance balance
Chip organization and
architecture