Professional Documents
Culture Documents
Understanding Performance
class Ideone
{
public static void main (String[] args) throws java.lang.Exception
{
System.out.println( 123.456 * 0.001 );
}
}
Processing Unit
Control Unit
Input/Output System
Dept. Arquitectura de Computadores 8
History of computing hardware
Third generation, transistors substitute
vacuum tubes:
◦ Smaller, require less power, more reliable, …
◦ Disk data storage units are built
◦ A secondary processor controls the
input/output system: multitasking is possible
◦ The first family of computers, the IBM
System/360, has a great success
Assembly language
◦ Textual representation of
instructions
Hardware representation
◦ Binary digits (bits)
◦ Encoded instructions and data
Dept. Arquitectura de Computadores 19
Abstraction
Abstraction helps us deal with complexity
◦ Hide lower-level detail
Instruction set architecture (ISA)
◦ The hardware/software interface
Application binary interface
◦ The ISA plus system software interface
Implementation
◦ The details underlying and interface
Seymour Cray
Clock (cycles)
Data transfer
and computation
Update state
Performance improved by
◦ Reducing number of clock cycles
◦ Increasing clock rate
◦ Hardware designer must often trade off clock rate
against cycle count
Relative frequency
Sequence 1: IC = 5 Sequence 2: IC = 6
Clock Cycles Clock Cycles
= 2×1 + 1×2 + 2×3 = 4×1 + 1×2 + 1×3
= 10 =9
Avg. CPI = 10/5 = 2.0 Avg. CPI = 9/6 = 1.5
Dept. Arquitectura de Computadores 42
Performance Summary
Performance depends on
◦ Algorithm: affects IC, possibly CPI
◦ Programming language: affects IC, CPI
◦ Compiler: affects IC, CPI
◦ Instruction set architecture: affects IC, CPI, Tc
+NOO M
+ "KH L M 4+ NOO M
P Q %&
@ & 9 9; &<?%
Z@$T
)* & + > 10[ 9$@ > 10[
]
\ )* & + ?%
B
In CMOS IC technology
Power = Capacitive load × Voltage 2 × Frequency
×30 5V → 1V ×1000
10 10
Overall ssj_ops per Watt = ∑ ssj_ops i ∑ poweri
i=0 i=0