Professional Documents
Culture Documents
Temporal
access)
Locality
Spatial
Locality
Time
Donald J. Hatfield, Jeanette Gerald: Program
Restructuring for Virtual Memory. IBM
Systems Journal 10(3): 168-192 (1971)
10/17/2019 CS294-73 – Lecture 14
Memory Hierarchy
Processor
• Some examples:
• Cray T3E eliminated one cache to speed up misses
• IBM uses a level of cache as a victim cache which is cheaper
• There are other levels of the memory hierarchy
• Register, pages (TLB, virtual memory), … (Page (memory))
• And it isn t always a hierarchy
Embed
Games
SPEC
HPC
DB
ML
Health Image Speech Music Browser
1 Finite State Mach.
2 Combinational
3 Graph Traversal
4 Structured Grid
5 Dense Matrix
6 Sparse Matrix
7 Spectral (FFT)
8 Dynamic Prog
9 N-Body
10 MapReduce
11 Backtrack/ B&B
12 Graphical Models
13 Unstructured Grid
• CodeAnalyst
• developed by AMD for profiling on Intel systems
• Linux and Windows versions.
• Intel Vtune package
• free versions are available for students
• complicated to navigate their web pages…
You can notice that the resolution of gprof is pretty poor. things under 10ms
are swept away
You can see that I put the main program inside it’s own loop for 200 iterations of the whole solver.