Professional Documents
Culture Documents
Clearly, the answer to the question is “The system that does the job with
the lowest cost-of-ownership”.
Cost-of-ownership includes
Benchmarking examples
1. Before
After
2. Before
between 165 and 205 under the Sysmark 2004 overall office
productivity benchmark
Purpose of Benchmark
As computer architecture advanced, it became more difficult to compare the
performance of various computer systems simply by looking at their specifications.
Therefore, tests were developed that allowed comparison of different architectures.
Ideally benchmarks should only substitute for real applications if the application is
unavailable, or too difficult or costly to port to a specific processor or computer
system. If performance is critical, the only benchmark that matters is the target
environment's application suite
Challenges of benchmarks
Properties of benchmarks
There are seven vital characteristics for benchmarks.These key properties are:
[1] Relevance: Benchmarks should measure relatively vital features.
[2] Representativeness: Benchmark performance metrics should be broadly
accepted by industry and academia.
[3] Equity: All systems should be fairly compared.
[4] Repeatability: Benchmark results can be verified.
[5] Cost-effectiveness: Benchmark tests are economical.
[6] Scalability: Benchmark tests should measure from single server to multiple
servers.
[7] Transparency: Benchmark metrics should be easy to understand
Types of benchmark
1. Real program
o word processing software
o tool software of CAD
o user's application software (i.e.: MIS)
2. Component Benchmark / Microbenchmark
o core routine consists of a relatively small and specific piece of code.
o measure performance of a computer's basic components [5]
o may be used for automatic detection of computer's hardware
parameters like number of registers, cache size, memory latency, etc.
3. Kernel
o contains key codes
o normally abstracted from actual program
o popular kernel: Livermore loop
o linpack benchmark (contains basic linear algebra subroutine written
in FORTRAN language)
o results are represented in Mflop/s.
4. Synthetic Benchmark
o Procedure for programming synthetic benchmark:
take statistics of all types of operations from many application
programs
get proportion of each operation
write program based on the proportion above
o Types of Synthetic Benchmark are:
Whetstone
Dhrystone
5. I/O benchmarks
6. Database benchmarks
o measure the throughput and response times of database management
systems (DBMS)
7. Parallel benchmarks
o used on machines with multiple cores and/or processors, or systems
consisting of multiple machines
SPEC
SPEC Members:
SPEC Associates:
SPEC Tools
SPEC SERT Suite 2.0 .The SERT suite 2.0 adds a single-value metric,
reduces runtime, improves automation and testing, and broadens device
and platform support.
SPEC SERT Suite 1.1.1. The SERT suite 1.1.1 is the most current SERT
version supported by the U.S. EPA Energy Star v2.0 program. Designed to
be simple to configure and use via a comprehensive graphical user
interface, the SERT suite uses a set of synthetic worklets to test discrete
system components such as processors, memory and storage, providing
detailed power consumption data at different load levels.
SPEC Chauffeur WDK Tool. The Chauffeur™ WDK (Worklet Development
Kit) Tool was designed to simplify the development of workloads for
measuring both performance and energy efficiency
PTDaemon. The power temperature daemon (also known as PTDaemon) is
used to offload the work of controlling a power analyzer or temperature
sensor during measurement intervals to a system other than the SUT.
SPEC benchmarks
CPU
Graphics/Applications
HPC/OMP
Java Client/Server
Mail Servers
Network File System
Web Servers