You are on page 1of 4

Precision Workstation

Storage Classification
By: Alex Shows, Dell Client Performance Engineering

This paper provides an overview of the method used for classifying


disparate storage devices in order to differentiate high performance
storage options for Dell Precision workstations.

Why workstation storage Storage classifications


performance matters provide performance tiers
Workstations are often equipped with some of the fastest Storage devices, even those of the same capacity,
components available to client systems. When it comes to mechanical form factor, physical electrical connection, and
storage, it’s important to ensure that high-end professional protocol may deliver significantly different performance
applications aren’t spending most of their time waiting for the levels. By classifying storage, performance tiers can be used
storage subsystem. Workstation performance is only as a to compare across storage devices which might otherwise
strong as its weakest link, and productivity lost waiting for lack clear specifications which denote performance. This is
reads and writes to the storage subsystem can significantly especially important for solid-state drives (SSDs), which can
reduce the overall value of a workstation. have dramatically different storage performance depending
their internal architecture.

THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND
TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR IMPLIED WARRANTIES OF ANY
KIND.
Copyright © 2017 Dell Inc. or its subsidiaries. All Rights Reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or
its subsidiaries. Other trademarks may be trademarks of their respective owners.
By classifying storage devices using the results of measured
performance, Dell provides premium performance for
professional workstation applications and the engineers that
need it the most.

Before SSD technology became


mainstream, hard drive performance
could often be ascertained by the so- Benchmarking Storage for
called “back of box” specifications. That Drive Classification
is, one could flip over the proverbial box
and look at the spec sheet to read
PCMark Storage
important performance factors like
rotational speed, cache size, and  Variety of consumer and
average seek time. A detailed hard drive office productivity workloads
specification might even include other  Measures using recorded
interesting factors like peak sustained traces of real applications
throughput values and the number of which are played back
“Without platters inside the drive. While these
specifications may not correlate directly
 Provides both an asymptotic
score and an average
measuring drive with an individual application’s bandwidth measurement
performance, they do provide some
model basis for comparison and contrast across SPECwpc
performance in different drives in the rotational drive
ecosystem.
 Targeted workstation
some systematic Enter SSD technology: SSDs may have
applications, separated by
descriptive information in their product
way, it can be sheets, but it has become increasingly
vertical or industry
 Scale is based on a
difficult to difficult to correlate SSD interface, flash
memory type, and flash controller model reference platform including
differentiate SSD to application performance. SSDs

an SSD
Difficult to optimize as
provide significant advantages over hard
models.” disk drives, but without measuring drive scoring is inversely related to
model performance in some systematic average drive latency rather
way, it can be difficult to differentiate than peak throughput
SSD models based on their “back of box”
specifications. IOmeter

Measurement Methods  Robust scripting mechanism


Performance measurements are enables targeted
essential to differentiating SSDs. But measurements under tightly
what should be measured? Reads and controlled access
writes, both random and sequential, to specifications
be sure. Unfortunately however, if we  Provides insights into
were to answer this question by looking storage performance scaling,
at some of the more popular client for example with block size
storage benchmarks, we might draw the and queue depth
conclusion that strictly peak performance  Enables targeting of
should be measured. That is, in what performance metrics that
ways can I measure the highest possible yield the most user benefit
values for bandwidth or IO operations
per second (IOPs).

2 Storage Classification | Dell Inc., 2017


These peak values and the methods by TLC-based SSDs, while
used to measure them may or may not the “performance” cluster
correlate to one’s preferred professional included the MLC and even
application. This is especially true of a few SLC-based SSDs
benchmarks which measure peak such as high-performance
performance under ideal application and add-in cards (see Figure 1).
environmental conditions such as high Ultimately, we chose a
queue depth (the number of outstanding centroid-based clustering
IOs in the storage pipeline). algorithm to classify
In order to reflect a wide variety of individual storage devices.
workstation applications and usage Even though this created a
models, the storage classification bit of overlap between SATA
method we use incorporates multiple SSDs with MLC flash and
different types of measurements, to NVMe SSDs with TLC flash,
reflect both application-based as well as it enabled us to simplify the
targeted synthetic performance. classification system by
deriving minimum
Cluster Analysis performance metrics the
The results of performance high-performance classes.
measurements across all the various SSD Figure 1 - Clustering of NVMe SSDs
drives are used as inputs into a cluster Simplified Classification
analysis. A number of different clustering System
algorithms were applied, including In order to provide clear guidance to SSD
centroid-based and density-based, and technology providers, it was important for
the results of those were compared for Dell to establish some minimum
fitness. performance guidance for the
It became immediately clear that, likely “performance” clusters. It
due to the advantages of the PCI Express was in this step that the
bus, NVMe SSDs easily formed a distinct storage classes came to be.
grouping separate from SATA SSDs. This Class 20 drives denoted
supported our earlier assumptions, and mainstream SATA SSDs.
we continued to split the clusters within These are the typical SSDs
each of these interfaces. found in many client
SATA SSDs showed two clusters: a platforms and are suitable for
“mainstream” cluster centered on the workstation customers who
average performing triple-level-cell (TLC) want quiet, fast, and reliable
flash module, and a “performance” cluster drives but don’t necessarily
centered on the average performing multi- need the highest
level-cell (MLC) flash (see Figure 2). Here performance.
again, this division substantiated our Class 30 drives represent the
earlier assumptions that MLC-based highest performing SATA
SSDs outperform TLC-based SSDs. SSDs. Initially when we
However, in reviewing the SATA SSD introduced the classification
clusters, significant overlap was present. system, there were quite
That is, the lowest-performing drives in many models of Class 30
the high performing cluster might be the SSD, but due to a number of
same or slightly lower performing in some factors including Figure 2 - Clustering of SATA SSDs
measurements than the highest manufacturing costs and
performing drive in the mainstream increasing performance of TLC-based
cluster. drives, there are very few Class 30 drives
NVMe SSDs also showed two clusters, today. Workstation users desiring higher
and these were much more distinct than performance than Class 20 should look to
their SATA counterparts: the NVMe devices.
“mainstream” cluster was clearly occupied

3 Room for text | Dell Inc., 2017


Class 40 drives are predominantly centroids of the clusters move upward,
mainstream NVMe devices that this can sometimes affect the minimum
Storage Classifications incorporate TLC flash. There are a required performance. This is why the
significant number of different models number and variety of Class 30 drives is
Class 10
available in Class 40, which tends to lead shrinking, while Class 40 continued to
to performance variation as newer grow.
 Rotational disk drives generations replace older generations.
 Typically high capacity and Class 50 drives are the highest Conclusions
low cost performing SSDs offered on Precision Solid-state drives provide significant
 Typically SATA workstations. They provide some performance advantages compared to
significant performance gains over Class rotational hard drives. This increased
Class 20 40. Many are based on MLC flash, while performance presents a challenge to
some incorporate newer, faster identifying and distinguishing high
technologies such as 3D Xpoint. performing SSDs from their mainstream
 SATA mainstream SSDs
equivalents. Dell’s solution to this
 Significant performance
Class 30 Performance challenge is a classification system,
increase over Class 10
In order to be classified as a high based on measured performance, to
 Comprised primarily of TLC-
performance SATA SSD, a specific drive determine the minimum performance
based SSDs
model (at a specific capacity point) must requirements for a drive to be classified
meet or exceed 8 out of 11 of the as a performance SSD.
Class 30 requirements listed in the following table:

 SATA performance SSDs


 Performance increase over
Class 20
 Comprised primarily of MLC-
based SSDs
 Fewer and fewer models
available in this class

Class 40
Class 50 Performance
In order to be classified as the highest
 PCIe mainstream SSDs
performing SSD, regardless of device
 Comprised primarily of TLC- connection, a specific drive model (at a
based SSDs specific capacity point) must meet or
 Significant number of exceed 8 out of 11 of the requirements
different SSD models listed in the following table:
available in this class
 Performance variation
across different models

Class 50

 PCIe performance SSDs


 Significant performance
increase over Class 40
 Comprised of MLC-based
SSDs as well as newer Classification Updates
flash-based technologies Storage technology continues to advance,
such as 3D Xpoint and the classification system is only
useful if it advances with the technology.
Every six months, we revisit the
classification and look at variance within
classes as well as outliers. As the

4 Storage Classification | Dell Inc., 2017

You might also like