You are on page 1of 5

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/329953006

Yodea: Workload Pattern Assessment Tool for Cloud Migration

Conference Paper · December 2018


DOI: 10.1109/CloudCom2018.2018.00019

CITATION READS
1 134

2 authors, including:

Rukma Talwadker
Games24x7
9 PUBLICATIONS 150 CITATIONS

SEE PROFILE

All content following this page was uploaded by Rukma Talwadker on 17 July 2020.

The user has requested enhancement of the downloaded file.


2018 IEEE International Conference on Cloud Computing Technology and Science (CloudCom)

Yodea: Workload Pattern Assessment Tool for


Cloud Migration
Rukma Talwadker Cijo George*
NetApp Inc. (rukma@netapp.com) cijogeorge.in@gmail.com

Abstract—As the news around cloud repatriations gets real,


many cloud technologists associate them with poor understanding
of the applications and their usage patterns by the enterprises.
Our solution, Yodea, is a tool cum methodology to analyze work-
load patterns in the light of cloud suitability. We bring forward
compute patterns which can benefit from cloud economics with
on-demand compute scaling. Yodea further ranks workloads in
terms of their cloud suitability on the basis of these metrics. After
the fact analysis of storage workloads for a customer install-base,
features 38% of the “already in cloud” volumes in the top 100
ranked list by Yodea.
Index Terms—cloud migration, workload analysis, pattern
extraction, time series decomposition, workload agility Fig. 1: Cloud workload types: (source:
www.iamondemand.com/blog/5-key-essentials-of-cloud-
I. I NTRODUCTION workloads-migration/)
A. Background and Motivation
CIO’s are keen on moving to the cloud primarily because of
performance bottlenecks prior to earmarking the applications
two reasons; 1) cost efficiency: cloud offers a pay-as-you-go
for the cloud. For instance, at peak demand, scaling the
model which has the premise to beat the cost of ownership
compute capacity by spinning up additional VM’s in the cloud
of an on premise infrastructure and 2) agility: incentivizing
might just not work if the underlying shared storage is the
on the endless and quick capacity on demand. However, a
bottleneck [2]. Yodea implements a complete workflow (from
recent study by 451 Research [3] mentions that over two-
data collection, to workload pattern classification, to cloud
thirds of customers hosting one or more workloads in the
recommendations) to analyze workloads in the light of these
public cloud are repatriating back to a private or on premises
four patterns. The analysis ranks the observed workloads with
cloud or are planning to do so within a year. Dropbox,
quantitative as well as visual illustrations to help CIO take an
TapJoy being a couple of the known examples.The cloud
informed decision.
technologists attribute these failures primarily to: 1) inability
1) Yodea Criteria: High Growth: If the read and write
to change legacy processes, and 2) limited understanding of
operations show a positive linear or an exponential trend,
the applications, processes and the usage patterns.
indicates that over a period of time the demand has been
Today workload analysis mostly happens at a coarser gran-
increasing. This presents an opportunity with cloud agility;
ularity, considering only the recent trends in the workload.
expand capacity on demand. For storage, one needs to further
The capability of identifying the right workloads taking into
analyse if the storage used capacity is also changing. If the
consideration their fine granular patterns and back in time
used capacity is on the rise, efficient partitioning of the data
trends is heavily sought after. Premise of Yodea is to provide
with compute scale out can be considered. On the other hand,
such a capability in the form of a self-help tool, which also
if the data is always overwritten with used capacity remaining
recommends workloads which can benefit from the dynamic
the same, this increasing trend may not be good for the cloud
compute scaling available in the cloud and meet the expected
from the storage perspective. One would need a storage re-
ROI.
configuration prior to the shift.
B. How can Yodea Help? On-Off: Workload pattern which shows an On and Off at
[1] recommends a few compute patterns being good for the storage reflects well on the actual application compute
the cloud for incentivising on agility and economics. These pattern too. Quite unlikely the application would be serving
patterns are illustrated in Figure 1 with the compute demand all of its data from a hot cache for the entire Off period of the
on the y-axis and the time on the x-axis. We believe that, storage, which could be as large as 10-12 hours in a day. For
only analyzing the compute patterns is not enough, one must such cases, a suitable cloud option can be considered (like
also analyze the corresponding storage workload patterns for shutting down the infrastructure during “Off” period), after
validating a similar trend for the compute.
*This work was done while the author was with NetApp A-Periodic and Periodic bursting: Many applications

978-1-5386-7899-2/18/$31.00 ©2018 IEEE 17


DOI 10.1109/CloudCom2018.2018.00019
Fig. 3: Time series characteristics

Fig. 2: Read counter time series and its decomposed con-


stituents of the counter and the x axis denotes the time. Time series
decomposition produces three distinct constituents: the trend,
the seasonal and the residual.
(especially web apps) are notorious for an unpredictable and Trend (T): This component reflects the progression of the
dynamic demand spikes, referred to as the a-periodic burst. In data. If there is a steady increase or decrease in the values of
the cloud, there is an opportunity to capitalize on this load the data, the trend progression will reflect this behaviour. We
and monetize it by dynamic compute scale up and down. only consider the additive trend.
Also, if an application exhibits a predictable demand rise, Seasonal (S): Any predictably repeating behaviour in the
several orders of magnitude from an average demand, cloud time series is captured by the seasonal constituent. Residual,
presents a unique opportunity too. In both these cases, an over- remainder or the noise (R): Any randomly occurring pattern
provisioned on premise (CAPEX) loses to a dynamically sized of counter values is accounted into the residual constituent.
cloud instance (OPEX). In an additive decomposition, the decomposed time series
The periodic and a-periodic bursts of compute may not can be reconstructed as: raw = T + S + R. Each of these
mirror read and write demands at the storage. However, constituents are also time series.
repeating the same analysis helps in avoiding a situation where
compute scale out is bottlenecked by the storage performance. B. The Constituents and the Metrics
Finally, this paper is not a detailed survey of the workload We discuss metrics which are used by Yodea to score each
patterns suitable for the cloud. Yodea only covers the patterns workload pattern along the following three categories: High
mentioned in Figure 1. growth, Periodic and A-Periodic. Identifying On and Off is
relatively simpler and we do not discuss it in this paper further.
C. Yodea Contributions High Growth: We begin with finding a linear trend. We
1) Yodea is a self-help tool which offers an automated perform a simple line fitting (linear regression) on the trend
assessment of applications for cloud compute based on component. The slope of the line indicates the trend. If the
the workload patterns. slope is negative, it indicates a downtrend. A zero-valued slope
2) Yodea offers simple and effective statistical metrics over indicates no trend.
counter time series for workload classification. More Periodic Burst: Periodic bursts are predictable. These bursts
importantly these metrics do not require any manual come from the workload patterns which repeat at regular
tuning or workload specific knowledge. intervals. Hence, periodic burst scores are extracted from the
3) These metrics are generic and are applicable to counter seasonal constituent.
data at the storage as well as the compute. The technique A-periodic Burst: A-periodic bursts are random in their
works for any granularity of the counter data. occurrence. There is no regularity in their arrival pattern. Any
dynamic, non-deterministic occurrence of a burst would be
II. A PPROACH reflected in the residual constituent.
A. Time Series Decomposition 1) Intuition Behind Burst Scoring Metrics: Figure 3 depicts
Yodea uses total number of 4KB read and write operations the characteristics of a raw time series. Prior to introducing
per unit of time (can be a second/minute or an hour) over a the metrics, we briefly discuss the various characteristics of a
period of time as a representative of a workload. This counter burst:
data gathered at the compute or the storage layer is our input Burst: indicates increase in the value of a counter by an order
time series. Counter data is interpolated to handle missing of magnitude higher than the mean value of the series. Though
values and then decomposed for metric calculations. Time the burst starts and ends gradually, only when the value of the
series decomposition is a statistical approach to deconstructing counter exceeds the threshold value, the beginning of a burst
a time series into its constituents as discussed in [5]. is assumed. Yodea configures threshold to be, one standard
Figure 2 shows an example of counter time series for a deviation from the mean.
randomly chosen storage workload. The “raw” series is the The burst inter-arrival time: denotes the time between end
actual observed time series. The y axis indicates the values of a burst and the start of the subsequent burst.

18
The peak or the amplitude: indicates the highest value of
the counter during the burst.
The width or the duration: indicates the time along the x
axis between the start and the end of the burst. Now, we define
the metrics for determining the periodic and a-periodic burst
scores. These metrics are applied independently on both the
seasonal and the residual constituents of both the counter types
(read, write) to obtain the respective scores.
2) The Metrics: Peak to mean (PtM): The amplitude of (a)
the burst can digress the burst score in the cases where a)
peak is not appearing frequently (peak inter-arrival time being
high); b) peak appears but doesn’t last longer (width of the
peak). Yodea considers the ratio of maximum observed value
of the burst to the mean of the counter series as the metric.
Coefficient of Variation (CV): This metric establishes the
notion of a relative variation. CV is the ratio of standard
deviation to the mean of the time series. This way, the two (b)
time series with distinct mean and peak values can be directly
Fig. 4: Two instances of periodic burst scores, occasional
compared.
predictive bursts ranked higher than constant low amplitude
Burst Percentage (Bp): This metric denotes the amount of
bursts
time spent in a burst. This further separates the two candidate
workloads on the basis of the overall spread of the bursts.
As the burst percentage increases, the ratio of peak to mean a − periodic burst scorei is computed similarly. The
reduces. The two metrics therefore balance the overall score. periodic and a-periodic burst scores are values between 0 and
Burst Inter-arrival time (BIaT): This metric favours work- 4 whereas the trend score is between 0 and 1.
loads where the bursts are well separated out vs. the cases
where the bursts and non-bursts are localized. In both these C. Finding the Dominant Pattern
cases the, PtM and Bp would favour both, and BIaT metric It is quite unlikely that a given raw counter time series
would bring in the differentiation. exhibits a trend, a periodic as well as an a-periodic burst
The final score is a combination of these four metrics and is pattern. For e.g. in Figure 2, the significant portion of the
calculated for both, the seasonal and the residual constituents. raw counter time series seems to be coming from the seasonal
3) The Score: Putting it All Together: Assume that there component. To obtain that knowledge, Yodea, extracts the most
are a total of N workloads which are under Yodea evaluation. dominant dimension for the raw counter time series (read,
Each of the four metrics are computed independently for the write). Using the variance (σ 2 ) calculation over the respective
seasonal and the residual constituents. Yodea also computes time series, weights of the constituents are obtained. For
the trend metrics via regression line fitting. example, periodic burst score weighti can be calculated
Yodea normalizes each of the four metric values by the max- as:
imum (max) value of the corresponding metric from workloads periodic burst variancei
under evaluation. When normalizing the two counter types are raw time series variancei
considered separately. For example, normalized PtM value for Variance for the trend and the a-periodic scores are sim-
a seasonal component (normalized P tM periodici ) for a ilarly calculated. The constituent with the highest weight is
workload i is: considered dominant for the corresponding raw time series.
P tM periodici
max(P tM periodici )f or i f rom 1 to N D. Workload Ranking
Similar transformation is applied to other metrics as well. Each workload is tagged with two scores, one for each of
The respective normalized metric values are between 0 and the counter types. Each score corresponds to its respective
1. This normalization by max is a conscious decision. There dominant component. Workloads are ranked on a counter type
is no absolute measure of fitness for the cloud. For an in the descending order of the score. The workload with
enterprise given the CIO budget, only a few applications can highest score is considered to be most suitable for the cloud
be migrated to the cloud. Relative ranking helps to break and for a particular pattern that it is best described for. We
the ties. The final periodic burst score for a workload i is presently do not have a way to combine the read and write
calculated as: scores.
periodic burst scorei = normalized P tM periodici +
normalized CV periodici + normalized Bp periodici + E. Related Work
normalized BIaT periodici Yodea approach has been inspired from [5]. From the tech-
nique perspective, the nearest work to ours is [6]. The approach

19
(a)
Fig. 6: Top trend pattern for a datalake workload

B. A-Periodic Bursts
Figures 5a and 5b, represent the respective high and low
scored instances of volumes in a-periodic burst category for
read operations. Infrequent high amplitude bursts are scored
(b) higher than not-so frequent low amplitude bursts. Further
analysis of the volume in Figure 5a, revealed that this volume
Fig. 5: Two instances of a-periodic burst scores, infrequent hosted a media application which had unpredictable active
high bursts are scored higher than not-so frequent low ampli- peaks. A number of other volume’s dominant scores (were
tude bursts also a-periodic) matched closely with this volume and were
also a part of the same media application.
C. Trend/High Growth
claims to provide an effective way to increase utilization in Figure 6 shows a volume which scored much higher on the
the data centers. Our work differs and also complements this trend metric with respect to the other volumes. While more
work in many ways, a couple being: 1) The mentioned paper than 98% of the other volumes indicated a flat trend. Drill
forecasts trends as it observes the facts for a limited period down analysis revealed that this volume was a part of the
of time whereas Yodea works on long term trends. 2) Yodea customer’s datalake cluster where the growth pattern is trivial.
comes up with workload rankings based on patterns. These In all the above three cases, Yodea analysis has to be
metrics are complementary and can be directly fed to the tool repeated on the compute pattern as well to confirm the oppor-
discussed in [6]. tunity and rule out the need for better storage configuration.
Also, the volumes under comparison in Figures 4b and 5b
III. VALIDATION were ranked much lower and were not recommended for the
cloud.
Our preliminary analysis is based on the read counter data
IV. C ONCLUSION AND F UTURE W ORK
available at the storage volume level which can be termed as
a workload. Our goal is to first verify the pattern classification Paper motivates the need for workload pattern analysis
technique of Yodea. A volume is a logical storage container. as one of the precursor to earmarking applications for the
Multiple volumes could be mapped to a single application. For cloud. Yodea defines statistical metrics over time series data
the purpose of validation, we selected about 4,565 volumes to extract workload pattern properties. It classifies workloads
belonging to a single customer. For each volume, the total into patterns, and recommends the top ranked as cloud fit. We
number of 4KB reads done per hour over 12 weeks from 1st would like to extend our present validation to compute patterns
of September 2017 was considered as a workload. This data and build an end-to-end recommender tool for self-help.
was obtained via NetApp® AutoSupport® [4]. R EFERENCES
[1] 5 key essentials of cloud workloads migration.
A. Periodic Bursts http://www.iamondemand.com/blog/5-key-essentials-of-cloud-workloads-
migration/.
[2] Cloud management blog. https://www.rightscale.com/blog/enterprise-
Volume in Figure 4a scores much higher than the one in cloud-strategies/identifying-workloads-cloud.
Figure 4b though both volumes are dominated by periodic [3] Insight in the age of digital disruption. https://451research.com/.
bursts. Reason being twofold: 1) higher burst amplitude when [4] NetApp, inc. proactive health management with AutoSupport. http:
//media.netapp.com/documents/wp-7027.pdf.
compared with the burst threshold and; 2) burst periods are [5] R. B. Cleveland, W. S. Cleveland, J. McRae, and I. Terpenning. Stl:
well spread out and distinct in Figure 4a. Manual analysis of A seasonal-trend decomposition procedure based on loess. Journal of
the volume in Figure 4a indicated that, this was an oracleTM Official Statistics, pages 3–73, 1990.
[6] Y. Zhang, G. Prekas, G. M. Fumarola, M. Fontoura, I. Goiri, and
database’s data volume which might be subjected to periodic R. Bianchini. History-based harvesting of spare cycles and storage
application bursts. The same analysis has to be repeated on in large-scale datacenters. In 12th USENIX Symposium on Operating
the compute demand to re-confirm the pattern. Systems Design and Implementation (OSDI 16).

20

View publication stats

You might also like