Professional Documents
Culture Documents
Measures of
variation,
skewness
and kurtosis
The coefficient of relative variation (CRV) is the
STANDARD DEVIATION of a variable normed by dividing It wouldn’t make any sense to compare the mean
by its mean. This measure is used to give a sense of how returns achieved by two different managers
large the standard deviation is. without explicitly considering the levels of risk that
they have incurred. A measure of relative variation
How to Use Relative Variation to Find the Uncertainty provides a number that considers both the risk
Associated with a Data Set and the return of a portfolio, so that it can be
determined which portfolio is riskier relative to
Relative variation refers to the spread of a sample or a the return.
population as a proportion of the mean. Relative
You can use several different types of measures of
variation is useful because it can be expressed as a
relative variation. One of the most popular is known
percentage, and is independent of the units in which
as the coefficient of variation (CV), which indicates
the sample or population data are measured.
how “spread out” the members of a sample or
population are relative to the mean. The coefficient
For example, you can use a measure of relative of variation is measured as a percentage, so it’s
variation to compare the uncertainty or variation independent of the units in which the mean and
associated with the temperature in two different standard deviation are measured. This enables the
countries, even if one country uses Fahrenheit relative variation of different samples or populations
temperatures and the other uses Celsius to be compared directly to each other.
temperatures. As another example, a measure of
relative variation can be useful for comparing the
returns earned by two portfolio managers.
Comparative Prices Charged by Superior
For example, the coefficient of variation can Accounting and Data Services
express the risk of an investment portfolio per unit Price Superior Accounting Data Services
of return. This means you can compare the
Mean price ($/hour) $200 $175
performance of different portfolios to see which
one offers the least amount of risk per unit of Standard deviation $80 $75
return. ($/hour)
Here’s the formula for finding the
coefficient of variation for either samples
or populations: Based on this data, the coefficient of variation for the
prices charged by each firm are
x 100%
The mean heat capacity of a certain substance is The average sales per month of MAI computer company is 59
80 degree Celsius with a standard deviation of 5 million pesos with a SD of 8 million pesos: the average sales
degree Celsius. The mean calorie per gram is volume of the company is 580 units per month, with SD of 65
0.638 and the standard deviation is 0.0125. units. Compare the variation of sales and sales volume of the
compare the variation of the two. company.
x 100%
x 100%
x 100%
%
x 100%
%
Since the coefficient of variation of the calorie Since the coefficient of variation of sales per
per grams is larger . The calorie per gram is month is larger . The sales per month is more
more variable than temperature. variable than unit per month.
Skewness
What Is Skewness?
Skewness refers to a distortion or asymmetry that deviates from
the symmetrical bell curve, or normal distribution, in a set of
data. If the curve is shifted to the left or to the right, it is said to
be skewed. Skewness can be quantified as a representation of
the extent to which a given distribution varies from a normal
distribution. A normal distribution has a skew of zero, while a
lognormal distribution, for example, would exhibit some degree
of right-skew.
As you might already know, India has more than 50% of its Why is Skewness Important?
population below the age of 25 and more than 65% below
the age of 35. If you’ll plot the distribution of the age of Now, we know that the skewness is the measure of
the population of India, you will find that there is a hump asymmetry and its types are distinguished by the side on
on the left side of distribution and the right side is which the tail of probability distribution lies. But why is
comparatively planar. In other words, we can say that knowing the skewness of the data important?
there’s a skew towards the end, right?
First, linear models work on the assumption that the
Skewness is the measure of the asymmetry of an ideally
distribution of the independent variable and the
symmetric probability distribution and is given by the third
target variable are similar. Therefore, knowing about
standardized moment. If that sounds way too complex.
the skewness of data helps us in creating better
In simple words, skewness is the measure of how much linear models.
the probability distribution of a random variable deviates Secondly, let’s take a look at the below distribution. It
from the normal distribution. is the distribution of horsepower of cars:
A positively skewed distribution is the distribution with the In this case, it was very easy to tell if the data is
tail on its right side. The value of skewness for a positively skewed or not. But what if we have something
skewed distribution is greater than zero. As you might like this:
have already understood by looking at the figure, the value
of mean is the greatest one followed by median and then Here, Q2-Q1 and Q3-Q2 are equal and yet the
by mode. distribution is positively skewed. The keen-eyed
So why is this happening? among you will have noticed the length of the
Well, the answer to that is that the skewness of the right whisker is greater than the left whisker.
distribution is on the right; it causes the mean to be From this, we can conclude that the data is
greater than the median and eventually move to the right. positively skewed.
Also, the mode occurs at the highest frequency of the So, the first step is always to check the equality
distribution which is on the left side of the median. of Q2-Q1 and Q3-Q2. If that is found equal, then
Therefore, mode < median < mean. we look for the length of whiskers.
Understanding Negatively Skewed Distribution
Number of variables, n = 2 + 3 + 5 + 6 + 4= 20
https://www.investopedia.com/terms/s/skewness.asp
https://www.analyticsvidhya.com/blog/2020/07/what-is-
skewness-statistics/