Professional Documents
Culture Documents
www.proschoolonline.com
Introduction
Statistical methods are powerful tools to analyze data and draw useful
conclusions from the same
Statistical methods are used to summarize return distributions eg. Asset
returns, earnings growth rates, stock prices, etc.
Following four properties can be explored
❑ Where are the returns centered (Central Tendency)
❑ How far are the returns dispersed from their center (Dispersion)
❑ Whether the returns distribution is symmetrically shaped or
lopsided (Skewness)
❑ Whether extreme outcomes are likely (Kurtosis)
• Sorts data and ranks the data points, however it does not differentiate between successively
Position ranked data points
• Example - Mutual fund ratings issued by MorningStar or ValueResearchonline. They provide
in a Ordinal
performance in star ratings. No clear performance differentiation is mentioned between a fund
match Scale
receiving 5 stars versus the one getting 4 stars
1st 2nd 5th
• Ranks data and the difference between the scale values is equal. It does not have a true zero
value
Thermometer • Example - Celsius scale for measurement of temperature is an interval scale. Difference between
Interval
19o and 20o is just 1 degree which is the same for any other scale values However 0o does not
Scale
mean an absence of temperature
• Ranks data and the difference between the scale values is equal. Also it has a true zero value.
Money • Example - Rate of returns is measured using a ratio scale. If our money doubles → rate of return is
Ratio 100%, if our money doesn’t grow → rate of return is 0%. A perfect zero, an actual absence of
Scale returns.
Hedge Fund
• Nominal Scale
classification types
Australia 9.00%
Belgium 4.80%
Canada 7.70%
Denmark 6.20%
France 6.30%
Germany 8.80%
Ireland 7.00%
Italy 6.80%
Japan 9.30%
Netherlands 7.70%
South Africa 9.10%
Spain 5.80%
Sweden 9.90%
Switzerland 6.90%
United Kingdom 7.60%
United States 8.70%
Histogram
5
Return Absolute Cumulative Relative Cumulative
4 4 4
Interval Frequency Absolute Frequency Frequency relative Frequency 4
4% to 5% 1 1 6% 6.25%
5% to 6% 1 2 6% 12.50% 3
2
6% to 7% 4 6 25% 37.50% 2
7% to 8% 4 10 25% 62.50% 1 1
8% to 9% 2 12 13% 75.00% 1
9% to 10% 4 16 25% 100.00%
0
16
4% to 5% to 6% to 7% to 8% to 9% to
5% 6% 7% 8% 9% 10%
Measures of Central
The 1-month returns for last 6 Measures of Location The 1-month returns for last 6
Tendency
months on the stock of Coffee months on the stock of Coffee
Day Enterprises are: Day Enterprises are:
8%, 12%, -2%, 5% and 15% 8%, 12%, -2%, 5% and 15%
Measures the
Where is the data location or
So the average or mean return is: centered? distribution of So the 25%th returns are located
7.60% between -2% and 5% (we’ll see
data the calculations in forthcoming
slides)
Definition:
❑ The arithmetic mean is the sum of the observations divided by the
number of observations.
❑ It can be calculated for both the population as well as the sample.
To Remember
Populations Mean:
❑ Always denoted by ‘µ’ (pronounced as ‘mew’)
❑ Formula
N
∑ Xi
µ = i=1
Sample Mean:
❑ Denoted by (read as X-bar)
N
❑ Formula X
n
X = i=1
∑ Xi
n
Australia 9.00%
Belgium 4.80%
Canada 7.70%
Denmark 6.20%
France 6.30%
Germany 8.80%
Ireland 7.00%
Italy 6.80%
Japan Mean = 7.60%
9.30%
Netherlands 7.70%
South Africa 9.10%
Spain 5.80%
Sweden 9.90%
Switzerland 6.90%
United Kingdom 7.60%
United States 8.70%
Definition:
❑ The median is the value of middle item of a set of items that has
been sorted into ascending or descending order.
Formula:
❑ For odd number of observations
➢ Median occupies = (n + 1)/2 position
❑ For even number of observations
➢ Median occupies the mean of the n/2 and (n + 2)/2 position
Australia 9.00%
Australia 9.00%
Belgium 4.80%
Belgium 4.80%
Canada 7.70%
Canada 7.70%
Denmark 6.20%
Denmark 6.20%
France 6.30%
France 6.30%
Germany 8.80%
Germany 8.80%
Ireland 7.00%
Ireland 7.00%
Italy 6.80%
Italy 6.80%
Japan 9.30%
Japan 9.30%
Netherlands 7.70%
Netherlands 7.70%
South Africa 9.10%
South Africa 9.10%
Spain 5.80%
Spain 5.80%
Sweden 9.90%
Sweden 9.90%
Switzerland 6.90%
Switzerland 6.90%
United Kingdom 7.60%
United Kingdom 7.60%
United States 8.70%
❑ Definition:
➢ The mode is the most frequently occurring value in a distribution
➢ Distributions are unimodal when there is a single most frequently occurring value and
multimodal if there is more than one frequently occurring value
Australia 9.00%
1 Belgium 4.80%
Belgium 4.80%
2 Spain 5.80% Unimodal
Canada 7.70%
Denmark 6.20% 3 Denmark 6.20%
France 6.30% 4 France 6.30%
Germany 8.80% 5 Italy 6.80%
Ireland 7.00% 6 Switzerland 6.90%
Italy 6.80% 7 Ireland 7.00%
Japan 9.30% 8 United Kingdom 7.60%
Netherlands 7.70% 9 Canada 7.70%
South Africa
Spain
9.10%
5.80%
10 Netherlands Mode = 7.70%
7.70%
Sweden 9.90%
11 United States 8.70% Bimodal
Switzerland 6.90% 12 Germany 8.80%
United Kingdom 7.60% 13 Australia 9.00%
United States 8.70% 14 South Africa 9.10%
15 Japan 9.30%
Source: CFA Institute course material - Level 1 16 Sweden 9.90%
Source: Dimson, Marsh and Staunton (2002)
Portfolio
Year Equities Bonds Year We Equities Wb Bonds
Return
2005 -5.0% 7% 2005 50% -5.0% 50% 7.0% 1.00%
2006 12.0% 2% 2006 50% 12.0% 50% 2.0% 7.00%
2007 22.5% -1% 2007 50% 22.5% 50% -1.0% 10.75%
2008 -24.0% 9% 2008 50% -24.0% 50% 9.0% -7.50%
2009 50% 18.5% 50% 6.0% 12.25%
2009 18.5% 6%
Solution
2005 -5.0% 7%
2006 12.0% 2%
2007 22.5% -1%
2008 -24.0% 9%
2009 18.5% 6%
AM 4.80% 4.60%
GM 3.26% 4.54%
metric
Where, Xi > 0, I = 1,2,3,…,n
2 HM finds its application in cost To Remember
HM =
1 1 averaging or value averaging ✓AM >= GM >= HM
10 14
= 11.67 km/hour such as SIP investing wherein (if all values are equal: AM = GM = HM else AM >
investments made over an GM > HM)
❑ Aisha has invested $4,000 in the debt and $6,000 in the equity market. The debt
investments gave a return of 8.5% and the equity investments gave a return of 11%.
Find the return that she realized from her total investment.
❑ Her sister, Soniya has invested equal capital in two stocks. One stock is priced at $60
and the other is priced at $40. What is the average price she paid per stock?
❑ Find the better fund based on their five year growth rate. Their growth rates per year
is given below:
▪Quantiles are used in portfolio performance evaluation as well as in investment strategy development and
research
▪Investment analysts use quantiles every day to rank performance—for example, the performance of
portfolios
▪The performance of investment managers is often characterized in terms of the quartile in which they fall
relative to the performance of their peer group of managers
▪For example Dividing data into quantiles based on some characteristic allows analysts to evaluate the impact
of that characteristic on a quantity of interest
Range MAD
Solution
Equities
Year Equities Bonds
Equities Mean = 4.8%
2005 -5.0% 7%
Between -24% and 22.5% i.e. Summation of absolute deviations:
2006 12.0% 2%
46.5%
2007 22.5% -1% |-5 – 4.8| + |12 – 4.8| + |22.5 – 4.8| + |-
2008 -24.0% 9% 24 – 4.8| + |18.5 – 4.8|
2009 18.5% 6%
Bonds = 77.2
Between -1% and 9% i.e. 10% Range 46.5% 10.0% Dividing by total number of observations:
MAD 15.4% 3.3% 77.2 / 5
= 15.44%
√
deviation: - this number will tell her that
n To Remember
on an average the calorie intake of
∑ (Xi - µ )2
σ = i=1 ✓Standard deviation
various individuals varies by what unit. N
is commonly used
I would say standard deviation is a mean
of deviations from mean! measure of risk.
Example
Compute the variance and standard deviation for the following set of observations
Real Equity Returns (1900-2000)
Australia 9.00%
Belgium 4.80%
Canada 7.70%
Denmark 6.20%
France 6.30%
Germany 8.80%
Ireland 7.00%
Italy 6.80%
Japan 9.30%
Netherlands 7.70%
South Africa 9.10%
Spain 5.80%
Sweden 9.90%
Switzerland 6.90%
United Kingdom 7.60%
United States 8.70%
Australia 9.00%
Belgium 4.80%
Canada 7.70%
Denmark 6.20%
France 6.30%
Germany 8.80%
Ireland 7.00%
Italy 6.80%
Japan 9.30%
Netherlands 7.70%
South Africa 9.10%
Spain 5.80%
Sweden 9.90%
Switzerland 6.90%
United Kingdom 7.60%
United States 8.70%
2
Variance 0.02%
Standard Deviation 1.44%
√
n for sample variance, which will
✓Once we have the sample mean,
∑ 2 provide a lower value of
(Xi - X ) there are n – 1 independent
s = i=1 denominator and higher value of
n-1 deviations from it. variance, leading to an unbiased
estimation!
❑ Standard deviation/variance concepts can be extended to compute and analyze the risk-reward
proportions. Some of the extensions are Chebyshev’s inequality and coefficient of variation.
❑ Chebyshev’s inequality:
❑ The proportion of values within k standard deviations of the arithmetic mean is at least 1 – 1/k2 ,
For all k > 1
❑ Predict the minimum number of values that will lie within 3 standard deviations of the mean from
a total of 900 values
❑ Proportions from Chebyshev’s Inequlity
Interval around
k Proportions
the sample mean
1.25 X ± 1.25s 36%
1.5 X ± 1.5s 56%
2 X ± 2s 75%
3 X ± 3s 89%
4 X ± 4s 94%
❑ Risk and Return cannot be seen in isolation, hence risk-adjusted returns are calculated
and analyzed for a proper reasoning. Few tools are Coefficient of Variation/ Sharpe Ratio
❑ Coefficient of Variation is the ratio of standard deviation of a set of observations to their
mean value
➢ CV = s / X
❑ Inverse of CV explains the returns per unit of risk
❑ Example
➢ Compute the coefficient of variation
➢ Rank the asset classes from most risky to least risky based on coefficient of variation
To Remember
Asset Class Arithmetic Mean Standard Deviation
Equities Small Cap 19% 40% ✓Lower the value of CV, lower is
Equities Large Cap 13% 20% the risk exhibited by the asset
Equities Mid Cap 16% 30%
class
❑ Solution
CV Rank
Asset Class Arithmetic Mean Standard Deviation
=40/19 = 2.105 1
Equities Small Cap 19% 40%
Equities Large Cap 13% 20% =20/13 = 1.538 3
Equities Mid Cap 16% 30% =30/16 = 1.875 2
❑ Ranking of the asset class in terms of riskiness is done based on highest CV to lowest CV
(higher the CV means more risk per unit of return)
Example
❑ Compute the CV and Sharpe ratio values for equities based on data provided for
the previous 5 years.
❑ Consider the risk free rate of 2%
Year Equities
2005 -5.0%
2006 12.0%
2007 22.5%
2008 -24.0%
2009 18.5%
Solution
❑ Risk free rate of 2%
Year Equities
2005 -5.0%
2006 12.0%
2007 22.5%
2008 -24.0%
2009 18.5%
Sharpe Ratio
= 4.80% - 2.00%
Mean 4.8% =0.1456
Std Dev 19% 0.19
CV 4.01
Sharpe Ratio 0.15
✓ Normal Curve
Formula:
Describe the relative locations of the mean, median, and mode for a
uni-modal, non-symmetrical distribution
❑ Remember that mean is pulled towards the tail whereas the mode
occurs at the peak.
Mo Md M
To Remember
Formula
❑ Sample Excess Kurtosis
1.Published ratings on stocks ranging from 1 (strong sell) to 5 (strong buy) are examples of which
measurement scale?
A)Ordinal
B)Interval
C)Nominal
2.Two portfolios have unimodal return distributions. Portfolio 1 has a skewness of 0.77, and Portfolio 2 has
a skewness of –1.11.
Which of the following is correct?
A)For Portfolio 1, the median is less than the mean.
B)For Portfolio 1, the mode is greater than the mean.
C)For Portfolio 2, the mean is greater than the median
1. A is correct. Ordinal scales sort data into categories that are ordered with respect to some characteristic
and may involve numbers to identify categories but do not assure that the differences between scale
values are equal. The buy rating scale indicates that a stock ranked 5 is expected to perform better than a
stock ranked 4, but it tells us nothing about the performance difference between stocks ranked 4 and 5
compared with the performance difference between stocks ranked 1 and 2, and so on
2.A is correct. Portfolio 1 is positively skewed, so the mean is greater than the median, which is greater
than the mode
3.A is correct. Any descriptive measure of a population characteristic is referred to as a parameter
4. B is correct. The FTSE Eurotop 100 represents a sample of all European stocks. It is a subset of the
population of all European stocks.