0% found this document useful (0 votes)

906 views10 pages

Standard Deviation Formulas Explained

The document defines and provides formulas for calculating standard deviation, a measure of how spread out numbers are in a data set. It explains that standard deviation is calculated by taking the square root of the average of the squared differences from the mean. The document provides step-by-step examples of calculating standard deviation using all data points from a population and using a sample of data to estimate the population's standard deviation. It notes that sampling provides an approximation that is easier to obtain but less accurate than using the entire population.

Uploaded by

Dhaval Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

906 views10 pages

Standard Deviation Formulas Explained

Uploaded by

Dhaval Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Standard Deviation Formulas

Standard Deviation
The Standard Deviation is a measure of how spread out numbers are.
Its symbol is (the Greek letter sigma).
This is the formula for Standard Deviation:

To calculate the standard deviation:

1. Work out the Mean (the simple average of the numbers)

2. Then for each number: subtract the Mean and square the result

3. Then work out the mean of those squared differences.

4. Take the square root of that and you are done!

The Formula Explained

Example: Sam has 20 Rose Bushes.
The number of flowers on each bush is
9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4
Work out the Standard Deviation.

Step 1. Work out the mean

In the formula above (the greek letter "mu") is the mean of all our values ...

Example: 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4

The mean is (9+2+5+4+12+7+8+11+9+3+7+4+12+5+4+10+9+6+9+4) / 20 = 140/20 = 7
So:
=7

Step 2. Then for each number: subtract the Mean and square the result
This is the part of the formula that says:

So what is xi ? They are the individual x values 9, 2, 5, 4, 12, 7, etc...

In other words x1 = 9, x2 = 2, x3 = 5, etc.
So it says "for each value, subtract the mean and square the result", like this

Example (continued):
(9 - 7)2 = (2)2 = 4
(2 - 7)2 = (-5)2 = 25
(5 - 7)2 = (-2)2 = 4
(4 - 7)2 = (-3)2 = 9
(12 - 7)2 = (5)2 = 25
(7 - 7)2 = (0)2 = 0
(8 - 7)2 = (1)2 = 1
... etc ...

Step 3. Then work out the mean of those squared differences.

To work out the mean, add up all the values then divide by how many.
The notation for "add up all the values" is "Sigma":
The handy Sigma Notation allows us to sum up as many terms as we want:

Sigma Notation
We want to add up all the values from 1 to N, where N=20 in our case because there are 20 values:

Example (continued):

Which means: Sum all values from (x1-7)2 to (xN-7)2

We already calculated (x1-7)2=4 etc. in the previous step, so just sum them up:
= 4+25+4+9+25+0+1+16+4+16+0+9+25+4+9+9+4+1+4+9 = 178
But that isn't the mean yet, we need to divide by how many, which is simply done by multiplying by
"1/N":

Example (continued):

Mean of squared differences = (1/20) 178 = 8.9

(This value is called the "Variance")

Step 4. Take the square root of that and you are done!
Example (concluded):

= (8.9) = 2.983...

Sample Standard Deviation

Sometimes your data is only a sample of the whole population.

Example: Sam has 20 rose bushes, but what if Sam only counted the flowers on 6 of
them?
The "population" is all 20 rose bushes,
and the "sample" is the 6 he counted. Let us say they were:
9, 2, 5, 4, 12, 7
We can still estimate the Standard Deviation.
But when you use the sample as an estimate of the whole population, the Standard Deviation
formula changes to this:
The formula for Sample Standard Deviation:

The important change is "N-1" instead of "N" (which is called "Bessel's correction").
The symbols also change to reflect that we are working on a sample instead of the whole population:

The mean is now x (for sample mean) instead of (the population mean),

And we use s (for Sample Standard Deviation) instead of .

But that does not affect the calculations. Only N-1 instead of N changes the calculations.
OK, let us now calculate the Sample Standard Deviation:

Step 1. Work out the mean

Example 2: Using sampled values 9, 2, 5, 4, 12, 7
The mean is (9+2+5+4+12+7) / 6 = 39/6 = 6.5
So:
x = 6.5

Step 2. Then for each number: subtract the Mean and square the result
Example 2 (continued):
(9 - 6.5)2 = (2.5)2 = 6.25
(2 - 6.5)2 = (-4.5)2 = 20.25
(5 - 6.5)2 = (-1.5)2 = 2.25
(4 - 6.5)2 = (-2.5)2 = 6.25
(12 - 6.5)2 = (5.5)2 = 30.25
(7 - 6.5)2 = (0.5)2 = 0.25

Step 3. Then work out the mean of those squared differences.

To work out the mean, add up all the values then divide by how many.
But hang on ... we are calculating the Sample Standard Deviation, so instead of dividing by how
many (N), we are going to divide by N-1

Example 2 (continued):
Sum = 6.25 + 20.25 + 2.25 + 6.25 + 30.25 + 0.25 = 65.5
Divide by N-1: (1/5) 65.5 = 13.1
(This value is called the "Sample Variance")

Step 4. Take the square root of that and you are done!
Example 2 (concluded):

s = (13.1) = 3.619...

Comparing
When we used the whole population we got: Mean = 7, Standard Deviation = 2.983...
When we used the sample we got: Sample Mean = 6.5, Sample Standard Deviation = 3.619...
Our Sample Mean was wrong by 7%, and our Sample Standard Deviation was wrong by 21%.

Why Would We Take a Sample?

Mostly because it is easier and cheaper.
Imagine you want to know what the whole country thinks ... you can't ask millions of people, so
instead you ask maybe 1,000 people.
There is a nice quote (supposed to be by Samuel Johnson):
"You don't have to eat the whole ox to know that the meat is tough."
This is the essential idea of sampling. To find out information about the population (such as mean and
standard deviation), we do not need to look at all members of the population; we only need a sample.
But when we take a sample, we lose some accuracy.

Mean, Median and Mode

We use statistics such as the mean, median and mode to obtain information about a
population from our sample set of observed values.
Mean

The mean (or average) of a set of data values is the sum of all of the data values divided by the
number of data values. That is:

Example 1

The marks of seven students in a mathematics test with a maximum possible mark of 20 are given
below:
15 13 18 16 14 17 12
Find the mean of this set of data values.
Solution:

So, the mean mark is 15.

Symbolically, we can set out the solution as follows:

So, the mean mark is 15.

Median
The median of a set of data values is the middle value of the data set when it has been
arranged in ascending order. That is, from the smallest value to the highest value.
Example 2

The marks of nine students in a geography test that had a maximum possible mark of 50 are given
below:
47 35 37 32 38 39 36 34 35
Find the median of this set of data values.
Solution:

Arrange the data values in order from the lowest value to the highest value:
32

The fifth data value, 36, is the middle value in this arrangement.

Note:

In general:

If the number of values in the data set is even, then the median is the average of the two middle
values.

Example 3

Find the median of the following data set:

12 18 16 21 10 13 17 19
Solution:

Arrange the data values in order from the lowest value to the highest value:
10 12 13 16 17 18 19 21
The number of values in the data set is 8, which is even. So, the median is the average of the two
middle values.

Alternative way:

There are 8 values in the data set.

The fourth and fifth scores, 16 and 17, are in the middle. That is, there is no one middle value.

Note:

Half of the values in the data set lie below the median and half lie above the
median.

The median is the most commonly quoted figure used to measure property

prices. The use of the median avoids the problem of the mean property price
which is affected by a few expensive properties that are not representative of the
general property market.

Mode
The mode of a set of data values is the value(s) that occurs most often.

The mode has applications in printing. For example, it is important to print more of the most popular
books; because printing different books in equal numbers would cause a shortage of some books and
an oversupply of others.
Likewise, the mode has applications in manufacturing. For example, it is important to manufacture
more of the most popular shoes; because manufacturing different shoes in equal numbers would cause
a shortage of some shoes and an oversupply of others.

Example 4

Find the mode of the following data set:

Solution:

The mode is 48 since it occurs most often.

Note:

It is possible for a set of data values to have more than one mode.

If there are two data values that occur most frequently, we say that the set of
data values is bimodal.

If there is no data value or data values that occur most frequently, we say that
the set of data values has no mode.

Analysing Data

The mean, median and mode of a data set are collectively known as measures of central tendency as
these three measures focus on where the data is centred or clustered. To analyse data using the mean,
median and mode, we need to use the most appropriate measure of central tendency. The following
points should be remembered:

The mean is useful for predicting future results when there are no extreme values
in the data set. However, the impact of extreme values on the mean may be
important and should be considered. E.g. The impact of a stock market crash on
average investment returns.

The median may be more useful than the mean when there are extreme values in
the data set as it is not affected by the extreme values.

The mode is useful when the most common item, characteristic or value of a data
set is required.

Understanding Standard Deviation Explained
No ratings yet
Understanding Standard Deviation Explained
3 pages
Understanding One-Way ANOVA Basics
No ratings yet
Understanding One-Way ANOVA Basics
22 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
27 pages
Skewness and Kurtosis in Normal Distributions
No ratings yet
Skewness and Kurtosis in Normal Distributions
14 pages
Chapter 6 The Normal Distribution Other Continuous Distributions
No ratings yet
Chapter 6 The Normal Distribution Other Continuous Distributions
66 pages
Lecture - 8 - Statistics - Standard Deviation - Dr. (MRS) - Neelam Yadav
No ratings yet
Lecture - 8 - Statistics - Standard Deviation - Dr. (MRS) - Neelam Yadav
31 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Central Tendency: Mean, Median, Mode
No ratings yet
Central Tendency: Mean, Median, Mode
18 pages
EC2303 Statistics Formula Sheet
No ratings yet
EC2303 Statistics Formula Sheet
8 pages
Understanding Variability in Statistics
No ratings yet
Understanding Variability in Statistics
31 pages
Anova
100% (1)
Anova
17 pages
Steps in Hypothesis Testing Explained
No ratings yet
Steps in Hypothesis Testing Explained
6 pages
Measurement Error in Psychological Testing
No ratings yet
Measurement Error in Psychological Testing
29 pages
48 Standard Deviation
No ratings yet
48 Standard Deviation
12 pages
Study Guide - Frequency Distributions and Graphs
No ratings yet
Study Guide - Frequency Distributions and Graphs
9 pages
SEM:Confirmatory Factor Analysis (CFA)
No ratings yet
SEM:Confirmatory Factor Analysis (CFA)
28 pages
One-Sample Hypothesis Testing Guide
No ratings yet
One-Sample Hypothesis Testing Guide
23 pages
Statistics in Research: Parametric vs Nonparametric
No ratings yet
Statistics in Research: Parametric vs Nonparametric
8 pages
Creating Histograms for Data Analysis
100% (1)
Creating Histograms for Data Analysis
11 pages
Lecture 9 Measures of Variability
0% (1)
Lecture 9 Measures of Variability
29 pages
Abstract - APA 7th Edition
No ratings yet
Abstract - APA 7th Edition
17 pages
Mediation Analysis: Estimation & Testing
No ratings yet
Mediation Analysis: Estimation & Testing
77 pages
Understanding Variance and Standard Deviation
No ratings yet
Understanding Variance and Standard Deviation
13 pages
Frequency Distributions & Graphs Guide
No ratings yet
Frequency Distributions & Graphs Guide
28 pages
PR2 Lesson 7 Hypothesis Testing
100% (1)
PR2 Lesson 7 Hypothesis Testing
59 pages
Understanding Measures of Variability
No ratings yet
Understanding Measures of Variability
11 pages
Module 3B - T Tests For A Population Mean With JASP Output
No ratings yet
Module 3B - T Tests For A Population Mean With JASP Output
53 pages
Reporting Statistics in APA Format
100% (1)
Reporting Statistics in APA Format
3 pages
06 - Independent Sample T Test
No ratings yet
06 - Independent Sample T Test
11 pages
Understanding Descriptive Statistics
100% (1)
Understanding Descriptive Statistics
32 pages
Statistics for Measurement Students
No ratings yet
Statistics for Measurement Students
31 pages
Introduction to Basic Statistics
No ratings yet
Introduction to Basic Statistics
53 pages
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
No ratings yet
Measuring The Occurrence of Disease: Dr. Elijah Kakande MBCHB, MPH Department of Public Health
25 pages
Understanding Regression Analysis Basics
No ratings yet
Understanding Regression Analysis Basics
25 pages
Levine Smume6 01
100% (1)
Levine Smume6 01
14 pages
Hypothesis Testing - Analysis of Variance (ANOVA)
No ratings yet
Hypothesis Testing - Analysis of Variance (ANOVA)
14 pages
One-Way ANOVA: What Is This Test For?
No ratings yet
One-Way ANOVA: What Is This Test For?
21 pages
Sampling Techniques and Distributions Guide
No ratings yet
Sampling Techniques and Distributions Guide
36 pages
Understanding Sampling Bias in Statistics
No ratings yet
Understanding Sampling Bias in Statistics
11 pages
3 Unit
No ratings yet
3 Unit
9 pages
Measures of Variability
No ratings yet
Measures of Variability
27 pages
3 Dispersion Skewness Kurtosis PDF
No ratings yet
3 Dispersion Skewness Kurtosis PDF
42 pages
Correlation and Regression Analysis Guide
No ratings yet
Correlation and Regression Analysis Guide
24 pages
Probability and Statistics Symbols Table
No ratings yet
Probability and Statistics Symbols Table
3 pages
Central Tendency for Stat Students
No ratings yet
Central Tendency for Stat Students
10 pages
Confidence Intervals for Unknown σ
No ratings yet
Confidence Intervals for Unknown σ
12 pages
Statistical Software for Analysts
100% (1)
Statistical Software for Analysts
5 pages
Understanding Statistics in Psychology
No ratings yet
Understanding Statistics in Psychology
31 pages
At The End of The Lesson The Students Should Be Able To
No ratings yet
At The End of The Lesson The Students Should Be Able To
19 pages
Module 1 - Descriptive Statistics PDF
No ratings yet
Module 1 - Descriptive Statistics PDF
34 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
60 pages
10.introduction To Hypothesis Testing
No ratings yet
10.introduction To Hypothesis Testing
2 pages
Sampling Basics in Statistics
No ratings yet
Sampling Basics in Statistics
16 pages
Ge 4 - Midterm Learning Material
No ratings yet
Ge 4 - Midterm Learning Material
93 pages
Independent Samples T-Test Explained
No ratings yet
Independent Samples T-Test Explained
5 pages
5.probability Distributions
No ratings yet
5.probability Distributions
63 pages
Describing Data: Centre Mean Is The Technical Term For What Most People Call An Average. in Statistics, "Average"
No ratings yet
Describing Data: Centre Mean Is The Technical Term For What Most People Call An Average. in Statistics, "Average"
4 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
18 pages
Mean, Variance, and Standard Deviation Calculations
No ratings yet
Mean, Variance, and Standard Deviation Calculations
16 pages
Intro Summary of Statistics PLTW Slide Show
No ratings yet
Intro Summary of Statistics PLTW Slide Show
47 pages
Design and Implementation of FPGA Based High Speed Data Acquisition Systems For Embedded Applications
No ratings yet
Design and Implementation of FPGA Based High Speed Data Acquisition Systems For Embedded Applications
52 pages
Propeller Owners Manual and Log Book - Hartzell PDF
No ratings yet
Propeller Owners Manual and Log Book - Hartzell PDF
88 pages
420D Backhoe Loader FDP00001-07198 (MACHINE) POWERED BY 3054 Engine BASIC ENGINE Caterpillar SIS
No ratings yet
420D Backhoe Loader FDP00001-07198 (MACHINE) POWERED BY 3054 Engine BASIC ENGINE Caterpillar SIS
1 page
jss1 Basic Sci
No ratings yet
jss1 Basic Sci
3 pages
Cartesian Coordinate System Treasure Hunt
No ratings yet
Cartesian Coordinate System Treasure Hunt
5 pages
Ni-Ti vs. Stainless Steel in Root Canal Prep
No ratings yet
Ni-Ti vs. Stainless Steel in Root Canal Prep
8 pages
Configuration Guide - Interface Management (V300R007C00 - 02)
No ratings yet
Configuration Guide - Interface Management (V300R007C00 - 02)
117 pages
Aakash JEE Advanced 2020 Mock Test 1 Answers
100% (1)
Aakash JEE Advanced 2020 Mock Test 1 Answers
34 pages
Petroleum Refining Reaction Types
100% (1)
Petroleum Refining Reaction Types
27 pages
Binary-Coded Decimal Guide
No ratings yet
Binary-Coded Decimal Guide
15 pages
T.L.E. 8 Computer Hardware Exam
No ratings yet
T.L.E. 8 Computer Hardware Exam
2 pages
Approach To: Renormalization-Group Interacting
No ratings yet
Approach To: Renormalization-Group Interacting
64 pages
MS-ASU-043 Rev03 METHOD STATEMENT FOR INSTALLATION OF ROCKWOOL PROROX GR...
100% (1)
MS-ASU-043 Rev03 METHOD STATEMENT FOR INSTALLATION OF ROCKWOOL PROROX GR...
8 pages
Quiz - Excel Assignment
No ratings yet
Quiz - Excel Assignment
7 pages
Mohammad Yan Fresen V Bull 2016253563
No ratings yet
Mohammad Yan Fresen V Bull 2016253563
8 pages
Wireless Network Security Jan 2024
No ratings yet
Wireless Network Security Jan 2024
8 pages
ATP Delivery and Tracking Overview
100% (2)
ATP Delivery and Tracking Overview
16 pages
Pin Name Alternate Functions Additional Functions
No ratings yet
Pin Name Alternate Functions Additional Functions
10 pages
Overview of FORTRAN 77 Features
100% (1)
Overview of FORTRAN 77 Features
28 pages
Interest Rates and Bond Valuation
No ratings yet
Interest Rates and Bond Valuation
24 pages
Sorting Algorithms Guide
No ratings yet
Sorting Algorithms Guide
92 pages
Mid Term Syllabus VII 24-25
No ratings yet
Mid Term Syllabus VII 24-25
5 pages
Timer (Schneider, Multi 9 IH, 24 Hours, 230V, P.n-15365)
No ratings yet
Timer (Schneider, Multi 9 IH, 24 Hours, 230V, P.n-15365)
1 page
Oxalate Intake and The Risk For Nephrolithiasis: Clinical Research
No ratings yet
Oxalate Intake and The Risk For Nephrolithiasis: Clinical Research
7 pages
Traffic Sign Recognition Guide
No ratings yet
Traffic Sign Recognition Guide
18 pages
Manual Testing Notes
No ratings yet
Manual Testing Notes
140 pages
2D Animation Course Overview
No ratings yet
2D Animation Course Overview
3 pages
Metric Spaces and Their Properties
No ratings yet
Metric Spaces and Their Properties
99 pages
Digital Computer Organization & Design
No ratings yet
Digital Computer Organization & Design
29 pages
Educational Math Research Insights
No ratings yet
Educational Math Research Insights
32 pages

Standard Deviation Formulas Explained

Uploaded by

Standard Deviation Formulas Explained

Uploaded by

Standard Deviation Formulas

To calculate the standard deviation:

1. Work out the Mean (the simple average of the numbers)

3. Then work out the mean of those squared differences.

4. Take the square root of that and you are done!

The Formula Explained

Step 1. Work out the mean

Example: 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4, 10, 9, 6, 9, 4

So what is xi ? They are the individual x values 9, 2, 5, 4, 12, 7, etc...

Step 3. Then work out the mean of those squared differences.

Which means: Sum all values from (x1-7)2 to (xN-7)2

Mean of squared differences = (1/20) 178 = 8.9

Sample Standard Deviation

And we use s (for Sample Standard Deviation) instead of .

Step 1. Work out the mean

Step 3. Then work out the mean of those squared differences.

Why Would We Take a Sample?

Mean, Median and Mode

So, the mean mark is 15.

Symbolically, we can set out the solution as follows:

So, the mean mark is 15.

Find the median of the following data set:

There are 8 values in the data set.

Find the mode of the following data set:

The mode is 48 since it occurs most often.

You might also like