0% found this document useful (0 votes)

51 views32 pages

Statical Data 1

This document discusses central tendency and dispersion in descriptive statistics. It defines: - Central tendency as the mean, median, and mode, which describe the center of a distribution. - Dispersion as variance and standard deviation, which measure how spread out the data is around the central tendency. - Types of distributions as normal, skewed, and the characteristics of each. It provides examples and formulas for calculating measures of central tendency like the mean and measures of dispersion like variance and standard deviation.

Uploaded by

Irma Estela Marie Esteban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views32 pages

Statical Data 1

Uploaded by

Irma Estela Marie Esteban

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Central Tendency & Dispersion

 Types of Distributions: Normal, Skewed

 Central Tendency: Mean, Median, Mode
 Dispersion: Variance, Standard Deviation

This PowerPoint has been ripped off from I don’t know where,
and improved upon by yours truly…Mrs. T  Enjoy!
DESCRIPTIVE STATISTICS
are concerned with describing the
characteristics of frequency distributions

 Where is the center?

 What is the range?
 What is the shape [of the
distribution]?
Frequency Table What is the range of test scores?
A: 30 (95 minus 65)
Test Scores When calculating mean, one
must divide by what number?
Observation Frequency
(scores) (# occurrences) A: 16 (total # occurrences)
65 1
70 2
75 3
80 4
85 3
90 2
95 1
Frequency Distributions

3
Frequency
(# occurrences) 2
1

65 70 75 80 85 90 95

Test Score
Normally Distributed Curve
Voter Turnout in 50 States - 1980
Skewed Distributions

We say the distribution is skewed We say the distribution is skewed

to the left  (when the “tail” is to the right  (when the “tail” is
to the left) to the right)
Voter Turnout in 50 States - 1940

Q: Is this distribution, positively or negatively skewed?

A: Negatively
Q: Would we say this distribution is
skewed to the left or right?
A: Left (skewed in direction of tail)
Characteristics - Normal Distribution
 It is symmetrical - half the values are to one side of the
center (mean), and half the values are on the other side.

 The distribution is single-peaked, not bimodal or multi-

modal.

 Most of the data values will be “bunched” near the center

portion of the curve. As values become more extreme
they become less frequent with the “outliers” being found
at the “tails” of the distribution and are few in number.

 The Mean, Median, and Mode are the same in a perfectly

symmetrical normal distribution.

 Percentage of values that occur in any range of the curve

can be calculated using the Empirical Rule.
Empirical Rule
Summarizing Distributions
Two key characteristics of a frequency distribution
are especially important when summarizing data
or when making a prediction:
 CENTRAL TENDENCY
 What is in the “middle”?
 What is most common?
 What would we use to predict?
 DISPERSION
 How spread out is the distribution?
 What shape is it?
The MEASURES of Central Tendency

 3 measures of central tendency are commonly

used in statistical analysis - MEAN, MEDIAN,
and MODE.
 Each measure is designed to represent a
“typical” value in the distribution.
 The choice of which measure to use depends on
the shape of the distribution (whether normal or
skewed).
Mean - Average
 Most common measure of central tendency.
 Is sensitive to the influence of a few extreme
values (outliers), thus it is not always the most
appropriate measure of central tendency.
 Best used for making predictions when a
distribution is more or less normal (or symmetrical).
 Symbolized as:
 x for the mean of a sample

 μ for the mean of a population

Finding the Mean

 Formula for Mean: X = (Σ x)

N
 Given the data set: {3, 5, 10, 4, 3}

X = (3 + 5 + 10 + 4 + 3) = 25
5 5
X =5
Find the Mean
Q: 85, 87, 89, 91, 98, 100
A: 91.67
Median: 90

Q: 5, 87, 89, 91, 98, 100

A: 78.3 (Extremely low score lowered the Mean)
Median: 90 (The median remained unchanged.)
Median
 Used to find middle value (center) of a distribution.
 Used when one must determine whether the data
values fall into either the upper 50% or lower 50%
of a distribution.
 Used when one needs to report the typical value of
a data set, ignoring the outliers (few extreme
values in a data set).
 Example: median salary, median home prices in a market
 Is a better indicator of central tendency than mean
when one has a skewed distribution.
To compute the median
 first you order the values of X from low to high:
 85, 90, 94, 94, 95, 97, 97, 97, 97, 98
 then count number of observations = 10.
 When the number of observations are even,
average the two middle numbers to calculate the
median.
 This example, 96 is the median
(middle) score.
Median
 Find the Median
4 5 6 6 7 8 9 10 12
 Find the Median
5 6 6 7 8 9 10 12
 Find the Median
5 6 6 7 8 9 10 100,000
Mode
 Used when the most typical (common) value is
desired.
 Often used with categorical data.
 The mode is not always unique. A distribution can
have no mode, one mode, or more than one mode.
When there are two modes, we say the distribution is
bimodal.
EXAMPLES:
a) {1,0,5,9,12,8} - No mode
b) {4,5,5,5,9,20,30} – mode = 5

c) {2,2,5,9,9,15} - bimodal, mode 2 and 9

Measures of Variability
 Central Tendency doesn’t tell us
everything Dispersion/Deviation/Spread
tells us a lot about how the data values
are distributed.

 We are most interested in:

 Standard Deviation (σ) and

 Variance (σ2)
Why can’t the mean tell us everything?

 Mean describes the average outcome.

 The question becomes how good a

representation of the distribution is the mean?
How good is the mean as a description of
central tendency -- or how accurate is the mean
as a predictor?

 ANSWER -- it depends on the shape of the

distribution. Is the distribution normal or
skewed?
Dispersion
 Once you determine that the data of interest is
normally distributed, ideally by producing a
histogram of the values, the next question to ask
is: How spread out are the values about the
mean?
 Dispersion is a key concept in statistical thinking.
 The basic question being asked is how much do
the values deviate from the Mean? The more
“bunched up” around the mean the better your
ability to make accurate predictions.
Means
 Consider these means for
hours worked day each day:
X = {7, 8, 6, 7, 7, 6, 8, 7} X = {12, 2, 0, 14, 10, 9, 5, 4}
X = (7+8+6+7+7+6+8+7)/8 X = (12+2+0+14+10+9+5+4)/8
X=7 X=7
Notice that all the data values The mean is the same for this data
are bunched near the mean. set, but the data values are more
Thus, 7 would be a pretty spread out.
good prediction of the average So, 7 is not a good prediction of
hrs. worked each day. hrs. worked on average each day.
Data is more spread out, meaning it has greater variability.

Below, the data is grouped closer to the center, less spread out,
or smaller variability.
 How well does the mean represent the values
in a distribution?
 The logic here is to determine how much
spread is in the values. How much do the
values "deviate" from the mean? Think of the
mean as the true value, or as your best
guess. If every X were very close to the
Mean, the Mean would be a very good
predictor.
 If the distribution is very sharply peaked then
the mean is a good measure of central
tendency and if you were to use the Mean to
make predictions you would be correct or
very close much of the time.
What if scores are widely
distributed?

The mean is still your best measure and your

best predictor, but your predictive power
would be less.

How do we describe this?

 Measures of variability
 Mean Absolute Deviation (You used in Math1)

 Variance (We use in Math 2)

 Standard Deviation (We use in Math 2)
Mean Absolute Deviation
The key concept for describing normal distributions
and making predictions from them is called
deviation from the mean.

We could just calculate the average distance between

each observation and the mean.
 We must take the absolute value of the distance,
otherwise they would just cancel out to zero!
Formula: |X X |
 n
i
Mean Absolute Deviation:
An Example
Data: X = {6, 10, 5, 4, 9, 8} X = 42 / 6 = 7

X – Xi Abs. Dev.
1. Compute X (Average)
7–6 1 2. Compute X – X and take
7 – 10 3 the Absolute Value to get
Absolute Deviations
7–5 2 3. Sum the Absolute
7–4 3 Deviations
4. Divide the sum of the
7–9 2
absolute deviations by N
7–8 1
Total: 12 12 / 6 = 2
What Does it Mean?
 On Average, each value is two units away
from the mean.

Is it Really that Easy?

 No!
 Absolute values are difficult to manipulate algebraically
 Absolute values cause enormous problems for calculus (Discontinuity)
 We need something else…
Variance and Standard Deviation
 Instead of taking the absolute value, we square
the deviations from the mean. This yields a
positive value.

 This will result in measures we call the Variance

and the Standard Deviation
Sample - Population -
s Standard Deviation σ Standard Deviation
s2 Variance σ2 Variance
Calculating the Variance and/or
Standard Deviation
Formulae:

Variance: Standard Deviation:

s 
2  (X  Xi ) 2
s
(X  X ) i
2

N N
Examples Follow . . .
Example:
Data: X = {6, 10, 5, 4, 9, 8}; N=6
Mean:
X X X (X  X ) 2

X
 X

42
7
6 -1 1 N 6
10 3 9 Variance:
5 -2 4
s 
2  ( X  X ) 2


28
 4.67
4 -3 9 N 6
9 2 4 Standard Deviation:
8 1 1 s  s 2  4.67  2.16
Total: 42 Total: 28

Univariate and Bivariate Statistical Analysis
No ratings yet
Univariate and Bivariate Statistical Analysis
63 pages
Unit Iii
No ratings yet
Unit Iii
152 pages
LECTURE 03 Measure of Central Tendency by Mahreen
No ratings yet
LECTURE 03 Measure of Central Tendency by Mahreen
35 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
38 pages
Introduction to Statistics Concepts
No ratings yet
Introduction to Statistics Concepts
32 pages
Biostatistics: Central Tendency & Dispersion
No ratings yet
Biostatistics: Central Tendency & Dispersion
69 pages
Topic 3
No ratings yet
Topic 3
49 pages
Central Tendency and Dispersion in Statistics
No ratings yet
Central Tendency and Dispersion in Statistics
23 pages
Biostatistics 5
No ratings yet
Biostatistics 5
28 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
33 pages
Unit 5 8614
No ratings yet
Unit 5 8614
39 pages
Understanding Measures of Central Tendency
No ratings yet
Understanding Measures of Central Tendency
54 pages
Unit 4 & 5 8614
No ratings yet
Unit 4 & 5 8614
58 pages
Chapter 22 Summarising Analysing Data Lyst1744453682255
No ratings yet
Chapter 22 Summarising Analysing Data Lyst1744453682255
18 pages
Understanding Descriptive Statistics
100% (3)
Understanding Descriptive Statistics
7 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
105 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
37 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
74 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
46 pages
Data Presentation Techniques Explained
No ratings yet
Data Presentation Techniques Explained
104 pages
Central Tendency and Dispersion Explained
No ratings yet
Central Tendency and Dispersion Explained
9 pages
Biostatistics: Averages & Dispersion
No ratings yet
Biostatistics: Averages & Dispersion
85 pages
Central Tendency and Variation Measures
No ratings yet
Central Tendency and Variation Measures
29 pages
Class3 - Central Tendency Dispersion - Post
No ratings yet
Class3 - Central Tendency Dispersion - Post
31 pages
Assessment1 FIN
No ratings yet
Assessment1 FIN
9 pages
Absences vs Final Grades Analysis
No ratings yet
Absences vs Final Grades Analysis
174 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
5 pages
Understanding Descriptive Statistics
No ratings yet
Understanding Descriptive Statistics
93 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
18 pages
Key Statistical Measures Explained
No ratings yet
Key Statistical Measures Explained
40 pages
Measures of Central Tendency Explained
No ratings yet
Measures of Central Tendency Explained
26 pages
Central Tendency and Dispersion Measures
No ratings yet
Central Tendency and Dispersion Measures
57 pages
Central Tendency and Variability Measures
No ratings yet
Central Tendency and Variability Measures
31 pages
Data Presentation Techniques Explained
100% (1)
Data Presentation Techniques Explained
40 pages
Central Tendency and Variability Explained
No ratings yet
Central Tendency and Variability Explained
38 pages
Central Location and Variability Measures
No ratings yet
Central Location and Variability Measures
49 pages
Understanding Frequency Distribution in Statistics
No ratings yet
Understanding Frequency Distribution in Statistics
13 pages
Descriptive Data Analysis in Education
No ratings yet
Descriptive Data Analysis in Education
34 pages
Understanding Test Scores and Statistics
No ratings yet
Understanding Test Scores and Statistics
39 pages
Statistics for Beginners
No ratings yet
Statistics for Beginners
42 pages
3 Measures of Central Tendency
No ratings yet
3 Measures of Central Tendency
30 pages
Data Presentation Techniques Explained
No ratings yet
Data Presentation Techniques Explained
40 pages
Choosing Central Tendency Measures
No ratings yet
Choosing Central Tendency Measures
5 pages
Central Tendency vs. Dispersion Explained
No ratings yet
Central Tendency vs. Dispersion Explained
28 pages
Measure of Central Tendency and Variability
No ratings yet
Measure of Central Tendency and Variability
73 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
32 pages
Statistical Measures and Their Applications
100% (1)
Statistical Measures and Their Applications
149 pages
Day 3 Educational Statistics
No ratings yet
Day 3 Educational Statistics
37 pages
Understanding Statistics: Types & Applications
No ratings yet
Understanding Statistics: Types & Applications
62 pages
Understanding Central Tendency & Variation
No ratings yet
Understanding Central Tendency & Variation
35 pages
Slides For IT SKill
No ratings yet
Slides For IT SKill
63 pages
Chapt3 Overheads
No ratings yet
Chapt3 Overheads
8 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
14 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
Quantitative Methods Module
No ratings yet
Quantitative Methods Module
22 pages
Ordered Pairs Distance Formula Test
No ratings yet
Ordered Pairs Distance Formula Test
2 pages
Research in The Teaching of Mathematics
No ratings yet
Research in The Teaching of Mathematics
36 pages
Sequences and Series Test With TOS
No ratings yet
Sequences and Series Test With TOS
8 pages
Conic Sections Test
No ratings yet
Conic Sections Test
1 page
Mathematics Investigations and Modeling
No ratings yet
Mathematics Investigations and Modeling
32 pages
Limits and Limit Theorems Test
No ratings yet
Limits and Limit Theorems Test
1 page
Sum of Terms in Arithmetic Sequences
No ratings yet
Sum of Terms in Arithmetic Sequences
11 pages
Lesson 1.3.2
No ratings yet
Lesson 1.3.2
8 pages
Understanding Number Sequences
No ratings yet
Understanding Number Sequences
11 pages
Sequence and Series Practice Questions
No ratings yet
Sequence and Series Practice Questions
4 pages
Sum of Arithmetic Sequence Terms
No ratings yet
Sum of Arithmetic Sequence Terms
7 pages
Lesson 1.2
No ratings yet
Lesson 1.2
18 pages
Mathematics Department Annual Learning Action Cell Plan
100% (1)
Mathematics Department Annual Learning Action Cell Plan
6 pages
WW 1-3
No ratings yet
WW 1-3
2 pages
Geometric Series and Sequence Problems
No ratings yet
Geometric Series and Sequence Problems
2 pages
Synthetic Division and Theorems Lesson
No ratings yet
Synthetic Division and Theorems Lesson
24 pages
Area of Sectors in Real Life Applications
No ratings yet
Area of Sectors in Real Life Applications
4 pages
Title - The 5E Model of Lesson Planning
No ratings yet
Title - The 5E Model of Lesson Planning
7 pages
Hand-Wrist Method for Skeletal Maturation
No ratings yet
Hand-Wrist Method for Skeletal Maturation
27 pages
Machine Learning for Aviation Inventory Optimization
No ratings yet
Machine Learning for Aviation Inventory Optimization
59 pages
Math 7 Q4 Weeks6to9 MELCs5to10 MOD2
No ratings yet
Math 7 Q4 Weeks6to9 MELCs5to10 MOD2
91 pages
Mathematics 10 Summative Test Questions
No ratings yet
Mathematics 10 Summative Test Questions
2 pages
Midterm So Ls
No ratings yet
Midterm So Ls
9 pages
GLY 413 Geostatistics and Data Analysis (1) - 021606
No ratings yet
GLY 413 Geostatistics and Data Analysis (1) - 021606
22 pages
CBSE Class 10 Math Solutions 2024
No ratings yet
CBSE Class 10 Math Solutions 2024
16 pages
Data Preprocessing
No ratings yet
Data Preprocessing
60 pages
Math Project (Aniruddha)
No ratings yet
Math Project (Aniruddha)
11 pages
SS 2864b Final Exam Overview
No ratings yet
SS 2864b Final Exam Overview
13 pages
영어II 적중예상문제 2회 심화
No ratings yet
영어II 적중예상문제 2회 심화
14 pages
Formula Sheet
No ratings yet
Formula Sheet
166 pages
Unit1 P&s 1
No ratings yet
Unit1 P&s 1
32 pages
Syllabus and Sample Questions With A Model Question Paper: Kneus Scholastic Aptitude Test (Ksat - Ii)
No ratings yet
Syllabus and Sample Questions With A Model Question Paper: Kneus Scholastic Aptitude Test (Ksat - Ii)
59 pages
GE 7 Lecture Notes For CBE (Statistics and Data Interpretation)
No ratings yet
GE 7 Lecture Notes For CBE (Statistics and Data Interpretation)
11 pages
Understanding Normal Distribution
No ratings yet
Understanding Normal Distribution
21 pages
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
No ratings yet
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
13 pages
Interval Estimation and Confidence Levels
No ratings yet
Interval Estimation and Confidence Levels
20 pages
Detailed Lesson Plan: Quartiles Explained
No ratings yet
Detailed Lesson Plan: Quartiles Explained
13 pages
Exstat
No ratings yet
Exstat
132 pages
Manm526 W1
No ratings yet
Manm526 W1
38 pages
Carpenter Pro Tabulate
No ratings yet
Carpenter Pro Tabulate
18 pages
B.Sc. IT Programme Syllabus Overview
No ratings yet
B.Sc. IT Programme Syllabus Overview
34 pages
Time: 3 Hrs (Maximum Marks: 100) : Cbse Class Xi
No ratings yet
Time: 3 Hrs (Maximum Marks: 100) : Cbse Class Xi
5 pages
Books Doubtnut Question Bank
No ratings yet
Books Doubtnut Question Bank
127 pages
Best Algebra 1 Eoc
No ratings yet
Best Algebra 1 Eoc
6 pages
Resistant vs Non-Resistant Statistics
No ratings yet
Resistant vs Non-Resistant Statistics
3 pages
Grade 11 Statistics Lesson Plan
No ratings yet
Grade 11 Statistics Lesson Plan
4 pages
Chapter 3
No ratings yet
Chapter 3
4 pages
SP7 Past Papers
No ratings yet
SP7 Past Papers
100 pages

Statical Data 1

Uploaded by

Statical Data 1

Uploaded by

Central Tendency & Dispersion

 Types of Distributions: Normal, Skewed

 Where is the center?

We say the distribution is skewed We say the distribution is skewed

Q: Is this distribution, positively or negatively skewed?

 The distribution is single-peaked, not bimodal or multi-

 Most of the data values will be “bunched” near the center

 The Mean, Median, and Mode are the same in a perfectly

 Percentage of values that occur in any range of the curve

 3 measures of central tendency are commonly

 μ for the mean of a population

 Formula for Mean: X = (Σ x)

Q: 5, 87, 89, 91, 98, 100

c) {2,2,5,9,9,15} - bimodal, mode 2 and 9

 We are most interested in:

 Mean describes the average outcome.

 The question becomes how good a

 ANSWER -- it depends on the shape of the

The mean is still your best measure and your

How do we describe this?

 Variance (We use in Math 2)

We could just calculate the average distance between

Is it Really that Easy?

 This will result in measures we call the Variance

Variance: Standard Deviation:

You might also like