0% found this document useful (0 votes)

237 views11 pages

Statistics for Students

The document discusses standard deviation, which is a measure of how dispersed or spread out values are from the average (mean) value. It can be represented by the Greek letter σ or Latin letter s. Standard deviation is calculated by taking the square root of the variance, which is the average of squared deviations from the mean. The standard deviation and standard error are related but different measures, with standard error estimating the standard deviation of an estimate based on sampling. Standard deviation is commonly reported alongside findings to measure potential error.

Uploaded by

JAHLEL SHELAH TRILLES

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

237 views11 pages

Statistics for Students

Uploaded by

JAHLEL SHELAH TRILLES

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

STANDARD DEVIATION

Standard Deviation is a measure of the amount of variation or dispersion of a set of values. A

low standard deviation indicates that the values tend to be close to the mean (also called
the expected value) of the set, while a high standard deviation indicates that the values are spread
out over a wider range.
Standard deviation may be abbreviated SD, and is most commonly represented in mathematical
texts and equations by the lower case Greek letter σ (sigma), for the population standard
deviation, or the Latin letter s, for the sample standard deviation.
The standard deviation of a random variable, sample, statistical population, data set,
or probability distribution is the square root of its variance. It is algebraically simpler, though in
practice less robust, than the average absolute deviation. A useful property of the standard
deviation is that, unlike the variance, it is expressed in the same unit as the data.
The standard deviation of a population or sample and the standard error of a statistic (e.g., of the
sample mean) are quite different, but related. The sample mean's standard error is the standard
deviation of the set of means that would be found by drawing an infinite number of repeated
samples from the population and computing a mean for each sample. The mean's standard error
turns out to equal the population standard deviation divided by the square root of the sample size
and is estimated by using the sample standard deviation divided by the square root of the sample
size. For example, a poll's standard error (what is reported as the margin of error of the poll), is
the expected standard deviation of the estimated mean if the same poll were to be conducted
multiple times. Thus, the standard error estimates the standard deviation of an estimate, which
itself measures how much the estimate depends on the particular sample that was taken from the
population.
In science, it is common to report both the standard deviation of the data (as a summary statistic)
and the standard error of the estimate (as a measure of potential error in the findings). By
convention, only effects more than two standard errors away from a null expectation are
considered "statistically significant", a safeguard against spurious conclusion that is really due to
random sampling error.
When only a sample of data from a population is available, the term standard deviation of the
sample or sample standard deviation can refer to either the above-mentioned quantity as applied
to those data, or to a modified quantity that is an unbiased estimate of the population standard
deviation (the standard deviation of the entire population).
Overview Of How to Calculate Standard Deviation

The formula for standard deviation (SD) is

where ∑ means "sum of", x is a value in the data set, μ is the mean of the data set, and N is the
number of data points in the population.
The standard deviation formula may look confusing, but it will make sense after we break it
down. In the coming sections, we'll walk through a step-by-step interactive example. Here's a
quick preview of the steps we're about to follow:
Step 1: Find the mean.
Step 2: For each data point, find the square of its distance to the mean.
Step 3: Sum the values from Step 2.
Step 4: Divide by the number of data points.
Step 5: Take the square root.

MEASURE OF VARIABILTY

Variability in statistics means deviation of scores in a group or series, from their mean scores. It
actually refers to the spread of scores in the group in relation to the mean. It is also known as
dispersion. For instance, in a group of 10 participants who have scored differently on a
mathematics test, each individual varies from the other in terms of the marks that he/she has
scored. These variations can be measured with the help of measure of variability, that measure
the dispersion of different values for the average value or average score. Variability or dispersion
also means the scatter of the values in a group. High variability in the distribution means that
scores are widely spread and are not homogeneous. Low variability means that the scores are
similar and homogeneous and are concentrated in the middle.
According to Minium, King and Bear (2001), measures of variability express quantitatively the
extent to which the score in a distribution scatter around or cluster together. They describe the
spread of an entire set of scores, they do not specify how far a particular score diverges from the
center of the group. These measures of variability do not provide information about the shape of
a distribution or the level of performance of a group.
Measures of variability fall under descriptive statistics that describe how similar a set of scores
are to each other. The greater the similarity of the scores to each other, lower would be the
measure of variability or dispersion. The less the similarity of the scores are to each other, higher
will be the measure of variability or dispersion. In general, the more the spread of a distribution,
larger will be the measure of dispersion. To state it succinctly, the variation between the data
values in a sample is called dispersion. The most commonly used measures of dispersion are the
range, and standard deviation.
While measures of central tendencies are indeed very valuable, their usefulness is rather limited.
Although through these measures we can compare the two or more groups, a measure of central
tendency is not sufficient for the comparison of two or more groups. They do not show how the
individual scores are spread out. Let us take another example, similar to the one that we
discussed under the section on introduction. A math teacher is interested to know the
performance of two groups (A and B) of his /her students. He/she gives them a test of 40 points.
The marks obtained by the students of groups A and B in the test are as follows:
Marks of Group A: 5,4,38,38,20,36,17,19,18,5 (N = 10, Total = 200, Mean = 20)
Marks of Group B: 22,18,19,21,20,23,17,20,18,22 (N = 10, Total = 200, Mean = 20)
The mean scores of both the groups is 20, as far as mean goes there is no difference in the
performance of the two groups. But there is a difference in the performance of the two groups in
terms of how each individual student varies in marks from that of the other. For instance, the test
scores of group A are found to range from 5 to 38 and the test scores of group B range from 18 to
23.
It means that some of the students of group A are doing very well, some are doing very poorly
and performance of some of the students is falling at the average level. On the other hand, the
performance of all the students of the second group is falling within and near about the average
(mean) that is 20. It is evident from this that the measures of central tendency provide us
incomplete picture of a set of data. It gives insufficient base for the comparison of two or more
sets of scores. Thus, in addition to a measure of central tendency, we need an index of how the
scores are scattered around the center of the distribution. In other words, we need a measure of
dispersion or variability. A measure of central tendency is a summary of scores, and a measure of
dispersion is summary of the spread of scores. Information about variability is often as important
as that about the central tendency.
The term variability or dispersion is also known as the average of the second degree, because
here we consider the arithmetic mean of the deviations from the mean of the values of the
individual items. To describe a distribution adequately, therefore, we usually must provide a
measure of central tendency and a measure of variability. Measures of variability are important
in statistical inference. With the help of measures of dispersion, we can know about fluctuation
in random sampling. How much fluctuation will occur in random sampling? This question in
fundamental to every problem in statistical inference, it is a question about variability.
The measures of variability are important for the following purposes:

 Measures of variability are used to test the extent to which an average represents the
characteristics of a data. If the variation is small then it indicates high uniformity of
values in the distribution and the average represents the characteristics of the data. On
the other hand, if variation is large then it indicates lower degree of uniformity and
unreliable average.
 Measures of variability help in identifying the nature and cause of variation. Such
information can be useful to control the variation.
 Measures of variability help in the comparison of the spread in two or more sets of data
with respect to their uniformity or consistency.  Measures of variability facilitate the
use of other statistical techniques such as correlation, regression analysis, and so on.

Functions of Variability
The major functions of dispersion or variability are as follows:
 It is used for calculating other statistics such as analysis of variance, degree of
correlation, regression etc.  It is also used for comparing the variability in the data
obtained as in the case of Socio-Economic Status, income, education etc.
 To find out if the average or the mean/median/mode worked out is reliable. If the
variation is small then we could state that the average calculated is reliable, but if
variation is too large, then the average may be erroneous.
 Dispersion gives us an idea if the variability is adversely affecting the data and thus
helps in controlling the variability.

THE QUARTILE DEVIATION (QD)

Since a large number of values in the data lie in the middle of the frequencies distribution and
range depends on the extreme (outliers) of a distribution, we need another measure of variability.
The Quartile deviation, is a measure that depends on the relatively stable central portion of a
distribution. According to Garret (1966), the Quartile deviation is half the scale distance between
75th and 25th per cent in a frequency distribution. The entire data is divided into four equal parts
and each part contains 25% of the values. According to Guilford (1963) the Semi-Interquartile
range is the one half the range of the middle 50 percent of the cases.
On the basis of above definitions, it can be said that quartile deviation is half the distance
between Q1 and Q3.
Inter Quartile Range (IQR): The range computed for the middle 50% of the distribution is the
interquartile range. The upper quartile (Q3) and lower quartile (Q1) is used to compute IQR. This
is Q3 – Q1. IQR is not affected by extreme values.
Semi-Interquartile Range (SIQR) or Quartile Deviation (QD): Half of the IQR is called as
semi inter quartile range. SIQR is also called as quartile deviation or QD. Thus, QD is computed
as;
QD = Q3 – Q1/2 T
Thus, quartile deviation is obtained by dividing IQR by 2. Quartile deviation is an absolute
measure of dispersion and is expressed in the same unit as the scores.
Quartile deviation is closely related to the median because median is responsive to the number of
scores lying below it rather than to their exact positions and Q1 and Q3 are defined in a same
manner. The median and quartile deviation have common properties. Both median and quartile
deviation are not affected by extreme values. In a symmetrical distribution, the two quartiles Q1
and Q3 are at equal distance from the median or Q1 =Q3- Median. Thus, like median, quartile
deviation covers exactly 50 per cent of observed values in the data. In normal distribution,
quartile deviation is called the Probable Error or PE. If the distribution is open-class, then
quartile deviation is the only measure of variability that is reasonable to compute.
In an asymmetric or skewed distribution, Q1 and Q3 are not equidistant from Q2 or median. In
such a distribution, the median of the IQR moves towards the skewed tail. The degree and
direction of skewness can be assessed from quartile deviation and the relative distance between
Q1, Q2 and Q3.
Kurtosis is proportional to quartile deviation. Smaller the quartile deviation, greater is the
concentration of scores in the middle of the distribution, thus making the distribution with high
peak and narrow body. The scores that are widely dispersed indicate a large quartile deviation
and thus, long IQR. This distribution has a low peak and broad body.
Merits and Limitations of Quartile Deviation
From the explanation in the above section, it becomes clear that quartile deviation is easy to
understand and compute.
1) Quartile deviation is a better measure of dispersion than range because it takes into account 50
per cent of the data, unlike the range which is based on two values of the data, that is highest
value and the lowest value.
2) Secondly, quartile deviation is not affected by extreme scores since it does not consider 25 per
cent data from the beginning and 25 per cent from the end of the data.
3) Lastly, quartile deviation is the only measure of dispersion which can be computed from the
frequency distribution with open-end class.
Despite the major merits of quartile deviation, there are limitations to it as well.
1) The value of quartile deviation is based on the middle 50 percent values, it is not based on all
the observations. Thus, it is not regarded as a stable measure of variability
2) The value of quartile deviation is affected by sampling fluctuation.
3) The value of quartile deviation is not affected by the distribution of the individual values
within the intervals of middle 50 percent observed values
Uses of Quartile Deviation
1) The distribution contains few and very extreme scores.
2) When the median is the measure of central tendency.
3) When our primary interest is to determine the concentration around the median.

WHAT IS A DECILE?

A decile is a quantitative method of splitting up a set of ranked data into 10 equally large
subsections. This type of data ranking is performed as part of many academic
and statistical studies in the finance and economics fields. The data may be ranked from largest
to smallest values, or vice versa.

A decile, which has 10 categorical buckets may be contrasted with percentiles that have 100,
quartiles that have four, or quintiles that have five.

Understanding a Decile

In descriptive statistics, a decile is used to categorize large data sets from the highest to lowest
values, or vice versa. Like the quartile and the percentile, a decile is a form of a quantile that
divides a set of observations into samples that are easier to analyze and measure.

While quartiles are three data points that divide an observation into four equal groups or
quarters, a decile consists of nine data points that divide a data set into 10 equal parts. When an
analyst or statistician ranks data and then splits them into deciles, they do so in an attempt to
discover the largest and smallest values by a given metric.

A decile is usually used to assign decile ranks to a data set. A decile rank arranges the data in
order from lowest to highest and is done on a scale of one to 10 where each successive number
corresponds to an increase of 10 percentage points. In other words, there are nine decile points.
The 1st decile, or D1, is the point that has 10% of the observations below it, D2 has 20% of the
observations below it, D3 has 30% of the observations falling below it, and so on.

How to Calculate a Decile

There is no one way of calculating a decile; however, it is important that you are consistent with
whatever formula you decide to use to calculate a decile. One simple calculation of a decile is:

From this formula, it is given that the 5th decile is the median since 5 (n+1) / 10 is the data
point that represents the halfway point of the distribution.

DEFINITION OF A PERCENTILE

In statistics, percentiles are used to understand and interpret data. The nth percentile of a set of
data is the value at which n percent of the data is below it. In everyday life, percentiles are used
to understand values such as test scores, health indicators, and other measurements. For example,
an 18-year-old male who is six and a half feet tall is in the 99th percentile for his height. This
means that of all the 18-year-old males, 99 percent have a height that is equal to or less than six
and a half feet. An 18-year-old male who is only five and a half feet tall, on the other hand, is in
the 16th percentile for his height, meaning only 16 percent of males his age are the same height
or shorter.
What Percentile Means

Percentiles should not be confused with percentages. The latter is used to express fractions of a
whole, while percentiles are the values below which a certain percentage of the data in a data set
is found. In practical terms, there is a significant difference between the two. For example, a
student taking a difficult exam might earn a score of 75 percent. This means that he correctly
answered every three out of four questions. A student who scores in the 75th percentile,
however, has obtained a different result. This percentile means that the student earned a higher
score than 75 percent of the other students who took the exam. In other words, the percentage
score reflects how well the student did on the exam itself; the percentile score reflects how well
he did in comparison to other students.

Percentile Formula

Percentiles for the values in a given data set can be calculated using the formula:

n = (P/100) x N

where N = number of values in the data set, P = percentile, and n = ordinal rank of a given value
(with the values in the data set sorted from smallest to largest). For example, take a class of 20
students that earned the following scores on their most recent test: 75, 77, 78, 78, 80, 81, 81, 82,
83, 84, 84, 84, 85, 87, 87, 88, 88, 88, 89, 90. These scores can be represented as a data set with
20 values: {75, 77, 78, 78, 80, 81, 81, 82, 83, 84, 84, 84, 85, 87, 87, 88, 88, 88, 89, 90}.

We can find the score that marks the 20th percentile by plugging in known values into the
formula and solving for n:

n = (20/100) x 20

n=4

The fourth value in the data set is the score 78. This means that 78 marks the 20th percentile; of
the students in the class, 20 percent earned a score of 78 or lower.

Deciles and Common Percentiles

Given a data set that has been ordered in increasing magnitude, the median, first quartile, and
third quartile can be used split the data into four pieces. The first quartile is the point at which
one-fourth of the data lies below it. The median is located exactly in the middle of the data set,
with half of all the data below it. The third quartile is the place where three-fourths of the data
lies below it.

The median, first quartile, and third quartile can all be stated in terms of percentiles. Since half of
the data is less than the median, and one-half is equal to 50 percent, the median marks the 50th
percentile. One-fourth is equal to 25 percent, so the first quartile marks the 25th percentile. The
third quartile marks the 75th percentile.

Besides quartiles, a fairly common way to arrange a set of data is by deciles. Each decile
includes 10 percent of the data set. This means that the first decile is the 10th percentile, the
second decile is the 20th percentile, etc. Deciles provide a way to split a data set into more pieces
than quartiles without splitting the set into 100 pieces as with percentiles.

Applications of Percentiles

Percentile scores have a variety of uses. Anytime that a set of data needs to be broken into
digestible chunks, percentiles are helpful. They are often used to interpret test scores—such as
SAT scores—so that test-takers can compare their performance to that of other students. For
example, a student might earn a score of 90 percent on an exam. That sounds pretty impressive;
however, it becomes less so when a score of 90 percent corresponds to the 20th percentile,
meaning only 20 percent of the class earned a score of 90 percent or lower.

Another example of percentiles is in children's growth charts. In addition to giving a physical

height or weight measurement, pediatricians typically state this information in terms of a
percentile score. A percentile is used in order to compare the height or weight of a child to other
children of the same age. This allows for an effective means of comparison so that parents can
know if their child's growth is typical or unusual.

https://en.wikipedia.org/wiki/Standard_deviation
https://www.khanacademy.org/math/statistics-probability/summarizing-quantitative-data/
variance-standard-deviation-population/a/calculating-standard-deviation-step-by-step
https://egyankosh.ac.in/bitstream/123456789/65183/3/Unit-4.pdf
Definition of a Percentile in Statistics (thoughtco.com)

BICOL COLLEGE
DARAGA, ALBAY
GRADUATE SCHOOL
1st Sem AY 2023-2024

RESEARCH PAPER
IN
APPLIED STATISTICS

Prepared by:
Keizha Mae T. Agudelo

Submitted to:
Remeline E. Lucilo - Bausa Ed.D
Professor

Tle9 q1 w6 10 Mod2 Draft Floor Plan v5
No ratings yet
Tle9 q1 w6 10 Mod2 Draft Floor Plan v5
48 pages
Tle 9 Technical Drafting Drafting Tools
No ratings yet
Tle 9 Technical Drafting Drafting Tools
15 pages
Week 8-Pakikilahok at Bolunterismo
No ratings yet
Week 8-Pakikilahok at Bolunterismo
16 pages
Standard Deviation and Variance Explained
No ratings yet
Standard Deviation and Variance Explained
14 pages
Vincentian Outreach for Community Empowerment
No ratings yet
Vincentian Outreach for Community Empowerment
3 pages
School Solid Waste Management
No ratings yet
School Solid Waste Management
2 pages
Mastering Self-Leadership Skills
No ratings yet
Mastering Self-Leadership Skills
8 pages
Criminology Students' Conduct Compliance
No ratings yet
Criminology Students' Conduct Compliance
27 pages
Binary Code Activity Sheet
0% (1)
Binary Code Activity Sheet
1 page
Factors Influencing Criminology Exam Success
No ratings yet
Factors Influencing Criminology Exam Success
16 pages
Ethical Practices
100% (1)
Ethical Practices
58 pages
CCE Points Evaluation Criteria
No ratings yet
CCE Points Evaluation Criteria
3 pages
OHS Policies and Procedures Overview
No ratings yet
OHS Policies and Procedures Overview
13 pages
Draft Ceiling Plans
No ratings yet
Draft Ceiling Plans
21 pages
Advanced Statistics Final Exam Guide
No ratings yet
Advanced Statistics Final Exam Guide
5 pages
Ethics in Technical Communication
No ratings yet
Ethics in Technical Communication
40 pages
Iowa Teaching Standards Overview
No ratings yet
Iowa Teaching Standards Overview
3 pages
Personal Values for Self-Development
No ratings yet
Personal Values for Self-Development
5 pages
Lecture 9 Measures of Variability
0% (1)
Lecture 9 Measures of Variability
29 pages
Criminology Crime Laboratory Inventory
100% (1)
Criminology Crime Laboratory Inventory
4 pages
Patul National High School
No ratings yet
Patul National High School
51 pages
WWW - Smccnasipit.edu - PH: Saint Michael College of Caraga
No ratings yet
WWW - Smccnasipit.edu - PH: Saint Michael College of Caraga
5 pages
Sacrament of Reconciliation
100% (1)
Sacrament of Reconciliation
15 pages
CFLM - Chapter 1 - Lesson 2 Development of Self-Leadership
No ratings yet
CFLM - Chapter 1 - Lesson 2 Development of Self-Leadership
8 pages
Activity On Common Laboratory Operations
No ratings yet
Activity On Common Laboratory Operations
6 pages
Teaching Effectiveness and Morale of Criminology Faculty of Laguna State Polytechnic University, Philippines
No ratings yet
Teaching Effectiveness and Morale of Criminology Faculty of Laguna State Polytechnic University, Philippines
7 pages
Ethical Leadership Insights
No ratings yet
Ethical Leadership Insights
11 pages
Public Awareness of BFP Services in San Jose
No ratings yet
Public Awareness of BFP Services in San Jose
8 pages
CRI 225 Final Exam: Criminal Justice Ethics
No ratings yet
CRI 225 Final Exam: Criminal Justice Ethics
8 pages
Values Card for Personal Development
No ratings yet
Values Card for Personal Development
1 page
NSTP1 Outreach Activity Rubrics
No ratings yet
NSTP1 Outreach Activity Rubrics
1 page
CDI 6 - Fire Pro Handout
No ratings yet
CDI 6 - Fire Pro Handout
8 pages
Fire Technology & Arson Investigation Module
No ratings yet
Fire Technology & Arson Investigation Module
25 pages
Ethics Course Overview and Frameworks
No ratings yet
Ethics Course Overview and Frameworks
6 pages
Opss Pms Stage 1 For Validation Smaoi (Albay)
No ratings yet
Opss Pms Stage 1 For Validation Smaoi (Albay)
129 pages
Chemistry Lab Experiments for Criminology
No ratings yet
Chemistry Lab Experiments for Criminology
8 pages
Module 1: Fire
No ratings yet
Module 1: Fire
8 pages
CFLM Assignment 1 Task 1
No ratings yet
CFLM Assignment 1 Task 1
7 pages
1st Semester AY 2019-2020 Seat Plan
No ratings yet
1st Semester AY 2019-2020 Seat Plan
1 page
Lab Equipment List For Forensic Analysis
No ratings yet
Lab Equipment List For Forensic Analysis
5 pages
Welcome Remarks OATH-TAKING AND PINNIING OF RANKS
100% (1)
Welcome Remarks OATH-TAKING AND PINNIING OF RANKS
1 page
Fire Extinguisher Use Guide
No ratings yet
Fire Extinguisher Use Guide
3 pages
Guidance Program Evaluation Tool
No ratings yet
Guidance Program Evaluation Tool
4 pages
Student Performance and Attendance Record
No ratings yet
Student Performance and Attendance Record
7 pages
Criminology & Law Books Catalog
No ratings yet
Criminology & Law Books Catalog
2 pages
Grade 12 Basic Education Exit Assessment Guide
No ratings yet
Grade 12 Basic Education Exit Assessment Guide
20 pages
Testing Statistical Hypotheses Explained
No ratings yet
Testing Statistical Hypotheses Explained
21 pages
BA Sociology-Social Research Method
No ratings yet
BA Sociology-Social Research Method
17 pages
Cdi 6
No ratings yet
Cdi 6
234 pages
Shipboard Routine at Sea/In Port
No ratings yet
Shipboard Routine at Sea/In Port
11 pages
Leadership Standards in Catholic Education
100% (1)
Leadership Standards in Catholic Education
9 pages
Misamis Oriental Institute of Science and Technology: Course Code: Module No. Course Title: Date
No ratings yet
Misamis Oriental Institute of Science and Technology: Course Code: Module No. Course Title: Date
5 pages
Quality Assurance in Higher Education
No ratings yet
Quality Assurance in Higher Education
20 pages
Criminology Fingerprint History
No ratings yet
Criminology Fingerprint History
26 pages
Chastity Pledge for Young Catholics
100% (1)
Chastity Pledge for Young Catholics
2 pages
Narrative Report 8
No ratings yet
Narrative Report 8
2 pages
Theories of Change in Nature
No ratings yet
Theories of Change in Nature
24 pages
Leadership Character and Values Development
100% (1)
Leadership Character and Values Development
72 pages
Teacher Self-Assessment Tool Validation
No ratings yet
Teacher Self-Assessment Tool Validation
16 pages
Module On Measures of Dispersion
No ratings yet
Module On Measures of Dispersion
3 pages

Statistics for Students

Uploaded by

Statistics for Students

Uploaded by

STANDARD DEVIATION

Standard Deviation is a measure of the amount of variation or dispersion of a set of values. A

The formula for standard deviation (SD) is

THE QUARTILE DEVIATION (QD)

How to Calculate a Decile

Deciles and Common Percentiles

Another example of percentiles is in children's growth charts. In addition to giving a physical

You might also like