Professional Documents
Culture Documents
F R E Q U E N C Y DISTRIBUTION
DISTRIBUTIONS
The first step in turning data into information is to create a
distribution. The most primitive way to present a distribution is to
simply list, in one column, each value that occurs in the population
and, in the next column, the number of times it occurs. It is
customary to list the values from lowest to highest. This simple listing
is called a frequency distribution. A more elegant way to turn data
into information is to draw a graph of the distribution. Customarily,
the values that occur are put along the horizontal axis and the
frequency of the value is on the vertical axis.
Frequency tells you how often something happened. The
frequency of an observation tells you the number of times the
observation occurs in the data. For example, in the following list of
numbers, the frequency of the number 9 is 5 (because it occurs 5
times):
1, 2, 3, 4, 6, 9, 9, 8, 5, 1, 1, 9, 9, 0, 6, 9.
DESCRIPTIVE STATISTICS
STATISTICS O F LOCATION
ARITHMETIC MEAN :
The most common statistic of location is familiar to everyone. It
is the arithmetic mean, commonly called the mean or average. The
mean is calculated by summing all the individual observations or
items of a sample and dividing this s u m by the number of items in
the sample.
For instance, a s the result of a gas
analysis in a respirometer an investigator
the following four readings of oxygen
obtains
percentages and s u m s them:
14.9
10.8
12.3
23.3
S u m =61.3
The investigator calculates the mean oxygen percentage as the
s u m of the four items divided by the number of items. Thus, the
average oxygen percentage is:
COURSECODE: NCM-6308/NCM-5308 BIOSTATISTICS/PRELIMS
only the first three items, we would have written O n the other
hand, had we wished to s u m all of them except the first one, we would
have written With some exceptions it is desirable to omit
subscripts and superscripts, which generally add to the apparent
complexity of the formula and, when they are unnecessary, distract
the student's attention from the important relations expressed by the
formula.
COURSECODE: NCM-6308/NCM-5308 BIOSTATISTICS/PRELIMS
The third symbol might be interpreted as meaning, "Sum the Yi's over
all available values of i". This is a frequently used notation. The next
with “n” as a superscript, tells u s to s u m “n” items of Y; note that the
“i” subscript of the Y has been dropped as unnecessary. Finally. the
simplest notation is shown at the right. It merely says “sum the Y's”.
This will be the form we shall use most frequently: if a summation
sign precedes a variable, the summation will be understood to be over
“n” items (all the items in the sample) unless subscripts or
superscripts specifically tell u s otherwise.
MEDIAN
The median is used to calculate variables that are measured
with ordinal, interval, or ratio scales. It is obtained by arranging the
data from the lowest to the highest and then picking the number(s)
in the middle. If the total number of data points is an odd number,
the median is usually the middle number. If the numbers are even,
the median is obtained by summing the two numbers in the middle
and dividing them by two to get the mean.
Median is mostly used when there are a few data points that are
different. For example, when calculating the median of students
entering college, there may be a section of students who are older
than the rest. Using the mean may distort the values since it will
show that the average age of students entering college to be higher,
whereas using the median can give a truer reflection of the situation.
For example, let’s find the median age of students
entering
college for the first time, given the following values of ten
students:
COURSECODE: NCM-6308/NCM-5308 BIOSTATISTICS/PRELIMS
MODE
The mode refers to the value represented by the greatest
number of individuals. The mode is the most occurring number
within a data distribution. It shows what number or value is the
highest in number or most common in the data distribution. The
mode is used for any type of data.
For example, let’s take the example of a college class with about
40 students. The students are given a test exam, graded, and then
grouped on a scale of 1-5, starting with students with the lowest
number of marks.
Cluster 1: 5
Cluster 2: 7
Cluster 3: 13
Cluster 4: 12
Cluster 5: 3
M E AS U R E S O F DISPERSION
RANGE:
One simple measure of dispersion is the range, which is defined
as the difference between the largest and the smallest items in a
sample. Since the range is a measure of the span of the variates along
the scale of the variable, it is in the same units as the original
measurements. The range is clearly affected by even a single outlying
value and for this reason is only a rough estimate of the dispersion
of all the items in the sample.
QUARTILES:
Quartiles are the values that divide a list of numbers into quarters:
INTERQUARTILE RANGE
STANDARD DEVIATION :
relative to the mean. If the data points are further from the mean,
there is a higher deviation within the data set; th us, the more spread
out the data, the higher the standard deviation. Standard deviation
is a statistical measurement in finance that, when applied to the
annual rate of return of an investment, sheds light on that
investment's historical volatility. The greater the standard deviation
of securities, the greater the variance between each price and the
mean, which shows a larger price range. For example, a volatile stock
has a high standard deviation, while the deviation of a stable blue-
chip stock is usually rather low.
COE F F IC IE N T O F VARIATION
REFERENCES:
1. https://www.investopedia.com/terms/s/standarddeviation.as
p#:~:text=The%20standard%20deviation%20is%20a,square%2
0root%20of%20the%20variance.&text=If%20the%20data%20p
oints%20are,the%20higher%20the%20standard%20deviation.
2. Sokal, R ., & Rohlf. (2009). Introduction to biostatistics. Dover
publications, inc. Mineola, New York.
3. https://corporatefinanceinstitute.com/resources/knowledge/
other/parameter/
COURSECODE: NCM-6308/NCM-5308 BIOSTATISTICS/PRELIMS
4. https://opentextbc.ca/introductorybusinessstatistics/chapter
/descriptive-statistics-and-frequency-distributions-2/
5. https://www.statisticshowto.com/probability-and-
statistics/descriptive-statistics/frequency-distribution-table/