Professional Documents
Culture Documents
Course Outline
Descriptive Statistics:
A descriptive statistic is a summary statistic that quantitatively describes or summarizes
features from a collection of information, while descriptive statistics is the process of using and
analyzing those statistics.
Inferential Statistics:
Statistical inference is the process of using data analysis to infer properties of an
underlying distribution of probability. Inferential statistical analysis infers properties of a
population, for example by testing hypotheses and deriving estimates.
Data Definition:
Data are individual pieces of factual information recorded and used for
the purpose of analysis. It is the raw information from which statistics are created.
Statistics are the results of data analysis - its interpretation and presentation. ...
Often these types of statistics are referred to as 'statistical data'
Graphical Representation:
Graphic representation is another way of analyzing numerical data. A graph
is a sort of chart through which statistical data are represented in the form of lines or
curves drawn across the coordinated points plotted on its surface. Graphs enable us
in studying the cause and effect relationship between two variables.
Pie Chart:
A pie chart (or a circle chart) is a circular statistical graphic, which is divided
into slices to illustrate numerical proportion. In a pie chart, the arc length of each
slice (and consequently its central angle and area), is proportional to the quantity it
represents.
Frequency Histogram:
A frequency histogram is a graph with vertical columns that represent the
frequency of a data point or range of data points occurring in a set of data. It's a
really useful tool we can use to visualize a data set and make it easier to understand.
Frequency Polygon:
A frequency polygon is a visual representation of a distribution. The visualization tool
is used to understand the shape of a distribution. Essentially, the frequency polygon indicates the
number of occurrences for each distinct class in the dataset.
Ogive:
In statistics, an ogive, also known as a cumulative frequency polygon, can refer to one of
two things: any hand drawn graphic of a cumulative distribution function any empirical cumulative
distribution function.
Mid Value:
The average of the upper limit and lower limit is called as mid value.
It is Calculated By This Formula
Magnitude Of Class-intervals:
The Difference Between the upper limit and lower limit of the class. Intervals is known
as magnitude of class-interval.
Class Limit:
The Class limit refers to the highest and lowest value that can be included in the class.
Highest limit is known as upper limit and lowest limit is known as lowest limit.
i.e: If interval’s is 30-40 the lower limit is 30 and upper limit is 40.
Two-Way Classification:
When Classification is done according to two variables it is called Two-way classification.
One-Way Classification:
When classification is done considering only one variable it is called one way
classification.
Quartiles
In statistics, Quartiles are the set of values which has three points dividing the data set into four
identical parts. We ordinarily deal with a large amount of numerical data, in stats. There are
several concepts and formulas, which are extensively applicable in various researches and
surveys. One of the best applications of quartiles is defined in box and whisker plot.
Quartiles are the values that divide a list of numerical data into three quarters. The middle part of
the three quarters measures the central point of distribution and shows the data which are near to
the central point. The lower part of the quarters indicates just half information set which comes
under the median and the upper part shows the remaining half, which falls over the median. In
all, the quartiles depict the distribution or dispersion of the data set.
Quartiles Definition
Quartiles divide the entire set into four equal parts. So, there are three quartiles, first, second and
third represented by Q , Q and Q , respectively. Q is nothing but the median, since it indicates
1 2 3 2
the position of the item in the list and thus, is a positional average. To find quartiles of a group of
data, we have to arrange the data in ascending order.
In the median, we can measure the distribution with the help of lesser and higher quartile. Apart
from mean and median, there are other measures in statistics, which can divide the data into
specific equal parts. A median divides a series into two equal parts. We can partition values of a
data set mainly into three different ways:
1. Quartiles
2. Deciles
3. Percentiles
Quartiles Formula
Suppose, Q is the upper quartile is the median of the upper half of the data set. Whereas, Q is
3 1
the lower quartile and median of the lower half of the data set. Q is the median. Consider, we
2
have n number of items in a data set. Then the quartiles are given by;
Q = [(n+1)/4]th item
1
Q = [(n+1)/2]th item
2
Q = [3(n+1)/4]th item
3
f is the frequency
c is the cumulative frequency of the class preceding the quartile class.
Quartiles in Statistics
Similar to the median which divides the data into half so that 50% of the estimation lies below
the median and 50% lies above it, the quartile splits the data into quarters so that 25% of the
estimation are less than the lower quartile, 50% of estimation are less than the mean, and 75% of
estimation are less than the upper quartile. Usually, the data is ordered from smallest to largest:
Interquartile Range
The interquartile range (IQR) is the difference between the upper and lower quartile of a given
data set and is also called a midspread. It is a measure of statistical distribution, which is equal
to the difference between the upper and lower quartiles. Also, it is a calculation of variation
while dividing a data set into quartiles. If Q is the first quartile and Q is the third quartile, then
1 3
Q = 16+0.25(23-26) = 15.25
1
Similarly,
Median, Q = [(n+1)/2]th item
2
Q = 26+0.5(26-26) = 26
2
And,
Upper Quartile, Q = [3(n+1)/4]th item
3
Q = 35+0.75(35-35) = 35
3