Professional Documents
Culture Documents
The Business Statistics and Analysis Specialization is designed to equip you with a
basic understanding of business data analysis tools and techniques. You’ll master
essential spreadsheet functions, build descriptive business data measures, and develop
your aptitude for data modeling. You’ll also explore basic probability concepts, including
measuring and modeling uncertainty, and you’ll use various data distributions, along
with the Linear Regression Model, to analyze and inform business decisions. The
Specialization culminates with a Capstone Project in which you’ll apply the skills and
knowledge you’ve gained to an actual business problem.
Nominal Data
Nominal data is one of the types of qualitative information which helps to label the
variables without providing the numerical value. Nominal data is also called the nominal
scale. It cannot be ordered and measured. But sometimes, the data can be qualitative
and quantitative. Examples of nominal data are letters, symbols, words, gender etc.
The nominal data are examined using the grouping method. In this method, the data are
grouped into categories, and then the frequency or the percentage of the data can be
calculated. These data are visually represented using the pie charts.
Ordinal Data
Ordinal data/variable is a type of data which follows a natural order. The significant
feature of the nominal data is that the difference between the data values is not
determined. This variable is mostly found in surveys, finance, economics,
questionnaires, and so on.
The ordinal data is commonly represented using a bar chart. These data are
investigated and interpreted through many visualisation tools. The information may be
expressed using tables in which each row in the table shows the distinct category.
Discrete Data
Discrete data can take only discrete values. Discrete information contains only a finite
number of possible values. Those values cannot be subdivided meaningfully. Here,
things can be counted in the whole numbers.
Example: Number of students in the class
Continuous Data
Continuous data is data that can be calculated. It has an infinite number of probable
values that can be selected within a given specific range.
Example: Temperature range
Data Summarization
The term Data Summarization refers to presenting the summary of generated
data in an easily comprehensible and informative manner. Presenting the raw
data, (the data that was generated which is essentially the entire repertoire of
datasets- individual measurements) is not practical in many cases.
Tabular Presentation
A table helps to represent even a large amount of data in an engaging, easy to
read, and coordinated manner. The data is arranged in rows and columns.
This is one of the most popularly used forms of presentation of data as data
tables are simple to prepare and read.
Objectives Of Tabulation
Graphic Presentation
Graphic presentation represents a highly developed body of techniques for
elucidating, interpreting, and analyzing numerical facts by means of points,
lines, areas, and other geometric forms and symbols. Graphic techniques are
especially valuable in presenting quantitative data in a simple, clear, and
effective manner, as well as facilitating comparisons of values, trends, and
relationships. They have the additional advantages of succinctness and
popular appeal; the comprehensive pictures they provide can bring out hidden
facts and relationships and contribute to a more balanced understanding of a
problem.
Charts
Charts are a great way to visually represent all kinds of information, from the
simple to the very complex.
You can have a variety of data which can be used in presentations. Some of
these chart types include:
Time Series
Bar Charts
Combo Charts
Pie Charts
Tables
Geo Map
Scorecard
Scatter Charts
Bullet Charts
Area Chart
Text & Images
Histogram
RELATIVE FREQUENCY
The number of times an event occurs is called a frequency. Relative
frequency is an experimental one, but not a theoretical one. Since it is an
experimental one, it is possible to obtain different relative frequencies when
we repeat the experiments. To calculate the frequency, we need
The ratio of the number of times a value of the data occurs in the set of all
outcomes to the number of all outcomes gives the value of relative frequency.
Let’s look at the table below to see how the weights of the people are
distributed.
Step 1: To convert the frequencies into relative frequencies, we need to do the
following steps.
Step 2: Divide the given frequency by the total N i.e., 40 in the above case
(Total sum of all frequencies).
Step 3: Divide the frequency by total number Let’s see how: 1/ 40 = 0.25.
This is a frequency table to see how many students have got marks between
given intervals in Maths.
45 – 50 3 3 / 40 x 100 = 0.075
50 – 55 1 1 / 40 x 100 = 0.025
55 – 60 1 1 / 40 x 100 = 0.075
65 – 70 8 8 / 40 x 100 = 0.2
70 – 80 3 3 / 40 x 100 = 0.275
Even though the measures above are the most commonly used to define
central tendency, there are some other measures, including, but not limited
to, geometric mean, harmonic mean, midrange, and geometric median.
Although the mean is regarded as the best measure of central tendency for
quantitative data, that is not always the case. For example, the mean may not
work well with quantitative datasets that contain extremely large or extremely
small values. The extreme values may distort the mean. Thus, you may
consider other measures.
PROBABILITY DISTRIBUTION
The probability distribution is one of the major theories of statistical analysis.
It gives the possibility of achieving each outcome in a randomly given event.
The probabilities of all outcomes can be known through the probability
distribution. A tad bit of recalling of the probability theory can be of much help
to thoroughly understand probability distribution. Probability is one of the
phenomena that helps us measure the certainty or uncertainty of different
outcomes in a given event.
However, the probability that X is exactly equal to some value is always zero
because the area under the curve at a single point, which has no width, is zero.
For example, the probability that a man weighs exactly 190 pounds to infinite
precision is zero. You could calculate a nonzero probability that a man weighs
more than 190 pounds, or less than 190 pounds, or between 189.9 and 190.1
pounds, but the probability that he weighs exactly 190 pounds is zero.
DISCRETE FUNCTIONS
A discrete distribution is a probability distribution that depicts the occurrence
of discrete (individually countable) outcomes, such as 1, 2, 3... or zero vs. one.
The binomial distribution, for example, is a discrete distribution that evaluates
the probability of a "yes" or "no" outcome occurring over a given number of
trials, given the event's probability in each trial—such as flipping a coin one
hundred times and having the outcome be "heads".
Poisson
Bernoulli
Binomial
Multinomial
Example: