Professional Documents
Culture Documents
ANDREW MADSON
DATA TYPES
DATA TYPES
WHY IT MATTERS
1. Appropriate Analysis: Different types of data require
different statistical tests. For example, nominal data can be
analyzed using a Chi-square test, while interval data can be
analyzed using a t-test or ANOVA. Using the wrong test can
lead to incorrect conclusions.
QUALITATIVE
Non-numerical data that consists of
descriptive information, such as
colors, tastes, textures, or any other
characteristics that cannot be
counted or measured.
QUANTITATIVE
QUANTITATIVE
DATA TYPES
Distinct and separate values
DISCRETE with no intermediate values in
between.
NON-PARAMETRIC
Do not assume that the data follow
any specific distribution. They are
defined without the assumption of
underlying parameters
PARAMETRIC
PARAMETRIC
DISTRIBUTIONS
Symmetric around the mean,
showing that data near the
NORMAL mean are more frequent in
occurrence than data far from
the mean.
Continuous probability
distribution that models the
time it takes for an event to
WEIBULL occur and is commonly used in
reliability and survival
analysis.
Discrete probability
distribution that models the
POISSON number of events occurring in
a fixed interval of time or
space.
Continuous probability
distribution that models the
time between events in a
EXPONENTIAL Poisson process, where events
occur independently and at a
constant average rate.
NON-PARAMETRIC
NON-PARAMETRIC
DISTRIBUTIONS
Discrete probability
distribution representing a
random experiment with only
BERNOULLI two possible outcomes,
typically denoted as success
(1) or failure (0), each with a
fixed probability.
STATISTICAL
TESTS
T-TEST
Compares the
PURPOSE means of two
groups
DISTRIBUTION Normal
If there is a
WHAT IT significant
SHOWS differences between
group means
T-TEST OUTPUT
Assess relationship
WHEN TO USE
between categorical
IT variables
No strict
DISTRIBUTION distribution
requirement
Degrees of
The number of categories minus 1
Freedom
Normally
DISTRIBUTION distributed
Significant
WHAT IT
differences between
SHOWS group means
ANOVA OUTPUT
No strict
DISTRIBUTION distribution
requirement
Regression
Y = 12.345 + 0.987 * X_Variable
Equation
The intercept (12.345) represents the
estimated value of the dependent
variable when the independent
variable (X_Variable) is zero.
Coefficients The coefficient for X_Variable (0.987)
represents the estimated change in
the dependent variable for a one-unit
increase in X_Variable.
Compare
WHEN TO USE
distributions of two
IT independent groups
No strict
DISTRIBUTION distribution
requirement
Significant
WHAT IT
differences in rank
SHOWS order
MANN-WHITNEY
OUTPUT
Compare
WHEN TO USE distributions of
IT three or more
independent groups
No strict
DISTRIBUTION distribution
requirement
Degrees of
Number of groups minus 1
Freedom
Normally
DISTRIBUTION distributed
No strict
DISTRIBUTION distribution
requirement
Compare a sample
WHEN TO USE
mean to a known
IT value
Normally
DISTRIBUTION distributed
No strict
DISTRIBUTION distribution
requirement
HAPPY LEARNING!
🙌