Professional Documents
Culture Documents
Lecture 1
- A data set may contain a mixture of data types. Two broad categories are
categorical data and numerical data
TYPES OF DATA SET
● Categorical Data (also called qualitative data) have values that are
described by words rather than numbers
Ex: type, size, classification,...
↬ T/F: not limited statistical use
+ Coding: categorical variable represented using numbers
1 = cash 2 = check 3 = credit/debit card 4 = gift card
+ Binary variables: categorical variables have only two values (using a 1 or 0)
Ex; employment status (e.g., employed or unemployed), (currently married or
not currently married)
● Stem-and-Leaf Plot
● Dot Plots
+ Usually, all the bin widths are the same and their limits cannot overlap
+ Frequencies can also be expressed as relative frequencies or percentages of
the total number of observations.
Ex
Sort raw data in ascending order: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38,
41, 43, 44, 46, 53, 58
Find range: 58 - 12 = 46
Select number of classes: 5 (usually between 5 and 15)
Compute interval width: 10 (46/5 then round up)
Determine interval boundaries: 10 but less than 20, 20 but less than 30,..., 60 but
less than 70
● Histogram: is a graphical representation of a frequency distribution (bar
chart)
+ Y-axis: number of data values (or a percentage) within each bin of a frequency
distribution
+ X-axis ticks show the end points of each bin
+ No gaps between bars
Measures of center
Mean ( giá trị trung bình)
● Measures of Variability
Standardized Date
+ main uses is to gauge the position of items within a data array
Chebyshev’s Theorem The Empirical Rule
● Z-score
Lưu ý:
● CORRELATION AND COVARIANCE
RULES OF PROBABILITY
Conditional Probability
INDEPENDENT EVENTS
CONTINGENCY TABLES
A contingency table is like a frequency distribution for a single variable, except
it has two variables (rows and columns).
● Marginal Probabilities
● Joint Probabilities
● Conditional Probabilities
TREE DIAGRAMS
+ Events and probabilities can be displayed to help visualize all possible
outcomes.
BAYES’ THEOREM
COUNTING RULES