Professional Documents
Culture Documents
Outcomes
Know the definitions
E.g.:
Population of cars
Sample of cars N = 96
n = 12
E.g.:
Population of cars
Sample of cars N = 96
n = 12
E.g.: Variable/ Characteristic = Thing of interest Can assume many different values
E.g.: Variable/ Characteristic = Thing of interest Can assume many different values
E.g.: Variable/ Characteristic = Thing of interest Can assume many different values
E.g.: Variable/ Characteristic = Thing of interest Can assume many different values
E.g.:
# Car Colour Age Distance Risk
01 Red 5 50016.30 High
02 Yellow 9 127694.00 Medium
Sample of cars
03 Blue 3 36011.50 High
n = 12
04 Red 2 27558.20 High
05 Blue 19 240321.20 Low
06 Grey 8 93483.60 Medium
07 Green 15 211027.70 Low
08 Green 7 92577.30 Medium
09 Blue 10 125465.60 Low
10 Green 6 88640.00 Medium
11 Green 3 38401.60 High
12 Yellow 4 43605.20 High
Exploring data Types of data
E.g.:
# Car Colour Age Distance Risk
01 04 Red 5 50016.30 High
02 54 Yellow 9 127694.00 Medium
Sample of cars
03 14 Blue 3 36011.50 High
n = 12
04 18 Red 2 27558.20 High
05 09 Blue 19 240321.20 Low
06 45 Grey 8 93483.60 Medium
07 68 Green 15 211027.70 Low
08 07 Green 7 92577.30 Medium
09 43 Blue 10 125465.60 Low
10 66 Green 6 88640.00 Medium
11 27 Green 3 38401.60 High
12 08 Yellow 4 43605.20 High
Exploring data Types of data
E.g.:
# Car Colour Age Distance Risk
01 04 Red 5 50016.30 High
02 54 Yellow 9 127694.00 Medium New data collected by the
03 14 Blue 3 High
04 18 Red 2
36011.50
27558.20 High
researcher through
05 09 Blue 19 240321.20 Low experimentation/
06 45 Grey 8 93483.60 Medium observation/ survey
07
08
68
07
Green
Green
15
7
211027.70 Low
92577.30 Medium
⇒ Primary data
09 43 Blue 10 125465.60 Low
Otherwise
10 66 Green 6 88640.00 Medium
11 27 Green 3 38401.60 High
⇒ Secondary data
12 08 Yellow 4 43605.20 High
Exploring data Types of data – Notation
E.g.:
Variable names
• Always single capitals
# Car C A D R
• Usually (but not always) the last letters of the
01 04 Red 5 50016.30 High
alphabet, like X, Y, Z
02 54 Yellow 9 127694.00 Medium
03 14 Blue 3 36011.50 High
04 18 Red 2 27558.20 High
05 09 Blue 19 240321.20 Low
06 45 Grey 8 93483.60 Medium
07 68 Green 15 211027.70 Low
08 07 Green 7 92577.30 Medium
09 43 Blue 10 125465.60 Low
10 66 Green 6 88640.00 Medium
11 27 Green 3 38401.60 High
12 08 Yellow 4 43605.20 High
Exploring data Types of data – Notation
E.g.:
Variable names
• Always single capitals
# Car C A D R
• Usually (but not always) the last letters of the
01 04 c₀₁ a₀₁ d₀₁ r₀₁
alphabet, like X, Y, Z
02 54 c₀₂ a₀₂ d₀₂ r₀₂
03 14 c₀₃ a₀₃ d₀₃ r₀₃
04 18 c₀₄ a₀₄ d₀₄ r₀₄
05 09 c₀₅ a₀₅ d₀₅ r₀₅ Variable values
06 45 c₀₆ a₀₆ d₀₆ r₀₆ • Always lower case version of variable name
07 68 c₀₇ a₀₇ d₀₇ r₀₇ • Must have a subscript /index equal to the case
08 07 c₀₈ a₀₈ d₀₈ r₀₈
number
09 43 c₀₉ a₀₉ d₀₉ r₀₉
10 66 c₁₀ a₁₀ d₁₀ r₁₀
11 27 c₁₁ a₁₁ d₁₁ r₁₁
12 08 c₁₂ a₁₂ d₁₂ r₁₂
Exploring data Types of data – Summary
In general:
• Sampling with /without replacement
# X Y Z
• Raw /grouped data
• Primary /secondary data
1 x1 y1 z1
• Univariate /bivariate /multivariate data
2 x2 y2 z2 • Variables
• Names are single capitals
3 x3 y3 z3 • Values are lower case with case# index
• Qualitative
⋮ ⋮ ⋮ ⋮ • Ordinal
• Nominal
n xn yn zn • Quantitative
• Discrete
• Continuous
Data