Professional Documents
Culture Documents
Types of data
1. Quantitative Data (Interval Data)
- are numerical observations. These data are said to have an interval
data scale.
Eg. Height, weight, time (values are real numbers)
Valid descriptive measure: mean (average), standard deviation,
median, mode, quartiles, percentiles.
Example: For each of the following examples, determine the data type.
a) The starting salaries of graduates from an M.B.A program.
Interval / Quantitative
Descriptive Statistics
Definition: Methods of organising, summarising and presenting numerical data.
Eg. Summary statistics (mean, standard deviation), tables, charts, graphs.
Notation:
Sample statistic Population parameter
Mean x µ
Standard deviation s σ
proportion p̂ p
Note : the sample value (known) estimates the population value (unknown)
iii) Mode
• the value that occurs most frequently.
Eg. 31, 34, 36, 33, 28, 34, 34
Mode=34
*May be used for nominal, ordinal or interval data.
ii) Interquartile range (looks at the middle 50% of data when data are in
ascending order)
Eg. 2, 4, 4, 5, 7, 8, 10, 12, 17, 18, 18, 21, 27, 29, 30
Q1 Me Q3
Me = 12
Lower quartile = Q1 = 5
Upper quartile = Q3 = 21
Interquartile range = IQR = Q3 – Q1 = 21-5 = 16
iii) Variance: the mean of the squared deviations from the mean, is very
important in statistical inference.
Mean = x =
∑ xi = 3.4 + 2.5 + 4.1 + 1.2 + 2.8 + 3.7 = 2.95
n 6
Variance = s 2 =
∑ ( xi − x ) 2
n −1
[(3.4 − 2.95) + (2.5 − 2.95) 2 + ... + (3.7 − 2.95) 2
2
=
6 −1
=1.075
Bar chart
Column chart nominal, ordinal data
Pie chart
Interval data
Eg. The following data are the test scores of 30 students:
45, 48, 52, 54, 55, 58, 58, 59, 61, 61, 62, 64, 65, 66, 66, 67, 70, 73,
75, 77, 77, 78, 79, 80, 82, 83, 86, 86, 91, 94
Note: There is no hard and fast rule on determining the appropriate number of
classes, it may be considered that any number between 5 and say 20 may be
considered sufficient. It will be sufficient for you to use your judgement in
deciding on the number of classes or the class width. However, the class
intervals should be:
i) of equal size
ii) cover the full range of data
iii) mutually exclusive i.e. : a number can fit into only one class .
Example
• The student placement office at a university wanted to determine the
general areas of employment of last year school graduates.
• Data was collected, and the count of the occurrences was recorded for
each area.
• These counts were converted to proportions and the results were
presented as a pie chart and a bar chart.
The Pie Chart
Other
11%
Accounting
29%
Marketing
25%
Finance
General 21%
management
14%
Bar Chart
80 73
70 64
60 52
Frequency
50
40 36
28
30
20
10
0
Accounting Finance General Marketing Other
management
Area
The Bar Chart
• Use bar charts also when the order in which nominal data are presented is
meaningful.
Total number of new products introduced in
North America in the years 1989,…,1994
0
‘89 ‘90 ‘91 ‘92 ‘93 ‘94
Relationship between two variables
• Nominal / ordinal – use table
Eg. Grade and Gender
Fail Pass Credit Distinction Total
Female 18 26 19 5 68
Male 13 32 11 7 63
Total 31 58 30 12 131