You are on page 1of 4

Tutorial sheet 1

MTL108

Q1. The following data represents the crime cases reported in a certain county for 24
months:

5, 8, 4, 13, 2, 9, 10, 11, 5, 7, 8, 9, 3, 12, 9, 15, 6, 13, 17, 6, 9, 13, 11, 4

(a) Construct a frequency table containing frequency and relative frequency columns
for this data.
(b) Plot it on a histogram.

Q2. Label each of the following datasets as symmetric, approximately symmetric, or not
at all symmetric:

A: 7, 9, 9, 8, 7, 8, 6, 6, 7, 8
B: 0, 1, 2, 2, 1, 3, 3, 1, 2
C: 6, 8, 4, 3, 1, 1, 9, 2

Q3. Relative frequency tables and plots are particularly useful when we want to compare
different sets of data. The following two data sets relate the number of flight detours
in a month for 2 airports.

Airport 1 15 13 16 10 8 20 14 19 9 12 16 18 20 12 14 14
Airport 2 8 12 10 8 14 12 13 11 9 8 9 10 14 9 10

Plot these two data sets together in a relative frequency polygon. What conclusion
can you draw about which data set tends to have larger values?
Q4. In an attempt to determine the relationship between effectiveness of employees and
the mean temperature of the day, a company records the following data over 20 days:

Temperature Effectivity index Temperature Effectivity index


24.2 50 24.4 44
22.7 62 24.8 46
30.5 72 20.6 40
28.6 66 25.1 50
25.5 38 21.4 50
32 48 23.7 46
28.6 54 23.9 54
26.5 50 25.2 60
25.3 32 27.4 66
26 28 28.3 64

(a) Draw a scatter plot.

1
(b) What can you conclude about the relationship between employee effectiveness
and day temperature from the scatter plot?
Q5. The following data relate the attention score (1-6) to a score on an IQ examination
of 18 preschool-age children.

Attention score IQ score Attention score IQ score Attention score IQ score


2 82 6 105 6 118
3 88 5 108 6 128
4 86 7 112 5 128
5 94 7 116 4 130
5 90 6 122 3 140
6 99 7 110 2 142

(a) Draw a scatter plot. Give a plausible inference concerning the relation of atten-
tion score to IQ score.
(b) Draw a pie chart for attention score.
Q6. The following data represent the net annual income (in lakhs of rupees) for a sample
of servicemen:

47, 55, 18, 24, 27, 41, 50, 38, 33, 29, 15, 77, 64, 22, 19, 35, 39, 41,
67, 55, 121, 77, 80, 34, 41, 48, 60, 30, 22, 28, 84, 55, 26, 105, 62,
30, 17, 23, 31, 28, 56, 64, 88, 104, 115, 39, 25, 18, 21, 30, 57, 40,
38, 29, 19, 46, 40, 49, 72, 70, 37, 39, 18, 22, 29, 52, 94, 86, 23, 36

(a) Plot a histogram with 5 class intervals.


(b) Repeat (a) with 10 class intervals.
(c) Which of the two plots do you think is more informative? Why?

Q7. A set of 200 data points was broken up into 7 classes each of size 3 units. A frequency
table was then constructed. However, some of the entries of this table were lost.
Suppose that the part of the frequency table remains is as follows:

Class Interval Frequency Relative Frequency


0.05
19
31
15-18 38
0.10
52
30

Fill in the missing numbers and draw a relative frequency histogram.


Q8. The following data represents the number of people affected by a virus in a sample
of cities (in thousands):

10, 11, 9, 4, 6, 8, 9, 12, 18, 4, 20, 16

(a) What is the sample mean?

2
(b) What is the sample median?
(c) What is the sample mode?
Q9. Suppose that the sample mean of a set of 15 data points is 21.
(a) If it is discovered that a data point having value 15 was incorrectly read as having
value 12, what should be the revised value of sample mean.
(b) Suppose there is an additional data point whose value is 22. Will this increase
or decrease the sample mean?
(c) Using the original data (and not the revised data in part (a)), what is the new
value of sample mean in (b)?
Q10. The following is a frequency table of the ages of a sample of members of a symphony
for young adults.
Age Value Frequency
16 9
17 12
18 15
19 10
20 8
Find the sample mean, median and mode of the given ages.
Q11. If the median of the data set xi , i = 1, 2, . . . , n is 15, then what is the median of the
data set yi = 4xi + 3
Q12. Consider a data set of n values 1, 2, 3, . . . , n. Find the value of the sample 95th
percentile when
(a) n=250
(b) n=251
Q13. You are given two data sets:
A: 14, 14, 15, 17, 20

B: 40, 45, 50, 50, 57


(a) Which one appears to have a larger sample variance?
(b) Determine the sample mean and sample variance of data set A
(c) Determine the sample mean and sample variance of data set B
Q14. Construct a box plot for the following data set:
15, 17, 20, 20, 21, 25, 26, 30, 32, 33, 35
Q15. Compute the sample correlation coefficient from the following table:
Subject Age (X) Glucose Level (Y)
1 40 98
2 25 63
3 26 80
4 41 73
5 58 86
6 56 84

3
Q16. Find the sample skewness in the following data:

Marks obtained by students Frequency


50-60 5
60-70 20
70-80 50
80-90 30
90-100 10

Q17. Determine the sample kurtosis and the excess kurtosis of the data in Q16.

You might also like