You are on page 1of 13

1.

Chapter 1 The number of driver attending a drivers class each day was recorded over several months, giving the data in the table below. Number 10-19 20-29 30-34 35-39 40-49 50-64 65-69 of Drivers Frequency 2 13 20 30 20 10 5 a) Represent this data by a histogram b) On your histogram, draw a frequency polygon c) Calculate the mean number of drivers attending d) Calculate the variance and the standard deviation of the number of drivers attending e) Use linear interpolation to calculate the median and the quartiles f) Represent the data with a boxplot and use the boxplot to determine the skewness of the data

2.

Chapter 1 The labourers in the peanut factory and silk factory were asked to keep a record of their working hours on May 5th of that year. Their times, to the nearest hour, are summarised in the table below.
Peanut Facotry Smallest value Largest value Q1 Q2 Q3 2 14 7 10 13 Silk Factory 3 13 6 10 12

a) On graph paper and using the same scale for both, draw box plots to represent these data. b) Compare and contrast the results for the two classes. c) Determine the Quartile Coefficient of Skewness for the Peanut Factory. d) Determine the type of Skewness for the Silk Factory

3. Chapter 1 The 50 members of the town committee each recorded the amount of hours, x hours, they did in the first week of March. The results are summarised as follows:

x = 4550,

= 414850 .

The mean and standard deviation of the number of hours of work by the members of the town committee in this week were and respectively. a) Find . b) Find . Two new people joined the committee and the number of hours of work they did in the first week of March were 3 and + 3. c) Without any further calculation, state, giving your reasons, whether the effect of including these two members was to increase, decrease or leave unchanged the mean and standard deviation.

4. Chapter 1 Every year in New York, a certain organization holds a turkey competition where the person with the heaviest turkey wins a trip to a country of his or her choice. This year, 127 contestants took part in this competition. The winner of the contest had a turkey that weighed 57 kilograms and the person that came in last has a turkey that weighed 14 kilograms. The full results are summarised in the table below. Weight (kilograms) Frequency 10-19 7 20-29 21 30-39 42 40-49 37 50-59 20

a) Use linear interpolation to estimate the median weight. b) The upper and lower quartiles of the weights are 30.3 and 38.6 respectively. On graph paper, draw a box and whisker plot for the results from this competition. c) Comment on the skewness. d) Calculate the mean of this distribution. e) Calculate the standard deviation of this distribution.

5. Chapter 1 The following table summarises the birth weights of a random sample of 100 male babies born with a particular type of disease.
B irth w e ig h t (k g ) 1 .2 1 .4 1 .6 1 .8 2 .0 2 .2 2 .4 2 .6 2 .8 3 .0 N u m b e r o f b a b ie s 3 4 8 9 1 0 1 4 2 3 1 7 1 2

a) Write down the upper class boundary of the first class. b) Represent these data by a histogram. c) Estimate the number of babies whose weight was between 2.1 kg and 2.5 kg. d) Calculate estimates of the mean and the standard deviation of this sample of birth weights. e) Calculate estimates of the median and the quartiles of these birth weights. f) Comment on the skewness of the data. g) State, giving a reason, which of (d) or (e) you would recommend using to summarise the data in the table.

6. Chapter 1 In the month of April, Amar and Akbar each recorded the time taken for them to solve a Sudoku problem. The data is summarized in the following back-to-back stem and leaf diagram. Amar 1 (6) (7) (7) (q) (4) (2) 9 8 9 8 5 7 7 3 7 5 3 8 7 4 5 3 6 4 2 3 1 2 3 5 0 1 1 1 2 2 2 3 4 5 6 7 7 2 1 1 2 0 9 3 2 2 4 1 4 3 2 4 1 4 3 3 7 4 8 4 7 9 5 7 7 9 Akbar (2) (5) (p) (7) (5) (5)

( 0 2 2 means 20 for Amar and 22 for Akbar )


a) Find the values of p and q. b) Find the values of r, s, t, u and v in the table below.

Smallest value Largest value Q1 Q2 Q3

Amar 20 s 33 u 56

Akbar r 64 t 42 v

c) On graph paper and using the same scale for the two boys, draw box plots to represent these data. d) Compare and contrast the result of the two boys. e) Find the mean and the standard deviation of the number of minutes spent by Amar f) The following table gives the times for Anthony to complete the Soduku in the month of April Anthony Smallest value 25 Largest value 65 Q1 35 Q2 45 Q3 55 Draw a third boxplot on the box plot from d) and determine the skewness of the three boys.

7.

Chapter 2 On a particular day in winter of 1862 at 1200 hours the height above sea level, x meters, and the temperature, y C, were recorded in 10 European towns. The following summary statistics were calculated from the results. x = 63, x2 = 642, Sxy = 224, Syy = 140.9, y = 23 a) Find Sxx. b) Calculate a regression line that can be used to find the temperature at a given height. c) Use your regression line from b) and use it to find the temperature at 0 meters and at 12 meters.

Chapter 2 A teacher monitored the mental ability of one of his pupils over a 10 week period. At the end of each week, the pupil was given a new test and the teacher noted the number of errors y. He also recorded the number of hours x that the pupil had studied each week. The data are shown in the table below. x y

8.

12 8

15 4

7 13

11 8

1 18

8 12

4 15

6 14

9 12

3 16

a) Plot these data on a scatter diagram. b) Find the equation of the regression line of y on x in the form y = a + bx. (You may use x2 = 746, xy = 749.) c) Give an interpretation of the slope and the intercept of your regression line. d) State whether or not you think the regression model is reasonable (i) (ii) for the range of x-values given in the table, for all possible x-values.

9.

Chapter 2 Lina was asked to guess the weight of ten people suspended above the floor. The guess g of the weight (in kilograms) of each person and the true weight t(in kilograms) of each person are summarized in the table below.

Person A B C D E F G H I J

g 70 90 60 100 56 78 60 70 65 60

t 80 110 55 100 52 82 56 80 60 60

Draw a scatter diagram for the above data. Find the equation of the regression line of t on g Draw the regression line on your scatter diagram. If possible, use your regression to calculate the true weight of a person K if Lina guesses her weight to be 92 kilograms. If not, explain why not? e) If possible, use your regression to calculate the true weight of a person L if Lina guesses his weight to be 120 kilograms. If not, explain why not? a) b) c) d)

10.

Chapter 1 Shazia and Randa meet with each other once a week to discuss business matters. Over many weeks they recorded, to the nearest minute, the number of minutes spent in a meeting on each occasion. The following table summarizes their results.

Time (to the nearest minute) 915 1624 2540 4160 61120

Number of conversations 2 7 15 10 6

a) Calculate an estimate of the mean time and standard deviation of the time spent on their conversations. On another occasion, Shazia, Randa and Laila met with each other once a week to discuss business matters. Over many weeks they recorded, to the nearest minute, the number of minutes spent in a meeting on each occasion. The following table summarizes their results. Time (to the nearest minute) 1520 2025 2530 3035 3540 Number of conversations 2 7 15 10 6

b) Using a method of coding, find estimates of the mean and standard deviation of the time taken.

11. Chapter 2 A set of data was collected that related a temperature t with a length l, where l measured the length, l mm, and the temperature was in Celsius. The data is recorded in the table below.
t l

21.3 25.6 31.7 39.6 46.3 45.0 58.6 61.4

2462.12 2462.41 2462.73 2462.88 2463.03 2463.37 2463.69 2464.05

The results were then coded such that x = t and y = l 2460.00. a) Calculate Sxy and Sxx. b) c) d) e) f) Find the equation of the regression line of y on x in the form y = a + bx. Estimate the length of the rod at 35 C. Find the equation of the regression line of l on t. Estimate the length of the rod at 85 C. Comment on the reliability of your estimate in part (e).

12.

Chapter 2 A pilot recorded the distance traveled, m miles, and the amount of fuel used, f liters, each day. Summarized below are data from the drivers records for a random sample of 8 days.

The data are coded such that x = m 250 and y = f 100.


x = 130 y = 48 xy = 8880 Sxx = 20 487.5

a) Find the equation of the regression line of y on x in the form y = a + bx. b) Hence find the equation of the regression line of f on m. c) Predict the amount of fuel used on a journey of 235 miles.

13.

Chapter 1 Consider the following set of data that represents the amount paid by 15 customers going on holiday to the same destination.

299 350 99

1050 169 2100

2315 1015 689

999 650 550

485 830 475

For these data, find a) The mean and the standard deviation, b) The median and the inter-quartile range. c) Determine if any of the payments are outliers using the quartiles d) On a graph paper, draw a box plot clearly indicating the minimum point, maximum point, quartiles and any outliers in the set.

14. Chapter 1 The following data shows the grades of students in first year statistics and in second year statistics.
Student Number

01556 01255 02356 01589 01662 03589 01451 01589 01237 012255

Year One (o) Year Two (t)

62 69

78 78

79 82

65 65

71 74

77 64

85 87

85 61

90 65

81 79

[You may use o 2 = 60, 475, t 2 = 53,122, ot = 56, 076 ]

a) Draw a scatter diagram for the grades b) Calculate a regression line t on o and use it to find the grade of a student coming from another school with a grade of 80 in first year statistics. c) Calculate a regression line o on t and use it to find the grade of a student that skipped first year statistics but ended up with a 75 in year two statistics. d) Draw the two regression lines from b) and c) on your scatter diagram. What is special about their point of intersection of these two lines?

15.

Chapter 1

Consider the following data 125 a) b) c) d) 160 169 171 175 186 210 243 250 258 390 420

Find the mean and the standard deviation of the data. Find the median, upper and lower quartile of this data. Using the quartiles, determine if there are any outliers in this set. Using the quartile coefficient of skewness, determine the type of skewness in this data.

16. Chapter 2 An old lady trains a particular type of cat in doing a particular jump. Over the past six years, she has owned 10 cats. She believes there is a link between the age of the cat, x, and the height, y, in inches, that the cat can jump. The following is a summary of the age and the heights.

x = 41, y = 406, x
a) b) Find S xx and S xy

= 188, xy = 1818.5

Find the equation of the least square regression line in the form y = a + bx . (Give the values of a and b to two decimal places.)

c) d)

Interpret your values of a and b Find the height a cat aged 5.5 years would jump if your regression line is reliable.

17. Chapter 1 A frequency distribution is shown below.

Class Frequency

110 6

1120 8

2130 10

3140 9

4150 7

For this frequency distribution calculate: a) the interquartile range b) the skewness.
Chapter 1 18. The table below is the distribution of faults in very large bales of cloth.

Faults Frequency
a)

1 16

2 36

3 70

4 86

5 90

6 68

7 35

8 22

9 5

10 2

If a, b and are the mean, mode and standard deviation, work out the a +b value of to two decimal places.

b)

If Q1, Q2 and Q3 are the respective quartiles of the distribution, show that 3Q3 3Q2 Q1 = 0

19. Chapter 1 The average weekly incomes, in , of household in 11 regions of Japan are given below: 255.8, 252.0, 270.6, 298.4, 362.3, 297.2, 266.8, 261.7, 274.5, 259.1, 220.6

Find the median and upper and lower quartiles. On graph paper, draw a box and whisker plot to represent these data. Identify a possible outlier. Calculate the mean and standard deviation for these data. Further investigations suggested that the 362.30. value could be incorrect and should in fact be 326.30 f) Without carrying out any further calculation indicate what effect this change would have on the standard deviation, (ii) (iii) the interquartile range. a) b) c) d) e) 20.
Chapter 1 The following figures show the age of sixty people on a plane.

20 34 34 15 36 64 a) b)

24 31 46 48 5 17

26 33 37 13 12 27

23 35 30 52 21 24

24 25 31 36 5 24

25 26 37 32 3 38

21 22 34 46 16 64

23 24 31 46 4 3

22 27 45 57 16 5

4 36 43 67 54 2

Construct a grouped frequency distribution for these data with class widths of 10. Construct a stem and leaf diagram for the data.