Professional Documents
Culture Documents
Juliet obtained a list of email addresses of doctors who work in her city. She
contacted them and asked them to fill in an anonymous questionnaire.
Participants were asked to state their annual income and to respond to a set of
questions. The responses were used to determine a happiness score out of 100. Of the
415 doctors on the list, 11 replied.
For the remaining ten responses in the table, Juliet calculates the mean
happiness score to be 52. 5.
Y = aX + b .
Y = cX
2
+ dX + e .
A report states that the mean annual income of doctors in the city is $80 000 .
Juliet decides to carry out a test to determine whether her sample could
realistically be taken from a population with a mean of $80 000.
(a.i) Describe one way in which Juliet could improve the reliability
of her investigation. [1]
Markscheme
[1 mark]
(a.ii) Describe one criticism that can be made about the validity of
Juliet’s investigation. [1]
Markscheme
Note: Do not condone the following common but vague responses unless
they make a clear link to validity:
Sample size is too small
Result is not generalizable
There may be other variables Juliet is ignoring
Sample might not be representative
[1 mark]
(b) Juliet classifies response K as an outlier and removes it from the
data. Suggest one possible justification for her decision to
remove it. [1]
Markscheme
[1 mark]
Markscheme
[2 marks]
Markscheme
[2 marks]
(d.i) State why the hypothesis test should be one-tailed. [1]
Markscheme
EITHER
only looking for change in one direction R1
OR
only looking for greater happiness with greater income R1
OR
only looking for evidence of positive correlation R1
[1 mark]
(d.ii) State the null and alternative hypotheses for this test. [2]
Markscheme
H0 : ρ = 0; H1 : ρ > 0 A1A1
Note: Award A1 for ρ seen (do not accept r), A1 for both correct hypotheses,
using their ρ or r. Accept an equivalent statement in words, however
reference to “correlation for the population” or “association for the
population” must be explicit for the first A1 to be awarded.
Watch out for a null hypothesis in words similar to “Annual income is not
associated with greater happiness”. This is effectively saying ρ ≤ 0 and
should not be condoned.
[2 marks]
(d.iii) The critical value for this test, at the 5% significance level, is
0. 549 . Juliet assumes that the population is bivariate normal.
Determine whether there is significant evidence of a positive
correlation between annual income and happiness. Justify your
answer.
[2]
Markscheme
[2 marks]
Markscheme
Markscheme
EITHER
the amount the happiness score increases for every $1 increase in (annual)
income A1
OR
rate of change of happiness with respect to (annual) income A1
[1 mark]
Markscheme
c = −2. 06 × 10
−9
(−2. 06191 … × 10
−9
) ,
d = 7. 05 × 10
−4
(7. 05272 … × 10
−4
) ,
[1 mark]
(e.iv) Find the coefficient of determination for each of the two
models she considers. [2]
Markscheme
[2 marks]
Markscheme
EITHER
quadratic model is a better fit to the data / more accurate A1
OR
quadratic model explains a higher proportion of the variance A1
[1 mark]
Markscheme
EITHER
not valid, R not a useful measure to compare models with different
2
numbers of parameters A1
OR
not valid, quadratic model will always have a better fit than a linear model
A1
Note: Accept any other sensible critique of the validity of the method. Do
not accept any answers which focus on the conclusion rather than the
method of model selection.
[1 mark]
(f.i) State the name of the test which Juliet should use. [1]
Markscheme
[1 mark]
(f.ii) State the null and alternative hypotheses for this test. [1]
Markscheme
EITHER
H0 : μ = 80 000; H1 : μ ≠ 80 000 A1
OR
H0 : (sample is drawn from a population where) the population mean is
$80 000
Note: Do not allow FT from an incorrect test in part (f )(i) other than a z-test.
[1 mark]
(f.iii) Perform the test, using a 5% significance level, and state your
conclusion in context. [3]
Markscheme
Note: For a z-test follow through from part (f )(i), either 0. 578 (from biased
estimate of variance) or 0. 598 (from unbiased estimate of variance).
0. 610 > 0. 05 R1
EITHER
OR
the sample could plausibly have been drawn from the quoted population
A1
Note: Allow R1FTA1FT from an incorrect p-value, but the final A1 must still be
in the context of the original research question.
[3 marks]
2. [Maximum mark: 6] 20N.1.SL.TZ0.T_10
On 90 journeys to his office, Isaac noted whether or not it rained. He also
recorded his journey time to the office, and classified each journey as short,
medium or long.
Of the 90 journeys to the office, there were 3 short journeys when it rained, 22
medium journeys when it rained, and 15 long journeys when it rained. There
were also 14 short journeys when it did not rain.
Isaac carried out a χ test at the 5% level of significance on these data, looking
2
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
[1 mark]
(b) Find the expected number of short trips when it rained. [3]
Markscheme
17
90
×
40
90
× 90 OR 17×40
90
(A1)(M1)
90
×
40
90
× 90 OR
17×40
90
seen.
7. 56 (7. 55555 … ,
68
9
) (A1) (C3)
[3 marks]
Markscheme
OR
Note: Follow through from part (a) for their phrasing of the null hypothesis.
[2 marks]
3. [Maximum mark: 19] 20N.2.SL.TZ0.T_1
Don took part in a project investigating wind speed, x km h −1
, and the time, y
minutes, to fully charge a solar powered robot.
The investigation was carried out six times. The results are recorded in the table.
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
(A4)
Note: Award (A1) for correct scales and labels.
Award (A3) for all six points correctly plotted.
Award (A2) for four or five points correctly plotted.
Award (A1) for two or three points correctly plotted.
Award at most (A0)(A3) if axes reversed.
If graph paper is not used, award at most (A1)(A0)(A0)(A0).
[4 marks]
Markscheme
19 (km h
−1
) (A1)
[1 mark]
(b.ii) Calculate y , the mean time to fully charge the robot. [1]
Markscheme
32 (minutes) (A1)
[1 mark]
(c) Plot and label the point M on your scatter diagram. [2]
Markscheme
[2 marks]
Markscheme
[2 marks]
(d.ii) Describe the correlation between the wind speed and the time
to fully charge the robot. [2]
Markscheme
Note: Award (A1)(ft) for (very) strong. Award (A1)(ft) for positive. Follow though
from their part (d)(i). If there is no answer to part (d)(i), award at most (A0)(A1)
for a correct direction.
[2 marks]
Markscheme
y = 0. 465x + 23. 2 (y = 0. 465020 … x + 23. 1646 …) (A1)(A1)(G2)
Note: Award (A1) for 0. 465x. Award (A1) for 23. 2. If the answer is not an
equation, award at most (A1)(A0).
[2 marks]
Markscheme
Note: Award a maximum of (A1)(A0) if the line is not straight/ruler not used.
Award (A0)(A0) if the points are connected.
Follow through from their point M in part (b) and their y -intercept in part
(e)(i).
If M is not plotted or labelled, then follow through from part (b).
[2 marks]
(e.iii) Hence or otherwise estimate the charging time when the wind
speed is 27 km h
−1
. [2]
Markscheme
Note: Award (M1) for correct substitution into their regression equation.
35. 7 (minutes) (35. 7201 …) (A1)(ft)(G2)
OR
Note: Award (M1) for an indication of using their regression line. This must
be illustrated by vertical and horizontal lines or marks at the correct place(s)
on their scatter diagram.
[2 marks]
Markscheme
wind speed does not cause a change in the time to charge (the robot)
(A1)
Note: Award (A1) for a statement that communicates the meaning of a non-
causal relationship between the two variables.
[1 mark]
4. [Maximum mark: 15] 19N.2.SL.TZ0.T_1
Casanova restaurant offers a set menu where a customer chooses one of the
following meals: pasta, fish or shrimp.
The manager surveyed 150 customers and recorded the customer’s age and
chosen meal. The data is shown in the following table.
A χ test was performed at the 10% significance level. The critical value for this
2
test is 4. 605.
Write down
Markscheme
[1 mark]
2 (A1)
[1 mark]
(c) Show that the expected number of children who chose shrimp
is 31, correct to two significant figures. [2]
Markscheme
69
150
×
67
150
× 150 OR 69×67
150
(M1)
Note: Award (M1) for correct substitution into expected frequency formula.
31 (AG)
Note: Both an unrounded answer that rounds to the given answer and
rounded answer must be seen for the (A1) to be awarded.
[2 marks]
Markscheme
(χ
2
calc
=) 2. 66 (2. 657537 …) (G2)
[2 marks]
Markscheme
(p-value =) 0. 265 (0. 264803 …) (G1)
Note: Award (G0)(G2) if the χ statistic is missing or incorrect and the p-value
2
is correct.
[1 mark]
(e) State the conclusion for this test. Give a reason for your answer. [2]
Markscheme
OR
[2 marks]
Markscheme
81
150
(
27
50
, 0. 54, 54%) (A1)(A1)(G2)
Markscheme
116
150
(
58
75
, 0. 773, 0. 773333 … , 77. 3%) (A1)(A1)(G2)
[2 marks]
(f.iii) Given that the customer is a child, calculate the probability that
they chose pasta or fish. [2]
Markscheme
34
69
(0. 493, 0. 492753 … , 49. 3%) (A1)(A1)(G2)
[2 marks]
5. [Maximum mark: 6] 19M.2.SL.TZ2.S_1
A group of 7 adult men wanted to see if there was a relationship between their
Body Mass Index (BMI) and their waist size. Their waist sizes, in centimetres, were
recorded and their BMI calculated. The following table shows the results.
Markscheme
0.141120, 11.1424
[3 marks]
Markscheme
0.977563
r = 0.978 A1 N1
[1 mark]
(b) Use the regression equation to estimate the BMI of an adult
man whose waist size is 95 cm. [2]
Markscheme
eg 0.141(95) + 11.1
24.5488
24.5 A1 N2
[2 marks]
6. [Maximum mark: 13] 19M.2.SL.TZ2.T_1
Sila High School has 110 students. They each take exactly one language class
from a choice of English, Spanish or Chinese. The following table shows the
number of female and male students in the three different language classes.
A χ test was carried out at the 5 % significance level to analyse the relationship
2
The critical value at the 5 % significance level for this test is 5.99.
(a) Write down the null hypothesis, H0 , for this test. [1]
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
Markscheme
2 (AG)
[1 mark]
Markscheme
[1 mark]
Markscheme
Note: Follow through from part (c)(ii). Accept “do not accept” in place of
“reject.” Do not award (A1)(ft)(R0).
OR
[2 marks]
(e.i) Find the probability that the student does not take the Spanish
class. [2]
Markscheme
88
110
(
4
5
, 0.8, 80% ) (A1)(A1)(G2)
Note: Award (A1) for correct numerator, (A1) for correct denominator.
[2 marks]
(e.ii) Find the probability that neither of the two students take the
Spanish class. [3]
Markscheme
88
110
×
87
109
(M1)(M1)
Note: Award (M1) for multiplying two fractions. Award (M1) for multiplying
their correct fractions.
OR
(
46
110
)(
45
109
)+ 2(
46
110
)(
42
109
)+ (
42
110
)(
41
109
) (M1)(M1)
Note: Award (M1) for correct products; (M1) for adding 4 products.
0.639 (0.638532 … ,
348
545
, 63.9% ) (A1)(ft)(G2)
(e.iii) Find the probability that at least one of the two students is
female. [3]
Markscheme
1 −
67
110
×
66
109
(M1)(M1)
Note: Award (M1) for multiplying two correct fractions. Award (M1) for
subtracting their product of two fractions from 1.
OR
43
110
×
42
109
+
43
110
×
67
109
+
67
110
×
43
109
(M1)(M1)
Note: Award (M1) for correct products; (M1) for adding three products.
545
) (A1)(G2)
[3 marks]
7. [Maximum mark: 9] 18N.2.SL.TZ0.T_1
The marks obtained by nine Mathematical Studies SL students in their projects
(x) and their final IB examination scores (y) were recorded. These data were used
to determine whether the project mark is a good predictor of the examination
score. The results are shown in the table.
(a.ii) Use your graphic display calculator to write down ȳ , the mean
examination score. [1]
Markscheme
54 (G1)
[1 mark]
Markscheme
0.5 (G2)
[2 marks]
(b.i) Find the exact value of m and of c for these data. [2]
Markscheme
m = 0.875, c = 41.75 (m =
7
8
, c =
167
4
) (A1)(A1)
Note: Award (A1) for 0.875 seen. Award (A1) for 41.75 seen. If 41.75 is rounded
to 41.8 do not award (A1).
[2 marks]
Markscheme
Note: Award (M1) for correct substitution into their regression line.
[2 marks]
Markscheme
the estimate is valid (A1)
OR
Note: Do not award (A1)(R0). The (R1) may be awarded for reasoning based
on strength of correlation, but do not accept “correlation coefficient is not
strong enough” or “correlation is not large enough”.
Award (A0)(R0) for this method if no numerical answer to part (a)(iii) is seen.
[2 marks]
8. [Maximum mark: 21] 18N.3.AHL.TZ0.Hsp_3
Mr Sailor owns a fish farm and he claims that the weights of the fish in one of his
lakes have a mean of 550 grams and standard deviation of 8 grams.
Assume that the weights of the fish are normally distributed and that Mr Sailor’s
claim is true.
Kathy is suspicious of Mr Sailor’s claim about the mean and standard deviation of
the weights of the fish. She collects a random sample of fish from this lake whose
weights are shown in the following table.
Using these data, test at the 5% significance level the null hypothesis
H : μ = 550 against the alternative hypothesis H : μ < 550 , where μ grams
0 1
Kathy decides to use the same fish sample to test at the 5% significance level
whether or not there is a positive association between the weights and the
lengths of the fish in the lake. The following table shows the lengths of the fish in
the sample. The lengths of the fish can be assumed to be normally distributed.
(a.i) Find the probability that a fish from this lake will have a weight
of more than 560 grams. [2]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
[2 marks]
(a.ii) The maximum weight a hand net can hold is 6 kg. Find the
probability that a catch of 11 fish can be carried in the hand net. [4]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
11
let Y = ∑ Xi
i=1
Var (Y) = 11 × 8
2
(704) (M1)A1
[4 marks]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
[2 marks]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
p = 0.25779…= 0.258 A2
[2 marks]
(b.iii) State the conclusion of the test, justifying your answer. [2]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
p > 0.05 R1
[2 marks]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
H0 : ρ = 0 , H1 : ρ > 0 A1
[1 mark]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
r = 0.782 A2
[2 marks]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
0.01095… = 0.0110 A1
[3 marks]
Markscheme
Note: Accept all answers that round to the correct 2sf answer in (a), (b) and
(c) but not in (d).
Note: Award M1A0A0 for the wrong regression line, that is y = 0.7393…x –
51.62….
[3 marks]
9. [Maximum mark: 6] 18M.1.SL.TZ1.T_4
A scientist measures the concentration of dissolved oxygen, in milligrams per
litre (y) , in a river. She takes 10 readings at different temperatures, measured in
degrees Celsius (x).
Markscheme
Note: Award (A1) for an answer of 0.974 (minus sign omitted). Award (A1) for
an answer of −0.973 (incorrect rounding).
[2 marks]
(a.ii) For these data, find the equation of the regression line y on x. [2]
Markscheme
Note: Award (A1) for −0.365x, (A1) for 17.9. Award at most (A1)(A0) if not an
equation or if the values are reversed (eg y = 17.9x −0.365).
[2 marks]
(b) Using the equation of the regression line, estimate the
concentration of dissolved oxygen in the river when the
temperature is 18 °C. [2]
Markscheme
Note: Award (M1) for correctly substituting 18 into their part (a)(ii).
[2 marks]
10. [Maximum mark: 6] 18M.1.SL.TZ2.T_1
The following scatter diagram shows the scores obtained by seven students in
their mathematics test, m, and their physics test, p.
(a) Plot and label the point M(m̄, p̄ ) on the scatter diagram. [2]
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
(A1)(A1)
(C2)
Note: Award (A1) for mean point plotted and (A1) for labelled M.
[2 marks]
(b) Draw the line of best fit, by eye, on the scatter diagram. [2]
Markscheme
straight line through their mean point crossing the p-axis at 5±2 (A1)(ft)(A1)
(ft) (C2)
Note: Award (A1)(ft) for a straight line through their mean point. Award (A1)
(ft) for a correct p-intercept if line is extended.
[2 marks]
(c) Using your line of best fit, estimate the physics test score for a
student with a score of 20 in their mathematics test. [2]
Markscheme
point on line where m = 20 identified and an attempt to identify y-
coordinate (M1)
[2 marks]
11. [Maximum mark: 13] 18M.2.SL.TZ1.S_8
The following table shows values of ln x and ln y.
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
−0.453620, 6.14210
[3 marks]
Markscheme
[3 marks]
Markscheme
METHOD 1
eg ln y = ln (kx ) ,
n n
ln (kx ) = a ln x + b
eg ln k + ln n
(x )
eg ln k + n ln x
eg n = a, b = ln k
eg ln k = 6.14210, k = e
6.14210
465.030
METHOD 2
eg e
ln y
= e
a ln x+b
eg e
a ln x
× e
b
eg ln x a
correct equation in y A1
eg y = x
a
× e
b
eg k = e ,
b
n = a
465.030
METHOD 3
eg ln y = ln (kx ) ,
n n
ln (kx ) = a ln x + b
eg b = ln (e )
b
eg ln b
(e x )
a
eg k = e ,
b
n = a
465.030
[7 marks]
12. [Maximum mark: 6] 18M.2.SL.TZ2.S_1
The following table shows the mean weight, y kg , of children who are x years old.
The relationship between the variables is modelled by the regression line with
equation y = ax + b.
Markscheme
a = 1.91966 b = 7.97717
[3 marks]
Markscheme
0.984674
r = 0.985 A1 N1
[1 mark]
(b) Use your equation to estimate the mean weight of a child that is
1.95 years old. [2]
Markscheme
11.7205
11.7 (kg) A1 N2
[2 marks]
13. [Maximum mark: 14] 17N.2.SL.TZ0.S_8
Adam is a beekeeper who collected data about monthly honey production in his
bee hives. The data for six of his hives is shown in the following table.
The relationship between the variables is modelled by the regression line with
equation P = aN + b.
Adam has 200 hives in total. He collects data on the monthly honey production
of all the hives. This data is shown in the following cumulative frequency graph.
Adam’s hives are labelled as low, regular or high production, as defined in the
following table.
Adam knows that 128 of his hives have a regular production.
Markscheme
* This question is from an exam for a previous syllabus, and may contain
minor differences in marking or structure.
a = 6.96103, b = −454.805
[3 marks]
Markscheme
eg6.96(270) − 455
1424.67
P = 1420 (g) A1 N2
[2 marks]
(c) Write down the number of low production hives. [1]
Markscheme
40 (hives) A1 N1
[1 mark]
Markscheme
eg128 + 40
k = 1640 A1 N3
[3 marks]
(d.ii) Find the number of hives that have a high production. [2]
Markscheme
eg200 − 168
32 (hives) A1 N2
[2 marks]
Markscheme
n
egX ∼ B(n, p), (
r
) p (1 − p)
n−r
0.144364
0.144 A1 N2
[3 marks]