Professional Documents
Culture Documents
R (p) = cpe
dp
, c, d ∈ R
There are two competing models, A and B with different values for the
parameters c and d.
The company will choose the model with the smallest value for the
sum of square residuals.
The Commissioner for the event would like to find the Spearman’s rank
correlation coefficient.
(c.ii) State whether this estimate is reliable. Justify your answer. [2]
[2]
(e.i) Find the value of the Spearman’s rank correlation coefficient, r .
s [2]
A sample of apples are taken from 2 trees, A and B, in different parts of the
orchard.
The owner of the orchard wants to know whether the mean weight of the apples
from tree A(μ ) is greater than the mean weight of the apples from tree B(μ )
A B
H0 : μ A = μ B and H 1 : μA > μB
(a) Find the probability that an apple from the tree has a weight
greater than 90 grams. [2]
State the conclusion of the test, giving a reason for your answer. [2]
5. [Maximum mark: 9] EXM.1.SL.TZ0.11
A calculator generates a random sequence of digits. A sample of 200 digits is
randomly selected from the first 100 000 digits of the sequence. The following
table gives the number of times each digit occurs in this sample.
It is claimed that all digits have the same probability of appearing in the
sequence.
(a) Calculate Spearman’s rank correlation coefficient for this data. [5]
(b) State what conclusion Kayla can make from the answer in part
(a). [1]
7. [Maximum mark: 9] EXM.1.SL.TZ0.3
Charles wants to measure the strength of the relationship between the price of a
house and its distance from the city centre where he lives. He chooses houses of
a similar size and plots a graph of price, P (in thousands of dollars) against
distance from the city centre, d (km).
(d) State what conclusion Charles can make from the answer in
part (c).
[1]
8. [Maximum mark: 11] EXM.1.SL.TZ0.8
In an effort to study the level of intelligence of students entering college, a
psychologist collected data from 4000 students who were given a standard test.
The predictive norms for this particular test were computed from a very large
population of scores having a normal distribution with mean 100 and standard
deviation of 10. The psychologist wishes to determine whether the 4000 test
scores he obtained also came from a normal distribution with mean 100 and
standard deviation 10. He prepared the following table (expected frequencies
are rounded to the nearest integer):
(a) Copy and complete the table, showing how you arrived at your
answers. [5]
(b) Test the hypothesis at the 5% level of significance. [6]
9. [Maximum mark: 6] EXM.1.AHL.TZ0.19
A company sends a group of employees on a training course. Afterwards, they
survey these employees to gather data on the effectiveness of the training. In
order to test the reliability of the survey, they design two sets of similar
questions, which are given to the employees one week apart.
The questions in the survey were grouped in different sections. The mean scores
of the employees on the first section of each survey are given in the table.
(b) State a possible disadvantage of using this test for reliability. [1]
(a) Calculate the mean number of eggs laid by these birds. [2]
(a) Find the exact value of the mean of this distribution. [2]
(b) Test, at the 5% level of significance, whether or not the data can
be modelled by a Poisson distribution. [12]
14. [Maximum mark: 13] EXM.2.SL.TZ0.5
A pharmaceutical company has developed a new drug to decrease cholesterol.
The final stage of testing the new drug is to compare it to their current drug. They
have 150 volunteers, all recently diagnosed with high cholesterol, from which
they want to select a sample of size 18. They require as close as possible 20% of
the sample to be below the age of 30, 30% to be between the ages of 30 and 50
and 50% to be over the age of 50.
Half of the 18 volunteers are given the current drug and half are given the new
drug. After six months each volunteer has their cholesterol level measured and
the decrease during the six months is shown in the table.
The company uses a t-test, at the 1% significance level, to determine if the new
drug is more effective at decreasing cholesterol.
(a) State the name for this type of sampling technique. [1]
(b) Calculate the number of volunteers in the sample under the age
of 30. [3]
(c.i) The new drug. [1]
(g) State the conclusion of this test, in context, giving a reason. [2]
15. [Maximum mark: 16] EXM.2.SL.TZ0.6
Jim writes a computer program to generate 500 values of a variable Z. He obtains
the following table from his results.
It is required to find the area bounded by the curve, the x-axis, the y -axis and the
line x = 10.
(a) Use the trapezoidal rule to find an estimate for the area. [3]
(b.i) Use all the coordinates in the table to find the equation of the
least squares cubic regression curve. [3]
(c.i) Write down an expression for the area enclosed by the cubic
regression curve, the x-axis, the y -axis and the line x = 10.
[1]
(a) Show that this data leads to an estimated value of p = 0.4 . [1]
The critical region is defined to be (ȳ < 495) ∪ (ȳ > 505) .
(a.ii) Calculate the mean of these data and hence estimate the value
of p. [5]
(a.iii) Calculate an appropriate value of χ and state your conclusion,
2
A random sample of 149 scores for a university exam are given in the table.
The university wants to know if the scores follow a normal distribution, with the
mean and variance found in part (a).
The university assigns a pass grade to students whose scores are in the top 80%.
The university also wants to know if the exam is gender neutral. They obtain
random samples of scores for male and female students. The mean, sample
variance and sample size are shown in the table.
The university awards a distinction to students who achieve high scores in the
exam. Typically, 15% of students achieve a distinction. A new exam is trialed with
a random selection of students on the course. 5 out of 20 students achieve a
distinction.
(b) Show that the expected frequency for 20 < x ≤ 4 is 31.5 correct
to 1 decimal place. [3]
(c) Perform a suitable test, at the 5% significance level, to
determine if the scores follow a normal distribution, with the
mean and variance found in part (a). You should clearly state
your hypotheses, the degrees of freedom, the p-value and your
conclusion. [8]
(d) Use the normal distribution model to find the score required to
pass.
[2]
She recorded the weights of eggs, in grams, from a random selection of geese.
The data is shown in the table.
In order to test her claim, Arriane performs a t-test at a 10% level of significance.
It is assumed that the weights of eggs are normally distributed and the samples
have equal variances.
(c) State whether the result of the test supports Arriane’s claim.
Justify your reasoning. [2]
21. [Maximum mark: 6] 21M.1.SL.TZ2.11
A newspaper vendor in Singapore is trying to predict how many copies of The
Straits Times they will sell. The vendor forms a model to predict the number of
copies sold each weekday. According to this model, they expect the same
number of copies will be sold each day.
To test the model, they record the number of copies sold each weekday during a
particular week. This data is shown in the table.
The critical value for the test is 9. 49 and the hypotheses are
(a) Find an estimate for how many copies the vendor expects to
sell each day. [1]
(b.i) Write down the degrees of freedom for this test. [1]
(b.ii) Write down the conclusion to the test. Give a reason for your
answer. [4]
© International Baccalaureate Organization, 2023