Professional Documents
Culture Documents
Statistics Question Paper
Statistics Question Paper
1. Why do count data need to be modeled differently from the standard linear
regression model? {Answer not to exceed 3 sentences.]
[10 marks]
2. The following data refer to the number of prescriptions (Pres) written in a
year by 54 physicians on one of two drugs (A and B) for diabetes and the physician's
Age (Young (Y) Middle-Aged (M) or Old (O)).
(a) Analyze the data to study the effects of Drug and physician Age on the number of
prescriptions.Also,study the effect of interaction between Drug and Age.
Apply Poisson Regression and Negative Binomial regression Model.Compare them
Give annotated versions of any software output and summarize the output in a
few sentences.
[75 marks]
ID
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Pres
26
30
54
25
70
52
51
26
67
18
21
29
17
12
18
35
30
36
36
21
24
18
10
43
28
15
26
27
14
29
19
Drug Age
A
Y
A
Y
A
Y
A
Y
A
Y
A
Y
A
Y
A
Y
A
Y
A
M
A
M
A
M
A
M
A
M
A
M
A
M
A
M
A
M
A
O
A
O
A
O
A
O
A
O
A
O
A
O
A
O
A
O
B
Y
B
Y
B
Y
B
Y
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
29
31
41
20
44
42
26
19
16
39
28
21
39
29
20
21
24
17
13
15
15
16
28
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
B
Y
Y
Y
Y
Y
M
M
M
M
M
M
M
M
M
O
O
O
O
O
O
O
O
O
(b) Suppose that there are two missing values in the above data in the following way:
one value missing in second coumn - 5th row and
other value missing in third coumn - 8th row
Apply a suitable regression technique to impute the missing values.(15 marks)