You are on page 1of 7

S#E# Half Killings IMDB G

1 4
S1E1 9 2.250 10.000
2 0
1 1
S1E2 8.7 4.350 9.000 8.700
2 1
1 0
S1E3 8.6 0.000 8.000
2 0
1 0
S1E4 8.7 8.700 7.000
2 1
1 2 6.000
S1E5 9 1.800
2 3
1 2 5.000
S1E6 9.1 2.275
2 2 4.350
1 0 4.000
S2E1 8.7 4.350
2 2
1 1 3.000
S2E2 8.5 8.500
2 0 2.250
1 0 2.000 1.800
S2E3 8.8 4.400
2 2
1 2 1.000
S2E4 8.7 2.900
2 1 0.000
0.000
1 3 S1E1 S1E2 S1E3 S1E4 S1E5
S2E5 8.7 2.175
2 1
1 2
S2E6 9 1.800
2 3 Hypothesis Test
1 0 H0: μ
S3E1 8.7 0.000
2 0 H1: μ
1 1 μG G bar
S3E2 8.5 8.500
2 0 3 4.192
1 0 The test score does not lie in the rejec
S3E3 8.8 8.800
2 1
1 0 The variable G is a more mathematical way of sh
S3E4 9.5 2.375 down. Meanwhile, if we increase the ratings, w
2 4
The graph shows that the highest of this ratio is
1 3
S3E5 9 3.000
2 0 The hypothesis tests show unexpected results fo
1 0 the population mean is even greater than 4. The
S3E6 8.8 8.800 conduct the hypothesis testing for the populatio
2 1 the values are below 3, it is unlikely that the pop
1 0 In conclusion, we cannot say anything in certain
S4E1 9 4.500 allow us to understand the nature of the variabl
2 2
1 2
S4E2 9.7 3.233
2 1
1 1
S4E3 8.8 2.200
2 3
1 1
S4E4 8.8 8.800
2 0
1 0
S4E5 8.7 2.900
2 3
1 0
S4E6 9.7 0.000
2 0
1 1
S5E1 8.4 4.200
2 1
1 0
S5E2 8.5 4.250
2 2
1 0
S5E3 8.5 8.500
2 1
1 0
S5E4 8.6 2.867
2 3
1 1
S5E5 8.6 8.600
2 0
1 1
S5E6 8 8.000
2 0
1 1
S6E1 8.5 1.700
2 4
1 1
S6E2 9.3 1.860
2 4
1 2
S6E3 8.7 1.243
2 5
1 2
S6E4 9.1 2.275
2 2
1 0
S6E5 9.7 2.425
2 4
1 1
S6E6 8.4 8.400
2 0
G for eve

8.700 8.800 8.800


8.500 8.500

4.350 4.350 4.400

2.900 3.000

2.275 2.375
2.175
1.800 1.800

0.000 0.000
S1E2 S1E3 S1E4 S1E5 S1E6 S2E1 S2E2 S2E3 S2E4 S2E5 S2E6 S3E1 S3E2 S3E3 S3E4 S3E5 S3E6

Hypothesis Test for μG at α=0.05 Hypothesis Test fo


H0: μG >=3 H0: σ2G
H1: μG <3 H1: σ2G
sG nG Test Score tc σ2G
2.972 36 2.408 1.690 8
test score does not lie in the rejection region, so we reject H0 at α=0.05 The test score lies in the rejection r

is a more mathematical way of showing the relationship between the IMDB ratings of each episode of the show, and the number of killing
while, if we increase the ratings, we can see that each killing contributes to a higher level of rating by the user.
ws that the highest of this ratio is observed during season 3, although the number of killings is relatively lower. This means a higher numbe
s tests show unexpected results for the behavior of this variable. As we can see through the graph, and the sample mean of the G variable,
mean is even greater than 4. The sample variance, on the other hands, shows relatively expectable results, that is the population variance
pothesis testing for the population proportion, i.e., the proportion of the G variables less than 3 out of the total sample. The results show
below 3, it is unlikely that the population mean will be below 3 as well, which leads to a contradiction between the two hypotheses.
we cannot say anything in certain about the population of G due to the fact that hypotheses testing only provides limited evidence, and do
derstand the nature of the variable.
G for every episode

8.800 8.800 8.800


8.500 8.500 8.600
8.00

4.500
4.200 4.250

3.233
3.000 2.900 2.867
2.375
2.200

0.000
S3E2 S3E3 S3E4 S3E5 S3E6 S4E1 S4E2 S4E3 S4E4 S4E5 S4E6 S5E1 S5E2 S5E3 S5E4 S5E5 S5E

Hypothesis Test for σ2G at α=0.05 Hypothesis Test for pG (G


H0: σ2G <= 8 H0: p
H1: σ2G > 8 H1:
s2G DOF Test Score χc pG
8.830 35 38.63 49.80 0.55
The test score lies in the rejection region, so we reject H0 at α=0.05 The test score does not lie in the rejec

de of the show, and the number of killings in each episode. In the table, we can see that as the number of killings goes up, the weightage of
by the user.
tively lower. This means a higher number of killings doesn’t necessarily mean higher IMDB ratings, rather, it means the opposite.
and the sample mean of the G variable, that the population mean should be greater than 4. However, the hypothesis test shows that at 95
le results, that is the population variance is greater than or equal to 8, which is close to, or maybe even greater than the sample variance. L
ut of the total sample. The results show that it is likely that less than 55% of the values are below 3. These numbesr provide us uncertain r
tion between the two hypotheses.
g only provides limited evidence, and does not allow us to make any certain conclusions. However, we have been able to draw certain from
8.500 8.600
8.400
8.000

4.250

2.867
2.275 2.425
1.860
1.700
1.243

S5E2 S5E3 S5E4 S5E5 S5E6 S6E1 S6E2 S6E3 S6E4 S6E5 S6E6

Hypothesis Test for pG (G being less than 3) at α=0.05


H0: pG <= 0.55
H1: pG > 0.55
p cap n Test Score zc
0.5 36 -0.60 -1.96
he test score does not lie in the rejection region, so we fail to reject H0 at α=0.05

r of killings goes up, the weightage of each killing in the rating of the show goes

her, it means the opposite.


, the hypothesis test shows that at 95% confidence, we reject the hypothesis that
n greater than the sample variance. Lastly, similar results are observed when we
These numbesr provide us uncertain results because, if, for instance, only 30% of

e have been able to draw certain from the data, the graph and the tests which
ECON 222, Spring 2022, Eray Yucel
Microsoft Excel functions are required: visit https://dar.vin/nDAmO for Excel assistance
Hw4: week of 18 April 2022

Grading: No submission=0 points, Late submission=30 points deduction. Other deductions will be announced by the TA team. Note
sharing your work in any form with other students may harm your individual letter grade assessment. Unacceptably high similarity b
disciplinary investigation. Once you are finished with Homework, save your file and send a copy of your workbook to your TA’s emai
for future reference and keep that e-mail message safe at all times.

Copy&paste the data you've used in Hw3 into this sheet. From Hw3, you now know the number of killings for every episode o
- Calculate for every episode (not half-episode) in your dataset 'the ratio of IMDB score to number of killings'. Name this va
- What is an economic interpretation of G?
- Plot G across seasons and episodes in a single chart. What do you observe?
- Using G itself or its proper mathematical transformations, conduct&conclude a sequence of hypothesis tests (for Mu, for
behavior.

At the very end, insert a text box and report all your findings in your own 300 to 500 words, by paying a special attention to
Submission deadline is 24 April 2022, Sunday, 15:00. Your TA will send you a short message of “Ok” upon her/his receipt of yo
nnounced by the TA team. Note that Homework performance makes up 12.5% of your semester total; so,
t. Unacceptably high similarity between submissions will yield a grade of 0 points as well as possible
our workbook to your TA’s email as an attachment. Send it also to your own Bilkent-registered e-mail address

r of killings for every episode of GoT as well as IMDB scores. Your tasks in Hw4 are as follows:
mber of killings'. Name this variable G. Note that you're supposed to study G only.

f hypothesis tests (for Mu, for P and for Sigmasq), in a way to provide a good technical description of G's

by paying a special attention to the key technical terms.


Ok” upon her/his receipt of your email.

You might also like