Professional Documents
Culture Documents
Murky data
Attendance at games is a factor of both teams competing
Unable to obtain attendance of just one team
When the data was adjusted initially, to only include data where the playoff ranking
was available, the number of usable observations remaining was low(274).
When the data is balanced the number of usable observations shrinks even
further(100)
Small sample size(~1000)
Data Concatenation
Data was collected from two sources.
The first source (open source sports) had data on hockey
teams from 1906 to 2010
The second source (Rodney Forts Sports Economics) had
data on attendence from 1952-2012
Initially all observations occurring outside the interval of
1952-2010 were dropped. This was done so that a 1:1
merge could be performed.
Then the command xtset name year was performed to
convert the data into panel data.
rank |
Freq.
Percent
Cum.
------------+-----------------------------------
3.
1|
177
18.40
18.40
4.
2|
180
18.71
37.11
5.
3|
180
18.71
55.82
6.
4|
181
18.81
74.64
7.
5|
168
17.46
92.10
8.
6|
54
5.61
97.71
The Regression
xtreg teamaverageattendancepergame pctwl L1.(rank) L(1/3).(playoffc) ,r
Number of obs =
Number of groups =
264
29
1
9.1
28
Wald chi2(5)
= 119.48
Prob > chi2
=
0.0000
-----------------------------------------------------------------------------teamaverag~e |
Coef. Std. Err.
z P>|z| [95% Conf. Interval]
-------------+---------------------------------------------------------------pctwl | 689.3315 134.8733 5.11 0.000 424.9848 953.6783
|
rank |
L1. | -192.1163 116.5086 -1.65 0.099 -420.469 36.23642
|
playoffc |
L1. | -168.2337 47.78129 -3.52 0.000 -261.8833 -74.58409
L2. | -85.85899 51.21385 -1.68 0.094 -186.2363 14.51832
L3. | -226.2054 49.08902 -4.61 0.000 -322.4182 -129.9927
|
_cons | 17553.48 531.8035 33.01 0.000 16511.17 18595.8
-------------+---------------------------------------------------------------sigma_u | 1474.9191
sigma_e | 1595.7286
rho | .46071747 (fraction of variance due to u_i)
Conclusion
Team average attendance is correlated with current win/ loss percentage, the rank
for the last three years and the teams playoff ranking for the last three years.
Average attendance is affected by many variables so there is a strong possibility
of omitted variable bias
More data is needed for a more balanced analysis