Professional Documents
Culture Documents
INF 397C
Version 3.5
9 11 13 3 7 2 8 9 6 10
Compute:
(a) n
(b) ∑ x
(c) x
(d) s
57 51 58 52 50 59 57 51 59 56
50 53 54 50 57 51 53 55 52 54
(a) n
(b) ∑ x
(c) x
(d) s
(e) s 2
(f) median
(g) mode
(h) range
(i) CV
skewness
ordinate
abscissa
central tendency
bimodal
ordered pair
Cartesian plane
stem-and-leaf plot
outlier
reliability
dispersion or variability
negatively skewed
(a) sample/population
(b) x /µ
(c) s/ σ
(d) variance/standard deviation
(e) n/N
(f) statistic/parameter
(g) mean, median, mode of the normal curve
(h) coefficient of variation/IQR
25 4 1 25 2 8
26 10 2 31 4 20
27 6 3 40 6 25
28 3 4 44 8 35
29 3 5 51 10 40
30 1 6 19 12 45
53 1 7 10 13 24
85 1 14 20
10. For each of the distributions in problem 9, answer the following questions.
x freq (x)
17. Define:
α
sampling distribution
Central Limit Theorem
Standard Error (SE)
ND(µ, ) σ
decile
descriptive statistics
Copyright Philip Doty, University of Texas at Austin, August 2004 10
inferential statistics
effect size
confidence interval (C.I.) on µ
Student's t
degrees of freedom (df)
random sampling
stratified random sample
E( x )/µ
α /df/t
Central Limit Theorem/C.I. on µ
α /C.I. on µ when σ
is known
α /C.I. on µ when σ
is not known
z/confidence interval on µ
t/C.I. on µ
t/z
2, 5, 9, 5, 3, 6, 6, 3, 1, 13
For the population from which the sample was drawn, µ = 4.1 and
σ = 2.93.
Calculate:
20. From previous research, we know that the standard deviation of the ages
of public library users is 3.9 years. If the "average" age of a sample of 90
public library users is 20.3 years, construct:
21. The sample size in problem 20 was increased to 145, while the "average"
age of the sample remained at 20.3 years. Construct three confidence
intervals around µ with the same levels of confidence as in Question 20.
Copyright Philip Doty, University of Texas at Austin, August 2004 12
22. Determine t for a C.I. of 95% around µ when n equals 20, 9, ∞ , and 1.
24. The following frequency distribution gives the values for variable x in a
sample drawn from a larger population.
x freq(x)
43 6
42 11
39 3
36 7
35 7
34 15
Calculate:
(a) x
(b) s
(c) SEµ = σ x
(d) E( x )
(e) Q 1 , Q 2 , and Q 3
(f) mode
(g) CV
(h) IQR
(i) a 95% C.I. on µ
(j) the width of the confidence interval in part (i)
(k) a 99% C.I. on µ
(l) the width of the confidence interval in part (k)
(m) PR (percentile rank) of x = 42
(n) our best estimates of µ and σ
from the data.
TIME (MINS)
Novice 14 20 19
Intermediate 15 16 9
Expert 22 11 2
29. Define:
statistical hypothesis
Ho
H1
p
Type I error
Type II error
χ2
nonparametric
contingency table
statistically significant
2
E (expected value) in χ
2
O (observed value) in χ
α /p 2
df/R/C [in χ situation]
H o / H1
α
χ 2 / /df
α /Type I error
Copyright Philip Doty, University of Texas at Austin, August 2004 15
2
E/O/ χ
Type I/Type II error
2. (a) n = 10
(b) ∑ x = 78
(c) x =
∑x 78
= = 7.8
n 10
(d) s =
∑x 2
− nx 2 714 − 608.4
= = 11.73 = 3.43
n −1 9
2
(e) s = 11.73
n +1 10 + 1
(f) P(med) = th score = th score = 5.5th score
2 2
5thscore + 6thscore 8 + 9
med = 5.5th score = = = 8.5
2 2
(g) mode = 9
s 3.43
(i) coefficient of variation (CV) = = = 0.44
x 7.8
4. (a) n = 20
(b) ∑ x = 1079
(c) x =
∑ x = 1079 = 53.95
n 20
(d) s =
∑x 2
− nx 2
=
58395 − 58212
= 9.63 = 3.1
n −1 19
2
(e) s = 9.63
53 + 54
median = = 53.5
2
(h) range = 59 - 50 = 9
s 3.1
(i) CV = = = 0.06
x 53.95
(d) mean = x =
∑ x = 776 ; 907 ; 2058 = 27.7,4.1,9.5
n 28 221 217
(e) s
2
=
∑x 2
− nx 2
=
22218 − (28)(27.7)2 733.88
= = 27.18
1
n −1 28 −1 27
9 3 25 0.12 1.00
8 9 22 0.36 0.88
6 5 13 0.20 0.52
3 2 8 0.08 0.32
2 6 6 0.24 0.24
25 1.00
(b) N = 25
(c) Range = H - L = 9 - 2 = 7
(e) mode = 8
(f) μ =
∑ x = 147 = 5.88
N 25
(g) σ =
∑x 2
− Nμ 2 1041− (25)(5.88)2 174.64
= = = 7.07 = 2.66
N 25 25
1+ ↓ P(Q2 ) 1+ 13
P(Q1 ) = = = 7th observation from the beginning; Q1= 3
2 2
1+ ↓ P(Q2 ) 1+ 13
P(Q 3 ) = = = 7th observation from the end; Q3= 8
2 2
σ 2.66
(i) CV = μ = 5.88 = 0.45
(j) IQR = Q3 - Q1 = 8 - 3 = 5
3− 5.88 9 − 5.88
z3 = = −1.08;z 9 = = 1.17
2.66 2.66
x − μ 45 − 50
(a) PR(45); z45 = = = −2.00 ⇒ −0.4772
σ 2.5
52.6 − 50
(b) z52.6 = = 1.04
2.5
58 − 50
(c) PR(58); z58 = = 3.2 ⇒ 0.4993
2.5
x − 50
−0.55 =
2.5
x − 50 = −1.375
x = 48.63
x − 50
z = 1.27 =
2.5
x = 53.175
49 − 50 −1
(g) PR(49); z49 = = = −0.4 ⇒ −0.1554
2.5 2.5
19. 2, 5, 9, 5, 3, 6, 6, 3, 1, 13
σ 2.93
(b) SE μ = σ x = = = 0.93
n 10
x − zSE μ ≤ μ ≤ x + zSEμ
x − zSE μ ≤ μ ≤ x + zSEμ
x − zSE μ ≤ μ ≤ x + zSEμ
df = n - 1
∞
n = 20, 9, , 1
t = 2.093, 2.306, 1.960, ?
df = n - 1
t = 2.861, 3.355, 2.576, ?
24. (a) x =
∑ x = 1844 = 37.63
n 49
(b) s =
∑x 2
− nx 2 70048 − 69385
= = 3.72
n −1 48
s 3.72
(c) SE μ = σ x = = = 0.537
n − 1 6.93
(d) E(x ) = μ
n +1
(e) P(Q2 ) = th position = 25th observation; Q2 = 36
2
1+ ↓ P(Q2 ) 1+ 25
P(Q1 ) = = = 13th observation from the beginning; Q1= 34
2 2
1+ ↓ P(Q2 ) 1+ 25
P(Q 3 ) = = = 13th observation from the end; Q3= 42
2 2
(f) mode = 34
(h) IQR = Q3 – Q1 = 42 - 34 = 8
x − zSE μ ≤ μ ≤ x + zSEμ
36.5 ≤ µ ≤ 38.7
TIME (MINS)
(O − E )2 RxC
χ2 = E=
E n
(14 − 21.12) 2 (15 − 15.94) 2 (22 − 13.95)2 (20 −19.46) 2 (16 −14.69) 2
χ2 = + + + +
21.12 15.94 13.95 19.46 14.69
χ 2 = 2.40 + 0.06 + 4.65 + 0.01 + 0.12 + 0.27 + 3.49 + 0.02 + 4.69 = 15.71
df = (R - 1)(C - 1) = 2 x 2 = 4
2
28. χ (4, 0.05) = 9.49