Professional Documents
Culture Documents
Cheat Sheet
SAMPLING Polygon
x: midpoints
Sampling frame
y: frequencies
The sampling frame is the actual list of
individuals that the sample will be drawn Ogive
from. x: upper class boundaries
y: cumulative frequencies
1
Class Width = Upper Class Boundary – MEASURES OF SPREAD
Lower Class Boundary
Variance
UCB + LCB
Midpoint = Raw
2
𝟐
∑ 𝒙𝟐 ∑𝒙
−( )
Mean 𝒏 𝒏
𝟐
∑𝒙 ∑ 𝒙𝟐 𝒇 ∑𝒙𝒇
𝒙̄ = −( )
𝒏 ∑𝒇 ∑𝒇
Ungrouped & Grouped Sample Standard Deviation
∑ 𝒙𝒇
𝒙̄ =
∑𝒇 s = sample var iance = s 2
Mode:
𝒇𝟏 − 𝒇𝟎 Quartile Deviation:
𝒍+( )×𝒄
𝟐𝒇𝟏 − 𝒇𝟎 − 𝒇𝟐
𝑸𝟑 − 𝑸𝟏
𝑸𝑫 =
Median 𝟐
Upper Quartile
𝟑𝒏 + 𝟏
−𝒎
𝑸𝟑 = 𝒍 + ( 𝟒 )×𝒄
𝒇
2
PROBABILITY Normal Distribution
Independent Events
THE NORMAL APPROXIMATION TO
𝑷(𝑨|𝑩) = 𝑷(𝑩) THE BINOMIAL DISTRIBUTION
𝛼 1+𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙
To find , we look for
2 2
THE DISTRIBUTION OF THE
SAMPLE MEAN Confidence Level z
99% 2.576
If 𝑋~𝑁(𝜇, 𝜎 2 ) and if the population is
98% 2.326
normal or the population is not normal but n
97% 2.17
is large, then 95% 1.96
𝜎2 90% 1.645
𝑋̅~𝑁 (𝜇, )
𝑛
Population normal; population standard
𝑥̅ − 𝜇 deviation known
𝑎𝑛𝑑 𝑧 = 𝜎
x−z x+z
√𝑛 n n
THE DISTRIBUTION OF THE
SAMPLE PROPORTION Population not normal; population standard
𝑝𝑞 deviation not known; n≥30
𝑃𝑠 ~𝑁 (𝑝, )
𝑛
𝜎̂ 𝜎̂
𝑥̅ − 𝑧 < 𝜇 < 𝑥̅ + 𝑧
1 √𝑛 √𝑛
(𝑃𝑠 ± 2𝑛) − 𝑝
𝑍=
𝑝𝑞 Population normal; population standard
√
𝑛 deviation not known; n < 30
ESTIMATION 𝜎̂ 𝜎̂
𝑥̅ − 𝑡 < 𝜇 < 𝑥̅ + 𝑡
Unbiased estimate for the Mean √𝑛 √𝑛
∑𝑥
𝑥̅ =
𝑛
Unbiased Estimator of the Variance
4
HYPOTHESIS TESTING FOR THE HYPOTHESIS TESTING FOR THE
MEAN PROPORTION
Null Hypothesis Null Hypothesis
𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝑝 = 𝑝0
Alternative hypothesis 𝑯𝟏 Alternative hypothesis 𝑯𝟏
𝑥̄ − 𝜇0 𝑝𝑠 − 𝑝0
𝑧= 𝑧=
𝜎 𝑝𝑜𝑞𝑜
√
√𝑛 𝑛
REGRESSION
Population not normal; population standard y=a+bx
deviation not known; n large
𝑥̄ − 𝜇0
𝑧=
𝜎̂
√𝑛
Population normal; population standard
deviation not known; n small
𝑥̄ − 𝜇0
𝑡=
𝜎̂
√𝑛
5
Coefficient of correlation
Null Hypothesis
H0: Variables are independent.
Alternative Hypothesis
H1: Variables are dependent
Test Statistics
=
2 (O − E)
2
E
where E =
(row total)(coloumn total)
sample size
Rejection Region
We reject Ho if test 2
2
df = (R − 1)(C − 1)