Professional Documents
Culture Documents
Let’s do a survey :
Which picture do you prefer ?
Desert Bridge
Answer on blackboard
2
James Bond : 4
Definitions
Representative sample ?
POPULATION
Sample
5
Quota’s method
Other 20
6
Estimation of a mean (quantitative)
What’s the difference between the
estimation and the value I want
to estimate ?
POPULATION Sample
Size N n
N n
1 1
n∑
x̄ = xi
N∑
Mean μ= xi
i=1 i=1
Difference between x̄ and μ ?
7
Estimation of a proportion (qualitative)
What’s the difference between the
estimation and the value I want
to estimate ?
POPULATION Sample
Size N n
Proportion p p̂
̂
Difference between p and p ?
8
What is an (n) estimator ?
- It’s a function
- It gives the result (estimation) for each sample of size n
Bias of an estimator
10
Central limit theorem
The estimator (for sample of size = n) is a variable (function).
When n is big enough, this variable is a normal distribution.
σ
Those parameters are mean = μ and standard deviation =
n
μ is the mean for all the population
n is the sample's size
σ is the standard-deviation for the population.
11
x̄ is one value of the estimator
which is a normal distribution,
σ
with parameters μ and
n
So, there is a 95 % chance that :
σ
d(x̄; μ) ≤ 1.96
n
σ σ
μ − 1.96 μ μ + 1.96
n n
σ σ
1.96 1.96
12 n n
Confidence interval for the mean
So, there is a 95 % chance that μ is in
[ ]
σ σ
x̄ − 1.96 ; x̄ + 1.96
n n
μ is the mean for all the population
x̄ is the mean for the sample
n is the sample's size
σ is the standard-deviation for the population.
We have to estimate σ by using the sample.
n
1 2
n∑
We could imagine using (xi − x̄) as estimator.
i=1
[ n]
s s N n
1 1
x̄ − 1.96 ; x̄ + 1.96
n∑
x̄ = xi
∑
Mean μ= xi
n N i=1 i=1
n
Std. 1 N 1 2
n−1∑
2 s= (xi − x̄)
N∑
σ= (xi − μ)
deviation i=1
i=1
16
Exercise : Cinema
18
Estimation of a proportion (qualitative)
What’s the difference between the
estimation and the value I want
to estimate ?
POPULATION Sample
Size N n
Proportion p p̂
̂
Difference between p and p ?
19
Confidence interval for the proportion
So, there is a 95 % chance that p is in
[ ]
̂ − p)̂
p(1 ̂ − p)̂
p(1
p̂ − 1.96 ; p̂ + 1.96
n n
p is the proportion for all the population
̂
p is the proportion for the sample
n is the sample's size
21
Chi-square test
2
χ
Relation between 2 qualitative variables ?
22
Repartition by gender and category in a company
Men 20 40 40
Women 30 60 10
Men 20 40 40 100
Women 30 60 10 100
Men 20 25 40 50 40 25 100
Women 30 25 60 50 10 25 100
2 2 2 2 2 2
2 (20 − 25) (30 − 25) (40 − 50) (60 − 50) (40 − 25) (10 − 25)
χ = + + + + +
25 25 50 50 25 25
2
If χ is significantly > 0 ( i . e ≠ 0)
we reject the hypothesis « Gender »25 and « Category » are independent.
2
χ distribution
If significance is less
than .05,
We reject the
hypothesis where the 2
variables are
independent.
So there is a relation
between « Gender »
and « Category ».
26
27
Questions
[ ]
̂ − p)̂
p(1 ̂ − p)̂
p(1
p̂ − 1.96 ; p̂ + 1.96
n n
30
31
Compare the mean of FEV between smokers and non
smokers for all the population
32