Professional Documents
Culture Documents
StockWatson Econ CH 2
StockWatson Econ CH 2
Chapters 2
Review of Probability
• Probabilities : The probability of an outcome is the proportion of the time that the outcome occurs
• The gender of the next new person you meet, your grade on an exam, and the number of times your
computer will crash while you are writing a term paper all have an element of chance or
randomness.
• Probability distribution: all possible values (M) of the variable and the probability (Pr) that each
• Probabilities of events: The probability of an event can be computed from the probability
distribution.
• For example : the probability of the event of one or two crashes is the sum of the probabilities of
• For example, the probability of at most one crash, Pr(M … 1), is 90%, which
is the sum of the probabilities of no crashes (80%) and of one crash (10%).
2.2 Expected Values, Mean, and Variance
occurrences.
Example of Expected Values:
Assume that you assign the following subjective probabilities for your final grade in your
A) 3.0 B
C
0.50
0.20
B) 3.5 D 0.08
C) 2.78 F 0.02
D) 3.25
• The Variance :
• The variance of a random variable Y is the expected value of the square of the
deviation of Y from its mean.
• The variance is a measure of the spread of a probability distribution.
• The variance of Y equals
The Skewness :
probability distribution of X
• Independence :Two random variables X and Y are
the other.
• Covariance: One measure of the extent to which two
• Being independent
• E (Y | X) =0
The Normal Distribution :
• A normal distribution with mean and standard deviation is denoted
• as N ( ; )
• to Standardize the variable by first subtracting the mean, then by
dividing the result by the standard deviation.
The Student t distribution
• with m degrees of freedom is the distribution the random variable
distribution.
• The mean of Y is
Review of Statistics
1-2
• A statistical test uses the data obtained from a sample to make a decision about
• whether the null hypothesis should be rejected.
• The numerical value obtained from a statistical test is called the test value.
• The level of significance is the maximum probability of committing a type I error. This
• probability is symbolized by a (Greek letter alpha). That is, P(type I error) a.
• The critical value separates the critical region from the noncritical region. The symbol
• for critical value is C.V.
• The critical or rejection region is the range of values of the test value that indicates
• that there is a significant difference and that the null hypothesis should be rejected.
• The noncritical or nonrejection region is the range of values of the test value that
• indicates that the difference was probably due to chance and that the null hypothesis
• should not be rejected.
• A one-tailed test indicates that the null hypothesis should be rejected when the test value
is in the critical region on one side of the mean.
• In a two-tailed test, the null hypothesis should be rejected when the test value is in either
of the two critical regions.
1-3
1. If s is known, use the z test. The variable must be normally distributed if n < 30.
2. If s is unknown but n >=30, use the t test.
3. If s is unknown and n<30, use the t test.
1-4
Find critical value in z-statistics
1-6
Finding critical value for t-statistics
(d.f. = n 1)
1-7
Comments on Student t distribution, ctd.
1-8
Running a z test on your data requires five steps:
1-10
The Traditional Method
A researcher wishes to see if the mean number of days that a basic, low-price, A medical investigation claims that the average number of infections per
small automobile sits on a dealer’s lot is 29. A sample of 30 automobile dealers week at a hospital in southwestern Pennsylvania is 16.3. A random sample
has a mean of 30.1 days for basic, low-price, small automobiles. At a 0.05, test of 10 weeks had a mean number of 17.7 infections. The sample standard
the claim that the mean time is greater than 29 days. The standard deviation of deviation is 1.8. Is there enough evidence to reject the investigator’s claim
the population is 3.8 days. at a 0.05?
1-11
p-value Method
A researcher wishes to test the claim that the average cost of tuition and fees
at a four year public college is greater than $5700. She selects a random
sample of 36 four-year public colleges and finds the mean to be $5950. The
population standard deviation is $659. Is there evidence to support the claim at
a 0.05? Use the P-value method.
1-12
Comments on Student t
distribution, ctd.
4. You might not know this. Consider the t-statistic testing the hypothesis
that two means (groups s, l) are equal:
Ys - Yl Ys - Yl
t= =
ss2 sl2 SE(Ys - Yl )
+n
ns l
Even if the population distribution of Y in the two groups is normal, this
statistic doesn’t have a Student t distribution!
There is a statistic testing this hypothesis that has a normal distribution,
the “pooled variance” t-statistic – see SW (Section 3.6) – however the
pooled variance t-statistic is only valid if the variances of the normal
distributions are the same in the two groups. Would you expect this to
be true, say, for men’s v. women’s wages?
1-13
Confidence Interval
• A 95% confidence interval for μY is an interval that contains the true
value of μY in 95% of repeated samples.
• Digression: What is random here? The values of Y1,...,Yn and thus any
functions of them – including the confidence interval. The confidence
interval will differ from one sample to the next. The population
parameter, μY, is not random; we just don’t know it.
Confidence Intervals for the Mean
When s Y2 Is Known
1-15
Ten randomly selected people were asked how long they slept at night. The mean time was
7.1 hours, and the standard deviation was 0.78 hour. Find the 95% confidence interval of the
mean time. Assume the variable is normally distributed.
Testing the Difference Between Two Means of Independent Samples:
Using the t Test