Stats Test #3 Word Cheat Sheet

=significance level 0=null hypothesis value of mean vs.
p0=null hypothesis value of proportion
Confidence Interval Sample Size for
Proportions Problems: parameterproportion p Point Estimate se

estimating population proportion with MoE=m
MoE
(if do not know
use 0.50, which will guarantee the sample size is large enough)
MoE for C.I. for population mean Confidence
*USE 1-PropZInt for (single) Population Proportion Confidence Interval USE 1-PropZTest for (single) proportions hypothesis test*
Means Problems: parameterMean Point Estimate
Sample Size for estimating a Population
se
MoE
Interval
or (
Mean, margin of error mn=(2z2)/m2
For a single hypothesis population Mean test statistic t=
0)/
where 0=the null value
When asked for critical value use Solver: tcdf and enter the df and C.I.
=population standard deviation
*USE TInterval for (one sample) Confidence Interval & TTest for (one sample) or (matched-pairs) population mean*
Point Estimate(for a population proportion is the sample proportion symbolized by ) a single # that is our best guess for the parameter, does not tell how close the estimate is likely to be to the parameter (an interval estimate is more useful, it incorporates a margin of error, which helps to gauge accuracy of the Point Estimate). EXof the possible responses, 627 picked definitely or probably should be, and 546 picked probably or definitely should not be. 627+546=1173 for the population size and 627 answered definitely or probably so for the point estimate. A POINT ESTIMATE alone may be highly inaccurate, especially with a small sample. Interval Estimateis an interval of numbers within which the parameter value is believed to fall (indicates precision by giving an interval of #s around the point estimate). Confidence Interval(are a function of 3 things: data in the sample, the confidence level & sample size) An interval of values (range) that contains the most believable (plausible) value for the population parameter. Is constructed by taking a point estimate and adding & subtracting a MoE. The MoE is based on the SE of the sampling distribution of that point estimate. The CI tells us the likelihood the most informative estimation method constructs an interval of #s called the confidence interval, within the unknown parameter value is believed to fall. Ex. 95% confidence interval says the we have a 95% confidence (refers to long-run interpretation) in the long run about 95% of those intervals would give correct results, containing the population proportion. Has standard deviation called standard error, also has mean equal to population proportion. Is approximately normal distribution, for large random samples, because of the central limit theorem. That the interval contains the parameter (must compromise between the desired margin of error and the desired confidence of a correct inference) to achieve greater confidence; we make the sacrifice of a larger margin of error and wider confidence interval. MoE for C.I. found by multiplying the critical value by the se C.I. for pop proportion is CI for population mean (1.96 Critical Value for 95%, 2.576 for 99%, 1.645 for 90% Confidence Intervals) A GOOD Estimator of a parameter has 2 desirable properties1) A good estimator has a sampling distribution that is centered at the parameter, in the sense that the parameter is the mean of that sampling distribution, an estimator with this property is said to be unbiased. 2) A good estimator has a small standard error compared to the other estimators. MoE(depends on the standard error of the sampling distribution of the point estimate) also how close the sample proportion should be to the population proportion. MoE also measures how accurate the point estimate is likely to be in estimating the parameter. (Increases as the confidence level increases; Decreases as the sample size increases) z-scoremeasures the number of standard errors between the sample proportion p and the null hypothesis p0 In a test of hypothesis, the null hypothesis is that the population mean is less than or equal to 45 and the alternative hypothesis is that the population mean is greater than 45. The test is to be made at the 2.5% significance level. A sample of 81 elements selected from this population produced a mean of 47.3 and a standard deviation of 4.5. What is the critical value of z? 1.96 *Both Z & T statistics have the form test Completely Randomized Designsubjects are randomly assigned to one of the treatments. Null Hypothesisis a statement that the parameter takes a particular value (means no effect or of no consequence).(in hypothesis testing, the null hypothesis is a claim about a population parameter that is assumed to be false until it is declared true) whenever the null is not rejected the alt. is rejected. A null hypothesis can only be rejected at the 5% significance level if and only if: a 95% confidence interval does not include the hypothesized value of the parameter. The null & alternative divide all possibilities into two non-overlapping sets. A two-tailed test is one where results in either of two directions can lead to rejection of the null hypothesis Alternative hypothesis states that the parameter falls in some alternative range of values (where the burden of proof lies). Is a claim about a population parameter that will be true if the null hypothesis is false. Statistical inferenceuses sample statistics to make decisions and predictions about population parameters. There are, generally speaking, two types of statistical inference: sample estimation and population estimation. Ordinal Variableis a categorical variable for which the categories are ordered from low to high in some sense. Case-Control Studya retrospective study. Subjects who have a response outcome of interest(ex cancer serves as
cases) other subjects not having that outcome serve as (controls). The cases and controls are compared on an explanatory variable, like whether they were smokers.
Standard Errorestimated value. Is an estimated standard deviation of a sampling distribution (Depends on the sample size) Error Probabilitythe probability that the method results in an incorrect inference, that the data generates a confidence interval that does NOT contain the population proportion. Degrees of freedom(df) , one less than the sample size. Statisticdescribes a sample Parameterdescribes a population (i.e. & ) Meanone way to summarize the center of the observations for a quantitative variable. Bayesian statisticsStatistical inference based on the subjective definition of probability. Proportionequals the # of items in a category divided by the sample size, it summarizes the relative frequency of observations in a category for a categorical variable. Properties of of t distributiont score is slightly larger than a z score 1) the t distribution is bell shaped and symmetric about 0 2) The probabilities depend on the degrees of freedom, df. The t distribution has a slightly different shape for each distinct value of df, and different t-scores apply for each df value. 3) The t distribution has thicker tails and is more spread out than the standard normal distribution. The larger df value the closer it gets to the standard normal. When df is about 30 or more, the t-score & z-score distributions are nearly identical. 4) A t-score multiplied by the standard error gives the margin of error for a confidence interval for the mean. T confidence interval does NOT work well when the data contains extreme outliers. However, the t distribution when estimating the mean accounts for the extra variability due to using the sample SD s to estimate the population SD in finding a standard error. As the sample size increases, the t distribution becomes more similar to the normal distribution. Cohort Study Designat the beginning none have disease, group of subjects is studied over time. Matched Pairseach subject in one sample is matched with a subject on the other sample. (i.e. a set of married couples with the men in one sample and the women in the other) two observations for a particular subject, because they both
come from the same person.
Crossover Design (Really Good Design) a matched pairs design in which subjects crossover during the experiment from using one treatment to using another treatment.
Dependent sampleswhen the two samples have the same subjects, they are also dependent if each subject in one sample is naturally paired with a subject in the other(i.e. husband in one sample and wife in the other) Robusta statistical method is said to be robust with respect to a particular assumption if it performs adequately even when that assumption is violated. Confidence intervals for a mean using the t distribution are robust against most violations of the normal population assumption. (t confidence interval score is not robust to violations of the random sampling assumption. Factors in determining a studys sample size1) desired precision (measured by MoE) 2) Confidence level, which determines the z-score or t-score in the sample size formulas 3) Variability in the data 4) financial Significance Levelis a number such that we reject H0 if the P-value is less or equal to that number (most common significance level is 0.05). 0.05 reject the H0 if >0.05 Do not reject. To avoid bias we select the significance level before looking at the data. REMEMBERthe smaller the p-value the stronger the evidence against the null hypotheses. Test Statisticis calculated by taking the difference between the sample proportion and the null proportion and dividing it by the standard error. Use the value of the test statistic to find the two-tail probability from the standard normal distribution to the left and right of the test statistic value. The p-value tells use that if the null hypotheses were true, a proportion of ## samples would fall at least this far from the null hypotheses. Hypothesisa statement about the population, usually of the form that a parameter takes a particular numerical value or falls in a certain range of values. Significance test (test for short) is a method for using data to summarize the evidence about a hypothesis. A significance test merely indicates whether the particular parameter values in H0 (such as =0) is plausible. A confidence interval is more informative, because it displays the entire set of believable values. Exif a high proportion of the astrologers predictions are correct, the data might provide strong evidence against the hypothesis that p=1/3 in favor of an alternative hypothesis representing the astrologers claim the p>1/3. P-valueis a tail probability beyond the observed test statistic value. A small P-value usually below .05 provides strong evidence against the null hypothesis or the result is statistically significant, the probability that the test statistic equals the observed value or a value even more extreme. It is calculated by presuming that the null hypothesis H 0 is true. Considered convincing if less than 0.01 Assumptions for a 95 & 99% population proportion C.I. 1) Data obtained by randomization (such as a random sample or a randomized experiment) and a large enough sample size of at least 15 successes and failures. Assumptions for a C.I. for a population mean 1) Data obtained by randomization 2) An approximately normal population distribution. Sample Size Needed for Large-Sample C.I. for a Proportion- For a 95% C.I. for a proportion p to be valid, you should have at least 15 successes & 15 failures. and if it is less than 15 successes and failures add 2 to the success and 4 to the total count. Find the needed sample size to estimate a pop. Proportion EXstudents at a school were surveyed, and it was estimated that 26% of students abstain from drinking alcohol. To estimate this proportion in your school, how large a random sample would you need to estimate it to within 0.04 with probability 0.99, if before conducting the study (a) you are unwilling to predict the proportion value at your school and (b) you use the results from the surveyed school as a guideline. (a)=.50(1-.50)*2.582/.042=1040 (b)=.26(1-.26)*2.582/.042=800. So strategy (a) would be inappropriate because it overestimates the sample size. EX estimating pop. meanan estimate is needed of the mean acreage of farms in a city. A 95% C.I. should have a MoE of 35 acres. A study 10 years ago in this city had a sample SD of 190 acres for farm size. About how large of a sample of farms is needed? 190 2(1.96)2/352=113 acres. Part (b) the sample size of 113 acres abovehowever, the sample in fact has a SD 290 acres instead of 190. What is the MoE for a 95% C.I. for the mean acreage of farms? (2902(1.96)2/113=53.5 so when the SD increases from 190 to 270 the MoE increases from 35 to 53.5 EXHow large a sample size is needed to estimate the mean annual income of all people in a certain county, correct to within $1,200 with probability of 0.99? No information is available about the SD of their income. It is estimated that nearly all of the incomes fall between $0 & $180,000 and that this distribution is approximately bell-shapeds=30,000 because from -3 to 3 SDs is 6 so 180k/6=30k n=z2*s2/m22.582*30,0002/1,2002=4160 round to nearest whole #. EXA survey asked, During the last year, did anyone take something from you by using force - such as a stickup, mugging, or threat? Of 955 subjects, 14 answered yes and 941 answered no. FIND the Point Estimate or 1-PropZInt(x:14 n:955 c-level: .95)=(.00704, .02228) for the C.I. =.014659 FIND the standard error of this estimate =.00389 FIND the MoE for a 95% confidence interval CONSTRUCT the 95% confidence interval for the population proportion1-PropZInt(x:14 n:955 c-level: .95)=(.00704, .02228) for the C.I. =.014659 Can you conclude that fewer than 10% of all adults were victims? Yes, fewer than 10% of all adults were victims. EX590 believe in carnation, 2201 do not. The sample p is 0.268060 and the 95% CI is (0.24955, 0.28656) Explain how to interpret the Sample p and the 95% CIThe sample p is the proportion of all respondents who believe in reincarnation, 590/2201=0.27. The 95% CI is the 95% confidence interval and it means that we can be 95% confident that the population proportion falls between .25 and .29 Using the CI shown, find the 95% CI for the population proportion who do not believe in reincarnation1.29=.71 for the lower bound and 1.25=.75 for the upper bound. The 99% CI would be wider than a 95% CI because the 99% CI z-score is higher than the 95% CI z-score, thus making the margin of error for a 99% CI larger than the margin of error for a 95% CI. The MORE CONFIDENT you want to be about the results, the wider the CI will be. MoE for POP MEAN EXFind the MoE for a 95% CI for estimating the population mean when the sample SD=104, with a sample size of 400 then 1600. What is the effect of the sample size? MoE for a 95% CI with 400 is 104/ =5.2 then 5.2*(1.96-Critical Value for 95%, 2.576 for 99%, 1.645 for 90% C.I.) =10.2, for 1600 its = 5.1 so as Sample size increases MoE becomes smaller, MoE becomes bigger when going from 95% to 99% CI Factors that affect Sample Size: 1stprecision-as measured by MoE. 2ndconfidence level-which determines the z-score or t-score in sample size formula. 3rdvariability in the data 4thFinancial EX less the 15 successes or failuresSuppose a random sample does not have at least 15 successes and 15 failures. The C.I. formula z still is valid if we use it after adding 2 to the original number
of successes and 2 to the original number of failures. This results in adding 4 to the sample size n. EXTo test Ho: p=0.40 that a population proportion equals 0.40, the test statistic is a z-score that measures the number of standard errors between the sample proportion and the Ho value of 0.40 If z=0.6, do the data support the null hypothesis, or do they give strong evidence against it? The data supports the null hypothesis because the sample proportion falls within three standard errors from the null hypothesis value.
Significance Test about Proportions USE 1-PropZTest for (single) hypothesis test1)Assumptionsi)the variable is categorical ii) The data are obtained by randomization (most important assumption for any significance test) iii) n is large enough to expect at least 15 successes and 15 failures 2)HypothesesNull hypothesis, H0 (a single parameter value, usually no effect, Always an (=) The
alternative refers to an alternative parameter value from the # in the null hypothesis the alternative has either a,< (left-tail probability),> (right-tail probability) or a which is a two-sided test) 3)Calculator Method used and values entered 4) List the t-statistic & p-value, The test statistic is where se0 is && P-valuepresume H0 to be true normal distribution. The P-value is the probability the test statistic takes the observed value or a value more extreme. Smaller P-values represent stronger evidence against H0 5) Conclusionreport and interpret the P-value in the context of the study. Based on the P-value, make a decision about H0 if one is needed, Reject H0 if p-value is .
Single Proportions EXIn the general US population, it has been reported that 25% of adults smoke. In our class survey, there were 65 out of the 505 subjects that said they smoked. a. Construct a 99% confidence interval for the true proportion in our population who smoke. Provide the correct interpretation of the interval. Use 1-PropZInt on the TI-83. Enter x=65 and n=505. Answer: The interval is (.09033, .1671). We are 99% confident that the true proportion of smokers in our population is in this interval. b. Does the interval provide sufficient evidence to conclude the proportion in our population who smoke is different than .25? Explain why or why not. Answer: Yes, it does, because the interval excludes .25, so we can be 99% confident the population proportion is NOT .25. c. What was the MoE for the interval in part a.? MoE is the upper end of the C.I. minus , so .1671 .1287 = .0384 Or you could use the formula for SE and MoE. se=
=.0149
d. Suppose some researchers want to repeat this study in a similar population at another university, but they require a margin of error of only 4%. What size sample would they need
to have? =465.07 so we need 465 subjects. EXA study considered whether daily consumption of garlic could reduce tick bites. The study used a crossover design where half of the subjects used place 1 st and garlic 2nd and half the reverse. The authors described garlic being more
effective with 48 subjects and placebo being more effective with 33 subjects. Does this suggest a real difference between garlic and placebo, or are the results consistent with random variation? State hypotheses for a large-sample two sided test. H0: p=0.5 Ha: p0.5 Check that sample size guidelines are satisfied for that testYes, the sample size was large enough to make that inference. Find the test statistic value z USE 1-PropZTest enter p0: 0.50
x:48 n:81 because of 48+33=81z=1.67 FIND the p-value.1 The p-value is greater then the S.L. of .05 so we do not reject the null. There is not sufficient evidence garlic is more effective than placebo. EX find p-value from ZFor a test of H0: p=0.50, the z test statistic equals 1.66 FIND the P-value for Ha: p>0.50use
if been Ha: <0.50 y
d or
(the sum of P-values for two possible 1-sided tests must=1) also, if it is a not equal sign just multiply the answer by 2 for a two tail ().
EX single prop.For a test of H0: p=0.50, the sample proportion is 0.49 based on a sample size of 100. FIND the test statistic zeither manually take .
use 1-PropZTestp0:.5 x:49 n:100
FIND the p-value for Ha: p<0.50use 1-PropZTtestsimply change prop to < p0calculate0.421
Significance test about means1. Assumptions 1a) Quantitative Variable, with population mean defined in context. 1b)data are obtained by randomization(most important assumption for any significance test), such as s.r.s. or a
randomized experiment. 1c)Population distribution is approximately normal (mainly needed for one-sided tests with small n) 2. Null: H0: 0 wh
0 is the hypothesized value (such as H0: =0) Alternative: Ha: 0 (two 4. P-valuepresume H0
sided) or Ha: < 0 (one-sided) or > 0 (one-sided) 3. Test statistic (Estimate of parameterH0 value of parameter/se of estimate)t=( 0)/se where remember p-value<0.05 gives strong evidence against the null H0 and supporting Ha Reject H0 if p-value is
to be true. The P-value is the probability the test statistic takes the observed value or a value more extreme. Smaller P-values represent stronger evidence against Use t-distribution with df=n-1 5. Conclusion,
Exwhen 898 male workers were asked about how many hours they worked in the previous week, the mean was 45.4 with a SD of 14.8 Does this suggest that the population mean work week exceeds 39 hours? The relevant variable is the # of hours worked in the previous week by male workers. The parameter of interest is the population mean work week (in hrs.) for men. USE TTesttest statistic is t=12.96 & p=1.09X1e-38 which rounds to 0. Interpret the p-valuethe p-value is the probability of obtaining a sample with a mean of 45.4 or more hours assuming that the null hypothesis were true. Since the p-value is less the S.L. of 0.01, there is sufficient evidence to reject the null hypothesis and to conclude that the population mean work week for men exceeds 39 hours. POP MEAN EXWhen a survey asked, "About how many hours per week do you spend sending and answering e-mail?" the 7 females in the survey sample of age at least 80 had the
responses 0,0,2,3,9,11,17.
4.86, SD=5.15, SE=5.15/ =1.95, Find the 90% CITInterval(1.1, 8.6) Even though the distribution is likely skewed to the right the CI is still valid because the t distribution is a robust method.
Type I & type II test errors (as P(Type I error) goes down, P(type II Error) goes up)type I (the null hypothesis is incorrectly rejected when it is true) occurs when H0 is rejected (type I is the more serious of the two errors).
Type II (the null hypothesis is incorrectly accepted when it is false) occurs when H0 is not rejected, type II usually occurs when we do not reject H0 when it is actually false. The probability of committing a Type I error is called the significance level. Confidence Interval for the difference between two population proportions USE 2-PropZInt (p1-p2)z critical value*(se),where se=
To use this method, you need 1)categorical response variable for two groups 2) Independent random samples for two groups, either from random sampling or a randomized experiment. 3) Large enough sample sizes n1 and n2 so that, in each sample, there are at least 10 successes and 10 failures.
Two-Sided Significance Test for Comparing two population proportions 2-PropZTest 1) Assumptions1a) Categorical response variable for two groups. 1b) Independent random samples, either from random sampling or a randomized experiment. 1c) n1 and n2 are large enough that there are at least 5 successes and 5 failures in each group. 2) HypothesesNull H0: p1 =p2 (that is, p1-p2=0) Alternative Ha: p1p2 (onesided Ha also possible) 3) Test Statistic which is the 3rd then use se=
(p1p2)0/se0 with se0=
where
h POOLED ESTIMATE
when using 2-PropZTest. If you dont use the pooled
4) P-value = two-tail probability from standard normal distribution of values even more extreme than
observed z test statistic. 5)conclusioncheck whether 0 falls in the C.I. If so, it is plausible (but not necessary) that the population proportions are equal. If all values in (p1-p2) are positive, you can infer that (p1-p2)>0, that is, p1>p2. The interval level shows just how much larger p1 might be. If all values in the C.I. are negative, you can infer that (p1-
p2)<0, that is, p1<p2. The magnitude of values in the C.I. tells you how large any true difference is. If all values are near 0, the true difference may be relatively small in practical terms.
EXRandom samples of students at 115 4-year colleges were interviewed several times since 1992. Of the students who reported using intravenous drugs, the % who reported injecting daily was 39.6% of 13,450 students in
92 and 47.2% of 8749 students in 2000. ESTIMATE the difference between the proportions in 2000 and 1992. 1- 2=.472.396=.076 so the proportions appear to have increased from 1992 to 2k. FIND the standard error for this differenceUse 2-PropZtest to find the pooled then use )= =.0068
Confidence Interval for the difference between two population means USE 2-SampTInt
violations of this assumption.
( 1- 2)
where
This method assumesIndependent random samples from the two groups & an approximately normal population distribution for each group, however this method is robust to
Two-Sided Significance Test for Comparing two INDEPENDENT population means 2-SampTTest 1) Assumptions for Two-sided Significance test comparing two population means 1a)Quantitative response variable for two
groups 1b)Independent random samples 1c)approximately normal distribution for each group. 2) Hypotheses H0:
1=2 Ha: 12 3) function used & test statistic
( 1 2)
where
4) List the P-value, degrees of freedom from the 2-SampTTest function (NOT SOLVER) and test statistic. P-value=two-tail probability from t-distribution of values even more extreme than observed t test statistic, with df 5) Conclusion
|**IF samples are INDEPENDENT DISREGARD the DIFFERENCES Use the actual data from the two SAMPLES**|
One of the first steps in comparing the means of two groups is determining whether the two samples are INDEPENDENT or DEPENDENT. A major benefit in using PAIRED samples instead of INDEPENDENT samples is that many sources of potential bias are controlled so we can make a more accurate comparison. For categorical variables, the inferences compare proportions. For quantitative variables inferences compare means.
Independent pop mean exCaptopril is a drug designed to lower systolic blood pressure. When the drug was tested, 10 subjects were randomly chosen to receive placebo drug, and another 10 subjects were randomly chosen to receive the active drug. All subjects had their blood pressures recorded (in mm of mercury) at the end of the study, with the summary statistics of the results given in the table below. Blood pressure is reasonably normally distributed. Placebo n=10 mean=179.5 sd=16.064 Active Drug n=10 mean=167.8 sd=14.467 difference n=10 mean=11.7 sd=9.776 Is there sufficient evidence to support the claim that captopril is effective in lowering systolic blood pressure? Use an =.01 level of significance. a. Give the assumptions necessary for the test, and check whether each is met. Independent so IGNORE the differences. Assumptions: 1. Random and INDEPENDENT samples: Were told they were randomly assigned, and they used different subjects so they are independent.2. Since samples are smaller than 30, blood pressure needs to be normally distributed, and were told this is reasonable. B)Give the null and alternative hypotheses to be tested.H0: (since I'm calling group 1 the placebo group, it should be greater)
C)Determine the correct test procedure to use. This is a means problem, with two samples. Since the two samples are different subjects, these are independent samples. Therefore, we will
use 2-SampTTest on the calculator.
D)Give the test statistic, the degrees of freedom for the test statistic, and the p-value of the test. The test statistic is t=1.711, df = 17.806, and the p-value is .0522. E)Give the conclusion of the test. Since the p-value is greater than alpha=.01, we fail to reject the null. Therefore we conclude there is not sufficient evidence to support that captopril lowers blood pressure. F)Provide a 95%
confidence interval for the population difference in mean blood pressure between Captopril and Placebo. Interpret the interval. From 2-SampTInt, the interval is (-2.674, 26.074). We are 95% confident the true difference in blood pressure is in this
interval. (Note 0 is included.) EXOf people who had tried tobacco, the mean of a measure of nicotine dependence (HONC) was 4.3 (s=4.8) for the 223 females and 3.7 (s=4.5) for the 80 males. Find the se
=.597 What
does the se indicate? The se describes the spread of the sampling distribution of
1-
2. FIND the t-statistic and p-value: Use 2-SampTTest:
1: 4.3, sx1: 4.8, n1: 223,
2: 3.7, sx2: 4.5, n2: 80t=1.00, p=0.317 What
conclusions about the means can be made? The means can not be said to be different because the p-value is greater than 0.05, InterpretationThere is not enough evidence to conclude that gender has an effect on the mean HONC score.
EXSuppose the following data show a comparison of females and males on the numbers of hours a day that the subject watched TV. Females: n=502 mean=3.13 SD=2.05 se Mean=.0915, 95%CI= (2.95,3.31) Males:
n=404 mean=2.73 SD=2.14 se mean=.1065 95%CI=(2.52,2.94) Set up a hypothesesH0:
1=2 and Ha: 12 What is the p-valueuse 2SampTTestp=.004 which means the H0 should be rejected.
Dependent Samples use T-Test & TInterval because they are dependent so we USE THE DIFFERENCE of the TWO samples. For dependent samples, Mean of Differences = Difference of MeansFor dependent
samples, the differences ( 1- 2) between the means of the two samples equals the mean d of the difference scores for the matched pairs. When the data are matched pairs, the samples that result are DEPENDENT When the samples we want to compare are paired in some natural way, such as pretest/posttest for each person or husband/wife pairs, a more appropriate form of analysis is to not compare two separate variables, but their difference.
DEPENDENT samples EXTwelve subjects are asked to test their grip strength on each of two consecutive weeks, where strength was measured by squeezing pressure. During one of the tests, the subjects wear
headphones playing classical music, and during the other listen to rock and roll. Which order they hear the different music types was determined at random. The results (in pounds/square inch) are given below. DATAClassicaln=12 mean=31.244 SD=1.196 Rockn=12 mean=31.479 SD=1.292 Differencen=12 mean=.235 SD=.362 Because their dependent samples disregard Classical and Rock and ONLY USE THE DIFFERENCE. A) Construct a 95% confidence interval for the true difference in the mean grip strength while listening to the two music types. Interpret the interval. The Classical music and Rock music samples are dependent (paired), since they contain the same subjects, measured twice. Therefore, we use "TInterval" on the calculator, using the summary statistics for "Difference." The means for each music type are not needed here. The interval is (0.005, 0.465), so we are 95% confident the true population difference in mean
strength during the two music types is in this interval. B) Give the critical value and margin of error used in the interval. There are 121=11 df, so the T critical value from the table is 2.201. The
MoE is 0.465 - 0.235 = 0.23. C) Does this interval provide evidence that there is a difference in the mean strength for the two music types? Why or Why Not? Yes, since it excludes 0, we can be 95% confident there is a difference. D) What is the design of this study called? This is a cross-over experiment. E) Conduct a test of the hypothesis that there is no difference in the mean strength for the two music types. Show all five step s of the test. 1. Assumptions: We aren't told if the 12 were a random sample, but they did randomize treatment order. The strength differences need to be normally distributed, since n < 30. We aren't told this either, but since the differences are a biological measurement and can be either positive or negative, it is a reasonable assumption. 2. H0: D = 0 Ha: D 0 3. Using "TTest" on the calculator, the test statistic is T = 2.249 4. The P-value is 0.0460. Since this less than alpha=.05, we reject the null hypothesis. 5. We conclude there is sufficient evidence to support that there is a difference between the strengths measured during the two types of music. McNemars test for DEPENDENT samples EXThe results are 420 said yes each time, 420 said no each time, 90 said yes on the first survey and no on the second survey, and 70 said no on the first survey and yes on the second
survey. Second
Y N
First Y N
Estimate the probability of favorable rating for (i) last month, (ii) last month(i)= The test statistic is
(ii)=
FIND the P-value
420 70 90 420
Do not reject the claim that the population proportion was the same each month because the p-value is greater than the significance level. If p-value < a reject the null hypothesis.

Stats Test #3 Word Cheat Sheet

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Stats Test #3 Word Cheat Sheet

Uploaded by

Copyright:

Available Formats

=significance level 0=null hypothesis value of mean vs.

p0=null hypothesis value of proportion

Confidence Interval Sample Size for

Proportions Problems: parameterproportion p Point Estimate se

(if do not know

Mean, margin of error mn=(2z2)/m2

For a single hypothesis population Mean test statistic t=

where 0=the null value

=population standard deviation

if been Ha: <0.50 y

use 1-PropZTestp0:.5 x:49 n:100

(p1p2)0/se0 with se0=

when using 2-PropZTest. If you dont use the pooled

1=2 Ha: 12 3) function used & test statistic

2. FIND the t-statistic and p-value: Use 2-SampTTest:

1: 4.3, sx1: 4.8, n1: 223,

2: 3.7, sx2: 4.5, n2: 80t=1.00, p=0.317 What

n=404 mean=2.73 SD=2.14 se mean=.1065 95%CI=(2.52,2.94) Set up a hypothesesH0:

FIND the P-value

You might also like