Professional Documents
Culture Documents
Hypothesis Testing
Florencia / 201650281
Mario Alexander Setiawan / 2016505..
Purpose of hypothesis testing -> determine accurately if the null hypothesis can be
rejected in favor of the alternate hypotesis
Type I Errors, Type II Errors , and
Statistical Power
Two kinds of errors
Example:
Ho : All honest people are good
Ha : Not all honest people are good
Type I Error : not all honest people are good, when in fact all honest people are
good. We reject the null hypothesis we can say that not all honest people are
good (rejecting Ho incorrectly)
Type II Error : all honest people are good, when in fact not all honest people are
good. We will accept the null hypothesis and say that all honest people are good.
(rejecting Ha incorrectly)
Statistical Power
Paired sample t-test : differences in the same group before and after a
treatment.
Wilcoxon signed-rank : nonparametic test for examining significant
differences between two samples or repeated measurement on a single
sample.
McNemar’s test : significance of the difference between two dependent
samples when the variable of interest is dichotomous.
3. Testing Hypoteses about two unrelated
means
Independent sample t-test : significant difference in the means for two
groups in the variables of interest
4. Testing hypotheses about several means
Analysis of Variance (ANOVA) : significant mean differences among more than
two groups on an interval or rato\io scaled dependent variable.
Multivariate Techniques
Regresion Analysis
For example, let’s say your model involved how income of parents and
their education level affected their offspring’s lifetime earnings. Income
of parents is measured in dollars and education level is measured in years
of school. Standardizing these variables means that they can be compared
to each other in the model. Let’s say income has a standardized beta
coefficient with a value of .2 and education level has a beta of .34. The
model shows that with every increase of one standard deviation in parent’s
income, an offspring’s income rises by .2 standard deviations. This assumes
the other variable (education level) is held constant. With an increase of
one standard deviation in education level, earnings rise .34 standard
deviations — assuming parent’s income is held constant.
Regression with dummy variables
A dummy variables is a variable that has two or more distinct level, which are coded 0 or 1.
Extra- Score
course
transform
Qualitative Quantitative
5. MANOVA
Is similar to ANOVA, with the difference that ANOVA
tests the mean differences of more than two groups on
one dependent variable, whereas MANOVA tests mean
differences among groups across several dependent
variables simultaneously, by using sums of square and
cross-product matrices.
6. Canonical Correlation
Examines the relationship between two or more
dependent variables and several independent variables.
“
data. Using these techniques allow us to generalize the results obtained from the sample
to the population at large.
“
Several univariate, bivariate, and multivariate techniques are available to analyze sample
Have been explained in this chapter that the choice of statistical technique depends on
the number of variables you are examining, on the scale of measurement of your
variables, on whether the assumptions of parametric test are met, and on the size of
your sample.
DATA WAREHOUSING, DATA MINING, AND
OPERATIONS RESEARCH
LISREL MATLAB
Is a computer program that was originally
Is designed to estimate and test structural designed to simplify the implementation of
equation models. numerical linear algebra routine.
Qualtrics
Mplus
Is a statistical modeling program that Allows users to do many kinds of online
offers researchers a wide choice of data collection and data analysis,
models, estimators, and alogrithms. employee evaluations, website feedback,
marketing research, and customer
satisfcation, and loyalty research.
SOME SOFTWARE PACKAGES USEFUL FOR DATA ANALYSIS
SAS SPSS
Is a data management and analysis
Is an integrated system of software products, program designed to do statistical data
capable of performing a broad range of analysis, including descriptive statistics
statistical analyses such as descriptive such as plots, frequencies, charts, and
statistics, multivariate techniques, and time lists, as well as sophisticated statistical
series analyses. procedure like ANOVA, factor analysis,
cluster analysis, and categorical data
analysis.