Professional Documents
Culture Documents
Outline
• Graphical examination of data
• Types and potential impacts of missing data
• Identify outliers
• Test the assumptions underlying multivariate
techniques
What is data analysis?
What is data analysis?
• Data analysis is the process of organizing and
arranging data so that the results of the study
can be interpreted.
Graphical examination of data
• Univariate profile: shape of distribution
Graphical examination of data
• Bivariate profile: examination of relationship
Graphical examination of data
• Bivariate profile: examination of group differences
Identify Missing Data
• What is missing data?
– Valid values on one or more variables are not
available for analysis.
• What are the impacts of missing data?
– Practical impacts: the reduction of sample size for
the analysis.
– Substantive impacts: results of statistical analysis
on data with a nonrandom missing data process
could be biased.
Identify Missing Data
Identify Missing Data
Identify Missing Data
Identify Missing Data
Identify Missing Data
Imputation Techniques for Missing Data
Imputation Techniques for Missing Data
Identify Outliers
Identify Outliers
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Test the assumptions underlying multivariate
techniques
Incorporating Non-metric Data with Dummy
Variables
Data Preparation (1)
• The first step of data preparation is coding. Coding is the process
of transferring verbal answers or observational categories into
numbers that can be recognized by the computers.
• Coding is time consuming, expensive, and source of error.
Precoded responses to a question are nescessary. For example,
instead of asking “What is your occupation?” we ask “What is
your occupation? Are you:
1) A professional
2) A government official
3) An employee
4) A self-employed
5) Temporarily un-employed
Data Preparation (2)
• Technological advances have reduced the
potential for error in time-consuming data
preparation. Still, data should be checked for
double codes, illegible numerals, and simple
mistakes.
• History of data entry:
1) Punch card
2) Direct data entry (DDE)
3) Optical scanning
Data Handling (1)
• Data handling prior to analysis may include:
Handling missing values
Recoding variables
Creating new variables