Professional Documents
Culture Documents
Sbs1e PPT Chapter02
Sbs1e PPT Chapter02
Data
1
© 2010 Pearson Education
2.1 What Are Data?
• Data values or observations are information collected
regarding some subject.
• Data can be numbers, names, etc., and tells us the “Who
and What”.
• Data are useless without their context.
• Data are often organized into a data table like that below.
2
© 2010 Pearson Education
2.1 What Are Data?
The rows of a data table correspond to individual cases
about Whom we record some characteristics.
These characteristics may be collected on or about …
• respondents – individuals who answer a survey
• subjects or participants – people in an experiment
• experimental units – animals, plants, websites, or other
inanimate objects
Cases
3
© 2010 Pearson Education
2.1 What Are Data?
The characteristics recorded about each individual or case
are called variables.
These are usually shown as the columns of a data table
and identify What has been measured.
Variables
4
© 2010 Pearson Education
2.1 What Are Data?
Data tables are cumbersome for complex data sets, so
often two or more separate data tables are linked together
in a relational database.
5
© 2010 Pearson Education
2.1 What Are Data?
Example: A typical relational database is provided
consisting of three relations: customer data, item data, and
transaction data.
For example, we
can look up a
customer to see
what items they
purchased, or we
may look up an
item to see who
purchased it.
6
© 2010 Pearson Education
2.2 Variable Types
When a variable names categories and answers questions
about how cases fall into those categories, it is called a
categorical variable.
7
© 2010 Pearson Education
2.2 Variable Types
Categorical variables …
8
© 2010 Pearson Education
2.2 Variable Types
Quantitative variables must have units. The units indicate …
9
© 2010 Pearson Education
2.2 Variable Types
Some variables can be both categorical and quantitative.
How data are classified depends on Why we are collecting
the data.
10
© 2010 Pearson Education
2.2 Variable Types
Counts
11
© 2010 Pearson Education
2.2 Variable Types
Counts
12
© 2010 Pearson Education
2.2 Variable Types
Counts
13
© 2010 Pearson Education
2.2 Variable Types
Identifiers
14
© 2010 Pearson Education
2.2 Variable Types
Identifiers
Identifier variables …
• do not have units.
• are a special kind of categorical variable.
• are useful in combining data from different sources to
avoid duplication.
• are not variables to be analyzed.
15
© 2010 Pearson Education
2.2 Variable Types
Other Data Types
16
© 2010 Pearson Education
2.2 Variable Types
Other Data Types
17
© 2010 Pearson Education
2.2 Variable Types
Cross-Sectional and Time Series Data
18
© 2010 Pearson Education
2.3 Where, How, and When
When data are collected can be important. Data that are
decades old may mean something different than similar
values recorded last year.
19
© 2010 Pearson Education
2.3 Where, How, and When
How data are collected can make the difference between
insight and nonsense.
20
© 2010 Pearson Education
2.3 Where, How, and When
Data can be found …
• on internet sites.
21
© 2010 Pearson Education
What Can Go Wrong?
• Don’t label variables as categorical or quantitative without
thinking about what they represent.
• Always be skeptical and look for bias in how the data was
collected.
22
© 2010 Pearson Education
What Have We Learned?
• The five W’s (Who, What, Why, Where, and When) help
nail down the context of the data.
23
© 2010 Pearson Education
What Have We Learned?
• Categorical variables identify a category for each case.
24
© 2010 Pearson Education