Professional Documents
Culture Documents
1.2 Data
1.4 Ethics and Statistics − It’s a Dangerous World of Data Out There
Statistics
the mathematical science that deals with the
collection, analysis, and presentation of data,
which can then be used as a basis for inference
and induction
Data
values assigned to observations or measurements
Information
data that are transformed into useful facts that
can be used for a specific purpose, such as making
a decision
Data set
a collection of data points
Database
a collection of data points that contains
multiple rows (records) and columns
(fields)
Primary data
data that you have collected for your own use
Secondary data
data collected by someone else
Advantages: Advantages:
• collected by the person or • Readily available
organization who uses the • Less expensive to collect
data
Disadvantages: Disadvantages:
• Can be expensive and time- • No control over how the data
consuming to gather was collected
• Less reliable unless collected
and recorded accurately
To encourage
respondents to
participate, an
effective survey
will state its
purpose in the
beginning.
Personal
demographic
questions are
often last, when
respondents feel
more comfortable
with the process.
Qualitative Quantitative
Data Data
Classified by descriptive terms
Counted Measured
Examples:
• Marital Status Described by numerical values
• Political Party
Examples: Examples:
• Eye Color
(Defined categories) • Number of • Weight
Children • Voltage
• Defects per hour (Measured
(Counted items) characteristics)
Types of Data
Qualitative Quantitative
Cross-Sectional Data
• values collected from a number of subjects
during a single time period
Unemployment Rate
Year USA % CA % MI % TX % FL %
Cross-
2013 7.4 8.3 8.4 6.0 6.2 Sectional
Data
2014 6.2 7.5 7.3 5.1 6.3
2015 5.3 6.2 5.4 4.5 5.4
2016 4.9 5.4 4.9 4.6 4.9
2017 4.4 4.3 4.7 3.9 3.7
Descriptive statistics
• collecting, summarizing, and displaying data
Inferential statistics
• making claims or conclusions about the data
based on a sample
Predictive statistics
• analyzing past data to predict future values and
make decisions
Population
• represents all possible subjects that are of
interest in a particular study
Sample
• refers to a portion of the
population that is
representative of the
population from which
it was selected
Population Sample
• Example:
Observed Estimated population
sample parameter (unknown, but
statistic Inference can be estimated from
(known) sample evidence)
vs.