Professional Documents
Culture Documents
2. Presenting Data
◦ Charts & Tables,
◦ Plot, etc.
3. Characterizing Data
◦ Descriptive methods (Average, quartile, percentile, decile, etc)
◦ Clustering, etc.
4. Data Analysis
◦ Create hypothesis and chose the right method for inference
Statistical Methods
Statistical
Methods
Descriptive Inferential
Statistics Statistics
Why Collect Data?
1.Obtain Input to a Research Study
2.Measure Performance
3.Assist in Formulating Decision Alternatives
4.Satisfy Curiosity
◦ Knowledge for the Sake of Knowledge
Informasi didahului
dengan adanya
PENGUKURAN
Fact/data/practice
Design
Problem experiment
formulation
Testing
Conjecture/hypothesis/theory/model
Network Of
Distribution
Data Sources
Data
Sources
Primary Secondary
Data
Numerical Categorical
(Quantitative) (Qualitative)
Discrete Continuous
Data Type Examples
1. Numerical
◦ Discrete
How Many Massages has come to the server in the last five minutes? ___
(Number)
◦ Continuous
How Tall Are You? ___ (Inches)
How long the mean time between massages comes to the server?
___(minutes)
2. Categorical
◦ Do You Own Savings Bonds? __ Yes __ No
◦ Do you agree if our internet connection bandwidth will be extended
with additional contribution monthly from user?
___certainly agree __agree __win-win __don’t agree __ certainly disagree
How Are Data Measured?
1. Nominal Scale 3. Interval Scale
◦ Categories
e.g., Male-Female ◦ Equal Intervals
◦ Count ◦ No True 0
◦ Big number doesn’t represent e.g., Degrees Celsius
Big difference (Coding only) ◦ Measurement
probabilistic deterministic
Valid
Reliable
Consisten
Valid vs Reliable vs Consistent
reliable
consisten
not valid
GIGO
(Garbage In – Garbage Out)
Input Output
Proses
GI GI
GI-FM-GO
(Good In – False Method – Garbage Out)
Input Output
Proses
GI GI
Correlation
Mathematical ex-
pression of the
nature of a relation-
ship between 2
variables
Expressed in terms
of a coefficient
Relationship may
be strong or weak,
direct or inverse
Common Statistical
Procedures
Relationship between
Correlation
2 variables
Relationship among Factorial analysis
more than 2 variables Clustering
Prediction Regression
Description of pheno- Descriptive statistics
mena or variables (frequency, mean, etc.)
Statistical control Standardization
Target
Capaian
Error
Penyimpangan nilai pengamatan dengan dugaan
Penyimpangan nilai pengamatan dengan model
Penyimpangan hasil proses terhadap nilai target
Tergantung pada persiapan, pemilihan metode,
pelaksanaan, dan pelaku pengukuran dalam
mendapatkan data
Sources of Error
Human error
Machine error
Measurement error
Experimental design error
Sampling error
Many, many others of errors …
Errors Due to Sampling
Sampling Error
Nonresponse &
Measurement Error
Sample Frame
Total Population Planned Sample Actual
(Students in
(Students) (Selected Students) Sample
Phone Book)
Types of Error
Precision Error (reading – average)
◦ “Random error”
◦ Leads to poor precision
◦ Example: resolution error
Bias error (average – true)
◦ “Systematic error”
◦ Leads to poor accuracy
◦ Examples: zero offset error, non-linearity
error
Definitions
Stability
◦ Total Variation over Time
Repeatability Time 2
Stability
Accuracy (Bias)
Repeatability ◦ Difference Between the Average of the Observed and
the “True” Value
Bias
Reproducibility
◦ Variation Under Different Conditions True Value
Condition A Average
Linearity
Condition B
◦ Difference in Bias in the Operating Range
Condition C
Reproducibility
Precision vs. Accuracy
Good Precision
Good Accuracy
Precision vs. Accuracy
Poor Precision
Poor Accuracy
Accuracy vs Precision Precision (RR)
RR Average ◦ Describes Variation and Spread
Bias
◦ The Extent to Which the Instrument
Repeats its results when Making
True Value
Repeat Measurements on the Same Unit
of Output