Professional Documents
Culture Documents
Statistics
• Variables which assume non-numerical • Level of measurement which classifies data into mutually
exclusive, all inclusive categories in which no order or
values. ranking can be imposed on the data.
• Quantitative Variables • Ordinal Level
• Variables which assume numerical values. • Level of measurement which classifies data into categories
that can be ranked. Differences between the ranks do not
• Discrete Variables exist.
• Variables which assume a finite or countable • Interval Level
number of possible values. Usually obtained • Level of measurement which classifies data that can be
by counting. ranked and differences are meaningful. However, there is no
meaningful zero, so ratios are meaningless.
• Continuous Variables
• Ratio Level
• Variables which assume an infinite number
of possible values. Usually obtained by • Level of measurement which classifies data that can be
ranked, differences are meaningful, and there is a true zero.
measurement. True ratios exist between the different units of measure.
INTRODUCTION TO STATISTICS
• Random Sampling • Stratified Sampling
• Sampling in which the data is • Sampling in which the population is
collected using chance methods or divided into groups (called strata)
according to some characteristic. Each of
random numbers. these strata is then sampled using one of
• Systematic Sampling the other sampling techniques.
• Sampling in which data is obtained • Cluster Sampling
by selecting every nth object. • Sampling in which the population is
divided into groups (usually
• Convenience Sampling geographically). Some of these groups are
• Sampling in which data, which is randomly selected, and then all of the
readily available is used. elements in those groups are selected.
INTRODUCTION TO STATISTICS
• Population vs Sample • Discrete vs Continuous
• The population includes all objects of interest whereas • Discrete variables are usually obtained by counting.
the sample is only a portion of the population. There are a finite or countable number of choices
Parameters are associated with populations and
statistics with samples. Parameters are usually denoted available with discrete data. You can't have 2.63
using Greek letters (μ; σ,ς) while statistics are usually people in the room.
denoted using Roman letters (x, s). • Continuous variables are usually obtained by
• There are several reasons why we don't work with measuring. Length, weight, and time are all
populations. They are usually large, and it is often examples of continuous variables. Since continuous
impossible to get data for every object we're studying. variables are real numbers, we usually round them.
Sampling does not usually occur without cost, and the This implies a boundary depending on the number
more items surveyed, the larger the cost.
of decimal places. For example: 64 is really
• We compute statistics, and use them to estimate anything 63.5 <= x < 64.5. Likewise, if there are two
parameters. The computation is the first part of the decimal places, then 64.03 is really anything 63.025
statistics course (Descriptive Statistics) and the
estimation is the second part (Inferential Statistics). <= x < 63.035. Boundaries always have one more
decimal place than the data and end in a 5.
INTRODUCTION TO STATISTICS
INTRODUCTION TO STATISTICS
• If the probability of having a bomb on a plane is one in a million, the
probability of having two bombs is one in a trillion. Then the safest is
to always get on the plane with a bomb.
• They took a statistician, put one hand in boiling water and one hand
in ice cold water, they said ... how do you feel? “On the average, i feel
fine”