Professional Documents
Culture Documents
Sampling: the process of drawing out a number of individuals from a larger population. Two types of
Sampling exists: Probability Sampling (Random) and Non probability Sampling.
A statistical population is the pool of individuals sharing something in common from which a statistical
sample is drawn for study. We have a variable of interest in statistical population.
Random and Non-Random Sample: In Random Sample, each individual has an equal chance/probability
of getting selected. Whereas in Non-Random Sample, Sample is selected on the basis of convenience,
judgement and experience.
Statistics:
In plural form Statistics is the Aggregate of Numerical Facts.
In singular sense, a branch of mathematics dealing with the collection, analysis, interpretation
and presentation of Numerical Facts and Data
And any quantity calculated from sample is called statistic
Sampling Units/Units: members of the population; an object that can be selected with known
probability from sampling frame.
Sample Size (n): the number of observations used for determining the estimates of population. Size of
the sample that has been drawn from population.
Variable of Interest (Random Variable X): variable is a characteristic that varies from person to person,
place to place and system to system. A variable whose values are determined by the outcomes of a
random experiment is called a Random Variable. Every random variable has its own probability
distribution.
Probability Distribution (Distribution of random variable X): A function that gives the probabilities of
occurrences of possible outcomes of an experiment.
Basic Purpose of sampling: to study a small group of people from the large group to derive
inferences that are likely to be applicable to all the people of the large group.
Reliability (standard error of statistic & its exact sampling distribution) : The standard error of a
statistic is the standard deviation of its sampling distribution. Standard error of statistics is
directly related to a test’s reliability. The lower the standard error, the greater the reliability.
Advantages of Sampling:
Sample Design and Sample Survey: Sampling design is a mathematical function that gives you the
probability of any given sample being drawn.
A sample survey is a survey which is carried out using a sampling method, i.e. in which the
selected sample of elements from a target population is surveyed only, and not the whole
population is surveyed.
Sampling Frame: is the list of all units of the population of interest from which the sample is drawn.
Probability & non-probability Sampling: Probability sampling involves random selection, allowing
you to make strong statistical inferences about the whole group in which the subjects of the
population get an equal opportunity to be selected as a representative sample. Non-probability
sampling involves non-random selection based on convenience or other criteria, allowing you to
easily collect data.
Sampling with and without replacement: with replacement: selected subjects are put back into the
population before another subject are sampled. Subject can possibly be selected more than
once.it generally produces independent events because the outcome of one random draw is not
affected by the previous draw.
Without replacement: you don’t replace the first item you choose before you choose a second.
All selected subjects are unique that is not selected more than once. the items in the sample
are dependent because the outcome of one random draw is affected by the previous draw.
Sampling and non-sampling errors: A sampling error is a statistical error that occurs when an
analyst does not select a sample that represents the entire population of data
A non-sampling error is a term used in statistics that refers to an error that occurs during data
collection, causing the data to differ from the true values.
Sampling Bias: sampling bias is a bias in which a sample is collected in such a way that some
members of the intended population have a lower or higher sampling probability than others.
Random Number Table: A random number table is a set of numbers usually with five digits
arranged in rows and columns. A random number table is used to select a sample of the
population randomly and fairly.
Probability sampling (SRS, Stratified Sampling, Systematic sampling, Cluster Sampling, Multi Stage,
Multi-Phase Sampling, Sequential sampling)
A simple random sample (SRS) of size n is produced by a scheme which ensures that each
subgroup of the population of size n has an equal probability of being chosen as the sample.
Stratified Sampling refers to the random sampling techniques that groups items of whole
population into different groups called strata, based on their similar characteristics.then samples
from each stratum is taken proportionately or disproportionately.
Systematic sampling is similar to simple random sampling, but it is usually slightly easier to
conduct. Every member of the population is listed with a number, but instead of randomly
generating numbers, individuals are chosen at regular intervals.
Cluster sampling is a probability sampling method in which you divide a population into clusters,
such as districts or schools, and then randomly select some of these clusters as your sample.
In sequential sampling, a sequence of one or more samples is taken from a group. Once the
group has been sampled, a hypothesis test is performed to see if you can reach a conclusion. If
you can't, the whole procedure is repeated.
Quota sampling: selecting participants in numbers proportionate to their numbers in the larger
population, no randomization.
Sampling distribution of Mean: The Sampling Distribution of the Mean is the mean of the
population from where the items are sampled.
Central Limit Theorem: The central limit theorem in statistics states that, given a sufficiently
large sample size, the sampling distribution of the mean for a variable will approximate a normal
distribution regardless of that variable’s distribution in the population.