Professional Documents
Culture Documents
3rd Quarter
Classifications of Variables:
1. Qualitative - can be placed into distinct categories, according to some characteristic or
attribute.
2. Quantitative - are numerical and can be ordered or ranked.
EXPERIMENT - Any activity, which can be done repeatedly under similar conditions.
OUTCOME - A result of an experiment.
EVENT/SAMPLE POINT - A subset/element of a sample space.
VARIABLE - A characteristic or attribute that can assume different values.
RANDOM VARIABLE
A function that assigns a real number to each element of the sample space. It is a variable
whose values are determined by chance. Random variables are denoted by uppercase letters
such as X, Y, or Z and the realizations or the possible values of these random variables are
denoted by the corresponding lowercase letters like x ,y, or z.
Mean – the sum of a collection of numerical data divided by the number of data items.
– the most reliable, but the most sensitive measure of average.
Variance – a measure based on the deviations of individual scores about the mean. It is
simply standard deviation squared.
Standard deviation – a measure of variability which is most commonly used as guide for the
degree of dispersion or spread. It is the square root of variance.
Mean of a Random Variable – the measure of the central location of a random variable. It is
also called the expected value. It is calculated as the weighted mean of the values where the
weights are their corresponding probabilities.
When all the values are constant, there are no variations in the possible values. Thus, all the
measures of variability will then be 0. A small value of standard deviation (or variance)
means that the dispersion of the random variable is narrowly concentrated around the mean.
A large value of standard deviation (or variance) means that the distribution is spread out,
with some possibility of observing values at some distance from the mean.
Formulas:
𝝁𝒙 = 𝑿𝟏 ∙ 𝑷(𝑿𝟏)+ 𝑿𝟐 ∙ 𝐏 (𝑿𝟐)+…+ 𝑿𝒏 ∙ 𝐏 (𝑿𝒏)
𝝈𝟐 = ∑[𝑿2 ∙ 𝑷(𝑿)] − 𝝁𝟐
𝝈 = 𝝈𝟐
The concept of expected value is very useful in games of chance, where the participants are
interested in expected gains or losses. They look at some strategies or options that will help in
their decision-making so that they can maximize their gains and minimize their losses. Others
look at the price of their bets in games, and check whether it is fair or not.
A fair game occurs if the expected value of the gain (or loss) equals 0. This means that the
expected value of the total winnings is equal to the bet.
Problem:
A school organized a fund-raising activity, it involved selling of 500 raffle tickets. One ticket
costs P100. The lone winner of this raffle will receive P10 000. What is your expected net
gain if you buy one raffle ticket?
Solution:
Let X be the amount of the net gain for this fund-raising activity. The net gain is the
difference between the amount to be on and the ticket price. Thus, the possible values of X
are P9 900 (if you win) and - P100 (if you lose). Since there are 500 tickets, the probability of
1 499
winning is 500
and the probability of losing is 500
The expected value of X is,
1 499
E[X] = 9 900 ( 500 ) + (−100) ( 500 ) = -80
Note that the expected value is negative. This means that you expect a loss of P80 for each
raffle ticket you buy.
Normal random variable is a set of numerical value of continuous random variable which is
normally distributed.
Normal distribution is a continuous probability distribution for a random variable X. It also
represents the probability, or the proportion, or the percentage associated with specific sets of
measurement values.
It is also known as the Gaussian distribution in honor of the German mathematician Johann
Carl Friedrich Gauss who derived its equation.The graph of a normal distribution is called
the normal curve.
Formula:
The mathematical equation for a normal distribution is:
(𝑋−μ)2
𝑒−
y= 2 σ2
2 2π
● e ≈ 2.718
● π ≈ 3.14
● μ = the population mean
● σ = population standard deviation
● σ2 = variance
The shape and position of a normal distribution curve depend on two parameters, the mean
and the standard deviation. Each normally distributed variable has its own normal distribution
curve, which depends on the values of the variable’s mean and standard deviation.
The area under the normal curve are given in terms of z-values or scores. The z-value leads to
the area under the curve found in the normal curve table, which is a probability, and that
probability gives the desired percentage for X.
The Table of Areas under the normal curve is known as the z-table.
𝑋−𝜇
𝑧= 𝜎
X- given measurement
𝜇 − 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑚𝑒𝑎𝑛
𝜎 − 𝑝𝑜𝑝𝑢𝑙𝑎𝑡𝑖𝑜𝑛 𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛
Problem:
Find the area under the standard normal curve between X = 40 and X = 55 with a mean of 50
and standard deviation of 10.
Solution:
𝑋−𝜇
𝑧= 𝜎
For x = 40 z = -1
For x = 55 z = 0.5
Area = Z0.5-Z-1
Area = .6915 - .1587
Area = .5328
Z-table
Is a table of areas under the normal curve. The z-table provides the proportion of the area
between any two specific values under the curve, regions under the curve can be described in
terms of area.
When the z-value is to the left or any related terms just write the value.
When the z-value is to the right or any related terms, subtract 1 by the value.
When the shaded region is in between of the two z-value, subtract the bigger by the smallest
value.
Lesson: Identifying the Different Random Sampling
Techniques
POPULATION
It includes all of its elements from a set of data. The size of the population is the number of
observations in the population.
SAMPLE
It consists of one or more data drawn from the population.
RANDOM SAMPLING
is a sampling method of choosing representatives from the population wherein every sample
has an equal chance of being selected. Accurate data can be collected using random sampling
techniques.
PROBABILITY SAMPLING
Sampling techniques that involve random selection.
NON-PROBABILITY SAMPLING
Sampling techniques that do not involve random selection of the data.
PARAMETER
is a measure that is used to describe the population.
STATISTIC
is a measure that is used to describe the sample.
Formulas:
POPULATION
Σ𝑋
𝝁 (mean) = 𝑁
Σ𝑋−𝝁)2
𝝈𝟐 (variance) = 𝑁
Σ𝑋−𝝁)2
𝝈 (standard deviation) = 𝑁
SAMPLE
Σ𝑋
X̅ (mean) = 𝑁
Σ𝑋−𝑥̅̅)2
S2 (variance) = 𝑁
Σ𝑋−𝑥̅)2
S (standard deviation) = 𝑁
Sampling distribution of the sample means is a frequency distribution using the computed
sample mean from all the possible random samples of a particular sample size taken from the
given population.
1. Determine the number of sets of all possible random samples that can be drawn from the
given population by using the formula, NCn, where N is the population size and n is the
sample size.
2. List all the possible random samples and solve for the sample mean of each set of samples.
3. Construct a frequency and probability distribution table of the sample means indicating its
number of occurrence or the frequency and probability.