Professional Documents
Culture Documents
Bio-Stat Class 2 and 3
Bio-Stat Class 2 and 3
1
Probability
• Proportion of happening outcome from the
total
• Relative frequency of the total
• Chance of happening outcome from the
experiment
• Denoted by p
• So probability = favorable outcome = m
total outcome n
2
Example
• In coin tossing, total outcomes = {head, tail}=2
Probability of head = P (H) = ½
Probability of not head = P (T) = q = 1-1/2 =1/2
• In dice tossing, total outcomes = 6 {1,2,3,4,5,6)
– Probability of 5 = P(5) = 1/6
– Probability of 1 = P(1) = 1/6
• Probability of not happening event =q
• So p+q = 1
• Probability of sure event = 1
• probability of impossible event = 0
• Therefore probability lies between 0 to 1
3
Terms used in probability
4
Screening test
6
Screening test
7
MCQ
1. A bag contains 3 red and 5 green marbles. A
marble is drawn at random. The probability of
drawing a blue marble is
a. 5/8 b. 3/8 c. 0/8 d. 1/8
2. The sum of the probability of an event and non
event is :
b. 2 b. 1 c. 0 d. 0.5
3. If three coins are tossed simultaneously, than the
total numbers of outcome will be:
a. 6 b. 3 c. 8 d. 1
• { H,T} =2
• {H,T} =2
• {H,T} =2
• 2X2X2 =8 outcomes
• {HHH, HHT, HTH, THH, ………TTT} =8
• P(all head) =P(HHH) = 1/8
• P(TTT) = 1/8
• P ( two head and one tail) = 3/8
• When 2 dice are tossed at once how many outcomes are there?
• 6 X6 =36
• First coin 1 2 3 4 5 6
• Second 1 (1 ,1) (1,2) (1,3)
• 2
• 3
• 4
• 5
• 6
Frequency distribution
• X F
• 20 3
• 30 5
• 40 10
• 50 5
Probability distribution
• Discrete probability distribution: having
discrete random variables: no. of death, no.
of births, household size
– Binomial probability distribution
– Poisson probability distribution
• Continuous probability distribution
– Normal probability distribution
– E.g. ht, wt. marks, income, expenditure etc
12
Binomial probability distribution
• Knowns a Bernoulli trails or Bernoulli process
• Deals
– the experiment having two mutually exclusive
outcomes, (binary events)
– independent trials,
– constant probability of success (p) and not
happening outcome is called failure (q) [p + q= 1]
– finite trials(n<20)
• Coin tossing(head/tail), birth(male/female),
test(positive/negative), result(pass/fail) etc.
13
Binomial probability
• Probability of happening outcome is called
success and denoted by p and not happening
outcome is q
• So p + q=1
• Parameter of binomial probability are n and p
• Mean = np
• Variance = npq
14
Poisson probability distribution
17
MCQ
• A variable that can assume any value between two given points is called
___________
a) Continuous random variable
b) Discrete random variable
c) Irregular random variable
d) Uncertain random variable
• Which of the following mentioned Probability distribution is continuous probability
distribution?
a) Gaussian probability Distribution
b) Poisson probability Distribution
c) Binomial probability Distribution
d) none of them
• A variable which can assume finite or countably infinite number of values is known
as:
a. Continuous b. Discrete c. Qualitative d. None of them
• Total area under the curve of a continuous probability density function· is always
equal to:
a. Zero b. One c. -1 d. None of them
MCQ
• Which of the following is not a characteristic of the
normal distribution?
a. the mean is always zero
b. the area under the curve equals one
c. the mean, median and mode are equal
d. it is a symmetrical distribution
• the parameters of normal distribution are:
a. Mean and median
b. Mean and standard deviation
c. Mean and mode
d. Mean, median and mode
Estimation
• Point estimation: single point
Example: estimation of mean and estimation of
prevalence
• Interval estimation: estimation of population
parameter within the certain range
• Confidence interval: interval estimation
having certain confidence i.e. certain level of
probability
• 90% CI (Z= 1.64), 95% CI (Z = 1.96) and 99%
CI (Z =2.58)
20
Confidence Interval (CI)
Mean ± 1.96 SE
Where SE for sample mean = SD/√n
Standard error: variability of sample means and
calculated by SD/√n
Standard deviation: variability of observations
21
Z value for α and β
23
Which one is reliable??
• The mean systolic blood pressure(120) lies
surely 95% in between 110 to 130
24
Hypothesis testing
• Null hypothesis(H0):
– Two means are equal ( not significantly different)
– Two proportions are equal
– There is no correlation between two variables
– There is no association between two variables
• Alternative hypothesis(H1)
– Two means are not equal (significantly different)
– Two proportions are significantly different
– There is significant correlation between two variables
– There is significant association between two variables
25
Types of alternative hypothesis
26
Two tailed test
0.95
Z=-1.96 Z=1.96
Reject Ho if the sample mean falls in either of these two region
27
One tailed test
0.05 + 0.5=0.95
Z= -1.645 =H0
28
Errors in hypothesis
•
29
False positive and false negative
30
P value
• In technical terms, a P value is the probability of
obtaining an effect at least as extreme as the
one in our sample data, assuming the truth of
the null hypothesis.
• High P values: Our data are likely with a true null
hypothesis
• Low P values: Our data are unlikely with a true
null hypothesis
31
Interpretation of p value
32
Quantitative Variable
H0: Distribution of sample is normal
33
Selection of test
34
ANOVA test
More than two group or samples
Samples Comparison of Parametric Non parametric test
more than two test (Does not follow
averages (Follows normality)
normality )
35
Measures of Association
Samples Variable 1 Variable 2 Statistics
37
Z test
• Randomness
• Known variance
• Sample size > 30
• Normality
Interpretation
• Calculated Z > tabulated Z value (z=1.96 at
5% level of significance), reject null
hypothesis, otherwise accept null
hypothesis
38
Types of Z test
• Comparison between
– Sample mean and population mean
– Two sample means
– Sample proportion and population proportion
– Two sample proportions
39
T test and types
• Randomness
• Normality
• Sample size less than 30
• Unknown variance
Degree of freedom
= n-1 ( for one sample mean test) and paired data
= (n1-1)+(n2-1) = n1+n2 -2 ( for two sample mean
test)
40
Chi square test
Assumption: Use
i. When row and column value in the contingency
table are categorical or qualitative data
ii. none of the cells have expected frequency zero.
iii. Expected cell frequency should be at least five
iv. Adequate sample size (n=50)
Types
i) Test of association between two categorical variables
ii) Test of goodness of fit
41
Contingency table ( 2x2 table)
column
Variable 1 Variable 2 Total
Yes No
Row Yes a b a+b
No c d c+d
Total a+c b+d N= a+b+c+d
42
MCQ
1. A statement made about a population for testing purpose is
called?
a) Statistic b) Hypothesis c) Level of Significance d) Test-Statistic
2. If the null hypothesis is false then which of the following is
accepted?
a) Null Hypothesis b) Positive Hypothesis
c) Negative Hypothesis d) Alternative Hypothesis
3. The point where the Null Hypothesis gets rejected is called as?
a) Significant Valueb) Rejection Value
c) Acceptance Value d) Critical Value
4. The alternative hypothesis is also called:
a) Statistical hypothesis b) research hypothesis
c) Simple hypothesis d) null hypothesis
43
MCQ