Professional Documents
Culture Documents
BIOSTATISTICS-
–Is the branch of statistics applied to biological or medical
sciences (biometry).
OR
- Is that branch of statistics concerned with mathematical
facts and data relating to biological events.
Functions Of Biostatistics -
eg. Following are the patient age group who attended a dental
college.
Age, - 10,11,24,25,35,40,45,41,54,15,13,33,45,42, 34,43,51,52
Disadvantages -
-Brief statistical analysis not possible
-It gives only the rough picture of data.
-Sometime misleading.
BAR CHARTS/diagrams -
The bar charts are popular media of presentation statistical data and
enables values to compare usually.
Length of the bars, drawn vertical or horizontal, indicates the frequency of
a character.
Bar chart or diagram is a popular and easy method adopted for visual
comparison of the magnitude of different frequencies in discrete data, such
as of morbidity, mortality, immunization status of population in different
ages, sexes.
Bars may be drawn in ascending or descending order of magnitude or in
the serial order of events.
Spacing between any two bars should be nearly equal to half of the width
of the bar.
There are three types of bar diagrams:
Simple
multiple and
proportional bar diagram for comparison of data.
HISTOGRAM
It's a pictorial diagram of
frequency distribution.
It consists of series of
blocks.
The class intervals are
given along the horizontal
axis & the frequency along
the vertical axis.
The area of each block or
rectangle is proportional
to the frequency.
FREQUENCY POLYGON
A frequency distribution
may also be represented
diagrammatically by the
frequency polygon.
Disadvantages of Mean
1) Since it is based on all the values of observation even
if one of the observation is missing it can't be
calculated.
2) Affected by extreme values.
MEDIAN -
83 81 2
It is the average of
75 81 6
deviation from the
arithmetic mean 81 81 0
MD= ( x-x̄) 79 81 2
n 71 81 10
x=each values 95 81 14
x̄=mean 75 81 6
77 81 4
n=number of
values 84 81 3
MD=56/10=5.6 90 81 9
Total 56
STANDARD DEVIATION
Most important and widely used measure of studying
dispersion.
Also know as root mean square deviation.
Greater the standard deviation, greater will be the
magnitude of dispersion from the mean.
A small SD means a higher degree of uniformity of the
observations.
STANDARD DEVIATION
• SD = ( x-x̄ )2
n
If sample size is more than 30
Total=65.5
Test of significance -
A statistical hypothesis test (also known as test of
significance) is a method of statistical inference using
data from a scientific study. Test of significance helps to
decide on the basis of a sample data, whether a hypothesis
about the population is likely to be true or false.
This is test to estimate or compare the significant
difference between 2 or more sample.
The test verifies whether the difference are real or
made due to variation in the sampling (by chance).
Hence the test of significance, distinguishes between
the real difference & chance differences.
Test of Significance of Difference in Means
Tests of significance of difference in means are discussed
under two heads.
1. Z-test for large samples.
2. t-test for small samples applied as:
i. Unpaired t-test (two independent samples); and
ii. Paired t-test (single sample correlated observations).
The two essential conditions for application of these tests are:
i. Samples are selected randomly from the corresponding
populations.
ii. There should be homogeneity of variances in the two
samples.
To test the homogeneity of variances-Fisher’s F-test
also called variance ratio test is applied.
If the difference is found to be insignificant, there is
homogeneity of variances and the Z-test and or t-test
can be carried out.
STANDARD ERROR OF MEAN (SE X̄ )
Gives the standard deviation of the means of several
samples from the same population.
SE X̄=SD/√n
SD=standard deviation
n=no of observation
Standard error of proportion
Standard error of proportion may be defined as a unit
that measures variation which occurs by chance in the
proportions of a character from sample to sample or
from sample to population or vice versa in a qualitative
data.
Standard error of proportion(SE p)=√pq/n
p=population proportion
q=1-p
n=sample size.
Standard error of difference between two mean
PAIRED t-TEST
(If observations UNPAIRED t
made on the same TEST(if
unit of study observations can
before and after be made from 2
interventation) different groups)
T-TEST FOR COMPARING PAIRED
OBSERVATIONS
When each individual gives a pair of observations, to
test for the difference in the pair of values, paired t
test is utilized.
The test procedure for testing the significance of
difference is as followes-
1. Find the difference in each set of paired
observations before and after (X1 – X2 = x).
2. Calculate the mean of the difference (x̄).
3. Work out the SD of differences and then the SE of
mean from the same.
4. Determine ‘t’ value by substituting the above values
in the formula.
t=x̄ ∕ (SD/√n)
As per null hypothesis, there should be no real
difference in means of two sets of observations
5.Find the degrees of freedom-degree of freedom is n –
1.
6.Compair the calculated t value with the table value
for (n-1) d.f. to find the p value.
7.If the probability (P) is more than 0.05, the difference
observed has no significance. But if P is less than 0.05,
the difference observed is significant.
ANALYSIS OF VARIANCE
While the t-test can be used only to test two group means, ANOVA
can test two or more than two group means.
In ANOVA, we use an F-distribution. We draw a
sample from one population and draw a second
sample from another population, then we calculate
the F-ratio of two variances by the formula
F = S12 / S22
ASSUMPTIONS for ANOVA
χ2=Σ (O-E)2
E
O= observed frequency .
E= expected frequency.
Steps Involved In Applying Chi-square Test
EXAMPLE
To study the gender-wise prevalence of oral cancer
Gender Oral cancer No oral
cancer
Males 38 7 45
Females 29 17 46
67 24 91
Exposure rates
Cases : 𝑎/ 𝑎+𝑐
Controls : 𝑏/ 𝑏+𝑐
Case control studies of smoking and lung cancer
Exposure rate: Cases Controls
(with lung (Without
cases=a/(a+c)=33/35 cancer) lung
cancer)
=94.2%
control=b/(b+d)=55/82 Smokers 33(a) 55(b)
=67%
Non 2(c) 27(d)
this shows frequency rate smokers
of lung cancer is Total 35(a+c) 82(b+d)
definitely higher among
smokers than among
non- smokers.
ESTIMATION OF RELATIVE RISK(Odds
ratio/relative odds)
The second analytical step is estimation of disease risk
associated with exposure.
Estimation of risk: it is obtained by an index termed as
“relative risk” or “risk ratio”, which is defined as the
probability of an event(developing a disease) occurring in
exposed people compared to the probability of the event in
non-exposed people, or the as the ratio of the two probabilities.
Relative risk = risk in exposed / risk in non-exposed.
or odd ratio=odds that case was exposed/odds that control was
exposed.
Odds ratio is a Key Parameter in the analysis of case control
studies.
INTERPRETING ODDS RATIO
OR = 1
-Odds of exposure among cases
and controls are same
-Exposure is not associated with
disease
OR > 1
-Odds of exposure among cases
are higher than controls
-Exposure is positively associated
with disease
OR < 1
- Odds of exposure among cases
are lower than controls
- Exposure is negatively associated
with disease
ADVANTAGES DISADVANTAGES
Relatively easy to carry out Problems of bias
Rapid and inexpensive Selection of an appropriate
compared to cohort study control group may be difficult
Requires comparatively few
subjects Incidence cannot be
No risk to the subject measured, only relative risk
Allows the study of several
can be estimated
Not suited for evaluation of
different etiological factors
like effect of smoking, therapy or prophylaxis of
physical activity, diet disease
COHORT STUDY:
Definition: it is a type of study usually undertaken to
obtain additional evidence to refute or support the
existence of an association between suspected cause
and disease.
Features of cohort studies
1. Cohorts are identified prior the appearance of the
disease under investigation .
2. The study groups are observed over a period of time
to determine the frequency of the disease among them
3. The study proceeds forward from cost to effect
COHORT SELECTION :
Cohort is a group of people who share a common
characteristic or experience within a definite time
period.
For selecting a cohort the following facts should be
considered
1. Cohorts must be free form the disease under study. 2.
Both the study and control cohorts should be equally
susceptible to the disease under study.
3. Both the cohort must be comparable in respect to all
possible variable which may Influence the frequency of
the disease.
STEPS OF COHORT STUDY
Selection of study subjects
Obtaining data on exposure
Selection of comparison groups
Follow up
Analysis
SELECTION OF STUDY SUBJECT
Subjects of a cohort study are usually assembled from
general population or special groups .