Professional Documents
Culture Documents
Course Purpose
NST 3.2
1. Introduction to statistics
At the end of this course, students should be 2. Presentation of data: bar charts, pie chats, histograms and
able to: frequency, distribution curves,
3. Measures of central tendency: mean and weighted mean,
1. Develop tools for biological data collection, geometric, mean, median and mode,
4. Measures of dispersion and position: range, standard deviation,
2. Describe inferential statistics used in analysis of IQR, SIQR, outliers,
biological data, 5. Probability and probability distributions
6. Probability testing: test of significance (z-test, t-test, f-test),
3. Interpret Biostatistical data confidence intervals, difference between means, univariate and
multivariate analysis,
7. Analysis of variance and covariance (ANOVA & ANCOVA)
8. Parametric and non-parametric tests
– Statistics , data , Biostatistics, • The word ‘statistics’ means one or more measures
– Variable ,Population ,Sample describing the characteristics of a population,
– Survey • The main question that we attempt to answer using
statistics is that there is a relationship between
variables,
• To demonstrate this relationship we (the
researcher/you) must show that when one variable
changes the other variable changes and that the
amount of change is not by mere chance.
1
2/14/2023
2
2/14/2023
• Analysis of data: This involves processing of the observed • There is a general perception that statistical knowledge is all too
frequently misused either deliberately or often due to lack of
data and transforming it to a form most suitable for decision knowledge.
making.
• On the other hand if various studies appear to contradict one
another, then the public may come to distrust such studies.
• Interpretation of data: This is the process of attaching
physical meaning and giving interpretation to the numerical • Most of the time, samples are used to draw conclusions about the
results useful in real life. This interpretation must be true in its population. If an experiment or study was done cautiously and
meaning and sense. The quality of interpretation depends results were interpreted without bias, then the conclusions would
be accurate. However, occasionally the conclusions are inaccurate
more and more on the experience and insight of the person or inaccurately portrayed for various reasons.
carrying out the study or research.
Sources of Data:
3- Surveys: Population and sample
The source may be a survey, if the data needed is about answering certain
questions. • In statistics, the collection of all individuals or items under
consideration is called population.
For example:
• It is the largest collection of values of a random variable for which
If the administrator of a clinic wishes to obtain information regarding the mode
we have an interest at a particular time.
of transportation used by patients to visit the clinic, then a survey may be
For example:
conducted among patients to obtain this information.
The weights of all the children enrolled in a certain elementary school.
4- Experiments.
Populations may be finite or infinite.
Frequently the data needed to answer a question are available only as the
• Sample is that part of the population from which information is
result of an experiment. collected. (Weiss, 1999)
For example: For example:
If a nurse wishes to know which of several strategies is best for maximizing
The weights of only a fraction of these students in a class.
patient compliance, she might conduct an experiment in which the different
strategies of motivating compliance are tried with different patients.
3
2/14/2023
Sources of Bias
Henry Athiany
23 Henry Athiany 24
4
2/14/2023
When to choose the statistical test? The three main aspects of statistics:
When to contact a Biostatistician?
Design: designing the process of data collection (Identify population,
what kind and how much data needed, how to collect a sample)
BEFORE data collection, at DESIGN stage Description: the methods of summarizing/describing data.
5
2/14/2023
• Measures of Frequency: Count, Percent, Frequency. • A data set in its original form is usually very large.
• Measures of Central Tendency : Mean, Median, and Mode, Consequently, such a data set is not very helpful in drawing
conclusions or making decisions. It is easier to draw
SD.
conclusions from summary tables and diagrams than from the
• Measures of Dispersion or Variation: Range, Variance, original version of a data set. So, we reduce data to a
Standard Deviation. manageable size by constructing tables, drawing graphs, or
• Measures of Position : Percentile ,Quartile, Deciles calculating summary measures such as averages. The portion
of statistics that helps us do this type of statistical analysis is
called descriptive statistics.
Descriptive statistics do not generalize beyond the available data
Inferential statistics
Statistics
• Generalize from the sample.
• Deductive Statistics • Hypothesis testing, confidence intervals
– Deduces properties of samples from a complete
• – t-test, Fisher’s Exact, ANOVA, survival
knowledge about population characteristics
analysis, bayesian approaches
• Inductive Statistics
• Making decisions in the face of uncertainty
– Concerned with using known sample information
to draw conclusions, or make inferences regarding
the unknown population
• Same as inferential statistics