Professional Documents
Culture Documents
Craig Jackson Prof. Occupational Health Psychology Faculty of Education, Law & Social Sciences BCU
craig.jackson@bcu.ac.uk
Keep it simple
Some people hate the very name of statistics but.....their power of dealing with complicated phenomena is extraordinary. They are the only tools by which an opening can be cut through the formidable thicket of difficulties that bars the path of those who pursue the science of man.
Sir Francis Galton, 1889
Ordinal Grade of condition Positions 1st 2nd 3rd Better- Same-Worse Height groups
Allows Continuous variables to be changed into Nominal variables BP BP > 90mmHg = = Hypertensive Normotensive =< 90mmHg
Easier clinical decisions Categorisation reduces quality of data Statistical tests may be more sensational Good for summaries
BMI
Obese vs Underweight
INFERENTIAL phenomena
STATISTICS
Inferences Proving or
about disproving
Multiple Measurement or. 25 cells why statisticians and love dont mix
22 cells
26
25
24
24 cells
23
22
21
21 cells 92 cells
20
Total Mean
= =
Central Tendency
Mode Median
Patient comfort rating 10 9 8 7 6 5
Mean
4 3 2 1
31
27
70
121
140
129
128
90
80
62 Frequency
Dispersion
Range Spread of data
82mmHg
Mode
Dispersion
An individual score therefore possess a standard deviation (away from the mean), which can be positive or negative Depending on which side of the mean the score is
If add the positive and negative deviations together, it equals zero (the positives and negatives cancel out)
negative deviation positive deviation
Dispersion
Range The interval between the highest and lowest measures Limited value as it involves the two most extreme (likely faulty) measures Percentile The value below / above which a particular percentage of values fall (median is the 50th percentile) e.g 5th percentile - 5% of values fall below it, 95% of values fall above 5th 1st it. 25th 50th 75th 95th 99th A series of percentiles (1st, 5th, 25th, 50th, 75th, 95, 99th) Range gives a good general idea of the scatter and shape of the data
56
57
58
59
510
511
61
62
63
64
Standard Deviation
To get around observations this, we square each of the
Makes all the values positive (a minus times a minus.) Then sum all those squared observations to calculate the mean This gives the variance - where every observation is squared Need to take the square root of the variance, to get the standard deviation
Grouped Data
Normal Distribution SD is useful because of the shape distributions of data. Symmetrical, bell-shaped / normal / distribution Non Normal Distribution Some distributions fail to be symmetrical If the tail on the left is longer than the right, the distribution is negatively skewed (to the left) of many
Gaussian
If the tail on the right is longer than the left, the distribution is positively skewed (to the right)
Normal Distributions
Standard Normal Distribution has a mean of 0 and a standard deviation of 1 The total area under the curve amounts to 100% / central value (mean) unity of the observations
3 SD
2 SD
1 SD
0 SD
1 SD
2 SD
3 SD
Proportions of observations within any given range can be obtained from the distribution by using statistical tables of the standard normal distribution
Distributions
Sir Francis Galton (1822-1911) Alumni of Birmingham University 9 books and > 200 papers Fingerprints, correlation of calculus, twins, neuropsychology, blood transfusions, travel in undeveloped countries, criminality and meteorology)
% of population
with
improving
standards
of
56 57 58 59 61 62 63 64
510
511
Obvious implications for the size of samples of populations used The more lead shot runs through the quincunx machine, the smoother the distribution
Presentation of data
Table of means Exposed n=197 Age 45.5 (yrs) ( 9.4) I.Q 105 ( 10.8) 115.1 Controls T n=178 48.9 99 ( 7.3) ( 8.7) 94.7 ( 12.4) P 2.19 0.07 1.78 0.12 3.76
0.04
Speed
(ms) ( 13.4)
Presentation of data
Category tables
Exposed Healthy 50 Controls 150 200
Unwell
147 197
28 178
175 375
Bar Charts
A set of measurements can be presented either as a table or as a figure Graphs are not always as accurate as tables, but Title of graph portray trends more easily y-axis
Legend key
scale
Bar Charts
Some Real Data A combination of distributions facilitate comparisons
7000 Vacation 6000 5000 Votes 4000 3000 2000 1000 0 1 2 3 4 5 6 7 User rating 8 9 10 Empire
is
acceptable
to
ordinate
abscissa Two perpendicular lines are drawn through the medians - dividing the plot into quadrants Each quadrant should outlie 25% of all observations
Correlations between 0 and 0.3 are weak Correlations between 0.4 and 0.7 are moderate Correlations between 0.8 and 1 are strong
Sampling Keywords
POPULATIONS Can be mundane or extraordinary SAMPLE Must be representative INTERNALY VALIDITY OF SAMPLE Sometimes validity is more important than generalizability SELECTION PROCEDURES Random Opportunistic Conscriptive Quota
Sampling Keywords
THEORETICAL Developing, exploring, and testing ideas EMPIRICAL Based on observations and measurements of reality NOMOTHETIC Rules pertaining to the general case (nomos - Greek) PROBABILISTIC Based on probabilities CAUSAL How causes (treatments) effect the outcomes
Clinical Research
Types of clinical research Experimental vs. Observational Longitudinal vs. Cross-sectional Prospective vs. Retrospective Experimental Longitudinal Prospective Observational Longitudinal Prospective
Cross-sectional
Retrospective Survey
Experimental Designs
Between subjects studies
Treatment group Outcome measured patients Control group Outcome measured
Observational studies
Cohort (prospective)
coho rt prospectively measure risk factors
Case-Control (retrospective)
start point measured aetiology odds ratios prevalence developmen
P 7.5
0.04
Randomisation Bias
RCT Deficiencies
Trials too small Trials too short Poor quality Poorly presented Address wrong question Methodological inadequacies Inadequate measures of quality of life (changing) Cost-data poorly presented Ethical neglect Patients given limited understanding Poor trial management Politics Marketeering Why still the dominant model?
If you or anyone you know has been affected by any of the issues covered in this lecture, you may need a statisticians help:
www.statistics.gov.uk
Further Reading
Abbott, P., & Sapsford, R.J. (1988). Research methods for nurses and the caring professions. Buckingham: Open University Press. Altman, D.G. (1991). Designing Research. In D.G. Altman (ed.), Practical Statistics For Medical Research (pp. 74-106). London: Chapman and Hall. Bland, M. (1995). The design of experiments. In M. Bland (ed.), An introduction to medical statistics (pp525). Oxford: Oxford Medical Publications. Bowling, A. (1994). Measuring Health. Milton Keynes: Open University Press. Daly, L.E., & Bourke, G.J. (2000). Epidemiological and clinical research methods. In L.E. Daly & G.J. Bourke (eds.), Interpretation and uses of medical statistics
Further Reading
Jackson, C.A. (2002). Planning Health and Safety Research Projects. Health and Safety at Work Special Report 62, (pp 1-16). Jackson, C.A. (2003). Analyzing Statistical Data in Occupational Health Research. Management of Health Risks Special Report 81, (pp. 2-8). Kumar, R. (1999). Research Methodology: step guide for beginners. London: Sage. a step by
Polit, D., & Hungler, B. (2003). Nursing research: Principles and methods (7th ed.). Philadelphia: Lippincott, Williams & Wilkins.