0% found this document useful (0 votes)

215 views28 pages

Understanding Error Bars in Research

Error bars provide important information about data but their interpretation depends on what they represent and the type of study design. Standard deviations indicate variability around the mean for each group while standard errors provide certainty about the mean. Confidence intervals estimate the range that includes the true population mean. Between-subjects designs allow comparisons using error bar overlap rules but within-subject designs require alternative analyses like difference scores and their variability or computing error terms from ANOVAs. Proper interpretation of error bars enhances understanding of experimental results.

Uploaded by

CLPHtheory

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

215 views28 pages

Understanding Error Bars in Research

Uploaded by

CLPHtheory

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Error Bars

What they tell you and what they dont

Jody Culham fMRI Journal Club May 29, 2006

Why Graphs and Error Bars?

The picturing of data allows us to be sensitive not only to the multiple hypotheses that we hold, but to the many more we have not yet thought of, regard as unlikely, or think impossible. -- Tukey, 1974

quote from Loftus & Masson, 1994

Popularity of Error Bars

growing use often expected by reviewers, readers and listeners at talks APA recommends them (but doesnt seem to get them) stat and graphics packages make them easy to add

What Do Scientists Think Error Bars Tell Them?

Add a second data point with equally-sized error bars such that the difference between the two groups is just barely statistically significant (by a two-tailed t-test, p < .05) Most scientists follow the just abutting rule regardless of what the error bars represent

Activation (% BSC)

Group 1

Group 2

Example: Activation in FFA to faces (with fixation as a baseline) for males vs. females

What Do Scientists Think Error Bars Tell Them?

Belia et al., 2005, Psychological Methods Recruited 3,944 authors of papers in high-impact journals in Psychology, Behavioral Neuroscience, and Medicine and gave them that test online (15% return rate) Also quantified use of error bars in publications in those fields
Psychologists dont use error bars much Neuroscientists usually use SE error bars in figures Medical researchers report 95% CIs in text

What Scientists Think SE Bars Tell Them

Correct Responses Psychology Behavioral Neuroscience Medicine

p < .025 p < .10 abutting rule of thumb

Belia et al., 2005

What Scientists Think 95%CI Bars Tell Them

Correct Responses Psychology Behavioral Neuroscience Medicine

abutting rule of thumb

Belia et al., 2005

What Do Error Bars Tell You?

It depends what type of error bars you use
standard deviation (SD) standard error (SE) 95% confidence intervals (95%CI)

It depends what your questions are

What is the likely mean of a population?
Error bars, particularly 95%CI are valuable

Are two groups significantly different?

Error bars can be informative if you know how to interpret them

Are two conditions tested in the same subjects significantly different?

Conventional error bars are completely uninformative

Computation of Standard Deviation

variance (s2) = spread around mean standard deviation = s = SQRT(s2)
~average distance of points from mean (with outliers weighted heavily because of squaring)
adding more subjects doesnt change SD, just the height!

What Does Standard Deviation Tell You?

120
100 IQ Score 80 60 40 20 0
Disclaimer: This completely made-up and facetious example was chosen simply as an illustration using IQ as the dependent measure because most students with Psychology training understand IQ scores. The example is not meant to disparage blondes

SD = +/-15

Blondes Brunettes

SDs estimate the variability around the mean Based on z distribution, 95% lie within +/- 1.96 SDs Most blondes have an IQ between ~70 and 130 Most brunettes have an IQ between ~80 and 140

Computation of Standard Error

Standard Error of the Mean estimate of certainty of your mean if you were to take a large number of samples from your population, all of sample size n, how variable would your estimate of the population mean be? SE is the SD of your estimate of the mean

SE = SD/SQRT(N) the more subjects you have, the more certain your mean will be the less variable your population, the more certain your mean will be
If we have 10 per group and SD = 15, SE = 15/SQRT(10) = 4.7 If we have 100 per group and SD = 15, SE = 15/SQRT(100) = 1.5

What Does Standard Error Tell You?

120
100 IQ Score 80 60 40 20 0 SD = 15

If n = 10, SE = +/- 4.7

Blondes Brunettes

SEs are not wildly useful on their own but they are useful in calculating confidence intervals Why do people use them then? Theyre the smallest!

Computation of 95% Confidence Intervals

Theres a 95% probability that the interval contains the true mean 95%CI = SE * tdf If n = 10, t9 is 2.26 95%CI = 4.7 * 2.26 = 10.6 If n = 100, t99 is ~1.96 95%CI = 1.5 * 1.96 = 2.94
1.96
one-tailed p < .025 two-tailed p < .05

What Do 95% Confidence Intervals Tell You?

120
100 IQ Score 80 60 40 20 0 Blondes Brunettes SD = 15 If n = 10, SE = 4.7 If n = 10, 95%CI = +/10.6

95%CIs tell you that the true IQ of brunettes is probably between 99.4 and 120.6. Because this interval includes 100, you cannot say with sufficient certainty that brunettes are smarter than the population average of 100.

SE --> 95% CI
Rule of Thumb: Given the t distribution, 95%CI are typically 2X to 3X the SD, depending on sample size

1.96

Group Differences: 95%CI Rule of Thumb

Error bars can be informative about group differences, but you have to know what to look for Rule of thumb for 95% CIs: If the overlap is about half of one one-sided error bar, the difference is significant at ~ p < .05 If the error bars just abut, the difference is significant at ~ p< .01 works if n >= 10 and error bars dont differ by more than a factor of 2

Cumming & Finch, 2005

Group Differences: SE Rule of Thumb

Rule of Thumb for SEs when gap is about the size of a one-sided error bar, the difference is significant at p < .05 when the gap is about the size of two one-sided error bars, the difference is significant at about p < .01 works if n >= 10 and error bars dont differ by more than a factor of 2

Cumming & Finch, 2005

What About Within-Subjects Designs?

error bars can tell you about the likely means but not about whether differences between conditions are significant 120 100 IQ Score 80 60 40 20 0 Differences will be significant if the trend is similar between subjects, even if their initial values are conventional error bars will look the variable same for the blue and pink Differences will not be scenarios significant if the trend is highly variable Error bars reflect variability between Brunettes Brunettes subjects, not after Sun-in consistency of trend

What About Within-Subjects Designs?

difference scores and their variability do tell you something Drop in IQ Score After Sun-in
Error Bars: +/- 95%CI

120 100 IQ Score 80 60 40 20 0 Brunettes Brunettes after Sun-in

doesnt include zero --> sig

includes zero --> non-sig

0
A paired t-test basically does an evaluation of such differences

How Many Scientists Understand This?

When Belia et al., 2005, asked their scientist subjects to do the error bar adjustment on a within-subjects design, only 11% expressed any doubt that it was a valid thing to do

Error Bars in fMRI Time Courses

Give you a flavour for the noise in data but tell you nothing about significance Even if single points are not different, the averages for two conditions may still be
better to measure peak %BSC (or beta weights) across several points, then average and do a paired t-test on those values better to do a ROI-GLM contrast in BV

Depends critically on what type of event-related average you do

raw MR signal --> humungous error bars (incl. run to run variability) %BSC file-based --> smaller (incl. all variability in starting point) %BSC epoch-based --> small %BSC condition-based --> small (but may be slightly diff than epoch-based)

% BSC

Time

So What Should You Do?: Between Designs

Dont use error bars?
reviewers may demand them they can be informative

Use SE but put stat comparisons on graphs?

can make graphs look dense

Use 95%CI?
reviewers are used to SE and may think that 95%CI bars look too large can emphasize 95%CI and interpretation
e.g., Error bars are 95% confidence intervals (2.26 x SE for df=9). e.g., Because the 95% confidence intervals for the beta weights do not include zero, the activation was significantly higher than baseline.

So What Should You Do?: Within Designs

You can show the pattern in single subjects. If its consistent, the pattern should jump out If the emphasis is on differences between conditions, you can factor out subject differences
e.g., subtract subject means

all variability

variability due to condition manipulation only

Loftus & Masson, 1994

So What Should You Do?: Within Designs

You can compute more meaningful error bars based on error terms from the ANOVA See Loftus & Masson, 1994, Psych. Bull. and Rev.

Loftus & Masson, 1994

Caveats
assumes homogeneous variance if variance is not homogeneous, diff comparisons may require different error bars

Masson & Loftus, 2003

Options for Multifactorial Within Designs

show difference scores and their errors can use slopes for parametric designs with ~linear trends

Loftus & Masson, 1994

Options for Multifactorial Within Designs

You can plot effect sizes and their error bars See Masson & Loftus, 2003, Can. J. Exp. Psychol.

Masson & Loftus, 2003

Bibliography
Belia, S., Fidler, F., Williams, J., & Cumming, G. (2005). Researchers misunderstand confidence intervals and standard error bars. Psychological Methods, 10(4), 389-396. summarizes scientists (usually incorrect) intuitions Cumming, G., & Finch, S. (2005). Inference by eye: Confidence intervals and how to read pictures of data. American Psychologist, 60(2), 170-180. good summary of rules of thumb and caveats

Loftus, G. R., & Masson, M. E. (1994). Using confidence intervals in within-subject designs. Psychonomic Bulletin & Review, 1(4), 476490. original proposal for simple within-subjects error bars Masson, M. E. J., & Loftus, G. R. (2003). Using confidence intervals for graphically based data interpretation. Canadian Journal of Experimental Psychology, 57(3), 203-220. reiterates Loftus & Masson, 1994, and expands on error terms for complex within-subjects designs

Research Stats for Grad Students
100% (1)
Research Stats for Grad Students
4 pages
LM 10 Supplementary-Readings
No ratings yet
LM 10 Supplementary-Readings
7 pages
Introduction To Data Science Exploratory Data Analysis
No ratings yet
Introduction To Data Science Exploratory Data Analysis
55 pages
Statistics Made Easy
100% (2)
Statistics Made Easy
20 pages
Fundamentals of Statistics Overview
100% (2)
Fundamentals of Statistics Overview
283 pages
Understanding Statistical Validity in Science
No ratings yet
Understanding Statistical Validity in Science
10 pages
Understanding Error Bars in Biology
No ratings yet
Understanding Error Bars in Biology
6 pages
Exploratory Data Analysis in Python
No ratings yet
Exploratory Data Analysis in Python
40 pages
Descriptive and Inferential Statistics Basics
No ratings yet
Descriptive and Inferential Statistics Basics
22 pages
Research Methods: Basic Statistics Overview
No ratings yet
Research Methods: Basic Statistics Overview
45 pages
Beginers Guide To Statistics
No ratings yet
Beginers Guide To Statistics
18 pages
Blood Pressure and Income Histograms Analysis
No ratings yet
Blood Pressure and Income Histograms Analysis
10 pages
Understanding Statistical Concepts
No ratings yet
Understanding Statistical Concepts
8 pages
Introduction To GraphPad Prism
No ratings yet
Introduction To GraphPad Prism
33 pages
Basic Commands SPSS
No ratings yet
Basic Commands SPSS
25 pages
Data Analysis and Hypothesis Testing Guide
No ratings yet
Data Analysis and Hypothesis Testing Guide
21 pages
Confidence Intervals: Example: Average Height
No ratings yet
Confidence Intervals: Example: Average Height
7 pages
Analyzing Quantitative Data in Research
No ratings yet
Analyzing Quantitative Data in Research
33 pages
Understanding Reference Ranges in Diagnostics
No ratings yet
Understanding Reference Ranges in Diagnostics
46 pages
250 Lec 5 Fall 13
No ratings yet
250 Lec 5 Fall 13
42 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
19 pages
Key Statistical Concepts Explained
No ratings yet
Key Statistical Concepts Explained
29 pages
Biostat Chat GPT Note
No ratings yet
Biostat Chat GPT Note
11 pages
Basic Statistics Survival Guide
No ratings yet
Basic Statistics Survival Guide
101 pages
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
No ratings yet
Lecture 5: Chapter 5 Statistical Analysis of Data Yes The "S" Word
42 pages
Unit+0++Day+3+ +Statistics+and+Ethics
No ratings yet
Unit+0++Day+3+ +Statistics+and+Ethics
21 pages
Lesson One Introduction To Inferential Statistics
No ratings yet
Lesson One Introduction To Inferential Statistics
20 pages
One Dimensional Statistics
No ratings yet
One Dimensional Statistics
21 pages
Hypothesis Testing-Online
No ratings yet
Hypothesis Testing-Online
56 pages
Statistics and Data Analysis Basics
No ratings yet
Statistics and Data Analysis Basics
71 pages
PDF - Statistics - Spss Guide For Dummies
100% (1)
PDF - Statistics - Spss Guide For Dummies
11 pages
ISDS 361A - Cheat Sheet Exam 1 PDF
No ratings yet
ISDS 361A - Cheat Sheet Exam 1 PDF
2 pages
Statistical Concepts and Data Analysis
100% (1)
Statistical Concepts and Data Analysis
23 pages
Intro to Statistical Analysis
No ratings yet
Intro to Statistical Analysis
42 pages
Measures of Variability and Probability
No ratings yet
Measures of Variability and Probability
99 pages
Descriptive Statistics in Education Analysis
No ratings yet
Descriptive Statistics in Education Analysis
48 pages
BioStats and Epidemiology BNB Notes
No ratings yet
BioStats and Epidemiology BNB Notes
39 pages
8614 PDF
No ratings yet
8614 PDF
20 pages
Z Test Basics: Understanding Significance
No ratings yet
Z Test Basics: Understanding Significance
5 pages
Statistics Course with GraphPad Prism 7
No ratings yet
Statistics Course with GraphPad Prism 7
79 pages
Biostatistics: Tests of Significance Guide
No ratings yet
Biostatistics: Tests of Significance Guide
97 pages
Statistical Analysis in Excel by Golden MCpherson
100% (2)
Statistical Analysis in Excel by Golden MCpherson
315 pages
Impact of Outliers on Mean vs. Median
No ratings yet
Impact of Outliers on Mean vs. Median
60 pages
Session 2
No ratings yet
Session 2
43 pages
Biostatistics: Differences & Correlations
100% (2)
Biostatistics: Differences & Correlations
9 pages
Understanding HR Analytics Concepts
No ratings yet
Understanding HR Analytics Concepts
9 pages
JCB 200611141
No ratings yet
JCB 200611141
5 pages
Statistical Methods and Hypothesis Testing
No ratings yet
Statistical Methods and Hypothesis Testing
6 pages
Introduction to Biostatistics Basics
No ratings yet
Introduction to Biostatistics Basics
8 pages
CMSC 177 Statistical Experiments and Significance Testing
No ratings yet
CMSC 177 Statistical Experiments and Significance Testing
31 pages
Statistical Analysis: Descriptive & Inferential
No ratings yet
Statistical Analysis: Descriptive & Inferential
43 pages
Understanding Statistical Significance & Confidence
No ratings yet
Understanding Statistical Significance & Confidence
13 pages
Statistics Equationls
No ratings yet
Statistics Equationls
5 pages
Understanding Cell Adhesion Mechanisms
No ratings yet
Understanding Cell Adhesion Mechanisms
17 pages
Agilent LC/MS Method Optimization
No ratings yet
Agilent LC/MS Method Optimization
46 pages
Confocal Live Cells
No ratings yet
Confocal Live Cells
50 pages
Adhesion of White Blood Cells TO Guinea-Worm Larvae: C. R. R. M. Reddy, G. Parvathi, and M. Sivaramappa
No ratings yet
Adhesion of White Blood Cells TO Guinea-Worm Larvae: C. R. R. M. Reddy, G. Parvathi, and M. Sivaramappa
3 pages
Mass Spectrometry Fundamentals
No ratings yet
Mass Spectrometry Fundamentals
498 pages
PPAK
No ratings yet
PPAK
48 pages
Acetoxymethyl (AM) and Acetate Esters: Product Amount Concentration Storage Stability
No ratings yet
Acetoxymethyl (AM) and Acetate Esters: Product Amount Concentration Storage Stability
3 pages
Cell Migration in Respiratory Epithelium Repair
No ratings yet
Cell Migration in Respiratory Epithelium Repair
11 pages
9 Full
No ratings yet
9 Full
9 pages
J Pharmacol Exp Ther 1999 Hesselink 543 50
No ratings yet
J Pharmacol Exp Ther 1999 Hesselink 543 50
8 pages
Osteoclast Cell Adhesion Dynamics
No ratings yet
Osteoclast Cell Adhesion Dynamics
99 pages
Renal Secretion of L-Glucose in Dogs and Rats
No ratings yet
Renal Secretion of L-Glucose in Dogs and Rats
6 pages
Small Bowel Transplant in Rats Study
No ratings yet
Small Bowel Transplant in Rats Study
7 pages
Urate Transport in Dog Nephrons
No ratings yet
Urate Transport in Dog Nephrons
3 pages
ANOVA Techniques for Experimental Design
No ratings yet
ANOVA Techniques for Experimental Design
54 pages
Disposition of Acamprosate in The Rat: Influence of Probenecid
No ratings yet
Disposition of Acamprosate in The Rat: Influence of Probenecid
9 pages
Becoming A Contagious Christian Outline
100% (2)
Becoming A Contagious Christian Outline
4 pages
Microscopy 2
No ratings yet
Microscopy 2
75 pages
TEM: Analyzing Material Composition
No ratings yet
TEM: Analyzing Material Composition
42 pages
StanfordFlex Basic CRRT Theory Rev2-06
100% (1)
StanfordFlex Basic CRRT Theory Rev2-06
64 pages
Advanced Analytical Methods in Chemistry
No ratings yet
Advanced Analytical Methods in Chemistry
76 pages
Lecture 6 1
No ratings yet
Lecture 6 1
42 pages
Park Lenti GFP Kidney 2010
No ratings yet
Park Lenti GFP Kidney 2010
20 pages
NEW TECHNOLOGY at PBGL
No ratings yet
NEW TECHNOLOGY at PBGL
13 pages
Degree Code List
100% (1)
Degree Code List
2 pages
Ydv Tools Yf Youth Engagement Summary en
No ratings yet
Ydv Tools Yf Youth Engagement Summary en
30 pages
What Is Test..draft
100% (1)
What Is Test..draft
4 pages
A.I Replacing Humans
No ratings yet
A.I Replacing Humans
1 page
IGCSE Results 2022
No ratings yet
IGCSE Results 2022
1 page
Akshay Nitin Puntambekar
No ratings yet
Akshay Nitin Puntambekar
2 pages
Lesson Plan
No ratings yet
Lesson Plan
4 pages
Italy Bachelor Programs (24!12!2025)
No ratings yet
Italy Bachelor Programs (24!12!2025)
7 pages
Bete Demissie CV 23 July 2020
No ratings yet
Bete Demissie CV 23 July 2020
11 pages
H2 Area Volume Capacity Teacher 1209
No ratings yet
H2 Area Volume Capacity Teacher 1209
6 pages
AI Infographic
No ratings yet
AI Infographic
2 pages
CARS2021 - Programme Book
100% (1)
CARS2021 - Programme Book
105 pages
Theory - Z
100% (1)
Theory - Z
14 pages
Developmental Psychology Notes
No ratings yet
Developmental Psychology Notes
34 pages
Understanding Logic and Reasoning
No ratings yet
Understanding Logic and Reasoning
20 pages
NEET Chapter Wise Weightage 2025 For Physics Physics Chapters/Topics Weightage (%)
No ratings yet
NEET Chapter Wise Weightage 2025 For Physics Physics Chapters/Topics Weightage (%)
5 pages
Islamic Leadership Traits and Organizational Commitment
No ratings yet
Islamic Leadership Traits and Organizational Commitment
16 pages
IT Ethics for Professionals
50% (2)
IT Ethics for Professionals
22 pages
Why Do People Forgive Lesson 2 Plan
No ratings yet
Why Do People Forgive Lesson 2 Plan
3 pages
Critical Legal Theory Without Modifiers
No ratings yet
Critical Legal Theory Without Modifiers
14 pages
Eng 2
No ratings yet
Eng 2
10 pages
Beginner English Lesson Plan
100% (1)
Beginner English Lesson Plan
3 pages
Aboabo High School Science Record 2019
No ratings yet
Aboabo High School Science Record 2019
4 pages
Class 7 Computer Science Autumn Break Holiday Homework
No ratings yet
Class 7 Computer Science Autumn Break Holiday Homework
3 pages
Comparing OJT and Work Immersion Performance
100% (1)
Comparing OJT and Work Immersion Performance
25 pages
Nutrients 13 03131
No ratings yet
Nutrients 13 03131
14 pages
Reading Comprehension Shot 13
No ratings yet
Reading Comprehension Shot 13
5 pages
Macmillan English 6 Unit 15 Worksheet Teaching Notes: Change The Nouns in The Box To Adjectives
No ratings yet
Macmillan English 6 Unit 15 Worksheet Teaching Notes: Change The Nouns in The Box To Adjectives
2 pages
Polyvagal Theory Grounding Exercises
No ratings yet
Polyvagal Theory Grounding Exercises
4 pages

Understanding Error Bars in Research

Uploaded by

Understanding Error Bars in Research

Uploaded by

Error Bars

What they tell you and what they dont

Why Graphs and Error Bars?

quote from Loftus & Masson, 1994

Popularity of Error Bars

What Do Scientists Think Error Bars Tell Them?

What Do Scientists Think Error Bars Tell Them?

What Scientists Think SE Bars Tell Them

p < .025 p < .10 abutting rule of thumb

Belia et al., 2005

What Scientists Think 95%CI Bars Tell Them

abutting rule of thumb

Belia et al., 2005

What Do Error Bars Tell You?

It depends what your questions are

Are two groups significantly different?

Are two conditions tested in the same subjects significantly different?

Computation of Standard Deviation

What Does Standard Deviation Tell You?

Computation of Standard Error

What Does Standard Error Tell You?

If n = 10, SE = +/- 4.7

Computation of 95% Confidence Intervals

What Do 95% Confidence Intervals Tell You?

Group Differences: 95%CI Rule of Thumb

Cumming & Finch, 2005

Group Differences: SE Rule of Thumb

Cumming & Finch, 2005

What About Within-Subjects Designs?

What About Within-Subjects Designs?

120 100 IQ Score 80 60 40 20 0 Brunettes Brunettes after Sun-in

doesnt include zero --> sig

includes zero --> non-sig

How Many Scientists Understand This?

Error Bars in fMRI Time Courses

Depends critically on what type of event-related average you do

So What Should You Do?: Between Designs

Use SE but put stat comparisons on graphs?

So What Should You Do?: Within Designs

variability due to condition manipulation only

Loftus & Masson, 1994

So What Should You Do?: Within Designs

Loftus & Masson, 1994

Masson & Loftus, 2003

Options for Multifactorial Within Designs

Loftus & Masson, 1994

Options for Multifactorial Within Designs

Masson & Loftus, 2003

You might also like