You are on page 1of 44

#ACRLDataShop

THE DATA ‘SHOP
Gangway for a Crash Course
in Data Visualization!
About Us
Peace Ossom Williamson
Director, Research Data Services

Heather Scalf
Director of Assessment

Leni Matthews
User Experience Librarian
3
Main Goal

Organize, clean, and visualize

Image by Patch, J. https://www.behance.net/gallery/10407529/Watermen-Photojournalism
Agenda
Introduction
Best Practices (Excel)
Data cleaning (OpenRefine)
Interactive Vizzes (Tableau)
Text Analysis (Voyant)
Closing / Q&A
6
#ACRLDataShop

How to #dataviz
1. Determine what question you’re
trying to answer.
2. Find/collect the data you need.
3. Clean!
4. Wrangle!
5. Do it many more times.
Preattentive Attributes
Quick Color Convo
Preattentive Processing
6970425934749
5872832949565
5928104523712
J reads and researches extensively, staying
on top of current developments that might
impact the field. J ingeniously puts the
resources and tools available to maximum
use and demonstrates a high level of
competency in the skills and knowledge
competency in the skills and knowledge
required. J learns and applies new skills
quickly.
Using Color

Color used sparingly grabs
attention
Using Color

Color can carry quantitative
value
Gestalt Principles
Gestalt Principles
of Visual Perception
Proximity
Gestalt Principles
of Visual Perception
Gestalt Principles
of Visual Perception
Gestalt Principles
of Visual Perception
Choosing Charts
Column Charts
Show comparison, using a
nominal or ordinal variable
and an interval or
ratio variable.

Best Practice

• One color per variable
• Arrange by pattern
or chronologically
Bar Charts
Show comparison, using a
How Nursing Questions are Received
nominal/ordinal variable & Chat is the primary method, followed by emails. Together, they
make up 70% of incoming nursing questions.
an interval/ratio variable.
Chat 409

Method of Questions
Email 202
Drop-In 102
Best Practice Appointment 61
Phone 58
Blank 36

• Avoid clutter Ask a Librarian
In-Person
11
5

• Can use to show negative 0 100 200 300 400 500
Number of questions per year?
numbers
Color used sparingly grabs attention
Line Charts
Show trends using a variable
of any type with an interval or
ratio variable.

Best Practice

• Only use solid lines
• Don’t use more than 4 lines
in one chart
• Make height so that lines
take up roughly 2/3 of chart Gestalt principle continuation
height
Scatter/Dot Plots
Scatter Plot

Show comparison, using a
nominal or ordinal variable
and an interval or
ratio variable.

Gestalt principle
common movement
Scatter/Dot Plots
Best Practice
• Use of distinct points
and call-outs
• Use of color to guide
where to look
Pie Charts Answering Nursing Questions
Show categories’ relationship Most nursing questions are not reaching the nursing team,
as we are answering fewer than 50% recorded.
to a whole.

Kaeli
28%

Best Practice
Others
52%
• Use to display one very large or very
Peace
small category. 13%

• Don’t use if too many Lydia 6%
categories. Heather 1%

• Order slices according to size.
Choosing Charts
Hourly Question Frequency
120 Chat Questions
100
80
60
40

•Stacked, clustered, 20
0
12 2 4 6 8 10 12 2 4 6 8 10

fragmented. Oh my! am pm

•Area charts Daily Question Frequency

•Heat maps Questions come most frequently in the middle of the
week, but there are many questions that come
weekends.

•Tree maps 200

2016 Questions
150
100

•Donuts & pies 50
0
Sun Mon Tue Wed Thu Fri Sat
Total Questions 35 138 174 185 158 145 50
Chat Questions 11 68 80 73 81 75 20
28
Google Fusion Tables
• For beginners
• For mapping
• Online sharing and
collaborating
Tableau #ACRLDataShop

• Point & Click
• Online interactivity
• Popular
Others
• iCharts
• CartoDB
• Cytoscape
• Gephi
• Google Insights
• Open Heatmap
• Plotly
• Infogram
• Piktochart
• Wordle/Voyant
• Canva
32
Excel Activity 1
Being Arrested is Deadlier for African-Americans
Deaths per 100,000 arrests by race in the U.S., 2003-2009
AFRICAN-AMERICAN 3.4
Homicide
WHITE 1.8
0.8
Intoxication
0.3
0.4
Unknown
0.1
0.4
Accident
0.1
0.3
Suicide
0.5
Natural 0.3 #ACRLDataShop
Causes 0.2
Excel Activity 2
Workshop Attendance
100
90
80
70
60
50
40
30
20
10
0
2012 2013 2014 2015 2016 2017

Data Management Data for Humanities
Intro to SPSS Tableau and ArcGIS
Data Viz for Diff. Media
WORKSHOP ATTENDANCE 2012 2013 2014 2015 2016 2017

Data Management 45 28

92
74
Data for Humanities

Introduction to SPSS 70 61

78
60
Tableau and ArcGIS

72 61
Data Viz Diff. Media
WORKSHOP ATTENDANCE

ALL WORKSHOPS DATA MANAGEMENT DATA FOR HUMANITIES
100
90 78
80
70
60
50
40
30 28
20
10
0
2012 2013 2014 2015 2016 2017

INTRODUCTION TO SPSS TABLEAU AND ARCGIS DATA VIZ FOR DIFF MEDIA

78

61
61
Resources
• Presentations for librarians: A complete guide to making
effective learner-centered presentations
book by Lee Hilyer
• Storytelling with data
book and blog by Cole Nussbaumer Knaflic
• Data Visualisation: A Handbook for Data Driven Design
book by Andy Kirk
• Effective Data Visualization: The Right Chart for the Right Data
book by Stephanie D.H. Evergreen
#ACRLDataShop
Agenda
Introduction
Best Practices (Excel)
Data cleaning (OpenRefine)
Interactive Vizzes (Tableau)
Text Analysis (Voyant)
Closing / Q&A
Agenda
Introduction
Best Practices (Excel)
Data cleaning (OpenRefine)
Interactive Vizzes (Tableau)
Text Analysis (Voyant)
Closing / Q&A
41
End

Image by Patch, J. https://www.behance.net/gallery/10407529/Watermen-Photojournalism
Resources
Twitter Accounts to Follow
• @sxywu
• @WSJGraphics and @PostGraphics
• @datachloe

Blogs to Visit
• flowingdata.com
• informationisbeautiful.net
• thedailyviz.com
#ACRLDataShop