Professional Documents
Culture Documents
Visualization
Topics
• Introduction / Definition
• History
• Examples
• Workflow / Pipeline
• Software overview
• Hands-on exercises
• Resources
“Sci vis” versus “Info vis”
• Selected figures
– William Playfair (1821) – line, bar charts, etc.
– Charles Joseph Minard (1869) – Napoleon’s march, etc.
– Jacques Bertin (1967) – “semiology of graphics”
– John Tukey (1977) – “exploratory data analysis”
– Edward Tufte (1983) – statistical graphics standards/practices
• 1985 NSF Workshop on Scientific Visualization
• 1990: S.K.Card, et al. Readings in Information
Visualization: Using Vision to Think
⚫ Network visualization
(vizster)
⚫ Demo
• Demo
• Demo
“Trendalyzer”
⚫ Demo
• Acquire
• Parse
• Filter
• Mine
• Represent
• Refine
• Interact
• Parse
Provide some structure for the data’s meaning and order it into categories.
• Filter/Mine
Filter
Remove all but the data of interest.
Mine
Apply methods from statistics or data
mining as a way to discern patterns or
place the data in mathematical context.
• Demo
[p. 12, Fry, Visualizing Data]
• data visualization techniques can also be illustrated with the Iris Plant
data set (more later).
– Can be obtained from the UCI Machine Learning Repository
http://www.ics.uci.edu/~mlearn/MLRepository.html
– Three flower types (classes):
• Setosa
• Virginica
• Versicolour
– Four (non-class) attributes
• Sepal width and length
• Petal width and length
**currently
df = sns.load_dataset("iris")
df=df.groupby('sepal_length')['sepal_width'].sum().to_frame().reset_index()
#Creating the line chart
plt.plot(df['sepal_length'], df['sepal_width'])
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
rnd_data
min 3.57
max 16.97
mean 10.00
std 1.97
**currently
**currently
**currently
**currently
**currently
rnd_data
min 3.57
max 16.97
mean 10.00
std 1.97
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
**currently
Hands-on using Python
• Venn Diagram;
Venn diagrams are used to see the relationships between two or more sets of
items. It highlights the similarities and differences.
**currently
**currently
**currently
Hands-on using Python
• Tree Maps;
Tree Maps are primarily used to display data that is grouped and nested in a
hierarchical structure and observe the contribution of each component.
**currently
**currently
**currently
⚫ Books
– Visual Complexity, Mapping Patterns of Information , Manuel Lima
– The Visual Display of Quantitative Information, Edward Tufte
– Information Visualization: Beyond the Horizon, Chaomei Chen
– JavaScript: The Definitive Guide, David Flanagan
– Getting Started with D3, Mike Dewar
– Visualizing Data, Ben Fry
– Interactive Data Visualization for the Web, Scott Murray