Professional Documents
Culture Documents
All plots in Mondrian are fully linked, and offer various interactions and queries. Any case
selected in a plot in Mondrian is highlighted in all other plots.
Currently implemented plots comprise Mosaic Plot, Scatterplots and SPLOM, Maps,
Barcharts, Histograms, Missing Value Plot, Parallel Coordinates/Boxplots and Boxplots
y by x.
Mondrian works with data in standard tab-delimited ASCII files. There is basic support for
working directly on data in Databases (please contact me for further info).
Mondrian is written in JAVA and is distributed as native application (wrapper) for MacOS X and
Windows. Linux users need to start the jar-file. The latest version can be downloaded here.
For questions please report to mondrian@theusRus.de. Bugs may be submitted to the bug-
tracker.
News:
(09/12/09) MacOSX 10.6 aka "Snow Leopard" is shipped with JAVA 6 only, which is still
a bit stubborn and refuses to use anti aliasing as it was found in previous versions.
Until we found a fix and released the next version of Mondrian, please take a look a
this guide on how to install JAVA 5 under Snow Leopard.
(09/10/09) The slides which go with the book "Interactive Graphics for Data Analysis –
Principles and Examples" can now be found on the web.
They should be a fruitful resource for everybody who wants to learn how to use
Mondrian.
(04/07/09) The JSM program is online now. Simon and I are giving a full day, hands on
CE-course titled "Comprehensive Data Analysis Using Interactive Statistical Graphics" at
this years JSM in Washington, DC. The course will be interesting for everyone who wants
to get deeper insights into data by using (interactive) graphics or wants to use Mondrian
most efficiently.
(05/24/07) we have a Bugzilla based bug-tracker online. Please submit all bug reports
and requests there - feel free to e-mail after submission as well ...
Plots
Mosaic Plot Mosaic plots in Mondrian are fully interactive. Interactivity includes the standard operation as
described in the convention section (except zooming), plus the reordering of the variables in
the plot by using the 4 arrow keys. Use <meta>-r to rotate the direction along which the last
variable in the plot is split.
Use <meta>-+ and <meta>-- to add and delete interactions during the modeling process using
the ModelNavigator. During the model process one may want to use the plot option to show
the expected values of the model and not the observed.
The picture below shows an example of the Titanic dataset, which includes information on the
class (1,2,3 crew) age (child, adult) and gender (male, female) of the passengers. Surviving
passengers are highlight.
Although there are no labels to decode the cells, the order of the variables is given in the title
of the window. Using the interactive interrogation it is very easy to query the cells. In this static
representation the knowledge of the fact that there are no children and hardly any women in
the crew, should be sufficient to decode the plot.
Example of a rotated Mosaic plot, i.e. first variable is split along y not x!
Additionally Mondrian features 4 variations of mosaic plots. The Figures below show the same
data from the cars data set, in all five possible variations. Use the pop-up menu for the plot
options:
Observed values Expected values
(according to current model)
Barcharts Barcharts in Mondrian follow a vertical layout, not a horizontal layout. Thus the level-names
can be printed in full length.
Besides standard selection and interrogation techniques, interactivity in a barchart comprises:
Since barcharts are scrollable, they may hold any arbitrary number of categories
A barchart for the dataset on the Titanic passengers. First class passenger are highlighted.
When a barchart has very many categories, it can be pretty painful to search particular items,
especially if the barchart is sorted to something else than lexicographical order. As used from
lists in many applications, you may now type a prefix of the item you are looking for, and
Mondrian will automatically scroll to that item.
(The same behavior can be found in the variables window - plus additional selection of the
hits)
Missing If the dataset has missing values, a missing value plot can be used to analyze the structure of
the missing values (monotone missingness etc.).
Value Plot
The options of the missing value plot are similar to those of a barchart (sorting etc.)
Maps Whenever a dataset provides information on polygons, Mondrian can draw interactive maps of
this geographical reference.
A corresponding data record must be provided for each polygon defined in the dataset.
Different polygons might point to the same data record, but multiple records to a set of
polygons are ignored.
Maps offer the standard selection and interrogation techniques. Additionally the standard
zooming function of Mondrian is enabled.
All maps have a popup-menu at the top to create a choropleth map of any of the variables;
including alphanumerical variables.
Further options include:
Further options include:
Color schemes
"white to black"
"red"
"green"
"blue"
"blue to red"
"blue to white to red"
if R and Rserve are installed
"heat"
"terrain"
"topo"
Invert color scheme
Assign color linearly, normalized, or by rank
Limit minimum or maximum to a specific value
Six choropleth maps of the five Midwest states, shaded according to educational status
The saturation (more precise the alpha) of boundaries can now be changed with the right
arrow key (more saturation) and left arror key (less saturation).
This can change the perception of the map drastically, so make sure to test it out!
Parallel Mondrian implements parallel coordinate plots for any arbitrary number of variables.
Alphanumerical categorical variables are displayed equally spaced according to the currently
Coordinates defined order. Numerical variables are scaled according to their actual numbers.
HotSelection only shows the currently selected points in PCPs and PBPs
Crop Selection removes the currently selected lines from the PCP. Subsequent crop
operations allow to peel a data set. (does only work for lines)
Selections can only be performed on the axis. If a plot with many variables extends the screen,
use meta-0 to fit to screen size.
A parallel coordinate plot for the olive oil data.
The "Sort Axes by" menu offers various ways to sort axes automatically:
The align menu offers means to
align/center the axes according to:
Boxplots (Parallel) Boxplots y by x only include a single variable, split by a second categorical variable.
To invoke a boxplot y by x select the continuous variable to plot and the categorical variable to
split by and select 'boxplots y by x' in the 'Plots' menu.
Manual reordering of the classes is only possible by reordering the levels in a corresponding
barchart.
The context menu offers to automatically sort the levels by either median or IQ-range (and to
reverse a current ordering.)
Parallel boxplots for the cars data set - heavy cars selected.
Scatterplots Scatterplots offer the basic interactions. Data can be selected and highlighted.
In contrast to most other plots in Mondrian, scatterplots offer axis, showing the maximum and
minimum as orientation.
Interrogation methods inside scatterplots operate on three levels. The first level is a simple
overview of the position of the cursor, which is displayed as projection onto the x- and y-axis.
This interrogation is invoked by simply pressing the altenate key. A pressing control invokes
the second level of interrogation. A tooltip is presented with the data of the variables in the plot
closest to the curser. If more than one point is found at the same distance, a list of the cases is
presented in the pop-up. Holding shift and control shows the information of the selected
variables in the variables list.
orientation objects extended
Example:
!o"!#%%%%%%%$/!%tt&$//www$!&&#!'%!sto !$co!/!!!g!s/!o"!#s/)I!!g!$%%$*!u$c%+"!t!+$$$
,u!"