Professional Documents
Culture Documents
Programming Tools: Adventures With R: Toolbox
Programming Tools: Adventures With R: Toolbox
PROGRAMMING TOOLS:
ADVENTURES WITH R
A guide to the popular, free statistics and visualization software
that gives scientists control of their own data analysis.
ILLUSTRATION BY THE PROJECT TWINS
B Y S Y LV I A T I P P M A N N with her data-processing demands. Besides being free, R is popular partly because
F
With the results of her first genomic sequenc- it presents different faces to different users. It is,
or years, geneticist Helene Royo used ing experiments in hand at the start of a new first and foremost, a programming language —
commercial software to analyse her work. postdoc, Royo had a choice: pass the sequences requiring input through a command line, which
She would extract DNA from the devel- over to the experts or learn to analyse the data may seem forbidding to non-coders. But begin-
oping sperm cells of mice, send it for analysis herself. She took the plunge, and began learning ners can surf over the complexities and call up
and then fire up a package called GeneSpring how to parse data in the free, open-source soft- preset software packages, which come ready-
to study the results. “As a scientist, I wanted to ware package R. It helped that the centre she had made with commands for statistical analysis
understand everything I was doing,” she says. joined — the Friedrich Miescher Institute for and data visualization. These packages create a
“But this kind of analysis didn’t allow that: I just Biomedical Research in Basel, Switzerland — welcoming middle ground between the com-
pressed buttons and got answers.” And as Royo’s ran regular courses on the software. But she was fort of commercial ‘black-box’ solutions and the
studies comparing genetic activity on different also following a wider trend: for many academ- expert world of code. “R made it very easy,” says
chromosomes became more involved, she real- ics seeking to wean themselves off commercial Rojo. “It did everything for me.”
ized that the commercial tool could not keep up software, R is the data-analysis tool of choice. That, indeed, is what R’s developers
1 JA N UA RY 2 0 1 5 | VO L 5 1 7 | N AT U R E | 1 0 9
© 2015 Macmillan Publishers Limited. All rights reserved
TOOLBOX
intended when they designed it in the 1990s. bioinformatics group, she took about half a year
1 1 0 | N AT U R E | VO L 5 1 7 | 1 JA N UA RY 2 0 1 5
© 2015 Macmillan Publishers Limited. All rights reserved