You are on page 1of 3

Technometrics

ISSN: (Print) (Online) Journal homepage: https://amstat.tandfonline.com/loi/utch20

R for Political Data Science: A Practical Guide


edited by Francisco Urdinez and Andrés Cruz, Chapman and Hall/CRC, Taylor
& Francis Group, Boca Raton, FL, 2021, ISBN 978-0-367-81889-0, xix + 440 pp.,
209 b/w illustrations, $99.95 (Hardback).

Stan Lipovetsky

To cite this article: Stan Lipovetsky (2021) R for Political Data Science: A Practical Guide,
Technometrics, 63:2, 277-278, DOI: 10.1080/00401706.2021.1904743

To link to this article: https://doi.org/10.1080/00401706.2021.1904743

Published online: 30 Apr 2021.

Submit your article to this journal

View related articles

View Crossmark data

Full Terms & Conditions of access and use can be found at


https://amstat.tandfonline.com/action/journalInformation?journalCode=utch20
BOOK REVIEWS 277

mimicking near-concert-hall acoustics in your home, and to Foundation (NSF) and science museums are discussed. Part
a crisp broadcast at the town bandstand. Chapter 11 recalls IV, “Communicating Science,” in Chapters 34–41 explores roles
about the first transatlantic cables installed in several tenacious of professional scientific journals. Scholars publish in English
attempts due to efforts of the American businessman Cyrus nearly three million journal articles a year in over 33,000
Field and the Atlantic Telegraph Company, with the first cable scholarly journals, and the number of journal titles has grown
constructed from Newfoundland to Ireland. French laid another roughly 3% a year since the first scholarly journals appeared
cable in 1869–79, and it had been in operation up to 1959. Before over three and a half centuries ago. The effectiveness of the
the cables, all messages across the ocean had to be sent by boat WWW search tools in the text and data mining is growing,
which took a minimum of 10 days. The telegraph took this and it greatly enhances possibilities of serendipitous discoveries
down to 1.5 h, when working optimally. The expense of this new with help of accumulation and transfer of a new information
infrastructure was $20 million; to put this into perspective, the in contemporary civilization. The problems of unreliable or
United States purchased Alaska the following year for $7.2 mil- unethical papers, of the National Digital Public Library (NDPL),
lion. And to transmit one word on this cable cost $10, more than of the disseminating scholar research are also discussed. Part
a week’s earnings for a typical laborer. Our present transatlantic V “Art and Science,” in Chapters 42–45 is devoted to the
calls and web traffic are now routed through fiber-optic cables. science presentation via natural and artistic artifacts, describing
The first fiber-optic cable, the TAT-8, was commissioned in mineral and fossil exhibitions, planetarium and astronomical
1988, and today the fiber optic cables that cross the Atlantic demonstrations, archeology and art museums, galleries with
deliver 10,000 times more information in one second than the wood carving and watercolor painting (see also Lipovetsky
first cables could deliver in a century of operation. Chapter 12 and Mandel 2009). Acknowledgments to dozens of people and
describes development of beacons and lighthouses from ancient Epilogue resume the book with the author’s thoughts on science
Greeks to the invention by Augustin Fresnel, a French engineer, and humanity in their fight with the virus pandemic of 2020,
who created a compound lens that efficiently focused a large quickly developed tests and vaccines, and continuing trials
fraction of light from a single source into a concentrated beam. and research with already more than 50,000 journal articles
Chapter 13 is devoted to Charles H. Town, American inventor published on the associated topics (see, e.g., Special Issue on
of the maser and laser. He won the 1964 Nobel Prize in physics Pandemics 2021).
(shared with Russian scientists N. Basov and A. Prokhorov, Multiple rare historical archive pictures illustrate the themes,
although the earlier inventor of the maser and laser from the and the references on the original sources are given in each
soviet side was a physicist Michail Martynovich Vudynsky— chapter. The book presents a fascinating livid history of science
S.L.). Chapter 14 presents prof. Chen Jia’er, the head of Peking which can be highly appreciated by any reader interested to learn
University’s Institute of High Energy Physics and of the National more about our civilization in it progress during mostly the last
Natural Science Foundation of China, and Chapter 15 presents century.
prof. Jean Tran Thanh Van, from the International Center for
Interdisciplinary Science and Education in Vietnam. Stan Lipovetsky
In Part II, “Mentors and Milestones,” contains topics and Minneapolis, MN
personalities of the author’s career in lasers and magnetic
fusion research, particle accelerators and materials science, and
new energy sources. Chapters 16–27 describe building a ruby
laser, a molecular microscope, fusion experiments and devices, References
wind turbines, research on plasma, experiments on graphene
sheet and nanotubes, building of LHC in Geneva, Switzerland, Lipovetsky S., and Mandel I. (2009), “How Art Helps to Understand Statis-
tics,” Model Assisted Statistics and Applications, 4, 313–324. [277]
Superconducting Super Collider (SSC) project and its cancela- Special Issue on Pandemics. (2021), Model Assisted Statistics and Applica-
tion, Continuous Electron Beam Accelerator Facility (CEBAF) tions, vol. 16, Amsterdam, The Netherlands: IOS Press. [277]
and the Superconducting RF (SRF) technology, free electron
laser technologies and Laser Interferometer Gravitational R for Political Data Science: A Practical Guide, edited
Observatory (LIGO), detection of black holes and their colli- by Francisco Urdinez and Andrés Cruz, Chapman and
sions. In 2015 gravitational waves were first detected, in 2017 Hall/CRC, Taylor & Francis Group, Boca Raton, FL, 2021,
collisions of two neutron stars were observed with help of new ISBN 978-0-367-81889-0, xix + 440 pp., 209 b/w
“multi-messenger” technologies in astronomy. One unique illustrations, $99.95 (Hardback).
observation was the detection of a gamma ray burst nearly
coincident with the detection of gravitational waves emanating The monograph belongs to The R Series, and presents a refer-
from the collision, which confirmed that gravity waves travel ence textbook on R language with a semester course on statistics
at the speed of light, as Einstein predicted. Part III, “Science with application to estimations on real political data. The book
Policy Matters,” explains in Chapters 28–33 the importance consists of three parts covering sixteen chapters, written by near
of science to wide audiences, including children and students, a dozen authors – mostly specialists on politics and economics
the general public, informed decision makers and legislators. of South American countries.
History of the American industrial revolution, science and Part I, “Introduction to R”, contains four chapters. Chap-
engineering growth, economic development and return on ter 1 describes R basics, including installation, RStudio and
scientific investments, experiments on massively online open console, scripts and objects, vectors and functions, packages
courses (MOOCs), the role of the U.S. National Science and libraries, with multiple examples of codes and screenshots.
278 BOOK REVIEWS

Chapter 2 presents data management tools with various oper- students, and is equally helpful for researcher and practitioners.
ations and commands, data transformation and pivoting – all The main material in the book consists of R codes, that supplies
in numerous practical examples. Chapter 3 deals with data the readers with amazingly useful tools of modeling not only in
visualization with help of tidyverse and ggplot2 packages, and political but in a wider area of applied social and other sciences,
multiple examples presented in different kind of graphs. Chapter wherever the statistical analysis is required.
4 focuses on data loading, explaining various dataset formats,
and downloading from other statistical software. Stan Lipovetsky
Part II, “Models”, consists of five chapters. Chapter 5 dis- Minneapolis, MN
cusses the descriptive statistics and scatterplots, distributions
and correlation matrix, simple linear and multiple regressions
in ordinary least squares (OLS) modeling, inference and diag-
nostics – in various examples and R implementations. Chapter Understanding the Analytic Hierarchy Process, by
6 considers typical outliers and influential cases. Chapter 7 Konrad Kulakowski, Boca Raton, FL: Chapman and
continues with panel data, lags and leads, time-series models, Hall/CRC, Taylor & Francis Group, 2021, 262 pp., $130.00
fixed and random effects, testing for unit roots and robust errors. (Hardback), ISBN 978-1-1380-3232-3.
Chapter 8 presents logistic regression for binary outcome built
as generalized linear models (GLM), with visual representa- The monograph belongs to the Series in Operations
tion of results, Akaike and Bayesian information criteria (AIC Research, and presents the method and methodology of Ana-
and BIC, respectively) for measures of fit, Receiver Operating lytic Hierarchy Process (AHP)—one of the most popular tools of
Characteristics (ROC) curves and Area Under the Curve (AUC) the practical multiple-criteria decision making (MCDM). AHP
score. Chapter 9 presents methods of survival analysis, explain- was proposed by Thomas Saaty in 1977, and from that time it
ing Cox’s proportional hazard models and their implementa- has been developed and applied in numerous works. The book
tions via R packages. Chapter 10, written by the known expert consists of eleven chapters, each divided into subsections—let
on the causal inference Andrew Heiss, includes questions of us describe them in more detail.
the directed acyclic graphs (DAG) and statistical associations, Chapter 1, “AHP as a Decision-Making Method,” starts with
measures of causal effects and do-calculus, drawing DAGs and a general introduction of the AHP, explaining why and for
finding paths and adjustment sets with R, propensity scores, which aims the decision-making approaches are needed, and
matching, inverse probability weighting. describing how the alternatives are compared under each of sev-
Part III of “Applications” shows how the above given tech- eral possible criteria and sub-criteria constituting a hierarchical
niques and R tools can be implemented for political data in var- structure. It considers the pairwise comparisons (PC) matrix of
ious problems. Chapter 11 uses Brazil and Chile official datasets the priority ratios elicited from an expert for each two within
to demonstrate an advance data management, with standardiz- a set of alternatives related to each criterion, and the PC among
ing codes and merging datasets, missing data imputation and the criteria themselves. The PC quotients are elicited in the ratio
regression modeling performed by R functions. Chapter 12 scale using the values from 9 for a maximum prevalence of
presents the Web mining and Web scraping in R, describing an one item over another one, and going with the step one till the
example of the Organization of American States (OAS) press equal value 1 of the compared items. When the local priorities
releases, loading and extracting information of the html texts for each PC matrix are found, the averaging of these vectors
from Google, recipes for custom functions and Application weighted by the elements of the criteria priority defines the
Programming Interfaces (API), and search for specific tweets global priorities among the alternatives combined by all criteria
and download data from Twitter. Chapter 13 suggests quanti- (with possible sub-criteria weighted averaging as well). The
tative analysis of political texts, including Twitter data explo- problem of intransitivity between the matrix elements and the
ration, most-used hashtags and wordclouds, preprocessing by matrix inconsistency is also described, and two classical AHP
the algorithm of Wordfish and Structural Topic Models (STM) problems of a house to buy and a car selection are presented.
in diagnostics and analysis of texts. Chapter 14 is devoted to Chapter 2, “PC Matrices,” describes the cardinal values of
the important in politics connections revealed via the networks, AHP matrices, where an ijth element defines the elicited pair-
describing their nodes and links, adjacency matrix and datasets wise value of how many times the ith item is preferred to the jth
in graphs presentation, measures of centrality and closeness, one, and the symmetric across the matrix diagonal jith element
eigenproblem analysis and applications in R. Chapter 15 con- corresponds to the reciprocal value of how many times the jth
tinues with Principal Component Analysis (PCA) and dimen- item is preferred to the i-th one. The diagonal values of such
sionality reduction, in application to a measure of index of trust a matrix equal one, as by them an item is compared with itself,
in democracy in Latin American countries. Chapter 16 finalizes although it can occasionally differ from 1, for instance, in a blind
with maps and spatial data, demonstrating how to perform a wine testing. Sometimes the ordinal values of +1 and -1 are used
mapping of countries and regions by different variables, and for indicating the prevalence between the items instead of their
to apply such index as the Moran’s I for spatial correlation, cardinal pairwise quotients. Additive PC matrices, matrices with
estimated by R facilities. The book is concluded with the Bib- missed elements, and presentation of PC matrices as directed
liography of about 150 sources, and a comprehensive Index. and undirected graphs are given too. Fuzzy PC matrices for
Each chapter suggests references on the recent sources, exer- representation of uncertain preferences with elements described
cises, and links to numerous websites with data, packages and by triangular and trapezoidal, gaussian and flatten-bell distribu-
other R facilities. The book is convenient as a textbook for tions are discussed as well.

You might also like