You are on page 1of 5

11/14/22, 11:28 AM The big data debate | Science

ADVERTISEMENT

HOME  SCIENCE  VOL. 346, NO. 6211  THE BIG DATA DEBATE

 BOOK REVIEW BIG DATA


 󡀆 
The big data debate
FRED H. CATE Authors Info & Affiliations

SCIENCE 14 Nov 2014 Vol 346, Issue 6211 p. 818 DOI: 10.1126/science.1261092

137
   

“Big data”—the collection, aggregation or federation, and analysis of vast amounts of increasingly granu-
lar data—present serious challenges not only to personal privacy but also to the tools we use to protect it.
Privacy, Big Data, and the Public Good focuses valuable attention on two of these tools: notice and con-
sent, and de-identification—the process of preventing a person's identity from being linked to specific
data. The book presents a collection of essays from a variety of perspectives, in chapters by some of the
heavy hitters in the privacy debate, who make a convincing case that the current framework for dealing
with consumer privacy does not adequately address issues posed by big data.

https://www.science.org/doi/10.1126/science.1261092 1/5
11/14/22, 11:28 AM The big data debate | Science

PHOTO: CHAD BAKER AND RYAN MCVAY/THINKSTOCK

In 1998, the U.S. Federal Trade Commission identified notice and consent as the “most fundamental”
principles for protecting privacy (1). In 2012, the U.S. Consumer Privacy Bill of Rights included as its first
principle: “Consumers have a right to exercise control over what personal data companies collect from
them and how they use it” (2). The pending draft of the European Union Data Protection Regulation
refers to “consent” more than 100 times (3). Regulators, it seems, are entranced by notice and consent. It
is perhaps not surprising, then, that we are inundated with privacy notices and demands that we click “I
agree” when we visit banks, doctors, Web sites, app stores, and the like. Yet, as Solon Barocas and Helen

Nissenbaum point out in Chapter 2, there is “overwhelming evidence that most of us neither read nor un-
derstand them.”

The challenge presented by big data lies not just in the size of the data sets or the ubiquity with which
data are collected but also, as Paul Ohm points out in Chapter 4, in the fact that big data “thrives on sur-
prising correlations and produces inferences and predictions that defy human understanding.” One can
readily agree when he asks, “How can you provide notice about the unpredictable and unexplainable?”

In a May 2014 report (4), the U.S. President's Council of Advisors on Science and Technology described
the framework of notice and consent as “unworkable as a useful foundation for policy.” The report goes
on to add that “only in some fantasy world do users actually read these notices and understand their im-
plications before clicking to indicate their consent.”

The authors of Privacy, Big Data, and the Public Good are generally unwilling to go so far. Despite pages of
criticism, they advocate for tweaking the current system so that notice and choice are used more effec-
tively. In Chapter 2, for example, Barocas and Nissenbaum propose that notice and consent be required in
cases when the proposed use of the data departs from the context in which the data were collected.
Alternatively, in Chapter 4, Ohm proposes that groups similar to institutional review boards might de-
velop ethical norms that would determine when consent was necessary.

The book's critique of de-identification is equally compelling. Currently, de-identified data are exempt
from most privacy laws. Yet throughout the book, the authors point to a number of well-publicized exam-
ples in which de-identified data sets have been released, only to have the data re-identified within days.
In short, the more data are available, the harder it is to de-identify them effectively. Or, as Cynthia Dwork
succinctly sums up the situation in Chapter 14: “‘Deidentified data’ isn't.”

On this subject, the authors offer an array of responses, ranging from Dwork's “differential privacy” ap-
proach, which uses statistical analyses to assess privacy risks presented by repeated queries of the same
data set, to Ohm's recommendation that deidentified data should no longer be exempt from privacy laws.

There is a third aspect of the book that warrants special mention—a series of chapters addressing the
uses of big data in the context of cities, urban planning, and research. These areas are not often discussed
in the context of big data, and although they make the book feel a bit eclectic, they address new and in-
teresting issues that clearly merit attention.

As society becomes more “datafied” (5)—a term coined to describe the digital quantification of our exis-
tence—our privacy is ever more at risk, especially if we continue to rely on the tools that we employ today
to protect it. Privacy, Big Data, and the Public Good represents a useful and approachable introduction to
these important issues.

References and Notes


1 U.S. Federal Trade Commission, Privacy Online: A Report to Congress (FTC, Washington, DC, 1998).

GOOGLE SCHOLAR
https://www.science.org/doi/10.1126/science.1261092 2/5
11/14/22, 11:28 AM The big data debate | Science

2 The White House, Consumer Data Privacy in a Networked World: A Framework for Protecting Privacy and Promoting Innovation
(White House, Washington, DC, 2012).

GOOGLE SCHOLAR

3 Regulation of the European Parliament and of the Council on the Protection of Individuals with Regard to the Processing of Personal
Data and on the Free Movement of Such Data (General Data Protection Regulation), Unofficial Consolidated Draft Text, 22 October
2013.

4 President's Council of Advisors on Science and Technology, Big Data and Privacy: A Technological Perspective (PCAST,
Washington, DC, 2014).

GOOGLE SCHOLAR

5 Mayer-Schönberger V., Cukier K., Big Data: A Revolution That Will Transform How We Live, Work, and Think (Houghton Mifflin,
Boston, 2013).

GOOGLE SCHOLAR

Books
1 Privacy, Big Data, and the Public Good Frameworks for EngagementJulia Lane, Victoria Stodden, Stefan Bender, and Helen
Nissenbaum, Eds. Cambridge University Press, 2014. 342 pp.

eLetters (0)

eLetters is an online forum for ongoing peer review. Submission of eLetters are open to all. eLetters are not edited,
proofread, or indexed. Please read our Terms of Service before submitting your own eLetter.

LOG IN TO SUBMIT A RESPONSE

No eLetters have been published for this article yet.

ADVERTISEMENT

CURRENT ISSUE
Subnational implications from climate and air pollution poli-
cies in India’s electricity sector
BY
SHAYAK SENGUPTA,
PETER J. ADAMS,
ET AL.

Seventy years of tunas, billfishes, and sharks as sentinels of


global ocean health
BY
MARIA JOSÉ JUAN-JORDÁ,
HILARIO MURUA,
ET AL.

The discovery and scientific potential of fast radio bursts


BY
MATTHEW BAILES

TABLE OF CONTENTS

ADVERTISEMENT

LATEST NEWS
SCIENCEINSIDER 11 NOV 2022 SCIENCEINSIDER 11 NOV 2022
Research spending could be lone bright Booming trade in mammoth ivory may
spot for U.S. science after election sets be bad news for elephants
up divided government

NEWS 11 NOV 2022 SCIENCEINSIDER 10 NOV 2022

How racism skewed estimates of heart ‘I’m nauseated’: Alzheimer’s whistle-


disease in women blower finds possible misconduct by his
mentor in their papers

https://www.science.org/doi/10.1126/science.1261092 3/5
11/14/22, 11:28 AM The big data debate | Science

NEWS 10 NOV 2022 NEWS 10 NOV 2022


News at a glance: A new antibiotic, 100 years after his birth, Kurt Vonnegut
COVID-19 in Antarctica, and a Venus is more relevant than ever to science
mission deferred

ADVERTISEMENT

View full text • Download PDF

FOLLOW US

     󡀆

NEWS CAREERS COMMENTARY JOURNALS AUTHORS & REVIEWERS

All News Careers Articles Opinion Science Information for Authors

ScienceInsider Find Jobs Analysis Science Advances Information for Reviewers

News Features Employer Profiles Blogs Science Immunology

Subscribe to News from Science Science Robotics

News from Science FAQ Science Signaling

About News from Science Science Translational Medicine

Science Partner Journals

LIBRARIANS ADVERTISERS RELATED SITES ABOUT US HELP

Manage Your Institutional Advertising Kits AAAS.org Leadership FAQs


Subscription Custom Publishing Info AAAS Communities Work at AAAS Access and Subscriptions
Library Admin Portal Post a Job EurekAlert! Prizes and Awards Order a Single Issue
Request a Quote Science in the Classroom Reprints and Permissions
Librarian FAQs TOC Alerts and RSS Feeds

Contact Us

© 2022 American Association for the Advancement of Science. All rights reserved. AAAS is a partner of HINARI, AGORA, OARE, CHORUS, CLOCKSS, CrossRef and COUNTER. Science ISSN
0036-8075
https://www.science.org/doi/10.1126/science.1261092 4/5
11/14/22, 11:28 AM The big data debate | Science
0036 8075.

Terms of Service
|
Privacy Policy
|
Accessibility

https://www.science.org/doi/10.1126/science.1261092 5/5

You might also like