Professional Documents
Culture Documents
Peter Fleck
December 12, 2009
There is no doubt as to how much the 2008 Obama campaign and Obama’s
presidential administration are embracing the Internet. More than a million people
signed up for campaign text messages on their cell phones; two million joined the
volunteer work; and another 13 million signed up for regular email reports (Melber,
2008). By Election Day, twenty-five percent of Obama voters were linked to him and
each other through one of these networks (Melber, 2008).1 After the election in
1
Not all the network efforts were successful. The idea of announcing his vice-president choice via cell
phone text messaging was a failure in itself (sending ten million cell phone text messages at one time is
just not possible with today’s infrastructure) but a smart marketing move in harvesting all the cell
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
November, the Obama tech team immediately set up Change.gov to document the
transition to power and to solicit ideas from the public (Jardin, 2008).2
These efforts were savvy organizing moves that utilized the power of the
were a new generation marketing tool that allows citizens to engage in the
But President Obama foresees a much greater potential for the Internet and its
Illinois—he along with Oklahoma Senator Tom Coburn, sponsored the Federal
rapidly in forms that the public can readily find and use” (Obama, 2009). This echoes
his campaign technology and innovation plan, which proposes creating “a transparent
formats to allow citizens to make use of that data to comment, derive value, and take
action in their own communities” (Obama for America, ND). In a move in this
direction, on March 5, 2009 President Obama chose open data proponent Vivek
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 2
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Kundra as his Chief Information Officer (CIO) (White House Office of the Press
Secretary, 2009). Kundra had been instrumental in opening up data when he was
Chief Technology Officer (CTO) for the District of Columbia. In Kundra’s first
conference call with the press, he said, “How do we make sure that the government is
about ‘We the People’ and that we engage citizens in terms of how their government
know where money is going throughout the public sector, ensuring that we have the
(O’Brien, 2009).
The push to make federal government data available easily to the public—
open access—is quickening under the Obama administration but it didn’t start there.
The federal level has been grappling with this issue since before 1946 when the
Administrative Procedure Act became law. Before the Internet, disseminating public
and delivering them to the interested parties. This could involve visiting an office and
making a request or using the mail system. The Internet has given us new protocols to
provide almost direct access to data at a lower cost than providing the data through a
“brick-and-mortar” office. It has enabled us to be the clerks that gather the data. The
advantage for us in this scenario is the ability to “timeshift” our tasks to our schedule.
We don’t have to collect the data between nine and five and show up at an office; we
can now do it from 9 p.m. to 5 a.m via the Internet from our homes.
Brito (2007) noted that much publically available data is not available online
and what is usually is not in a “useful format.” He called for the government to
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 3
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
(Brito, 2007). That premise is at the heart of this paper. Government at all levels must
begin the process of migrating the data that anchors policy to the Internet and provide
open access.
This paper presents the case for providing government data in “universally
accessible” or “structured” formats as the best way to integrate the most citizens in
allows a computer to easily parse and manipulate it.” In most cases this will mean a
CSV4 or XML5 format—directly from a database. Besides being the best way to
deployment, informs the public to a higher degree, and provides a path to deep
open access to government data. I’ll also describe the “ancient” methods of scraping
The twentieth century idea of building more web pages to inform the
electorate will continue to be a necessary and important function but it is not the same
as providing the raw data behind government policy-making in a format that third
parties (researchers, students, journalists, small businesses, citizens) can slice, dice,
and analyze in a thousand ways. Web pages share with printed documents a static
4
CSV stands for “comma separated values.” It is a standard method of exporting and importing
database information in text files that does not rely on any particular software application or operating
system.
5
XML stands for “extensible markup language” and it is a global standard. It is meant to be read by a
machine (computer) and not a human.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 4
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Vivek Kundra, states that “If you look at government, what they've done historically
is they've just put up a website and they'll say this is Agency X. Unfortunately, if you
look at the traffic on those websites and you compare that traffic to a Facebook or a
Craigslist, it just pales in comparison and one of the things we need to start thinking
about is how do we put information in the right context” (O’Brien, 2009). Brito
(2007) says that the government should not try to offer “one best way” to use the data
but instead “allow myriad third parties develop innovative tools that utilize the data.”
According to Clift (2007), using open technologies like XML will “make it
easy to re-use public government data from many sources to create views and
searches that provide insight, understanding, and accountability.” This will give us
government” (Clift, 2007). That last part about building “outside of government” is
important. Open-access activists have already proven that they will build the
interfaces to read the XML data. Brito (2007) states that as long as “information is
available in an open and structured format” developers can “mash” the data in
Scraping Screens
activists, many informative web sites have been constructed using a method known as
web page and find the data in the page that you are looking for. Since web pages for
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 5
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
the most part are unstructured documents to be read by humans with no underlying
standardized structure that can be easily parsed for data sharing, this is an immense
challenge. Yet incredible sites have been created using this method.6
Holovaty’s scripts must grab data from the police site and determine what is relevant
for his “mashup” web application.8 He can only hope that the Chicago Police web
department doesn’t decide to change something on their site that could require
reprogramming of Holovaty’s scripts. The data on the web pages has no standard
structure of any kind. Imagine trying to blindly extract information from a particular
article in a magazine. How would you know when to turn the page? How would you
know which paragraph has the important information? Then repeat your process on
6
Lincoln Stein (2002) compares screen-scraping to “mediaeval torture.” Brito (2007) calls it “much
like the hand copying of texts by medieval monks” and provides this more detailed description:
“In essence, “screen-scraping” involves calling up the web page that displays the type of data the user
wishes to gather (for example, a senate roll call vote page), identifying the patterns apparent on the
page (such as where the bill title and number are displayed and which boxes correspond to the yeas and
nays), and then writing a computer script that will transfer data found in designated display positions to
the appropriate fields in a database. In many ways this is the digital equivalent of having to scan paper
copies of documents because, while the original may well be electronic in this case, it is the final user
display that is accessed and parsed into meaningful groupings. In short, it is an inefficient and often
inexact method.”
7
The web URL ChicagoCrime.org now redirects to EveryBlock.com, Holovaty’s new web project that
grew out of the crime site. EveryBlock is tracking fifteen cities. In addition to crime statistics, it tracks
building permits, restaurant inspections, news and blog entries about the city or block, and local photos
and restaurant reviews. Funding for EveryBlock is from the Knight News Foundation via the Knight
News Challenge (Holovaty, 2009). On June 30, 2009, Holovaty released the EveryBlock programming
source code as open source allowing any city to establish its own EveryBlock site (Brady, 2009).
8
A mashup is “a web page or application that combines data or functionality from two or more
external sources to create a new service. The term mashup implies easy, fast integration, frequently
using open APIs and data sources to produce results that were not the original reason for producing the
raw source data. An example of a mashup is the use of cartographic data from Google Maps to add
location information to real estate data, thereby creating a new and distinct Web service that was not
originally provided by either source” (Wikipedia, 2009).
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 6
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
every article in the magazine. It is an almost impossible task but somehow people like
Holovaty have figured out how to do it to web pages via an automated process.
wanted to create a web site for gathering data about the U.S. Congress and putting
that data together in new ways—creating mashups (Tauberer, 2006). The government
does post the data that he wants online but in “scattered locations” (Tauberer, 2006).
For example, “legislation is posted in one place and votes on the very same
various locations and then “normalizes” the data and creates XML files (Tauberer,
2006). All the work that he does including the programming are available publically;
website” (Tauberer, 2006). The site has been referenced in the footnotes for federal
court cases.
their hacked data is offered” back to the community and to the government site “in a
structured and open format” (Brito, 2007). In essence they are giving us what the
government should be providing in the first place. Third parties can now access this
data and create new applications. MAPLight.org, a site that explores the connection
between money and politics, does exactly that by mashing together congressional
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 7
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Government data has value and can be sold to private firms. This has been a
long-standing practice within government, which has used this argument as a reason
to restrict access.9 But there are good arguments to distribute the data freely and as
widely as possible. Third party companies can still repackage the data and sell it in
innovative ways. The more widespread the data, the more experimentation and the
Selling data for profit has been a major issue in the domain of geographic
information systems (GIS) on state and local levels. GIS departments are branding
datasets created for public purposes with intellectual property and ownership rights
and “attempting to generate revenue streams from secondary uses of the data being
made by citizens and businesses” (Onsrud, 1998). This negates the idea of this data as
a public good. The bureaucrats often state that they are protecting the “public trust”
by making sure private firms don’t benefit at the expense of the taxpayer (Onsrud,
1998). But treating the information as a public good and distributing it as widely as
possible serves the same purpose by creating a level playing field where anyone can
use the data as he or she sees fit. The government’s loss of a revenue stream can be
made up by third parties freely sharing innovative tools to mine the data. These tools
will also be shared with government. Governments that support open access to
and growth of the tax base through an open government information commons results
in greater economic benefits for the community in the long run than restrictive
9
I discuss some of this later in talking about Carl Malamud’s campaigns to bring the Security and
Exchange Commission’s EDGAR database and the Patent and Trade Office’s patent listings to the
Internet. Both agencies had lucrative deals to sell the information to private companies.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 8
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
In looking at three distribution models for GIS data, Lopez (1996) makes a
convincing argument for non-proprietary information policies and free and open
access of GIS data.10 First, it keeps government out of the business of creating value-
added products and services demanded by end-users and thus minimize costs as
agencies simply “…disseminate the raw data sets in more or less the same condition
that they use it internally” (Lopez, 1996). Second is that the incremental cost of
networked (Lopez, 1996). Finally, government agencies should be aware of the tax
contributions from the private sector and economic growth. Releasing GIS data to the
and this is consistent “with the near-universal goals of encouraging local economic
Legislature, Malamud (2008) looked at the issue of copyrighting and selling public
data. He points to analysis by William Patry, an expert on copyright and the author of
the 7-volume treatise Patry on Copyright, that the law does not support copyrighting
public information although it is still common at the state and local level. Oregon
publishes the Oregon Revised Statutes and sells the license for $30,000. Malamud
argues that Oregon can continue to maintain the revenue stream from selling the
$30,000 version of the statutes but can also distribute the data to the public.
Corporations and law firms will purchase the $30,000 version while students and
10
Besides the open access model, Lopez (1996) looks at cost-recovery scenarios where agencies
charge user fees to public and private parties that want to use the data; and public-private partnerships
where a government agency negotiates an agreement with a single commercial data vendor.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 9
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
infrastructure:
Works of the government are in the public domain because we understand that
business builds on top of these works of government as infrastructure, just as
they build business on top of our other infrastructures such as water, roads, and
electricity. One of the least-recognized but certainly most important roles of
government is as an information provider, and one of the little heralded things
that make our United States different from other countries is the remarkably
effective role our government has played in providing that information
infrastructure.
(Malamud, 2008)
Open Sourcing
free rides, FOSS has shown that talented programmers are willing to
research and view data. These applications are then available to the
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 10
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
source software.
build custom interfaces to the data and charge for access. If they
are able to design a compelling interface for working with the data
and find customers willing to pay for access to that interface, then
not have any data access beyond what the FOSS community is
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 11
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
government data.
President Bill Clinton signed the Paperwork Reduction Act into law on May
22, 1995. Besides reducing paper usage, the act requires the Federal government to
The act endorses the idea of sharing government data and its
provide the data and circumvents the need to build web sites that
data. The closer we can go to the source of the info, the more we
The act served as a base for the first major movement of U.S.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 12
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
The 1966 Freedom of Information Act (FOIA) was amended in 1996 under
President Clinton hoped that there would be less need to invoke the
The legislation I sign today brings FOIA into the information and
electronic age by clarifying that it applies to records maintained
in electronic format. This law also broadens public access to
government information by placing more material on-line and
expanding the role of the agency reading room. As the
Government actively disseminates more information, I hope that
there will be less need to use FOIA to obtain government
information.
(U.S. Department of Justice, 1996)
Government web presence was the E-Government Act of 2002 signed into law under
President George W. Bush (Library of Congress Thomas, 2002). This act established
barriers between federal agencies to give the public easier online access to data and
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 13
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
services” (Raney, 2002). The law calls for Federal agencies to provide ways for the
with processes that implement these technologies, to: (1) enhance the access to and
On one level, this is a step backward from the Paperwork Reduction Act. It
resulted in a more top-down approach, serving the data that the government wants
shared or that government employees have the time to share. It also focuses on
providing web-based services like renewing automobile license tags or paying for
city-delivered utility services like water and trash. These service functions of
government web sites are necessary but should not be confused with engaging
for Public Policy at Brown University, as quoted in the Raney (2002) article. He says
that this law pushes the government to a more corporate style and might even include
On the U.S. GSA site documents discuss this “corporate” style and state that the
Federal government should use the tools developed in the private sector since
corporations have solved many of the problems inherent in creating and maintaining
web sites (Patch, 2003). I see a concern here, as corporate environments are not
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 14
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
regulatory process by improving the public's ability to find, view, and comment on
Research Service and the American Bar Association have criticized the poor design
of the site and lack of funding for improvements (Brito, 2007; Lindeman, 2008
10/20). Brito (2007) has gone so far as to say that “While efficient in theory,
obscure data than to make it easily accessible.” Noveck (2004) called the initiative
“perhaps the most far-reaching and important such governmental transformation ever
effected” but noted that “this radical overhaul of the administrative process is
ignoring “how individuals and groups communicate and work together to solve
problems.”
The site had a major redesign in 2008 and added an RSS feed and revised
searching features.11 This also allowed Jerry Brito to build a companion site called
offers features not available at the government site and indicates that third parties will
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 15
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Accountability and Transparency Act (FFATA) (S 2590) into law (Hatch, 2006).12
This law requires the Office of Management and Budget to create a new public
database that will share information about entities that receive federal grants, loans,
and contracts (Hatch, 2006). The government awards over one trillion dollars each
year in this type of financial transaction, a figure that represents nearly one-third of
searchable and free to the public. It provides names of entities, amounts of awards,
the entity location, and other information including transaction type and funding
agency (Miller, 2008). It also delivers the data in an accessible format13 via an API14
for pulling the raw data out and repurposing it and/or mixing it with other data to
This site goes a long way towards satisfying President Obama’s transparency
initiatives and giving citizens new ways to view government data. The data is still
somewhat filtered via the API15 but it is “raw” enough to make it very useful for
mashups.
12
As previously mentioned, this law was sponsored by then Senator Barack Obama (IL) and Senator
Tom Coburn (OK).
13
The data is delivered in an XML format. See Footnote 5.
14
API stands for “application programmer interface.” An API makes it easier for a programmer to
work with an application or an operating system by providing a documented interface.
15
Robinson (2008) notes that although an API delivers search results “…having to work through an
interface sometimes limits developers from making innovative, unforeseen uses of the data.” He also
notes that the Strengthening Transparency and Accountability in Federal Spending Act of 2008 (S.
3077), mandates use of an API for no good reason. (This bill has not been reintroduced in the current
legislative session as of this writing.)
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 16
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
include:
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 17
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
applications.
5. A KML file of geospatial data for use with Google Earth,
Google maps, or similar applications.
owner, fees paid, latitude and longitude of the project, zip codes,
in all the formats previously listed. With the KML data format a link
brings you to a page with the data overlaid on a Google map. Each
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 18
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
2.
interview with Jon Udell (2006), she listed three reasons that the
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 19
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
devices that would make the D.C. data feeds more accessible and
useful for the public (District of Columbia, 2008). The idea was to
Applications for Democracy took six days to launch and cost very
little. An RFP process might take years and cost over $1 million
(Corbett, 2008).
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 20
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
lists bike routes, Metro stations, creates custom maps, and even
maps out bike theft sites. All of these applications utilize D.C.
CityDW data in some way. All were built at no cost to the District.
The various feeds that the District provides made creating the
(Malamud, 1998). He also provided audio feeds from the floor of the
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 21
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
in using the SEC data and gaining access through the 1,400
Mead, the SEC contractor, stated that it would cost $18 million to
provide the access and the effort was defeated (Love, 1993).
circulated via the Internet in support of the initiative and groups like
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 22
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
a lawsuit if the SEC did provide the access (Love, 1993). The SEC also
claimed that providing the data on the internet was “technically impossible” and even
if it were possible, “the only people interested in SEC fillings were Wall Street
Fatcats and they didn't really need subsidized access to data they were willing to pay
Rep. Markey spoke with Malamud about the issue in 1993. Malamud was
(Malamud, 1999).
Rep. Markey’s staff arranged a meeting with Malamud and the SEC to
“discuss the idea of giving us the data and letting us put together an Internet site.
There was a bit of pushback, to say the least” (Malamud, 1999). Malamud continues:
The problem was the 70's era data processing system that the SEC had put in
place in the late 80's. The deal was that EDGAR was way too rough for
consumers to digest. It needed, to speak the MIS lingo of the time, “value-add.”
Who would add value? Well, the SEC had cut a contract with a data wholesaler
who would add value. The wholesaler, in turn, would sell to information
retailers who would add even more value. Then, the information would be sold
on the retail information market to the Wall Street crowd who had an interest in
the data. Obviously, if we gave away all this information on the Internet, it
would subvert our entire Free Enterprise System.
In that meeting with the SEC and the Chairman's staff, my favorite moment was
when we got to the question of why in the world people would want to see
EDGAR data. I maintained that the Internet was full of lots of people—students,
journalists, senior citizen investors—who were dying for access to this data. The
SEC felt that only a few people would want to see EDGAR documents, and
besides the Internet (or “the ARPANET” as they kept referring to it) “didn't
have the right kind of people.”
Now, this was a cheap shot, and I understood that what they meant was “there
weren't a lot of people, just a few researchers,” but I couldn't resist.
“The right kind of people?” I said, rising up in my chair. “I think the American
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 23
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
from Sun, Malamud launched a free EDGAR system on the Internet in January 1994.
This effort undercut the retail information industry of the time that was making
several hundred million dollars a year selling the information. He ran the site with
Brad Burdick for eighteen months. At its peak, 50,000 people visited the site each day
(Malamud, 1999).
Malamud’s goal however was to get the SEC to maintain the site. He did not
want to be in the online database business (Malamud, 1999). To that end, Malamud
forced the issue by putting a notice on the site in August 1995 stating that access to
EDGAR would shut down in 60 days and users should contact the SEC for continued
access (Malamud, 1999). Users of the site sent 15,000 messages to SEC Chairman
Arthur Levitt and the Wall Street Journal, New York Times and Associated Press ran
stories (Malamud, 1999; Malamud, 1998). This put pressure on the SEC and they
finally decided it was time to take the service under their roof and support it. By
October 1, Malamud’s cutoff date, the SEC EDGAR site was fully operational. Since
then, they have made substantial enhancements to the site (Malamud, 1999). The
EDGAR site now gets up to 500,000 users each day (Macavinta, 1998).
Malamud had added access to Patents and Trade Office (PTO) data in addition
to SEC data and planned on ending that with the end of SEC publication. He was not
so successful convincing the PTO to maintain the servers. PTO Commissioner Bruce
Lehman felt the fees generated by retailers repackaging the data—estimated at $20
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 24
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
(Malamud, 1998; Macavinta, 1998). In an April 27, 1998 letter to then Vice President
Al Gore, Malamud argued that open access to the patent database would attract new
users who could never afford the fees and that current customers would still prefer
and pay for the repackaging (Malamud, 1998). Malamud (1998) stressed that opening
up the patent information would give the U.S. a much more robust intellectual
property market: “Markets are based on information and we cannot have an efficient
market for intellectual property on a global basis if we hide the documents that define
that property.”
In June of 1998, Lehman and the PTO changed course and announced that the
over 2 million patent documents would become available via the Internet in a
Malamud continues his open access activism today from the nonprofit site
download including nonprofit tax returns, state and local building codes and
regulations, and Smithsonian Institution images (Schwartz & Mackey, 2009). Also
available is a growing film archive (915 films as of June 2009) called FedFlix. These
are films originally produced for the federal government (Schwartz & Mackey, 2009).
Malamud is currently working on “freeing” data out of the U.S. Courts Public Access
to Court Electronic Records (PACER) search engine (Singel, 2008) and running for
Conclusion
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 25
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Over the course of writing this paper, open access at the Federal level has
grown exponentially. Data.gov was built to “increase public access to high value,
Foundation Blog points out that the Office of Management and Budget’s final
“guidance” does not include providing Recovery Act data in a raw format and calls
this a “Significant failure.” Madrigal (2009) points out that Data.gov had only fifty
data sets available when launched in May.18 But given President Obama’s
Memorandum for Transparency and Open Government (Obama, 2009) and his choice
of D.C.’s Vivek Kundra as CIO, the Federal government is showing that it believes
open access to data that fuels policy decisions will nurture an engaged citizenry and
promote democracy. Now these efforts need to cascade downward to state and local
levels where government entities should emulate the work of Washington, D.C. and
This paper should leave no doubt that open access to raw data is in the best
interests of the public good. It should also prove that third parties (including average
18
From personal experience, I have found problems with the data formats at Data.gov. One dataset that
I downloaded was only available as a Windows executable file and not accessible with the Apple
operating system. The other dataset was in a dBase format. This is an “ancient” database format that is
not easily readable today.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 26
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
citizens) will gather the data for analysis and reuse and that should be seen as a
benefit for the government as it can share in the process. This truly moves us closer to
a participatory democracy.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 27
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
Bibliography
Apps for Democracy. (2008a). DC Historic Tours | Apps for Democracy. Retrieved
June 3, 2009, from http://www.appsfordemocracy.org/dc-historic-tours/.
Apps for Democracy. (2008b). iLive.at | Apps for Democracy. Retrieved June 3,
2009, from http://www.appsfordemocracy.org/iliveat/.
Clift, S. (2007). Ten Practical Online Steps for Government Support of Democracy.
USA Services Intergovernmental Newsletter, How E-Government is Changing
Society and Strengthening Democracy, (20).
Corbett, P. (2008). Government 2.0: The Rise of Citizen Innovation Through Open
Data | iStrategyLabs. Retrieved June 2, 2009, from
http://www.istrategylabs.com/government-20-the-rise-of-citizen-innovation-
through-open-data/.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 28
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
%7C#summ.
Love, J. (1993). SEC'S EDGAR on Net, What Happened and Why. Retrieved June 5,
2009, from http://w2.eff.org/Activism/edgar_grant.announce.
Macavinta, Courtney. (1998). Patent office slammed for not posting data - CNET
News. Retrieved June 6, 2009, from http://news.cnet.com/Patent-office-
slammed-for-not-posting-data/2100-1023_3-210894.html.
Markoff, J. (, 1998b). Internet Gadfly Wants U.S. To Put More Data Online.
Retrieved June 5, 2009, from
http://www.nytimes.com/library/tech/98/05/biztech/articles/04database.html#
1.
Melber, A. (n.d.). Obama for America 2.0? Obama for America 2.0? Retrieved
January 26, 2009, from http://www.thenation.com/doc/20090112/melber?
rel=hp_currently.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 29
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
http://blog.sunlightfoundation.com/2009/06/25/no-raw-data-on-recoverygov-
significant-failure/.
O'Brien, T. M. (2009). Vivek Kundra: Federal CIO in His Own Words - O'Reilly
Radar. Retrieved June 11, 2009, from http://radar.oreilly.com/2009/03/vivek-
kundra-federal-cio-in-hi.html.
Obama for America. (ND). Barack Obama: Connecting and Empowering All
Americans Through Technology and Innovation. Retrieved January 20, 2009,
from
http://cairns.typepad.com/blog/files/fact_sheet_innovation_and_technology_pl
an_final.pdf.
Obama, B. (2009). The White House - Press Office - Transparency and Open
Government. Retrieved July 3, 2009, from
http://www.whitehouse.gov/the_press_office/Transparency_and_Open_Gover
nment/.
Raney, F.R. (2002). TECHNOLOGY; New Economy In the next year, the federal
government will move to give the public easier online access to data and
services. . New York Times (1857-Current file), C4. Retrieved December 7,
2008, from http://proquest.umi.com.floyd.lib.umn.edu/pqdweb?
did=731021792&Fmt=7&clientId=2256&RQT=309&VName=HNP.
Robinson, D., Yu, H., Zeller, W. P., & Felten, E. W. (2008). Government Data and
the Invisible Hand. Yale Journal of Law & Technology, 11. Retrieved from
http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1138083.
Schwartz, J., & Mackey, R. (2009). Steal These Federal Records — Okay, Not
Literally - The Lede Blog - NYTimes.com. Retrieved June 6, 2009, from
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 30
Fleck
This is a draft version. It will change and may be removed from ScribD. Please do not
cite without permission.
http://thelede.blogs.nytimes.com/2009/02/13/steal-these-federal-records-okay-
not-literally/?scp=4&sq=%22carl%20malamud%22&st=cse.
Singel, R. (2008). Online Rebel Publishes Millions of Dollars in U.S. Court Records
for Free. Retrieved July 3, 2009, from
http://www.wired.com/politics/onlinerights/news/2008/12/open_pacer?
currentPage=all.
Tauberer, J. (2006). XML.com: GovTrack.us, Public Data, and the Semantic Web.
Retrieved June 20, 2009, from
http://www.xml.com/pub/a/2006/02/08/govtrack-us-public-data-semantic-
web.html?page=1.
U.S. Department of Justice. (1996). FOIA Update: The Freedom of Information Act,
5 U.S.C. sect. 552, As Amended By Public Law No. 104-231, 110 Stat. 3048.
Retrieved June 14, 2009, from
http://www.usdoj.gov/oip/foia_updates/Vol_XVII_4/page2.htm.
U.S. National Archives. (1995). Paperwork Reduction Act (44 U.S.C. 3501.
Retrieved June 5, 2009, from http://www.archives.gov/federal-
register/laws/paperwork-reduction/.
Udell, J. (2006). Jon Udell, Dan Thomas, Suzanne Peck - A conversation with Dan
Thomas and Suzanne Peck about open government: Free MP3 Download.
Retrieved April 30, 2009, from http://beemp3.com/download.php?
file=3288516&song=A+conversation+with+Dan+Thomas+and+Suzanne+Pec
k+about+open+government.
Weber, S. (2005). The Success of Open Source (p. 320). Harvard University Press.
White House Office of the Press Secretary. (2009). The White House - Press Office -
President Obama Names Vivek Kundra Chief Information Officer. Retrieved
June 21, 2009, from http://www.whitehouse.gov/the_press_office/President-
Obama-Names-Vivek-Kundra-Chief-Information-Officer/.
Fleck: Opening Democracy: Sharing the Raw Ingredients of Policy ©2009 Peter 31
Fleck