reader’ which could display articles from blogs (which were appearing all over the place), and fromnews sites, listservs and newsgroups, and on and on. All these sources (‘publishers’) were deliveringinformation of roughly the same kind: articles with titles, with an author and a publication date. They weren’t all identical, for example some academic sources might have had a standard ‘citations’element, and so on.The insight was this: if we could get all the publishers to offer their articles with a standardformat, if it was cheap or free for them to do so, then we could have a ‘news reader’ that wouldallow a person to see all the new articles published on CNN side by side with the articlespublished on Pito’s Blog and on the Sunlight Foundations news group. And any new publisher on the scene had an easy way to get their content delivered to that same news reader.Data RSS is not a perfect analogy to this, but it serves as inspiration.
Data RSS
(another disclaimer: this isn’t anything like a specification. I will try to describe this to the levelof detail that I understand, hopefully sufficient to be understood. I will follow with a set of challenges, objections and questions.)In the description below I reference two ‘participants’:1.The
publisher
. This is who owns the data and wants to make it available broadly toothers. Technically the publisher can be found at a url, for example: opengovernment.org/datarss. So the publisher is also represented by a piece of software on the publisher’s web site that responds to that url.2.The
accessor.
This is who wants to use the data. It could be a web widget, it could be areporting tool, it could be another web site or application. Technically the accessor is a piece of software that is accessing a url, say for example opengovernment.org/datarss.Data RSS consists of the following major elements:•An XML format that describes the information that is available. The names of datasets. for each dataset. names of fields, datatypes, and other discovery information.•An XML format used to actually deliver data•A REST protocol with which to query for discovery information and for data.That’s all there is. At least at the heart. With these three formats publishers know what to do topublish their information as Data RSS and accessors have what they need to know to access any publisher’s information.
Advantages
•An accessor has a shot at accessing information from any publisher, even those that don’texist yet, or accessing new datasets that become available from existing publishers.
Data RSS - A modest proposal
Pito Salas -rps@salas.com- February 26 2009
Add a Comment