You are on page 1of 3

WEB COMPUTING

Editor: Geoffrey Fox, fox@csit.fsu.edu

PEER-TO-PEER NETWORKS
By Geoffrey Fox

W ILL PEER-TO-PEER COMPUTING BE THE NEXT KILLER

INTERNET APPLICATION? LIKE MOST OVER-HYPED

CONCEPTS, P2P IS LOOSELY DEFINED AND COVERS A SET OF


cause usually all you have is a filename.
The added value of metadata for files
lies at the heart of the Semantic Web,
a vision from the W3C Web Consor-
tium related to P2P (www.w3.org/
2001/sw).
RATHER DISPARATE IDEAS. PERHAPS THE ONLY COMMON THEME IS Although the two concepts of Web-
based NFS and metadata-enhanced
a client-oriented view of the world; proximately 10,000 music files per sec- files are fundamental and broadly ap-
you might think of P2P as “Power ond using Napster, more than 100 users plicable, Napster is currently contro-
to the People.” The servers are sub- tried to connect to the system every sec- versial and under siege because the
servient to the clients, which do most ond, and there would be 75 million audio files are typically copyrighted.
of the work. Napster users by the end of 2000 The company’s legal problems are a
P2P has several important technol- (http://news.cnet.com/News/Pages/ feature of the particular content, but
ogy challenges and applications, vary- Special/Napster/napster_patel.html). the technology is long-lasting and in
ing from the sublime to the ridiculous. Napster has other typical P2P ser- my opinion uncontroversial—it’s an es-
This column presents a quick overview vices—instant messaging, chat rooms, sential P2P capability.
and suggests some emerging research buddy lists, and information about to- Some 200 Napster clones are avail-
areas and opportunities. day’s popular music—but the key fea- able to support this area (www.
ture is the ability to share files between ultimateresourcesite.com/napster/
Napster any Internet-connected consenting main.htm). Currently the most popu-
Napster (www.napster.com) is the clients. This is roughly the Web ver- lar is Imesh (www.imesh.com), which
most well-known and popular P2P sys- sion of NFS (Network File System) fa- has some two million users and can
tem. Shawn Fanning developed the miliar from traditional computing en- share any type of file. Some of the
original application and service in Jan- vironments. best-known file-sharing systems are
uary 1999 while a freshman at North- Another feature is its handling of MojoNation (www.mojonation.net),
eastern University. Napster lets any MP3 files, which are important as a Freenet (http://freenet.sourceforge.
client advertise the MP3 files stored on popular digital encoding format for au- net), and Gnutella (http://gnutella.
its disk and download MP3 files from dio files. It is straightforward to “rip” wego.com). These are not server-
other clients connected to the Napster (copy) files off an audio CD and look based like Napster but rather support
server network. It is said that Fanning up key metadata (artist, title, and so on) waves of software agents expressing
was taking a computer programming in the CDDB database on the Web resource availability and interest prop-
course at Northeastern but had to buy (www.gracenote.com). You can store agating among an informal, dynamic
a programming book to build Napster. and access the audio and metadata as a network of peers.
Like most good ideas, Napster was de- single unit. Although the system uses a
signed to solve a real need—in this case server to establish the initial connec- Other P2P features
to enable Fanning, a musician, to share tion, it transfers files efficiently, di- So far, we’ve examined some basic
his music with his friends on campus. rectly from client to client. This is an P2P services—file registration, access,
The system has become staggeringly improvement over most NFSs where and search. We can also categorize
popular. A legal opinion from last sum- using distributed files is not easy (ex- P2P systems in other ways, including
mer stated that users were sharing ap- cept possibly for the originator), be- distributed computing, collaboration,

MAY/JUNE 2001 75
WEB COMPUTING

and core technologies. Let’s review given a more detailed review of these A central server manages digital audio
these three areas. activities at www.nature.com/nature/ conferences with a general mix of phone
webmatters/grid/grid.html and relates and pure Internet audio. As their tech-
Distributing computing. Distrib- them to computational grids (www. nology advances, they should move to
uted-computing P2P applications are gridforum.org). the Groove and Garnet models with dy-
well illustrated in the CiSE article namic relay servers positioned through-
“Distributed Projects Tackle Protein Collaboration. Collaborative sys- out the Web. All forms of collaboration
Mystery” by Keri Schreiner in the first tems form a rather different type of P2P use some type of messaging, with the
issue of 2001. Schreiner discusses network: a community of clients work- message (typically called an event) car-
the use of millions of Internet clients ing together and sharing different In- rying a variety of content including the
to analyze data looking for extra- ternet resources. Instant-messenger IM text, pixel changes to record a
terrestrial life (SETI@home, http:// (IM) communication and chat rooms, in changed shared display (frame buffer),
setiathome.ssl.berkeley.edu) and the all their various forms, are the most or digital audio packets. XML is the
newer project examining protein fold- common examples in this arena. Here, natural way of encoding such messages,
ing (Folding@home, www.stanford. participating clients exchange messages and the open source Instant Messenger
edu/group/pandegroup/Cosm). This with each other. Unlike the file-sharing Jabber (www.jabber.org) provides a
kind of distributed-computing solution case, a user would typically multicast the clean framework of this kind. Several
divides an application into a huge num- same message to multiple clients at the Napster-like systems have based their
ber of essentially independent compu- same time. The best architecture for service on IM technology; Aimster
tations plus a central server system that this is still under active research. In fact, (www.aimster.com) is one of the best
doles out separate work chunks to each I work in this area (see the Garnet Col- known. OpenCola (www.opencola.com)
participating client. In the parallel- laboration System at http://aspen.csit. has a general XML framework to sup-
computing community, these problems fsu.edu/collabtools). Groove Networks port P2P systems.
are called pleasingly or embarrassingly (www.groove.net), founded by the cre-
parallel. I include this approach in the ator of Lotus Notes, is the best-known Core technologies. Core tech-
P2P category because the computing is P2P collaboration project; it uses relay nologies or services include P2P man-
peer-based, even though it transfers servers to implement the P2P multi- agement, messaging, security, and client
files through peer–server communica- cast. Collaboration systems form a P2P grouping, as well as the file (or more
tion (unlike the largely pure client– “illusion” using a static or dynamic generally object) registration, discovery,
client model of Gnutella and Napster). suite of servers to optimally route mes- and access capabilities mentioned ear-
SETI@home and Folding@home are sages. When the clients are scattered lier for Napster. We must develop these
elegantly implemented as screen savers around the world, the relaying servers core capabilities and then define com-
that you download. would perhaps be in the “middle” of munity standards. Then they can inter-
Other projects of this type include the Web; when a group of clients are operate, resulting in bigger and better
United Devices (www.ud.com/home. clustered, their relay would be “on the P2P systems. Sun Microsystems has
htm based on SETI@home), Ap- edge” and perhaps dynamically created two important technology projects. Jini
pliedMeta (based on the University on a peer machine of this cluster. Typ- (www.sun.com/jini) deserves a column
of Virginia Legion project, www. ically you also need some sort of server of its own; it has a beautiful, simple
appliedmeta.com), Parabon Compu- to establish the initial session and man- model for dynamic self-defining ob-
tation (www.parabon.com), Condor age the permanent state. So this type of jects. Like Napster peers, these objects
(from Wisconsin www.cs.wisc.edu/ P2P application gives a rich mix of true register with distributed servers so that
condor), and Entropia (www.entropia. peers and servers. other peers can discover and access
com). Other applications for this type In addition to IM, chat, and email, them. JXTA (from juxtaposition, www.
of system include financial modeling, collaboration systems offer shared re- openp2p.com/pub/a/p2p/2001/02/15/
bioinformatics, Web server perfor- sources such as white boards, shared doc- joy_keynote.html) is a new project from
mance testing, and the scheduling of uments, and audio-videoconferencing. Sun’s chief scientist Bill Joy that aims at
different jobs to use idle time on a net- The HearMe system is a nice example core P2P capabilities, including peer
work of workstations. Ian Foster has of the P2P illusion (www.hearme.com). grouping and security.

76 COMPUTING IN SCIENCE & ENGINEERING


How to
Reach CiSE
Writers
For detailed information on
Besides the messaging services needed shared files into many parts could submitting articles, write to
to implement collaborative P2P systems, increase bandwidth (parallel I/O) and cise@computer.org or visit
there is also a Java message service called security (no one site could access files computer.org/cise/edguide.htm.
JMS (http://java.sun.com/products/jms) without cooperation from its peers).
that provides the core publish–subscribe This type of technology is contro- Letters to the Editors
mechanism on which most P2P services versial because it makes censorship Send letters to
are built. This needs some upgrading to very hard. MojoNation has a load-
join the P2P revolution; Sun should add balancing and scheduling algorithm in Jenny Ferrero, Contact Editor
XML, a more dynamic matching para- the form of micro payments to reward jferrero@computer.org
digm (of collaborating peers), and sup- those who contribute most to the Please provide an email address or
port for relay servers. community. Gnutella, a family of re- daytime phone number with your
I expect research and commercial ex- lated products, is usually described as letter.
perience to identify more base services a P2P search engine, because its inter-
as we better understand the common face is more like a search engine than On the Web
needs of P2P systems. Resource man- a Web file system. Access computer.org/cise or
agement in such a network must be There is one more important char- ojps.aip.org/cise for information
an important challenge. It is our acteristic of P2P networks: client het- about CiSE.
Nirvana—the Web operating system. erogeneity. Hand-held devices, mobile
Maybe society can live in a hodge- phones, special interfaces for those Subscription Change of Address
podge of unstructured knowledge having physical handicaps, as well as (IEEE/CS)
swept back and forth by armies of basic desktops can participate in a sin- Send change-of-address requests for
Gnutella agents, but this will not do gle P2P session. This requires that magazine subscriptions to address.
for the Enterprise P2P (named by each peer be able to render copied files change@ieee.org. Be sure to spec-
O’Reilly and the Gartner Group, and shared objects differently. Careful ify CiSE.
www.oreillynet.com/pub/d/547) design of the XML metadata for both
needed by Fortune 500 organizations. clients and display devices makes this Subscription Change of Address
Here we will need to manage structured quite possible and ripe for research. (AIP)
information within a dynamic P2P There are lots of good research top- Send general subscription and
grouping. ics and obviously lots of business op- refund inquiries to subs@aip.org.
The right approach is to generalize portunities. P2P networks, as part of
Napster and Jini, ensuring that all ob- the next wave of the Web, are intellec- Subscribe
jects are tied to metadata (possibly in a tually challenging to design and so- Visit ojps.aip.org/cise/subscrib.html
separate record) that define their dis- cially and intellectually rewarding to or computer.org/subscribe.
covery, rendering, access, and sharing use. They can and will unite us all.
characteristics. One homely example is Missing or Damaged Copies
family photos; usually these are indeed If you are missing an issue or you
a melange of folders haphazardly
stuffed in shoeboxes. This gets much
worse for a community event recorded
Y ou can find general discussions
on P2P technology at two good
Web sites: www.openp2p.com from the
received a damaged copy, contact
membership@computer.org.

in shoeboxes across the nation. With O’Reilly group and www.peer-to- Reprints of Articles
the proper metadata and Enterprise peerwg.org from an industry working For price information or to order
P2P support, such photos could be group originally initiated by Intel. I reprints, send e-mail to cise@
nicely organized and presumably of also recommend a remarkable book, computer.org or fax +1 714 821
greater value. Peer-to-Peer: Harnessing the Power of 4010.
Disruptive Technologies by Andrew
The future Oram, Nelson Minar, Clay Shirky, and Reprint Permission
Many interesting ideas are being Tim O’Reilly (O’Reilly & Associates, To obtain permission to reprint an
explored. As an example, breaking 2001). article, contact William Hagen,
IEEE Copyrights and Trademarks
Manager, at whagen@ieee.org.
MAY/JUNE 2001

You might also like