A Hybrid Publish Subscribe Protocol
Mark Linderman, Norman Ahmed,James Metzler
Air Force Research LaboratoryRome, NY 13441
{
Mark.Linderman, Norman.Ahmed,James.Metzler}@rl.af.mil
Jason Bryant
ITT CoorporationRome, NY 13441
Jason.Bryant.ctr@rl.af.mil
ABSTRACT
Content-based publish/subscribe system performancedepends upon the e
ffi
cient subscription matching and eventdissemination to interested subscribers. We propose a hy-brid content-based publish/subscribe protocol for large sizeevents wherein a centralized brokering system is coupledwith a decentralized BitTorrent-like peer-to-peer (P2P) pro-tocol for scalable event distribution among publishers andsubscribers. Events are mapped to a torrent that grows asnew events are published. Subscribers self-broker on eventmetadata and request content only if interested. Subscriberinterests determine event popularity that the broker esti-mates with sampling. Popular events are disseminated P2P;unpopular events, directly from the broker; and somewhatpopular ones, with P2P and broker-directed pre-seeding.The challenge is the dissemination of popular events withoutoverwhelming centralized resources while e
ffi
ciently dissem-inating unpopular events that lack su
ffi
cient interest to sus-tain gossip-based dissemination. The key advances includenew means of handling variable event popularity inherent incontent-based pub/sub and an adaptive anti-entropy mech-anism for undelivered events.
Categories and Subject Descriptors
C.2.2 [
Computer - Communication Networks
]: Net-work Protocols; C.2.4 [
Computer - Communication Net-works
]: Client/Server, Distributed Applications
General Terms
Performance
Keywords
BitTorrent, Content Distribution, Event-based Systems, Mid-dleware, Peer-to-Peer, Publish and Subscribe, Scalability
1. INTRODUCTION
c
2008 Association for Computing Machinery. ACM acknowledges thatthis contribution was authored or co-authored by a contractor or affiliateof the U.S. Government. As such, the Government retains a nonexclusive,royalty-free right to publish or reproduce this article, or to allow others todo so, for Government purposes only.
Middleware
’08 Companion, December 1-5, 2008 Leuven, Belgium.Copyright 2008 ACM 978-1-60558-369-3/08/12 ...$5.00.
The publish and subscribe (pub/sub) disseminationparadigm has emerged as a popular means of disseminatingtime-sensitive or filtered information. Through the use of anevent service, or broker, published information is dissemi-nated only to the subscribers interested in that information.One of the major disadvantages of a centralized broker isthat predicate processing and event dissemination can over-whelm the processing, memory, and bandwidth resources of the broker. Therefore, for large numbers of subscribers (say1K-100K) or for large events (e.g., video or imagery) cen-tralized brokers may lack scalability.The system described in this paper reduces the predicateprocessing burden on the broker with a combination of pop-ularity estimation and filtering-upon-receipt. It reduces thedissemination burden by building upon decentralized peer-to-peer (P2P) file sharing protocols, such as BitTorrent [5],to handle the bulk of the dissemination load for content-based pub/sub.The fundamental challenge addressed by this paper is theproblem of simultaneously disseminating popular events forwhich gossip-based P2P protocols work well and unpop-ular events for which they fail. In this paper, we com-bine a centralized pub/sub broker with a BitTorrent-likede-centralized P2P data dissemination protocol to build ahybrid pub/sub system that is highly scalable, and yet stillhandles unpopular events e
ffi
ciently.We have implemented a full-featured information-spacebased information management system [6] on top of the pub-lish and subscribe protocol described in this paper. Eventsof a given type are treated as a single torrent. Each eventcomprises XML metadata and a binary payload. The meta-data tends to be
∼
1Kb in size. Payloads range from 0 bytesto several megabytes.Principal attributes of the hybrid publish and subscribesystem presented are:
•
The use of torrents that start out empty and grow overtime as new events are published,
•
Peer-based self-evaluation of predicates,
•
Broker-based popularity estimation,
•
Aggressive propagation technique for dealing with mid-popular objects that reduces to hub-and-spoke dissem-ination for very unpopular objects
•
A broker-based fallback mechanism for events that faildelivery by one of the other means.
24
Leave a Comment