General Course Information:
Instructor: Lakshmish Ramaswamy (laks[AT]cs[dot]uga[dot]edu, 706-542-2737)
Time and Venue(Tentative): Wednesdays - 3:35 PM to 4:25 PM; Tuesdays & Thursdays - 03:30 PM to 04:45 PM
Office Hours: To be announced
Course Description:
Distributed
systems have
become widely pervasive and are having a tremendous impact on various
domains of human activity. Today's distributed systems range from
ad-hoc networks
comprising of tiny sensor devices, to overlay networks such as
peer-to-peer
systems, to massive web farms of powerful servers. The research in the
area of distributed systems has focused on achieving better
performance, reliability, security, and privacy of various kinds of
distributed data processing applications.
In this course we will study the design, implementation and evaluation
of a wide class of distributed systems including edge cache networks,
multimedia content distribution networks, peer-to-peer systems,
publish-subscribe systems, stream processing systems, mobile systems
and sensor networks, with the objective of gaining an in-depth
understanding of the requirements and the design options.
Grading Policy (Tentative)
- Class participation - 15%
- Paper summaries - 20%
- Project proposal - 10%
- Final project - 40%
- Project Presentation - 15%
Course Materials (Tentative)
Web content generation and delivery
(Preliminary study: AFS and NFS in Silbershatz, Galvin and Gagne "OS Concepts [Chapter 11])
-
Nelson, M.N., Wlech, B.B., Ousterhout, J.K., "Caching
in the Sprite Network File System", ACM Transactions on Computer Systems,
6, 1, pgs. 134-154, February 1988. (self-study)
-
Anderson, T. et. al, "Serverless
Network File System", ACM Transpaction on Computer Systems, February
1996.
- C. Thekkath, T. Mann and E. Lee "Frangipani: A Scalable Distributed File System", (SOSP), October 1997.
-
C. Amza, A. Cox, S Dwarkadas, P Keleher, H Lu, R. Rajamony, W. Yu and W.
Zwaenepoel, "TreadMarks:
Shared Memory Computing on Networks of Workstations" IEEE Computer,
February, 1996.
Web content generation and delivery
Web servers and application servers
- N. J. Yeager, R. E. McGrath "Web Server Technology", Morgan Kaufmann
- A. Iyengar, J. Challenger, D. Dias, and P. Dantzig. High-Performance Web Site Design Techniques. IEEE Internet Computing, 4(2):17--26, March 2000.
- J. Challenger, P. Dantzig, A. Iyengar "A Scalable and Highly Available System for Serving Dynamic Data at Frequently Accessed Web Sites", Supercomputing - 98
- G. Banga and P.Druschel " Measuring the Capacity of a Web Server", USENIX Symposium on
Internet Technologies and Systems - 1997
- Arlitt and Williamson " Internet Web Servers: Workload Characterization and Performance
Implications", ACM/IEEE Transactions on
Networking, October 1997
- V. N. Padmanabhan and L. Qiu, "The Content and Access Dynamics of a Busy Web Site: Findings
and Implications", ACM-SIGCOMM 2000
- A. Iyengar, M. S. Squillante, L. Zhang "Analysis and Characterization of Large-Scale Web Server Access Patterns and Performance" World Wide Web Journal-1999
Web Content Caching and Delivery
- A. Chankhunthod, P. Danzig, C. Neerdaels, M. F. Schwartz, K. J. Worrell. A Hierarchical Internet Object Cache USENIX Annual Technical Conference, 1995
- R. Tewari, M. Dahlin, H. M. Vin, J. S. Kay. Beyond Hierarchies: Design Considerations for Distributed Caching on the Internet, ICDCS 1999
- L. Fan, P. Cao, J. Almeida, A. Z. Broder. Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol, IEEE/ACM Transactions on Networking, 2000
- J. Yin, L. Alvisi, M. Dahlin, A. Iyengar. Engineering Server-Driven Consistency for Large Scale Dynamic Web Services, WWW conference, 2001
- K. Amiri, S. Park, R. Tewari and S. Padmanabhan. DBProxy: A dynamic data cache for Web applications, ICDE 2003
- WS Li, O Po, WP Hsiung, KS Candan, D Agrawal. Engineering and hosting adaptive freshness-sensitive web applications on data centers, WWW Conference 2003
- S Shah, K Ramamritham, P Shenoy. Resilient and Coherence Preserving Dissemination of Dynamic Data Using Cooperating Peers IEEE TKDE 2004.
- L. Ramaswamy, A.
Iyenagar, L. Liu and F. Douglis, Automatic Detection of Fragments
in Dynamic Web Pages and its Impact on Caching, IEEE TKDE 2005
- L. Ramaswamy, L. Liu and
Arun Iyengar Cooperative EC Grid: Caching Dynamic Documents using Cache Clouds (Hard copies will be provided in the class)
- C. Yuan,
Y. Chen and Z. Zhang,
Evaluation of edge caching/offloading for dynamic content delivery.
WWW 2003
Peer -to-Peer Computing and Overlay Networks
- S. Androutsellis-Theotokis and D. Spinellis A Survey of Peer-to-Peer File Sharing Technologies, ACM Computing Surveys 2002.
- Jordan Ritter, Why Gnutella Can't Scale. No, Really. 2001
- B. Yang,
H. Garcia-Molina,
Improving Search in Peer-to-Peer Networks,
ICDCS 2002
- LV, C., CAO, P., COHEN, E., LI, K., AND SHENKER, S. Search and replication in unstructured peer-to-peer networks, Supercomputing 2002
- M. Ripeanu,
I. T. Foster,
A. Iamnitchi,
Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design, IEEE Internet Computing Journal, 2002
- Q. Lv, S. Ratnasamy and S. Shenker Can Heterogeneity Make Gnutella Scalable, IPTPS 2002
- K. Gummadi,
R. J. Dunn,
S. Saroiu,
S. D. Gribble,
H. M. Levy and Zahorjan,
Measurement, modeling, and analysis of a peer-to-peer file-sharing workload.
SOSP 2003
- CHAWATHE, Y., RATNASAMY, S., BRESLAU, L., AND SHENKER, S. Making Gnutella-like P2P Systems Scalable, SIGCOMM-2003
- E. Cohen and S. Shenker,
Replication strategies in unstructured peer-to-peer networks.
SIGCOMM 2002
- I. Stoica, R. Morris, D. Liben-Nowell, D. Karger, M. Kaashoek, F. Dabek and H. Balakrishnan, Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications,
IEEE/ACM Transactions on Networking, February 2003
- A. Rowstron and P. Druschel, Pastry:
Scalable, distributed object location and routing for large-scale peer-to-peer
systems, Middleware 2001
- Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp and Scott Shenker, A Scalable Content-Addressable Network, SIGCOMM 2001
- Karl Aberer, Philippe Cudré-Mauroux, Anwitaman Datta,
Zoran Despotovic, Manfred Hauswirth, Magdalena Punceva, Roman Schmidt P-Grid: A Self-organizing Structured P2P System, SIGMOD RECORD 2003
- S. C. Rhea, B. Godfrey, B. Karp, J. Kubiatowicz, S. Ratnasamy, S. Shenker, I. Stoica, and H. Yu. OpenDHT: A public DHT service and its uses, SIGCOMM 2005
- Y. Chawathe, S. Ramabhadran, S. Ratnasamy, A. LaMarca, S. Shenker, J. Hellerstein, A Case Study in Building Layered DHT Applications, SIGCOMM 2005
- A. Rao K. Lakshminarayanan S. Surana R. Karp I. Stoica Load Balancing in Structured P2P Systems, INFOCOM 2003
- A. R. Bharambe, M. Agrawal, and S. Seshan, Mercury: supporting scalable multi-attribute range queries, SIGCOMM 2004
- Dongsheng Li, Jiannong Cao, Xicheng Lu, Keith C. C. Chan, Jinshu Su,
H.V. Leong, Alvin T. S. Chan , Delay-Bounded Range Queries in DHT-based
Peer-to-Peer Systems, ICDCS 2006 (Copies provided in class)
- P. B. Godfrey, S. Shenker, and I. Stoica, Minimizing Churn in Distributed Systems, SIGCOMM 2006.
- J. Kannan, B. Yang, S. Shenker, P. Sharma, S. Banerjee, S. Basu, S.J. Lee, SmartSeer: Using a DHT to Process Continuous Queries Over Peer-to-Peer Networks, INFOCOM 2006
- V. Pappas, D. Massey, A. Terzis and L. Zhang, A Comparative Study of Current DNS with DHT-Based Alternatives, INFOCOM 2006
Event Processing and Publish/Subscribe Systems
- P. Th. Eugster, P. A. Felber, R. Guerraoui and A.-M. Kermarrec, The Many Faces of Publish-Subscribe, ACM Computing Surveys, 2003.
- G. Banavar, T. Chandra, B. Mukherjee, J. Nagarajarao, R. Strom and D. Sturman, An Efficient Multicast Protocol for Content-Based Publish-Subscribe Systems, ICDCS 1999.
- L. Ramaswamy, J. Chen and P. Parate, COQUOS: Lightweight Support to Continuous Queries on Unstructured Overlays, IPDPS 2007
- A. Carzaniga, D. S. Rosenblum and A. Wolf, Design and Evaluation of a Wide-Area Event Notification Service ACM TOCS 2001
- A. Gupta, O. Sahin, D. Agrawal and A. El Abbadi Meghdoot: Content-based Pub/Sub over P2P Networks Middleware 2004
- P.A. Chirita, S. Idreos, M. Kaubarakis and W. Nejdl Designing Semantics Pub/Sub Using Super Peers, Middleware 2005
- Y. Huang and H. Garcia-Molina Publish-Subscribe in a Mobile Environment, Wireless Networks, 2004.
- S. Banerjee, S. Basu, J. Kannan, S. J. Lee, P. Sharma, S. Shenker, and B. Yang, SmartSeer: Using a DHT to Process Continuous Queries Over Peer-To-Peer Networks, INFOCOM 2006
Stream Processing,
- S. Babu and J. Widom, "Continuous Queries over Data Streams", SIGMOD Record 2001
- S. Chandrasekaran,
M. J. Franklin:
Streaming Queries over Streaming Data.
VLDB 2002
- S. Madden,
M. J. Franklin:
Fjording the Stream: An Architecture for Queries Over Streaming Sensor Data.
ICDE 2002
- D.
Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, G. Seidman, M.
Stonebraker, N. Tatbul, and S. Zdonik. Monitoring Streams: A New
Class of Data Management Applications, VLDB 2002
- S. Chandrasekaran, et al. TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.
CIDR 2003
- M. Cherniack, H. Balakrishnan, M. Balazinska,
D. Carney, U. Cetintemel, Y. Xing, S. Zdonik, Scalable Distributed Stream Processing, CIDR 2003
-
D. J. Abadi, et al. The
Design of the Borealis Stream Processing Engine CIDR-2005
-
M.J. Stonebraker, U. Cetintemel, and
S. Zdonik, The
8 Requirements of Real-Time Stream Processing
ICDE 2005
-
M. Balazinska, H. Balakrishnan, S. Madden, and M.Stonebraker, Fault-Tolerance
in the Borealis Distributed Stream Processing System, SIGMOD 2005
- P.R. Pietzuch, et al. Network-Aware Operator Placement for Stream-Processing Systems, ICDE 2006
-
N. Tatbul, S. Zdonik,
Window-aware Load Shedding for Aggregation Queries over Data Streams, VLDB 2006
Security and Privacy
- M. Srivatsa and L. Liu,
"Securing Publish-Subscribe Overlay Services With EventGuard"
, ACM CCS 2005
-
M. Srivatsa, L. Xiong and L. Liu,
"TrustGuard: Countering Vulnerabilities in Reputation Management for Decentralized Overlay Networks" WWW 2005.
- L. Sweeney, k-anonymity: a model for protecting privacy.
International Journal on Uncertainty, Fuzziness and Knowledge-based Systems.
-
K. LeFevre, D. DeWitt, and R. Ramakrishnan. Incognito: Efficient Full-Domain K-Anonymity. In ACM SIGMOD 2005.
- A. Machanavajjhala, J.
Gehrke, D. Kifer, and M. Venkitasubramaniam.
l-Diversity: Privacy Beyond
k-Anonymity. ICDE-2006.
Miscellanious Materials