This is an archived course. A more recent version may be available at ocw.mit.edu.

Readings

This section provides information on the readings for the course. Most readings are from the "Red Book," otherwise known as Readings in Database Systems. Supplemental readings are also presented in the table.

Textbooks

The course readings will primarily be drawn from the following books:

Red Book = Amazon logo Hellerstein, Joseph M., and Michael Stonebraker, eds. Readings in Database Systems. 4th ed. Cambridge, MA: MIT Press, 2005. ISBN: 0262693143.

RG = Amazon logo Ramakrishnan, Raghu, and Johannes Gehrke. Database Management Systems. 2nd ed. New York, NY: McGraw-Hill, 2000. ISBN: 0072465352.

Note that the 3rd edition of Readings In Database Systems is a substantially different text (it does not include the same readings as the 4th edition).

SES # TOPICS READINGS
L1 Introduction  
L2 The Relational Model Red Book: Stonebraker, Michael, and Joseph Hellerstein. "What Goes Around Comes Around." Read Sections 1-4 (if you know something about XML, you may also enjoy reading Sections 10 and 11; they are classic Stonebraker).

Codd, E. F. "A relational model of data for large shared data banks." Communications of the ACM 13, no. 6 (1970): 377-387. Focus on Sections 1.3 and all of Section 2.

RG: pp. 57-63.
L3 Schema Normalization RG: Chapter 19.
L4 Database Design Red Book: Hellerstein, Joseph, and Michael Stonebraker. "The Anatomy of a Database System." Focus on Sections 1-4, though you should also read Sections 5.1, 5.2 and skim Section 6.

Astrahan, M. M., et al. "System R: Relational Approach to Database Management." ACM TODS 1, no. 2 (1976): 97-137. Read up to page 122; you may also skip the "Optimizer" section, pages 110 - 114.
L5 Indexing and Access Methods Red Book: pp. 273-289 and 344-358.

Red Book: Beckmann, N., et al. "The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles."
L6 Buffer Pool Design and Memory Management Gray, Jim, and Goetz Graefe. "The Five-Minute Rule Ten Years Later, and Other Computer Storage Rules of Thumb." SIGMOD Record 26, no. 4 (1997): 63-68. (Also in Red Book.)

Chou, Hong-Tai, and David DeWitt. "An Evaluation of Buffer Management Strategies for Relational Database Systems." In Proceedings of the VLDB Conference, 1985.
L7 Join Algorithms Red Book: Shapiro, L. D. "Join Processing in Database Systems with Large Main Memories."
L8 Query Optimization Selinger, Patricia, M. Astrahan, D. Chamberlin, Raymond Lorie, and T. Price. "Access Path Selection in a Relational Database Management System." In Proceedings of ACM SIGMOD, Boston, MA, 1979, pp. 22-34. (Also in Red Book.)

Mannino, Michael, Paichen Chu, and Thomas Sager. "Statistical Profile Estimation in Database Systems." ACM Computing Surveys 20, no. 3 (1988): 191-221.
L9 Introduction to Transaction Processing RG: Chapter 16.
L10 Recovery Red Book: Mohan, C., et al. "ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging." Read Sections 1-7, and skim Sections 12 and 13.
L11 Optimistic Concurrency Control Kung, H. T., and John T. Robinson. "On Optimistic Methods for Concurrency Control." ACM Transactions on Database Systems 6, no. 2 (June 1981): 213-226. (Also in Red Book.)
L12 Recovery (cont.) Red Book: Mohan, C., et al. "ARIES: A Transaction Recovery Method Supporting Fine-Granularity Locking and Partial Rollbacks Using Write-Ahead Logging." Read Sections 1-7, and skim Sections 12 and 13.
L13 Degrees of Locking Gray, Jim, et al. "Granularity of Locking and Degrees of Consistency in a Shared Data Base." In Modelling in Data Base Management Systems: Proceeding of the IFIP Working Conference on Modelling in Data Base Management Systems. Edited by Nijssen, G. M. Freudenstadt, Germany, January 5-8, 1976. ISBN: 0720404592. (Also in Red Book.)
Q1 Exam 1  
L14 Distributed Databases - Guest Lecture Dewitt, David, and Jim Gray. "Parallel Database Systems: The Future of High Performance Database Processing." Communications of the ACM 35, no. 6 (1992): 85-98. (Also in Red Book.)
L15 Distributed Transactions Mohan, C., Bruce Lindsay, and R. Obermarck. "Transaction Management in the R* Distributed Database Management Systems." ACM Transactions On Database Systems 11, no. 4 (1986): 378-396. (Also in Red Book.)
L16 Replication in Database Systems Davidson, Susan, Hector Garcia-Molina, and Dale Skeen. "Consistency in a partitioned network: a survey." In ACM Computing Surveys 17, no. 3 (1985): 341-370.
L17 Search Engines Red Book: Brewer, Eric. "Combining Systems and Databases: A Search Engine Retrospective."

Dean, Jeffrey, and Sanjay Ghemawat. "MapReduce: Simplified Data Processing on Large Clusters." OSDI (2004): 137-150.
L18 Stream Processing Abadi, Daniel J., Don Carney, Ugur Cetintemel, Mitch Cherniack, Christian Convey, et al. "Aurora: a new model and architecture for data stream management." VLDB Journal 12, no. 2 (August 2003): 120-139. Read Sections 1-6. Note that this IS NOT the Aurora paper in the Red Book.
L19 Adaptive Query Processing - Guest Lecture Babu, Shivnath, and Pedro Bizarro. "Adaptive Query Processing in the Looking Glass." In Proceedings of CIDR. Asilomar, CA, January 4-7, 2005, pp. 238-249.

Avnur, Ron, and Joseph M. Hellerstein. "Eddies: Continuously Adaptive Query Processing." In Proc. of ACM SIGMOD. 2000, pp. 261-272. (Also in Red Book.)
L20 Online Query Processing Hellerstein, Joseph, Ron Avnur, and Vijayshankar Raman. "Informix under CONTROL: Online Query Processing." Data Mining and Knowledge Discovery 12 (2000): 281-314. (Also in Red Book.)
L21 Sensor Networks Gehrke, Johannes, and Samuel Madden. "Query Processing in Sensor Networks." In Pervasive Computing, Jan-March 2004, pp. 46-55.

Deshpande, Amol, Carlos Guestrin, Samuel Madden, Joseph Hellerstein, and Wei Hong. "Model-Driven Data Acquisition in Sensor Networks." In Proc. of International Conference on Very Large Databases. 2004.
L22 XML and Semantic Web Bergholz, Andre. "Extending your Markup: An XML Tutorial." IEEE Internet Computing 4, no. 4 (2000): 74-79.

Hunter, Jason. "X is For XQuery." Oracle Technology Network. Also published in Oracle Magazine, May/June 2003.

Halevy, Alon, Oren Etzioni, AnHai Doan, Zachary Ives, Jayant Madhavan, Luke McDowell, and Igor Tatarinov. "Crossing the Structure Chasm." In CIDR, Asilomar, CA, January 5-8, 2003.
Q2 Exam 2  
L23 XML Red Book: Stonebraker, Michael, and Joseph Hellerstein. "What Goes Around Comes Around."
P1 Final Project Presentation