| dc.contributor.advisor | Samuel Madden. | en_US |
| dc.contributor.author | Wu, Eugene, Ph. D. Massachusetts Institute of Technology | en_US |
| dc.contributor.other | Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. | en_US |
| dc.date.accessioned | 2011-01-26T14:30:36Z | |
| dc.date.available | 2011-01-26T14:30:36Z | |
| dc.date.copyright | 2010 | en_US |
| dc.date.issued | 2010 | en_US |
| dc.identifier.uri | http://hdl.handle.net/1721.1/60824 | |
| dc.description | Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2010. | en_US |
| dc.description | Cataloged from PDF version of thesis. | en_US |
| dc.description | Includes bibliographical references (p. 65-68). | en_US |
| dc.description.abstract | Many data-intensive websites are characterized by a dataset that grows much faster than the rate that users access the data and possibly high insertion rates. In such systems, the growing size of the dataset leads to a larger overhead for maintaining and accessing indexes even while the query workload becomes increasingly skewed. Additionally, the database index update costs can be a non-trivial proportion of the overall system cost. Shinobi introduces a cost model that takes index update costs account, and proposes database design algorithms that optimally partition tables and drop indexes from partitions that are not queried often, and that maintain these partitions as workloads change. We show a 60x performance improvement over traditionally indexed tables using a real-world query workload derived from a traffic monitoring application and over 8x improvement for a Wikipedia workload. | en_US |
| dc.description.statementofresponsibility | by Eugene Wu. | en_US |
| dc.format.extent | 68 p. | en_US |
| dc.language.iso | eng | en_US |
| dc.publisher | Massachusetts Institute of Technology | en_US |
| dc.rights | M.I.T. theses are protected by
copyright. They may be viewed from this source for any purpose, but
reproduction or distribution in any format is prohibited without written
permission. See provided URL for inquiries about permission. | en_US |
| dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
| dc.subject | Electrical Engineering and Computer Science. | en_US |
| dc.title | Shinobi : insert-aware partitioning and indexing techniques for skewed database workloads | en_US |
| dc.title.alternative | Insert-aware partitioning and indexing techniques for skewed database workloads | en_US |
| dc.type | Thesis | en_US |
| dc.description.degree | S.M. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
| dc.identifier.oclc | 697282348 | en_US |