Partitioning Techniques for Fine-grained Indexing
Author(s)
Wu, Eugene; Madden, Samuel R.
DownloadMadden_Partitioning techniques.pdf (847.2Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Many data-intensive websites use databases that grow
much faster than the rate that users access the data. Such growing
datasets lead to ever-increasing space and performance overheads
for maintaining and accessing indexes. Furthermore, there is often
considerable skew with popular users and recent data accessed
much more frequently. These observations led us to design Shinobi,
a system which uses horizontal partitioning as a mechanism for
improving query performance to cluster the physical data, and increasing
insert performance by only indexing data that is frequently
accessed. We present database design algorithms that optimally
partition tables, drop indexes from partitions that are infrequently
queried, and maintain these partitions as workloads change. We
show a 60× performance improvement over traditionally indexed
tables using a real-world query workload derived from a traffic
monitoring application
Description
URL to paper listed on conference site
Date issued
2011-04Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer ScienceJournal
Proceedings of the International Conference on Data Engineering, ICDE 2011
Publisher
International Conference on Data Engineering
Citation
Wu, Eugene and Samuel Madden. "Partitioning Techniques for Fine-grained Indexing." International Conference on Data Engineering, ICDE 2011, Hannover, April 11-16, 2011.
Version: Author's final manuscript