| dc.contributor.author | Tatarowicz, Aubrey L. | |
| dc.contributor.author | Curino, Carlo | |
| dc.contributor.author | Jones, Evan P.C. | |
| dc.contributor.author | Madden, Sam | |
| dc.date.accessioned | 2021-11-08T18:29:54Z | |
| dc.date.available | 2021-11-08T18:29:54Z | |
| dc.date.issued | 2012-04 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/137764 | |
| dc.description.abstract | The standard way to get linear scaling in a distributed OLTP DBMS is to horizontally partition data across several nodes. Ideally, this partitioning will result in each query being executed at just one node, to avoid the overheads of distributed transactions and allow nodes to be added without increasing the amount of required coordination. For some applications, simple strategies, such as hashing on primary key, provide this property. Unfortunately, for many applications, including social networking and order-fulfillment, many-to-many relationships cause simple strategies to result in a large fraction of distributed queries. Instead, what is needed is a fine-grained partitioning, where related individual tuples (e.g., cliques of friends) are co-located together in the same partition. Maintaining such a fine-grained partitioning requires the database to store a large amount of metadata about which partition each tuple resides in. We call such metadata a lookup table, and present the design of a data distribution layer that efficiently stores these tables and maintains them in the presence of inserts, deletes, and updates. We show that such tables can provide scalability for several difficult to partition database workloads, including Wikipedia, Twitter, and TPC-E. Our implementation provides 40% to 300% better performance on these workloads than either simple range or hash partitioning and shows greater potential for further scale-out. © 2012 IEEE. | en_US |
| dc.language.iso | en | |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
| dc.relation.isversionof | 10.1109/icde.2012.26 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | Other repository | en_US |
| dc.title | Lookup Tables: Fine-Grained Partitioning for Distributed Databases | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Tatarowicz, Aubrey L., Curino, Carlo, Jones, Evan P.C. and Madden, Sam. 2012. "Lookup Tables: Fine-Grained Partitioning for Distributed Databases." | |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2019-06-18T13:07:02Z | |
| dspace.date.submission | 2019-06-18T13:07:03Z | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |