Show simple item record

dc.contributor.authorShanbhag, Anil Atmanand
dc.contributor.authorJindal, Alekh
dc.contributor.authorLu, Yi
dc.contributor.authorMadden, Samuel R
dc.date.accessioned2021-12-16T20:22:32Z
dc.date.available2021-11-05T15:33:27Z
dc.date.available2021-12-16T20:22:32Z
dc.date.issued2016
dc.identifier.urihttps://hdl.handle.net/1721.1/137530.2
dc.description.abstractData partitioning significantly improves the query performance in distributed database systems. A large number of techniques have been proposed to efficiently partition a dataset for a given query workload. However, many modern analytic applications involve ad-hoc or exploratory analysis where users do not have a representative query workload upfront. Furthermore, workloads change over time as businesses evolve or as analysts gain better understanding of their data. Static workload-based data partitioning techniques are therefore not suitable for such settings. In this paper, we describe the demonstration of Amoeba, a distributed storage system which uses adaptive multi-attribute data partitioning to efficiently support ad-hoc as well as recurring queries. Amoeba applies a robust partitioning algorithm such that ad-hoc queries on all attributes have similar performance gains. Thereafter, Amoeba adaptively repartitions the data based on the observed query sequence, i.e., the system improves over time. All along Amoeba offers both adaptivity (i.e., adjustments according to workload changes) as well as robustness (i.e., avoiding performance spikes due to workload changes). We propose to demonstrate Amoeba on scenarios from an internet-of- things startup that tracks user driving patterns. We invite the audience to interactively fire fast ad-hoc queries, observe multi-dimensional adaptivity, and play with a robust/reactive knob in Amoeba. The web front end displays the layout changes, runtime costs, and compares it to Spark with both default and workload-aware partitioning.en_US
dc.language.isoen
dc.relation.isversionof10.14778/3007263.3007311en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceVLDB Endowmenten_US
dc.titleAmoeba: A Shape changing Storage System for Big Dataen_US
dc.typeArticleen_US
dc.identifier.citationShanbhag, A, Jindal, A, Lu, Y and Madden, S. 2016. "Amoeba: A shape changing storage system for big data." Proceedings of the VLDB Endowment, 9 (13).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journalProceedings of the VLDB Endowmenten_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-01-29T18:10:49Z
dspace.orderedauthorsShanbhag, A; Jindal, A; Lu, Y; Madden, Sen_US
dspace.date.submission2021-01-29T18:10:56Z
mit.journal.volume9en_US
mit.journal.issue13en_US
mit.licensePUBLISHER_CC
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version