| dc.contributor.author | Zhou, Xinjing | |
| dc.contributor.author | Hao, Xiangpeng | |
| dc.contributor.author | Yu, Xiangyao | |
| dc.contributor.author | Stonebraker, Michael | |
| dc.date.accessioned | 2025-08-11T19:50:51Z | |
| dc.date.available | 2025-08-11T19:50:51Z | |
| dc.date.issued | 2025-05-24 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/162344 | |
| dc.description.abstract | Real-world DBMS workloads invariably exhibit skewed access patterns, where a small number of "hot" records are accessed much more frequently than the remaining "cold" records. Page-oriented data structures, such as B+trees, dynamic hash tables, heap files, and LSM-tree, are sub-optimal in terms of memory utilization under skewed access conditions. Hot records might be co-located with cold ones on pages in the data structure. Caching those lukewarm pages in the buffer pool lowers memory utilization due to the mismatch of caching granularity (page) and access granularity (record), leading to sub-optimal performance. Recently, the 2-Tree approach was proposed to improve caching efficiency for B+trees using record-level migration. In this paper, we generalize the 2-Tree approach to Tiered-Indexing that can be applied to common buffer-managed data structures to efficiently handle skew using record migration. Using this architecture, we extend hash tables, heap files, and LSM-trees with I/O-efficient record migration. Moreover, we design a general mechanism to ensure data structure consistency for Tiered-Indexing data structures during record migration using optimistic lock coupling. Compared to traditional 1-Tier and state-of-the-art record-caching designs, we observe significant throughput and memory utilization improvement across B+tree, hash table, heap file, and LSM-tree under skewed workloads. | en_US |
| dc.publisher | Springer Berlin Heidelberg | en_US |
| dc.relation.isversionof | https://doi.org/10.1007/s00778-025-00928-6 | en_US |
| dc.rights | Creative Commons Attribution | en_US |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
| dc.source | Springer Berlin Heidelberg | en_US |
| dc.title | Tiered-Indexing: Optimizing Access Methods for Skew | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Zhou, X., Hao, X., Yu, X. et al. Tiered-Indexing: Optimizing Access Methods for Skew. The VLDB Journal 34, 45 (2025). | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.relation.journal | The VLDB Journal | en_US |
| dc.identifier.mitlicense | PUBLISHER_CC | |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
| eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
| dc.date.updated | 2025-07-20T03:14:56Z | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | The Author(s) | |
| dspace.embargo.terms | N | |
| dspace.date.submission | 2025-07-20T03:14:56Z | |
| mit.journal.volume | 34 | en_US |
| mit.license | PUBLISHER_CC | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |