Benchmarking learned indexes

Marcus, R; Stoian, M; Kipf, A; Misra, S; van Renen, A; Kemper, A; Neumann, T; Kraska, T

Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/132293.2

Author(s)

Marcus, R; Stoian, M; Kipf, A; Misra, S; van Renen, A; ... Show more

DownloadPublished version (736.0Kb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution-NonCommercial-NoDerivs License http://creativecommons.org/licenses/by-nc-nd/4.0/

Metadata

Show full item record

Abstract

© 2020, VLDB Endowment. All rights reserved. Recent advancements in learned index structures propose replacing existing index structures, like B-Trees, with approximate learned models. In this work, we present a unified benchmark that compares well-tuned implementations of three learned index structures against several state-of-the-art "traditional" baselines. Using four real-world datasets, we demonstrate that learned index structures can indeed outperform non-learned indexes in read-only in-memory workloads over a dense array. We investigate the impact of caching, pipelining, dataset size, and key size. We study the performance profile of learned index structures, and build an explanation for why learned models achieve such good performance. Finally, we investigate other important properties of learned index structures, such as their performance in multi-threaded systems and their build times.

URI

https://hdl.handle.net/1721.1/132293

Journal

Proceedings of the VLDB Endowment

Publisher

VLDB Endowment

Collections

MIT Open Access Articles

Version	Item	Date	Summary
2	1721.1/132293.2	2022-07-19T15:53:59Z	Metadata changed: Verified or entered author name and department authority metadata.
1	1721.1/132293*	2021-09-20T18:21:42Z

*Selected version

DSpace@MIT

Notice