research-article

Adaptive Hybrid Indexes

Authors:

Alfons KemperAuthors Info & Claims

SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

Pages 1626 - 1639

https://doi.org/10.1145/3514221.3526121

Published: 11 June 2022 Publication History

Get Access

Abstract

While index structures are crucial components in high-performance query processing systems, they occupy a large fraction of the available memory. Recently-proposed compact indexes reduce this space overhead and thus speed up queries by allowing the database to keep larger working sets in memory. These compact indexes, however, are slower than performance-optimized in-memory indexes because they adopt encodings that trade performance for memory efficiency. Applying different encodings within a single index might allow optimizing both dimensions at the same time - however, it is not clear which encodings should be applied to which index parts at build-time.

To take advantage of multiple encodings in one index structure, we present a new framework forming the basis of workload-adaptive hybrid indexes which moves encoding decisions to run-time instead. By sampling incoming queries adaptively, it tracks accesses to index parts and keeps fine-grained statistics which are used for space- and performance-optimized encoding migrations. We evaluated our framework using B+-trees and tries, and examine the adaptation process and space/performance trade-off for real-world and synthetic workloads. For skewed workloads, our framework can reduce the space by up to 82% while retaining more than 90% of the original performance.

Supplementary Material

MP4 File (SIGMOD22-moddm280.mp4)

Database systems leverage index structures to improve query performance. However, indexes can also significantly increase a database's memory footprint, making them an ideal compression target. Nevertheless, there is a trade-off between space and performance as more compressed encodings often result in decreased query performance. We, therefore, propose a framework called Adaptive Hybrid Indexes (AHI) that allows optimizing indexes adaptively at run-time at a finer granularity by considering the workload properties. Instead of representing all index parts using one encoding, AHI can implement different encodings which are adaptively applied depending on the current workload. For example, frequently accessed parts could be stored in performance-optimized encodings, while rarely accessed parts could be represented in a more compressed fashion. Our experiments show that this approach can reduce index sizes significantly while retaining most of the performance under skewed workloads.

Download
152.01 MB

References

[1]

AWS EC2 Instances. https://aws.amazon.com/en/ec2/instance-types/high-memory [accessed 2021-03-01].

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Hybrid Indexes for Spatial-Visual Search

Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes

A Hybrid BitFunnel and Partitioned Elias-Fano Inverted Index

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations