Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2023
Cost-based Data Prefetching and Scheduling in Big Data Platforms over Tiered Storage Systems
ACM Transactions on Database Systems (TODS), Volume 48, Issue 4Article No.: 11, Pages 1–40https://doi.org/10.1145/3625389The use of storage tiering is becoming popular in data-intensive compute clusters due to the recent advancements in storage technologies. The Hadoop Distributed File System, for example, now supports storing data in memory, SSDs, and HDDs, while OctopusFS ...
- research-articleAugust 2023
Enabling Timely and Persistent Deletion in LSM-Engines
ACM Transactions on Database Systems (TODS), Volume 48, Issue 3Article No.: 8, Pages 1–40https://doi.org/10.1145/3599724Data-intensive applications have fueled the evolution of log-structured merge (LSM) based key-value engines that employ the out-of-place paradigm to support high ingestion rates with low read/write interference. These benefits, however, come at the cost ...
- research-articleApril 2022
Height Optimized Tries
ACM Transactions on Database Systems (TODS), Volume 47, Issue 1Article No.: 3, Pages 1–46https://doi.org/10.1145/3506692We present the Height Optimized Trie (HOT), a fast and space-efficient in-memory index structure. The core algorithmic idea of HOT is to dynamically vary the number of bits considered at each node, which enables a consistently high fanout and thereby good ...
- research-articleDecember 2018
Optimal Bloom Filters and Adaptive Merging for LSM-Trees
ACM Transactions on Database Systems (TODS), Volume 43, Issue 4Article No.: 16, Pages 1–48https://doi.org/10.1145/3276980In this article, we show that key-value stores backed by a log-structured merge-tree (LSM-tree) exhibit an intrinsic tradeoff between lookup cost, update cost, and main memory footprint, yet all existing designs expose a suboptimal and difficult to tune ...
- research-articleMarch 2015
Time- and Space-Efficient Sliding Window Top-k Query Processing
ACM Transactions on Database Systems (TODS), Volume 40, Issue 1Article No.: 1, Pages 1–44https://doi.org/10.1145/2736701A sliding window top-k (top-k/w) query monitors incoming data stream objects within a sliding window of size w to identify the k highest-ranked objects with respect to a given scoring function over time. Processing of such queries is challenging because,...
-
- research-articleJanuary 2014
Deletion without rebalancing in multiway search trees
ACM Transactions on Database Systems (TODS), Volume 39, Issue 1Article No.: 8, Pages 1–14https://doi.org/10.1145/2540068Some database systems that use a form of B-tree for the underlying data structure do not do rebalancing on deletion. This means that a bad sequence of deletions can create a very unbalanced tree. Yet such databases perform well in practice. Avoidance of ...
- research-articleJanuary 2014
Indexing for summary queries: Theory and practice
ACM Transactions on Database Systems (TODS), Volume 39, Issue 1Article No.: 2, Pages 1–39https://doi.org/10.1145/2508702Database queries can be broadly classified into two categories: reporting queries and aggregation queries. The former retrieves a collection of records from the database that match the query's conditions, while the latter returns an aggregate, such as ...
- research-articleApril 2013
ElasTraS: An elastic, scalable, and self-managing transactional database for the cloud
ACM Transactions on Database Systems (TODS), Volume 38, Issue 1Article No.: 5, Pages 1–45https://doi.org/10.1145/2445583.2445588A database management system (DBMS) serving a cloud platform must handle large numbers of application databases (or tenants) that are characterized by diverse schemas, varying footprints, and unpredictable load patterns. Scaling out using clusters of ...
- research-articleDecember 2012
Finding Alternative Shortest Paths in Spatial Networks
ACM Transactions on Database Systems (TODS), Volume 37, Issue 4Article No.: 29, Pages 1–31https://doi.org/10.1145/2389241.2389248Shortest path query is one of the most fundamental queries in spatial network databases. There exist algorithms that can process shortest path queries in real time. However, many complex applications require more than just the calculation of a single ...
- research-articleMarch 2012
A survey of B-tree logging and recovery techniques
ACM Transactions on Database Systems (TODS), Volume 37, Issue 1Article No.: 1, Pages 1–35https://doi.org/10.1145/2109196.2109197B-trees have been ubiquitous in database management systems for several decades, and they serve in many other storage systems as well. Their basic structure and their basic operations are well understood including search, insertion, and deletion. ...
- research-articleJuly 2010
A survey of B-tree locking techniques
ACM Transactions on Database Systems (TODS), Volume 35, Issue 3Article No.: 16, Pages 1–26https://doi.org/10.1145/1806907.1806908B-trees have been ubiquitous in database management systems for several decades, and they are used in other storage systems as well. Their basic structure and basic operations are well and widely understood including search, insertion, and deletion. ...
- research-articleJuly 2009
ODE: Ontology-assisted data extraction
ACM Transactions on Database Systems (TODS), Volume 34, Issue 2Article No.: 12, Pages 1–35https://doi.org/10.1145/1538909.1538914Online databases respond to a user query with result records encoded in HTML files. Data extraction, which is important for many applications, extracts the records from the HTML files automatically. We present a novel data extraction method, ODE (...
- research-articleSeptember 2008
Efficient online index construction for text databases
ACM Transactions on Database Systems (TODS), Volume 33, Issue 3Article No.: 19, Pages 1–33https://doi.org/10.1145/1386118.1386125Inverted index structures are a core element of current text retrieval systems. They can be constructed quickly using offline approaches, in which one or more passes are made over a static set of input data, and, at the completion of the process, an ...
- articleNovember 2007
An adaptive packed-memory array
ACM Transactions on Database Systems (TODS), Volume 32, Issue 4Pages 26–eshttps://doi.org/10.1145/1292609.1292616The packed-memory array (PMA) is a data structure that maintains a dynamic set of N elements in sorted order in a Θ(N)-sized array. The idea is to intersperse Θ(N) empty spaces or gaps among the elements so that only a small number of elements need to ...
- articleAugust 2007
Modeling and managing changes in text databases
ACM Transactions on Database Systems (TODS), Volume 32, Issue 3Pages 14–eshttps://doi.org/10.1145/1272743.1272744Large amounts of (often valuable) information are stored in web-accessible text databases. “Metasearchers” provide unified interfaces to query multiple such databases at once. For efficiency, metasearchers rely on succinct statistical summaries of the ...
- articleJune 2006
Maintenance of K-nn and spatial join queries on continuously moving points
ACM Transactions on Database Systems (TODS), Volume 31, Issue 2Pages 485–536https://doi.org/10.1145/1138394.1138396Cars, aircraft, mobile cell phones, ships, tanks, and mobile robots all have the common property that they are moving objects. A kinematic representation can be used to describe the location of these objects as a function of time. For example, a moving ...
- articleSeptember 2005
LH*RS---a highly-available scalable distributed data structure
ACM Transactions on Database Systems (TODS), Volume 30, Issue 3Pages 769–811https://doi.org/10.1145/1093382.1093386LH*RS is a high-availability scalable distributed data structure (SDDS). An LH*RS file is hash partitioned over the distributed RAM of a multicomputer, for example, a network of PCs, and supports the unavailability of any k ≥ 1 of its server nodes. The ...
- articleDecember 2004
Decoupling partitioning and grouping: Overcoming shortcomings of spatial indexing with bucketing
ACM Transactions on Database Systems (TODS), Volume 29, Issue 4Pages 789–830https://doi.org/10.1145/1042046.1042052The principle of decoupling the partitioning and grouping processes that form the basis of most spatial indexing methods that use tree directories of buckets is explored. The decoupling is designed to overcome the following drawbacks of traditional ...
- articleJune 2004
Proxy-based acceleration of dynamically generated content on the world wide web: An approach and implementation
ACM Transactions on Database Systems (TODS), Volume 29, Issue 2Pages 403–443https://doi.org/10.1145/1005566.1005571As Internet traffic continues to grow and websites become increasingly complex, performance and scalability are major issues for websites. Websites are increasingly relying on dynamic content generation applications to provide website visitors with ...