Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1645953.1646083acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Efficient joins with compressed bitmap indexes

Published: 02 November 2009 Publication History

Abstract

We present a new class of adaptive algorithms that use compressed bitmap indexes to speed up evaluation of the range join query in relational databases. We determine the best strategy to process a join query based on a fast sub-linear time computation of the join selectivity (the ratio of the number of tuples in the result to the total number of possible tuples). In addition, we use compressed bitmaps to represent the join output compactly: the space requirement for storing the tuples representing the join of two relations is asymptotically bounded by min(h; n.cb), where h is the number of tuple pairs in the result relation, n is the number of tuples in the smaller of the two relations, and cb is the cardinality of the larger column being joined. We present a theoretical analysis of our algorithms, as well as experimental results on large-scale synthetic and real data sets. Our implementations are efficient, and consistently outperform well-known approaches for a range of join selectivity factors. For instance, our count-only algorithm is up to three orders of magnitude faster than the sort-merge approach, and our best bitmap index-based algorithm is 1.2x-80x faster than the sort-merge algorithm, for various query instances. We achieve these speedups by exploiting several inherent performance advantages of compressed bitmap indexes for join processing: an implicit partitioning of the attributes, space-efficiency, and tolerance of high-cardinality relations.

References

[1]
G. Antoshenkov. Byte-aligned bitmap compression. U.S. Patent number 5,363,098, 1994.
[2]
P. Bizarro and H. Madeira. The Dimension-Join: A new index for data warehouses. In Proc. XVI SBBD, pages 259--273, Rio de Janeiro, Brazil, Oct 2001.
[3]
C.-Y. Chan and Y.E. Ioannidis. Bitmap index design and evaluation. SIGMOD Rec., 27(2):355--366, 1998.
[4]
C.-Y. Chan and Y.E. Ioannidis. An efficient bitmap encoding scheme for selection queries. SIGMOD Rec., 28(2):215--226, 1999.
[5]
D.J. DeWitt, J.F. Naughton, and D.A. Schneider. An evaluation of non-equijoin algorithms. In Proc. VLDB, pages 443--452, Barcelona, Spain, Sep 1991.
[6]
J. Gray et al. There goes the neighborhood: Relational algebra for spatial data search. Technical report, Microsoft Research, 2004. MSR-TR-2004-32.
[7]
T. Johnson. Performance measurements of compressed bitmap indices. In Proc. VLDB, pages 278--289, Edinburgh, Scotland, Sep 1999. Morgan Kaufmann.
[8]
N. Koudas. Space efficient bitmap indexing. In Proc. CIKM, pages 194--201, McLean, VA, Nov 2000. ACM.
[9]
N. Koudas and K.C. Sevcik. Size separation spatial join. SIGMOD Rec., 26(3):324--335, 1997.
[10]
Z. Li and K.A. Ross. PERF join: an alternative to two-way semijoin and bloomjoin. In Proc. CIKM, pages 137--144, Baltimore, MD, Nov--Dec 1995. ACM.
[11]
P. Mishra and M.H. Eich. Join processing in relational databases. ACM Comput. Surv., 24(1):63--113, 1992.
[12]
P. O'Neil and G. Graefe. Multi-table joins through bitmapped join indices. SIGMOD Rec., 24(3):8--11, 1995.
[13]
P. O'Neil and E. O'Neil. Database: principles, programming, and performance. Morgan Kaufmann, 2nd edition, 2000.
[14]
P. O'Neil and D. Quass. Improved query performance with variant indexes. SIGMOD Rec., 26(2):38--49, 1997.
[15]
V. Paxson. Bro: A system for detecting network intruders in real-time. Computer Networks, 31(23--24):2435--2463, 1999.
[16]
R. Power. Large catalogue query performance in relational databases. Publications of the Astronomical Society of Australia (PASA), 24(1):13--20, 2007.
[17]
T. Siqueira, R. Ciferri, V. Times, and C. Ciferri. Investigating the eff ects of spatial data redundancy in query performance over geographical data warehouses. In Proc. GEOINFO, Rio de Janeiro, Brazil, Dec 2008.
[18]
H.K.T. Wong, H.-F. Liu, F. Olken, D. Rotem, and L. Wong. Bit transposed files. In Proc. VLDB, pages 448--457, Stockholm, Sweden, Aug 1985.
[19]
K. Wu, E.J. Otoo, and A. Shoshani. On the performance of bitmap indices for high cardinality attributes. In Proc. VLDB, pages 24--35, Toronto, Canada, Sep 2004. Morgan Kaufmann.

Cited By

View all
  • (2021)Energy Efficiency vs. Performance of Analytical Queries: The case of Bitmap Join Indexes2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671307(3066-3074)Online publication date: 15-Dec-2021
  • (2018)Particle swarm optimization for bitmap join indexes selection problem in data warehousesThe Journal of Supercomputing10.1007/s11227-013-1058-968:2(672-708)Online publication date: 31-Dec-2018
  • (2015)EMeD-PartProceedings of the International Conference on Intelligent Information Processing, Security and Advanced Communication10.1145/2816839.2816876(1-7)Online publication date: 23-Nov-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge management
November 2009
2162 pages
ISBN:9781605585123
DOI:10.1145/1645953
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. compressed bitmap index
  2. range join

Qualifiers

  • Research-article

Conference

CIKM '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)8
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Energy Efficiency vs. Performance of Analytical Queries: The case of Bitmap Join Indexes2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671307(3066-3074)Online publication date: 15-Dec-2021
  • (2018)Particle swarm optimization for bitmap join indexes selection problem in data warehousesThe Journal of Supercomputing10.1007/s11227-013-1058-968:2(672-708)Online publication date: 31-Dec-2018
  • (2015)EMeD-PartProceedings of the International Conference on Intelligent Information Processing, Security and Advanced Communication10.1145/2816839.2816876(1-7)Online publication date: 23-Nov-2015
  • (2013)Massive Parallel Join in NUMA ArchitectureProceedings of the 2013 IEEE International Congress on Big Data10.1109/BigData.Congress.2013.37(219-226)Online publication date: 27-Jun-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media