Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2463676.2465329acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

DeltaNI: an efficient labeling scheme for versioned hierarchical data

Published: 22 June 2013 Publication History

Abstract

Main-memory database systems are emerging as the new backbone of business applications. Besides flat relational data representations also hierarchical ones are essential for these modern applications; therefore we devise a new indexing and versioning approach for hierarchies that is deeply integrated into the relational kernel.
We propose the DeltaNI index as a versioned pendant of the nested intervals (NI) labeling scheme. The index is space- and time-efficient and yields a gapless, fixed-size integer NI labeling for each version while also supporting branching histories. In contrast to a naive NI labeling, it facilitates even complex updates of the tree structure. As many query processing techniques that work on top of the NI labeling have already been proposed, our index can be used as a building block for processing various kinds of queries. We evaluate the performance of the index on large inputs consisting of millions of nodes and thousands of versions. Thereby we show that DeltaNI scales well and can deliver satisfying performance for large business scenarios.

References

[1]
S. Al-Khalifa, H. Jagadish, N. Koudas, J. Patel, D. Srivastava, and Y. Wu. Structural joins: A primitive for efficient XML query pattern matching. In ICDE, 2002.
[2]
B. Becker, S. Gschwind, T. Ohler, B. Seeger, and P. Widmayer. An asymptotically optimal multiversion B-tree. VLDB Journal, 5(4), 1996.
[3]
Boeing. 747 fun facts. http://www.boeing.com/commercial/747family/pf/pf_facts.html.
[4]
P. Boncz, S. Manegold, and J. Rittinger. Updating the pre/post plane in MonetDB/XQuery. In XIME-P, 2005.
[5]
N. Bruno, N. Koudas, and D. Srivastava. Holistic twig joins: optimal XML pattern matching. In SIGMOD, 2002.
[6]
P. Buneman, S. Khanna, K. Tajima, and W. Tan. Archiving scientific data. TODS, 29(1), 2004.
[7]
J. Celko. Trees & Hierarchy in SQL for Smarties. Morgan Kaufmann, 2004.
[8]
S. Chien, V. Tsotras, C. Zaniolo, and D. Zhang. Efficient complex query support for multiversion XML documents. EDBT, 2002.
[9]
S. Chien, V. Tsotras, C. Zaniolo, and D. Zhang. Supporting complex queries on multiversion XML documents. TOIT, 6(1), 2006.
[10]
S. Chien, V. J. Tsotras, and C. Zaniolo. XML document versioning. SIGMOD Record, 30(3), 2001.
[11]
G. Cobena, S. Abiteboul, and A. Marian. Detecting changes in XML documents. In ICDE, 2002.
[12]
E. Cohen, H. Kaplan, and T. Milo. Labeling dynamic XML trees. SIAM Journal on Computing, 39(5), 2010.
[13]
F. Farber, S. K. Cha, J. Primsch, C. Bornhovd, S. Sigg, and W. Lehner. SAP HANA database: data management for modern business applications. SIGMOD Record, 40(4), 2012.
[14]
T. Grust. Accelerating XPath location steps. In SIGMOD, 2002.
[15]
T. Grust, M. van Keulen, and J. Teubner. Staircase join: Teach a relational DBMS to watch its (axis) steps. In VLDB, 2003.
[16]
L. Jiang, B. Salzberg, D. Lomet, M. Barrena, et al. The BT-tree: A branched and temporal access method. In VLDB, 2000.
[17]
A. Kemper and T. Neumann. HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots. In ICDE, 2011.
[18]
S. Lanka and E. Mays. Fully persistent B+-trees. In SIGMOD, 1991.
[19]
Q. Li, B. Moon, et al. Indexing and querying XML data for regular path expressions. In VLDB, 2001.
[20]
D. Lomet and B. Salzberg. Access methods for multiversion data. In SIGMOD, 1989.
[21]
A. Marian, S. Abiteboul, G. Cobena, L. Mignet, et al. Change-centric management of versions in an XML warehouse. In VLDB, 2001.
[22]
A. Mendelzon, F. Rizzolo, and A. Vaisman. Indexing temporal XML documents. In VLDB, 2004.
[23]
P. O'Neil, E. O'Neil, S. Pal, I. Cseri, G. Schaller, and N. Westbury. Ordpaths: insert-friendly XML node labels. In SIGMOD, 2004.
[24]
F. Rizzolo and A. Vaisman. Temporal XML: modeling, indexing, and query processing. VLDB Journal, 17(5), 2008.
[25]
L. Rosado, A. Marquez, and J. Gonzalez. Representing versions in XML documents using versionstamp. ECDM, 2006.
[26]
L. Rusu, W. Rahayu, and D. Taniar. Maintaining versions of dynamic XML documents. WISE, 2005.
[27]
L. Rusu, W. Rahayu, and D. Taniar. Storage techniques for multi-versioned XML documents. In DASFAA, 2008.
[28]
V. Sans and D. Laurent. Prefix based numbering schemes for XML: techniques, applications and performances. PVLDB, 1(2), 2008.
[29]
A. Silberstein, H. He, K. Yi, and J. Yang. BOXes: Efficient maintenance of order-based labeling for dynamic XML data. In ICDE, 2005.
[30]
I. Stoica, R. Morris, D. Liben-Nowell, D. Karger, M. Kaashoek, F. Dabek, and H. Balakrishnan. Chord: a scalable peer-to-peer lookup protocol for internet applications. TON, 11(1), 2003.
[31]
Z. Vagena, M. Moro, and V. Tsotras. Supporting branched versions on XML documents. In RIDE, 2004.
[32]
Z. Vagena and V. Tsotras. Path-expression queries over multiversion XML documents. In WebDB, 2003.
[33]
A. Woss and V. Tsotras. Experimental evaluation of query processing techniques over multiversion XML documents. In WebDB, 2009.
[34]
L. Xu, T. Ling, H. Wu, and Z. Bao. DDE: from Dewey to a fully dynamic XML labeling scheme. In SIGMOD, 2009.
[35]
Y. Zhang, X. Wang, and Y. Zhang. A labeling scheme for temporal XML. In WISM, 2009.

Cited By

View all
  • (2022)Robust and scalable content-and-structure indexingThe VLDB Journal10.1007/s00778-022-00764-y32:4(689-715)Online publication date: 15-Oct-2022
  • (2021)Dynamic interleaving of content and structure for robust indexing of semi-structured hierarchical dataProceedings of the VLDB Endowment10.14778/3401960.340196313:10(1641-1653)Online publication date: 10-Mar-2021
  • (2019)A Scalable Index for Top-k Subtree Similarity QueriesProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3319892(1624-1641)Online publication date: 25-Jun-2019
  • Show More Cited By

Index Terms

  1. DeltaNI: an efficient labeling scheme for versioned hierarchical data

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGMOD '13: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
    June 2013
    1322 pages
    ISBN:9781450320375
    DOI:10.1145/2463676
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. database indexing
    2. hierarchical data
    3. hierarchy indexing
    4. labeling schemes
    5. multiversion indexing
    6. nested intervals

    Qualifiers

    • Research-article

    Conference

    SIGMOD/PODS'13
    Sponsor:

    Acceptance Rates

    SIGMOD '13 Paper Acceptance Rate 76 of 372 submissions, 20%;
    Overall Acceptance Rate 785 of 4,003 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Robust and scalable content-and-structure indexingThe VLDB Journal10.1007/s00778-022-00764-y32:4(689-715)Online publication date: 15-Oct-2022
    • (2021)Dynamic interleaving of content and structure for robust indexing of semi-structured hierarchical dataProceedings of the VLDB Endowment10.14778/3401960.340196313:10(1641-1653)Online publication date: 10-Mar-2021
    • (2019)A Scalable Index for Top-k Subtree Similarity QueriesProceedings of the 2019 International Conference on Management of Data10.1145/3299869.3319892(1624-1641)Online publication date: 25-Jun-2019
    • (2017)Structural XML Query ProcessingACM Computing Surveys10.1145/309579850:5(1-41)Online publication date: 26-Sep-2017
    • (2017)Order IndexesThe VLDB Journal — The International Journal on Very Large Data Bases10.1007/s00778-016-0436-326:1(55-80)Online publication date: 1-Feb-2017
    • (2015)Indexing highly dynamic hierarchical dataProceedings of the VLDB Endowment10.14778/2794367.27943698:10(986-997)Online publication date: 1-Jun-2015
    • (2015)Supporting hierarchical data in SAP HANA2015 IEEE 31st International Conference on Data Engineering10.1109/ICDE.2015.7113376(1280-1291)Online publication date: Apr-2015
    • (2015)Towards a web-scale data management ecosystem demonstrated by SAP HANA2015 IEEE 31st International Conference on Data Engineering10.1109/ICDE.2015.7113374(1259-1267)Online publication date: Apr-2015
    • (2014)The Database Group at TUMACM SIGMOD Record10.1145/2694428.269443943:3(55-60)Online publication date: 4-Dec-2014

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media