Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2396761.2396806acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

G-SPARQL: a hybrid engine for querying large attributed graphs

Published: 29 October 2012 Publication History

Abstract

We propose a SPARQL-like language, G-SPARQL, for querying attributed graphs. The language expresses types of queries which of large interest for applications which model their data as large graphs such as: pattern matching, reachability and shortest path queries. Each query can combine both of structural predicates and value-based predicates (on the attributes of the graph nodes and edges). We describe an algebraic compilation mechanism for our proposed query language which is extended from the relational algebra and based on the basic construct of building SPARQL queries, the Triple Pattern. We describe a hybrid Memory/Disk representation of large attributed graphs where only the topology of the graph is maintained in memory while the data of the graph is stored in a relational database. The execution engine of our proposed query language splits parts of the query plan to be pushed inside the relational database while the execution of other parts of the query plan are processed using memory-based algorithms, as necessary. Experimental results on real datasets demonstrate the efficiency and the scalability of our approach and show that our approach outperforms native graph databases by several factors.

References

[1]
J. Cheng and J. Yu. On-line exact shortest distance query processing. In EDBT, 2009.
[2]
T. Cormen, R. Rivest, C. Leiserson, and C. Stein. Introduction to Algorithms. The MIT Press, third edition, 2009.
[3]
D. Abadi et al. Scalable Semantic Web Data Management Using Vertical Partitioning. In VLDB, 2007.
[4]
E. Cohen et al. Reachability and Distance Queries via 2-Hop Labels. SIAM J. Comput., 32(5), 2003.
[5]
H. Tong et al. Fast best-effort pattern matching in large attributed graphs. In KDD, 2007.
[6]
L. Zou et al. Answering pattern match queries in large graph databases via graph embedding. VLDB J., 32(5), 2011.
[7]
L. Zou et al. gStore: Answering SPARQL Queries via Subgraph Matching. PVLDB, 4(8), 2011.
[8]
R. Jin et al. 3-HOP: a high-compression indexing scheme for reachability query. In SIGMOD, 2009.
[9]
T. Grust, M. Mayr, J. Rittinger, S. Sakr, and J. Teubner. A SQL: 1999 code generator for the pathfinder XQuery compiler. In SIGMOD, 2007.
[10]
T. Grust, M. Mayr, J. Rittinger, and T. Schreiber. FERRY: database-supported program execution. In SIGMOD, 2009.
[11]
T. Grust, S. Sakr, and J. Teubner. XQuery on SQL Hosts. In VLDB, 2004.
[12]
T. Neumann and G. Weikum. RDF-3X: a RISC-style engine for RDF. PVLDB, 1(1), 2008.
[13]
J. Pérez, M. Arenas, and C. Gutierrez. Semantics and complexity of SPARQL. TODS, 34(3), 2009.
[14]
E. Prud'hommeaux and A. Seaborne. SPARQL Query Language for RDF, W3C Recommendation.
[15]
S. Sakr. GraphREL: A Decomposition-Based and Selectivity-Aware Relational Framework for Processing Sub-graph Queries. In DASFAA, 2009.
[16]
S. Sakr, S. Elnikety, and Y. He. G-SPARQL: A Hybrid Query Engine for Querying Large Attributed Graphs. Technical Report MSR-TR-2011--138, Microsoft Research. http://research.microsoft.com/apps/pubs/?id=157417.
[17]
M. Sarwat, S. Elnikety, Y. He, and G. Kliot. Horton: Online Query Execution Engine For Large Distributed Graphs . In ICDE, 2011.
[18]
F. Wei. TEDI: efficient shortest path query answering on graphs. In SIGMOD, 2010.
[19]
X. Yan, P. Yu, and J. Han. Graph indexing: a frequent structure-based approach. In SIGMOD, 2004.
[20]
S. Zhang, M. Hu, and J. Yang. TreePi: A Novel Graph Indexing Method. In ICDE, 2007.
[21]
P. Zhao and J. Han. On Graph Query Optimization in Large Networks. PVLDB, 3(1), 2010.

Cited By

View all

Index Terms

  1. G-SPARQL: a hybrid engine for querying large attributed graphs

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
    October 2012
    2840 pages
    ISBN:9781450311564
    DOI:10.1145/2396761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. graphs
    2. query
    3. sparql

    Qualifiers

    • Research-article

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)9
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
    • (2022)Finding Multidimensional Constraint Reachable Paths for Attributed GraphsICST Transactions on Scalable Information Systems10.4108/eetsis.v9i4.2581(e2)Online publication date: 22-Aug-2022
    • (2022)BanyanProceedings of the VLDB Endowment10.14778/3547305.354731115:10(2045-2057)Online publication date: 7-Sep-2022
    • (2020)Enhancing recursive graph querying on RDBMS with data clustering approachesProceedings of the 35th Annual ACM Symposium on Applied Computing10.1145/3341105.3375770(404-411)Online publication date: 30-Mar-2020
    • (2020)A GraphBLAS solution to the SIGMOD 2014 Programming Contest using multi-source BFS2020 IEEE High Performance Extreme Computing Conference (HPEC)10.1109/HPEC43674.2020.9286186(1-7)Online publication date: 22-Sep-2020
    • (2018)Querying GraphsSynthesis Lectures on Data Management10.2200/S00873ED1V01Y201808DTM05110:3(1-184)Online publication date: Oct-2018
    • (2018)An early look at the LDBC social network benchmark's business intelligence workloadProceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3210259.3210268(1-11)Online publication date: 10-Jun-2018
    • (2018)Using partial evaluation in holistic subgraph searchFrontiers of Computer Science: Selected Publications from Chinese Universities10.1007/s11704-016-5522-612:5(966-983)Online publication date: 1-Oct-2018
    • (2018)Applications of Flexible Querying to Graph DataGraph Data Management10.1007/978-3-319-96193-4_4(97-142)Online publication date: 1-Nov-2018
    • (2017)IVD: Automatic Learning and Enforcement of Authorization Rules in Online Social Networks2017 IEEE Symposium on Security and Privacy (SP)10.1109/SP.2017.33(1094-1109)Online publication date: May-2017
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media