Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3210259.3210268acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

An early look at the LDBC social network benchmark's business intelligence workload

Published: 10 June 2018 Publication History

Abstract

In this short paper, we provide an early look at the LDBC Social Network Benchmark's Business Intelligence (BI) workload which tests graph data management systems on a graph business analytics workload. Its queries involve complex aggregations and navigations (joins) that touch large data volumes, which is typical in BI workloads, yet they depend heavily on graph functionality such as connectivity tests and path finding. We outline the motivation for this new benchmark, which we derived from many interactions with the graph database industry and its users, and situate it in a scenario of social network analysis. The workload was designed by taking into account technical "chokepoints" identified by database system architects from academia and industry, which we also describe and map to the queries. We present reference implementations in openCypher, PGQL, SPARQL, and SQL, and preliminary results of SNB BI on a number of graph data management systems.

References

[1]
Günes Aluç, Olaf Hartig, M. Tamer Özsu, and Khuzaima Daudjee. 2014. Diversified Stress Testing of RDF Data Management Systems. In ISWC. 197--212.
[2]
Renzo Angles and others. 2018. G-CORE: A Core for Future Graph Query Languages. In SIGMOD.
[3]
Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan Reutter, and Domagoj Vrgoč. 2017. Foundations of Modern Query Languages for Graph Databases. ACM Comput. Surv. 50, 5, Article 68 (Sept. 2017), 40 pages.
[4]
Timothy G. Armstrong, Vamsi Ponnekanti, Dhruba Borthakur, and Mark Callaghan. 2013. LinkBench: a database benchmark based on the Facebook social graph. In SIGMOD. 1185--1196.
[5]
David A. Bader and Kamesh Madduri. 2005. Design and Implementation of the HPCS Graph Analysis Benchmark on Symmetric Multiprocessors. In HiPC. 465--476.
[6]
Sumita Barahmand and Shahram Ghandeharizadeh. 2013. BG: A Benchmark to Evaluate Interactive Social Networking Actions. In CIDR. http://cidrdb.org/cidr2013/Papers/CIDR13_Paper93.pdf
[7]
Scott Beamer, Krste Asanovic, and David A. Patterson. 2013. Direction-optimizing breadth-first search. Scientific Programming 21, 3-4 (2013), 137--148.
[8]
Christian Bizer and Andreas Schultz. 2009. The Berlin SPARQL Benchmark. Int. J. Semantic Web Inf. Syst. 5, 2 (2009), 1--24.
[9]
Peter A. Boncz, Thomas Neumann, and Orri Erling. 2013. TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark. In TPCTC. 61--76.
[10]
Donko Donjerkovic and Raghu Ramakrishnan. 1999. Probabilistic Optimization of Top N Queries. In VLDB. 411--422. http://www.vldb.org/conf/1999/P40.pdf
[11]
Benedikt Elser and Alberto Montresor. 2013. An evaluation study of BigData frameworks for graph processing. In Big Data. 60--67.
[12]
Orri Erling and others. 2015. The LDBC Social Network Benchmark: Interactive Workload. In SIGMOD. 619--630.
[13]
Philip J. Fleming and John J. Wallace. 1986. How Not To Lie With Statistics: The Correct Way To Summarize Benchmark Results. Commun. ACM 29, 3 (1986), 218--221.
[14]
Nadime Francis and others. 2018. Cypher: An Evolving Query Language for Property Graphs. In SIGMOD.
[15]
Jim Gray and others. 1997. Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals. Data Min. Knowl. Discov. 1, 1 (1997), 29--53.
[16]
Andrey Gubichev and Peter A. Boncz. 2014. Parameter Curation for Benchmark Queries. In TPCTC (Lecture Notes in Computer Science), Vol. 8904. Springer, 113--129.
[17]
Yuanbo Guo, Zhengxiang Pan, and Jeff Heflin. 2005. LUBM: A benchmark for OWL knowledge base systems. J. Web Sem. 3, 2-3 (2005), 158--182.
[18]
Annegret Habel, Reiko Heckel, and Gabriele Taentzer. 1996. Graph Grammars with Negative Application Conditions. Fundam. Inform. 26, 3/4 (1996), 287--313.
[19]
Alexandru Iosup and others. 2016. LDBC Graphalytics: A Benchmark for Large-Scale Graph Analysis on Parallel and Distributed Platforms. PVLDB 9, 13 (2016), 1317--1328.
[20]
Norbert Martínez-Bazan, Sergio Gómez-Villamor, and Francesc Escale-Claveras. 2011. DEX: A high-performance graph database management system. In 2nd International Workshop on Graph Data Management: Techniques and Applications (GDM) at ICDE. 124--127.
[21]
Bruce Momjian. 2000. PostgreSQL: Introduction and Concepts. Addison-Wesley.
[22]
Mohamed Morsey, Jens Lehmann, Sören Auer, and Axel-Cyrille Ngonga Ngomo. 2011. DBpedia SPARQL Benchmark - Performance Assessment with Real Queries on Real Data. In ISWC. 454--469.
[23]
Lifeng Nai, Yinglong Xia, Ilie Gabriel Tanase, Hyesoon Kim, and Ching-Yung Lin. 2015. GraphBIG: understanding graph computing in the context of industrial solutions. In SC. 69:1--69:12.
[24]
Raghunath Othayoth Nambiar and Meikel Pöss. 2006. The Making of TPC-DS. In VLDB. 1049--1058. http://dl.acm.org/citation.cfm?id=1164217
[25]
Thomas Neumann and Guido Moerkotte. 2009. A Framework for Reasoning about Share Equivalence and Its Integration into a Plan Generator. In BTW. 7--26. http://subs.emis.de/LNI/Proceedings/Proceedings144/article5220.html
[26]
Jorge Pérez and others. 2009. Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34, 3 (2009).
[27]
Meikel Pöss and Chris Floyd. 2000. New TPC Benchmarks for Decision Support and Web Commerce. SIGMOD Record 29, 4 (2000), 64--71.
[28]
Arnau Prat-Pérez and David Domínguez-Sal. 2014. How community-like is the structure of synthetically generated graphs?. In GRADES at SIGMOD. 7:1--7:9.
[29]
Sherif Sakr, Sameh Elnikety, and Yuxiong He. 2012. G-SPARQL: a hybrid engine for querying large attributed graphs. In CIKM. 335--344.
[30]
Michael Schmidt, Thomas Hornung, Michael Meier, Christoph Pinkel, and Georg Lausen. 2009. SP2Bench: A SPARQL Performance Benchmark. In Semantic Web Information Management - A Model-Based Perspective. 371--393.
[31]
Bin Shao, Yatao Li, Haixun Wang, and Huanhuan Xia. 2017. Trinity Graph Engine and its Applications. IEEE Data Eng. Bull. 40, 3 (2017), 18--29. http://sites.computer.org/debull/A17sept/p18.pdf
[32]
Gábor Szárnyas, Benedek Izsó, István Ráth, and Dániel Varró. 2017. The Train Benchmark: cross-technology performance evaluation of continuous model queries. Softw. Syst. Model. (2017).
[33]
Oskar van Rest, Sungpack Hong, Jinha Kim, Xuming Meng, and Hassan Chafi. 2016. PGQL: a property graph query language. In GRADES at SIGMOD.
[34]
Hilmi Yildirim, Vineet Chaoji, and Mohammed J. Zaki. 2012. GRAIL: a scalable index for reachability queries in very large graphs. VLDB J. 21, 4 (2012), 509--534.

Cited By

View all
  • (2024)The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management SpacePerformance Evaluation and Benchmarking10.1007/978-3-031-68031-1_7(90-106)Online publication date: 22-Sep-2024
  • (2023)Distributed Asynchronous Regular Path Queries (RPQs) on GraphsProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626833(35-41)Online publication date: 11-Dec-2023
  • (2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GRADES-NDA '18: Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
June 2018
94 pages
ISBN:9781450356954
DOI:10.1145/3210259
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2018

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper

Funding Sources

Conference

SIGMOD/PODS '18
Sponsor:

Acceptance Rates

GRADES-NDA '18 Paper Acceptance Rate 10 of 26 submissions, 38%;
Overall Acceptance Rate 29 of 61 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)16
  • Downloads (Last 6 weeks)3
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management SpacePerformance Evaluation and Benchmarking10.1007/978-3-031-68031-1_7(90-106)Online publication date: 22-Sep-2024
  • (2023)Distributed Asynchronous Regular Path Queries (RPQs) on GraphsProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626833(35-41)Online publication date: 11-Dec-2023
  • (2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
  • (2023)Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph QueriesACM Computing Surveys10.1145/360493256:2(1-40)Online publication date: 15-Sep-2023
  • (2023)Better Distributed Graph Query Planning With Scouting QueriesProceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3594778.3594884(1-9)Online publication date: 18-Jun-2023
  • (2023)Microarchitectural Analysis of Graph BI Queries on RDBMSProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595321(102-106)Online publication date: 18-Jun-2023
  • (2023)The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of CoresProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607068(1-18)Online publication date: 12-Nov-2023
  • (2022)A Contemporary Review on Utilizing Semantic Web Technologies in Healthcare, Virtual Communities, and Ontology-Based Information Processing SystemsElectronics10.3390/electronics1103045311:3(453)Online publication date: 3-Feb-2022
  • (2022)The LDBC Social Network BenchmarkProceedings of the VLDB Endowment10.14778/3574245.357427016:4(877-890)Online publication date: 1-Dec-2022
  • (2022)μ-Bench: Real-world Micro Benchmarking for SPARQL Query Processing over Knowledge GraphsProceedings of the 11th International Joint Conference on Knowledge Graphs10.1145/3579051.3579054(39-47)Online publication date: 27-Oct-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media