short-paper

An early look at the LDBC social network benchmark's business intelligence workload

Authors:

Gábor Szárnyas,

Arnau Prat-Pérez,

József Marton,

Marcus Paradies,

Moritz Kaufmann,

János Benjamin AntalAuthors Info & Claims

GRADES-NDA '18: Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)

Article No.: 9, Pages 1 - 11

https://doi.org/10.1145/3210259.3210268

Published: 10 June 2018 Publication History

Abstract

In this short paper, we provide an early look at the LDBC Social Network Benchmark's Business Intelligence (BI) workload which tests graph data management systems on a graph business analytics workload. Its queries involve complex aggregations and navigations (joins) that touch large data volumes, which is typical in BI workloads, yet they depend heavily on graph functionality such as connectivity tests and path finding. We outline the motivation for this new benchmark, which we derived from many interactions with the graph database industry and its users, and situate it in a scenario of social network analysis. The workload was designed by taking into account technical "chokepoints" identified by database system architects from academia and industry, which we also describe and map to the queries. We present reference implementations in openCypher, PGQL, SPARQL, and SQL, and preliminary results of SNB BI on a number of graph data management systems.

References

[1]

Günes Aluç, Olaf Hartig, M. Tamer Özsu, and Khuzaima Daudjee. 2014. Diversified Stress Testing of RDF Data Management Systems. In ISWC. 197--212.

Digital Library

[2]

Renzo Angles and others. 2018. G-CORE: A Core for Future Graph Query Languages. In SIGMOD.

Digital Library

[3]

Renzo Angles, Marcelo Arenas, Pablo Barceló, Aidan Hogan, Juan Reutter, and Domagoj Vrgoč. 2017. Foundations of Modern Query Languages for Graph Databases. ACM Comput. Surv. 50, 5, Article 68 (Sept. 2017), 40 pages.

Digital Library

[4]

Timothy G. Armstrong, Vamsi Ponnekanti, Dhruba Borthakur, and Mark Callaghan. 2013. LinkBench: a database benchmark based on the Facebook social graph. In SIGMOD. 1185--1196.

Digital Library

[5]

David A. Bader and Kamesh Madduri. 2005. Design and Implementation of the HPCS Graph Analysis Benchmark on Symmetric Multiprocessors. In HiPC. 465--476.

Digital Library

[6]

Sumita Barahmand and Shahram Ghandeharizadeh. 2013. BG: A Benchmark to Evaluate Interactive Social Networking Actions. In CIDR. http://cidrdb.org/cidr2013/Papers/CIDR13_Paper93.pdf

[7]

Scott Beamer, Krste Asanovic, and David A. Patterson. 2013. Direction-optimizing breadth-first search. Scientific Programming 21, 3-4 (2013), 137--148.

Digital Library

[8]

Christian Bizer and Andreas Schultz. 2009. The Berlin SPARQL Benchmark. Int. J. Semantic Web Inf. Syst. 5, 2 (2009), 1--24.

[9]

Peter A. Boncz, Thomas Neumann, and Orri Erling. 2013. TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark. In TPCTC. 61--76.

Digital Library

[10]

Donko Donjerkovic and Raghu Ramakrishnan. 1999. Probabilistic Optimization of Top N Queries. In VLDB. 411--422. http://www.vldb.org/conf/1999/P40.pdf

Digital Library

[11]

Benedikt Elser and Alberto Montresor. 2013. An evaluation study of BigData frameworks for graph processing. In Big Data. 60--67.

[12]

Orri Erling and others. 2015. The LDBC Social Network Benchmark: Interactive Workload. In SIGMOD. 619--630.

Digital Library

[13]

Philip J. Fleming and John J. Wallace. 1986. How Not To Lie With Statistics: The Correct Way To Summarize Benchmark Results. Commun. ACM 29, 3 (1986), 218--221.

Digital Library

[14]

Nadime Francis and others. 2018. Cypher: An Evolving Query Language for Property Graphs. In SIGMOD.

Digital Library

[15]

Jim Gray and others. 1997. Data Cube: A Relational Aggregation Operator Generalizing Group-by, Cross-Tab, and Sub Totals. Data Min. Knowl. Discov. 1, 1 (1997), 29--53.

Digital Library

[16]

Andrey Gubichev and Peter A. Boncz. 2014. Parameter Curation for Benchmark Queries. In TPCTC (Lecture Notes in Computer Science), Vol. 8904. Springer, 113--129.

[17]

Yuanbo Guo, Zhengxiang Pan, and Jeff Heflin. 2005. LUBM: A benchmark for OWL knowledge base systems. J. Web Sem. 3, 2-3 (2005), 158--182.

Digital Library

[18]

Annegret Habel, Reiko Heckel, and Gabriele Taentzer. 1996. Graph Grammars with Negative Application Conditions. Fundam. Inform. 26, 3/4 (1996), 287--313.

Digital Library

[19]

Alexandru Iosup and others. 2016. LDBC Graphalytics: A Benchmark for Large-Scale Graph Analysis on Parallel and Distributed Platforms. PVLDB 9, 13 (2016), 1317--1328.

Digital Library

[20]

Norbert Martínez-Bazan, Sergio Gómez-Villamor, and Francesc Escale-Claveras. 2011. DEX: A high-performance graph database management system. In 2nd International Workshop on Graph Data Management: Techniques and Applications (GDM) at ICDE. 124--127.

Digital Library

[21]

Bruce Momjian. 2000. PostgreSQL: Introduction and Concepts. Addison-Wesley.

Digital Library

[22]

Mohamed Morsey, Jens Lehmann, Sören Auer, and Axel-Cyrille Ngonga Ngomo. 2011. DBpedia SPARQL Benchmark - Performance Assessment with Real Queries on Real Data. In ISWC. 454--469.

Digital Library

[23]

Lifeng Nai, Yinglong Xia, Ilie Gabriel Tanase, Hyesoon Kim, and Ching-Yung Lin. 2015. GraphBIG: understanding graph computing in the context of industrial solutions. In SC. 69:1--69:12.

Digital Library

[24]

Raghunath Othayoth Nambiar and Meikel Pöss. 2006. The Making of TPC-DS. In VLDB. 1049--1058. http://dl.acm.org/citation.cfm?id=1164217

Digital Library

[25]

Thomas Neumann and Guido Moerkotte. 2009. A Framework for Reasoning about Share Equivalence and Its Integration into a Plan Generator. In BTW. 7--26. http://subs.emis.de/LNI/Proceedings/Proceedings144/article5220.html

[26]

Jorge Pérez and others. 2009. Semantics and complexity of SPARQL. ACM Trans. Database Syst. 34, 3 (2009).

Digital Library

[27]

Meikel Pöss and Chris Floyd. 2000. New TPC Benchmarks for Decision Support and Web Commerce. SIGMOD Record 29, 4 (2000), 64--71.

Digital Library

[28]

Arnau Prat-Pérez and David Domínguez-Sal. 2014. How community-like is the structure of synthetically generated graphs?. In GRADES at SIGMOD. 7:1--7:9.

Digital Library

[29]

Sherif Sakr, Sameh Elnikety, and Yuxiong He. 2012. G-SPARQL: a hybrid engine for querying large attributed graphs. In CIKM. 335--344.

Digital Library

[30]

Michael Schmidt, Thomas Hornung, Michael Meier, Christoph Pinkel, and Georg Lausen. 2009. SP2Bench: A SPARQL Performance Benchmark. In Semantic Web Information Management - A Model-Based Perspective. 371--393.

Digital Library

[31]

Bin Shao, Yatao Li, Haixun Wang, and Huanhuan Xia. 2017. Trinity Graph Engine and its Applications. IEEE Data Eng. Bull. 40, 3 (2017), 18--29. http://sites.computer.org/debull/A17sept/p18.pdf

[32]

Gábor Szárnyas, Benedek Izsó, István Ráth, and Dániel Varró. 2017. The Train Benchmark: cross-technology performance evaluation of continuous model queries. Softw. Syst. Model. (2017).

[33]

Oskar van Rest, Sungpack Hong, Jinha Kim, Xuming Meng, and Hassan Chafi. 2016. PGQL: a property graph query language. In GRADES at SIGMOD.

Digital Library

[34]

Hilmi Yildirim, Vineet Chaoji, and Mohammed J. Zaki. 2012. GRAIL: a scalable index for reachability queries in very large graphs. VLDB J. 21, 4 (2012), 509--534.

Digital Library

Cited By

Szárnyas GBebee BBirler ADeutsch AFletcher GGabb HGosnell DGreen AGuo ZHare KHidders JIosup AKiryakov AKovatchev TLi XLibkin LLin HLuo XPrat-Pérez APüroja DQi Svan Rest OSteer BSzakállas DTong BWaudby JWu MYang BYu WZhang CZhang JZhou YBoncz P(2024)The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management SpacePerformance Evaluation and Benchmarking10.1007/978-3-031-68031-1_7(90-106)Online publication date: 22-Sep-2024
https://doi.org/10.1007/978-3-031-68031-1_7
Faltín TTrigonakis VBerdai AFusco LIorgulescu CLee JYaghob JHong SChafi H(2023)Distributed Asynchronous Regular Path Queries (RPQs) on GraphsProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626833(35-41)Online publication date: 11-Dec-2023
https://dl.acm.org/doi/10.1145/3626562.3626833
Khan A(2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
https://dl.acm.org/doi/10.1145/3615952.3615956
Show More Cited By

An early look at the LDBC social network benchmark's business intelligence workload
1. Information systems
  1. Data management systems
    1. Database management system engines
2. Theory of computation
  1. Theory and algorithms for application domains
    1. Database theory

Recommendations

The LDBC Social Network Benchmark: Business Intelligence Workload

The Social Network Benchmark's Business Intelligence workload (SNB BI) is a comprehensive graph OLAP benchmark targeting analytical data systems capable of supporting graph workloads. This paper marks the finalization of almost a decade of research in ...
The LDBC Social Network Benchmark: Interactive Workload
SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

The Linked Data Benchmark Council (LDBC) is now two years underway and has gathered strong industrial participation for its mission to establish benchmarks, and benchmarking practices for evaluating graph data management systems. The LDBC introduced a ...
A generic construct based workload model for business intelligence benchmark

Benchmarks are vital tools in the performance measurement and evaluation of computer hardware and software systems. Standard benchmarks such as the TREC, TPC, SPEC, SAP, Oracle, Microsoft, IBM, Wisconsin, AS^3AP, OO1, OO7, XOO7 benchmarks have been used ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

GRADES-NDA '18: Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)

June 2018

94 pages

ISBN:9781450356954

DOI:10.1145/3210259

Editors:
Akhil Arora
American Express Big Data Labs
,
Arnab Bhattacharya
Indian Institute of Technology, Kanpur, India
,
George Fletcher
Eindhoven University of Technology
,
Josep Lluis Larriba Pey
UPC
,
Shourya Roy
American Express Big Data Labs
,
Robert West
EPFL

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMOD: ACM Special Interest Group on Management of Data

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Short-paper

Funding Sources

Magyar Tudományos Akadémia

Conference

SIGMOD/PODS '18

Sponsor:

SIGMOD

SIGMOD/PODS '18: International Conference on Management of Data

June 10, 2018

Texas, Houston

Acceptance Rates

GRADES-NDA '18 Paper Acceptance Rate 10 of 26 submissions, 38%;

Overall Acceptance Rate 29 of 61 submissions, 48%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

16
Total Citations
View Citations
345
Total Downloads

Downloads (Last 12 months)16
Downloads (Last 6 weeks)3

Reflects downloads up to 21 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Szárnyas GBebee BBirler ADeutsch AFletcher GGabb HGosnell DGreen AGuo ZHare KHidders JIosup AKiryakov AKovatchev TLi XLibkin LLin HLuo XPrat-Pérez APüroja DQi Svan Rest OSteer BSzakállas DTong BWaudby JWu MYang BYu WZhang CZhang JZhou YBoncz P(2024)The Linked Data Benchmark Council (LDBC): Driving Competition and Collaboration in the Graph Data Management SpacePerformance Evaluation and Benchmarking10.1007/978-3-031-68031-1_7(90-106)Online publication date: 22-Sep-2024
https://doi.org/10.1007/978-3-031-68031-1_7
Faltín TTrigonakis VBerdai AFusco LIorgulescu CLee JYaghob JHong SChafi H(2023)Distributed Asynchronous Regular Path Queries (RPQs) on GraphsProceedings of the 24th International Middleware Conference: Industrial Track10.1145/3626562.3626833(35-41)Online publication date: 11-Dec-2023
https://dl.acm.org/doi/10.1145/3626562.3626833
Khan A(2023)Knowledge Graphs QueryingACM SIGMOD Record10.1145/3615952.361595652:2(18-29)Online publication date: 11-Aug-2023
https://dl.acm.org/doi/10.1145/3615952.3615956
Besta MGerstenberger RPeter EFischer MPodstawski MBarthels CAlonso GHoefler T(2023)Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph QueriesACM Computing Surveys10.1145/360493256:2(1-40)Online publication date: 15-Sep-2023
https://dl.acm.org/doi/10.1145/3604932
Faltín TTrigonakis VBerdai AFusco LIorgulescu CHong SChafi HHartig OYoshida Y(2023)Better Distributed Graph Query Planning With Scouting QueriesProceedings of the 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)10.1145/3594778.3594884(1-9)Online publication date: 18-Jun-2023
https://dl.acm.org/doi/10.1145/3594778.3594884
Sen RTian Y(2023)Microarchitectural Analysis of Graph BI Queries on RDBMSProceedings of the 19th International Workshop on Data Management on New Hardware10.1145/3592980.3595321(102-106)Online publication date: 18-Jun-2023
https://dl.acm.org/doi/10.1145/3592980.3595321
Besta MGerstenberger RFischer MPodstawski MBlach NEgeli BMitenkov GChlapek WMichalewicz MNiewiadomski HMueller JHoefler TMohror KArnold DBadia R(2023)The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of CoresProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607068(1-18)Online publication date: 12-Nov-2023
https://dl.acm.org/doi/10.1145/3581784.3607068
Narayanasamy SSrinivasan KHu YMasilamani SHuang K(2022)A Contemporary Review on Utilizing Semantic Web Technologies in Healthcare, Virtual Communities, and Ontology-Based Information Processing SystemsElectronics10.3390/electronics1103045311:3(453)Online publication date: 3-Feb-2022
https://doi.org/10.3390/electronics11030453
Szárnyas GWaudby JSteer BSzakállas DBirler AWu MZhang YBoncz P(2022)The LDBC Social Network BenchmarkProceedings of the VLDB Endowment10.14778/3574245.357427016:4(877-890)Online publication date: 1-Dec-2022
https://dl.acm.org/doi/10.14778/3574245.3574270
Saleem MAkhter AVahdati SNgonga Ngomo A(2022)μ-Bench: Real-world Micro Benchmarking for SPARQL Query Processing over Knowledge GraphsProceedings of the 11th International Joint Conference on Knowledge Graphs10.1145/3579051.3579054(39-47)Online publication date: 27-Oct-2022
https://dl.acm.org/doi/10.1145/3579051.3579054
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents