Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3569966.3570004acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsseConference Proceedingsconference-collections
research-article

Isolate-Set-Based In-Memory Parallel Subgraph Matching Framework

Published: 20 December 2022 Publication History

Abstract

Subgraph matching is a classical graph task, which finds all subgraphs that are isomorphic to the query graph in a labeled data graph. According to the theoretical basis, there are three types of subgraph matching methods: exploration-based methods, state-space representation methods, and multi-way join methods. Each of these methods has its advantages, but their common feature is a large amount of computation. In order to improve the computation efficiency, some representative subgraph matching methods have their own parallel optimization methods, but these parallel optimization methods are not universal. To overcome the drawbacks and improve the efficiency of parallel computing, we propose an isolate-set-based parallel subgraph matching framework. The parallel framework can flexibly adapt different subgraph matching methods without changing the sequential methods themselves. We partition the search tree into several decoupled subtrees and prune the subtrees to reduce computation overheads. And then, parallel subgraph matching is implemented on the decoupled pruned subtrees. To demonstrate the proposed method’s effectiveness empirically, we parallelize several representative subgraph matching methods in our framework and compare their performance with the sequential methods. The results show that our parallel framework is adapted to representative sequential methods, outperforms the sequential methods by a large margin, and achieves a speedup of 10x-50x on a 64-core x86 machine.

References

[1]
[1] SAHU S, MHEDHBI A, SALIHOGLU S, et al. The ubiquity of large graphs and surprising challenges of graph processing[J]. Proceedings of the VLDB Endowment, 2017, 11(4): 420-431.
[2]
[2] CONTE D, FOGGIA P, SANSONE C, et al. Thirty years of graph matching in pattern recognition[J]. International journal of pattern recognition and artificial intelligence, 2004, 18(03): 265-298.
[3]
[3] LEWIS H R. Computers and intractability. a guide to the theory of np-completeness[M]. JSTOR, 1983.
[4]
[4] TRAN H N, KIM J J, HE B. Fast subgraph matching on large graphs using graphics processors[C]//International Conference on Database Systems for Advanced Applications. Springer, 2015: 299-315.
[5]
[5] AFRATI F N, FOTAKIS D, ULLMAN J D. Enumerating subgraph instances using map-reduce[C]//2013 IEEE 29th International Conference on Data Engineering (ICDE). IEEE, 2013: 62-73.
[6]
[6] LAI L, QIN L, LIN X, et al. Scalable subgraph enumeration in mapreduce[J]. Proceedings of the VLDB Endowment, 2015, 8(10): 974-985.
[7]
[7] KYROLA A, BLELLOCH G, GUESTRIN C. Graphchi: Large-scale graph computation on just a PC[C]//10th USENIX Symposium on Operating Systems Design and Implementation (OSDI 12). 2012: 31-46.
[8]
[8] SUN S, LUO Q. Parallelizing recursive backtracking based subgraph matching on a single machine[C]//2018 IEEE 24th International Conference on Parallel and Distributed Systems (ICPADS). IEEE, 2018: 1-9.
[9]
[9] KIMMIG R, MEYERHENKE H, STRASH D. Shared memory parallel subgraph enumeration[C]//2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). IEEE, 2017: 519-529.
[10]
[10] SUN Z, WANG H, WANG H, et al. Efficient subgraph matching on billion node graphs[A]. 2012.
[11]
[11] CARLETTI V, FOGGIA P, SAGGESE A, et al. Challenging the time complexity of exact subgraph isomorphism for huge and dense graphs with vf3[J]. IEEE transactions on pattern analysis and machine intelligence, 2017, 40(4): 804-818.
[12]
[12] SOLNON C. Experimental evaluation of subgraph isomorphism solvers[C]//International Workshop on Graph-Based Representations in Pattern Recognition. Springer, 2019: 1-13.
[13]
[13] GOTTLOB G, GROHE M, MUSLIU N, et al. Hypertree decompositions: Structure, algorithms, and applications[C]//International Workshop on Graph-Theoretic Concepts in Computer Science. Springer, 2005: 1-15.
[14]
[14] SUN S, LUO Q. In-memory subgraph matching: An in-depth study[C]//Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data. 2020: 1083-1098.
[15]
[15] SHANG H, ZHANG Y, LIN X, et al. Taming verification hardness: an efficient algorithm for testing subgraph isomorphism[J]. Proceedings of the VLDB Endowment, 2008, 1(1): 364-375.
[16]
[16] BONNICI V, GIUGNO R, PULVIRENTI A, et al. A subgraph isomorphism algorithm and its application to biochemical data[J]. BMC bioinformatics, 2013, 14(7): 1-13.
[17]
[17] JüTTNER A, MADARASI P. Vf2++—an improved subgraph isomorphism algorithm[J]. Discrete Applied Mathematics, 2018, 242: 69-81.
[18]
[18] ZHANG S, LI S, YANG J. Gaddi: distance index based subgraph matching in biological networks[C]//Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology. 2009: 192-203.
[19]
[19] ZHAO P, HAN J. On graph query optimization in large networks[J]. Proceedings of the VLDB Endowment, 2010, 3(1-2): 340-351.
[20]
[20] RIVERO C R, JAMIL H M. Efficient and scalable labeled subgraph matching using sgmatch[J]. Knowledge and Information Systems, 2017, 51(1): 61-87.
[21]
[21] HE H, SINGH A K. Graphs-at-a-time: query language and access methods for graph databases[C]//Proceedings of the 2008 ACM SIGMOD international conference on Management of data. 2008: 405-418.
[22]
[22] HAN W S, LEE J, LEE J H. Turboiso: towards ultrafast and robust subgraph isomorphism search in large graph databases[C]//Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 2013: 337-348.
[23]
[23] BI F, CHANG L J, LIN X M, et al. Efficient subgraph matching by postponing cartesian products[C/OL]//ACM SIGMOD International Conference on Management of Data. 2016: 1199-1214.
[24]
[24] BHATTARAI B, LIU H, HUANG H H. Ceci: Compact embedding cluster index for scalable subgraph matching[C]//Proceedings of the 2019 International Conference on Management of Data. 2019: 1447-1462.
[25]
[25] HAN M, KIM H, GU G, et al. Efficient subgraph matching: Harmonizing dynamic programming, adaptive matching order, and failing set together[C/OL]//International Conference on Management of Data: ACM SIGMOD International Conference on Management of Data (SIGMOD). 2019: 1429-1446.
[26]
[26] CARLETTI V, FOGGIA P, RITROVATO P, et al. A parallel algorithm for subgraph isomorphism[C]//International Workshop on Graph-Based Representations in Pattern Recognition. Springer, 2019: 141-151.
[27]
[27] ZENG L, ZOU L, ÖZSU M T, et al. Gsi: Gpu-friendly subgraph isomorphism[C]//2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE, 2020: 1249-1260.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering
October 2022
753 pages
ISBN:9781450397780
DOI:10.1145/3569966
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 December 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. isolated set
  2. parallel subgraph matching
  3. shared memory algorithms

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

CSSE 2022

Acceptance Rates

Overall Acceptance Rate 33 of 74 submissions, 45%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 58
    Total Downloads
  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)1
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media