Abstract
Graph pattern matching is a powerful mechanism for searching on network data. Most of the graph pattern matching tools available are based on subgraph isomorphism, i.e. finding a one-to-one correspondence between nodes of a query graph and nodes of a target graph. Often this approach is not flexible enough, since it does not allow the query to represent sets of nodes of any size that share the same profile of connectivity. We propose a novel powerful graph matching approach that overcomes the existing limits and allows the user to define complex patterns in a simple and intuitive way. In our approach, queries are expressed as graphs, whose nodes and edges specify structural constraints and filtering criteria. We show that, despite its simplicity, the proposed approach can solve a large variety of practical problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
Our approach can also be applied to undirected graphs or multi-graphs. For simplicity of exposition, in this paper we refer to directed graphs only.
- 2.
In this formulation we do not make any distinction between fixed nodes and variable nodes since the number of candidate valid associations depends on f.
- 3.
Filtering conditions can be imposed on edges as well, though for simplicity of exposition this is not considered in our description.
- 4.
References
Angles, R., Arenas, M., Barceló, P., Hogan, A., Reutter, J., Vrgoč, D.: Foundations of modern query languages for graph databases. ACM Comput. Surv. (CSUR) 50(5), 68 (2017)
Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning. IEEE Trans. Knowl. Data Eng. 13(1), 64–78 (2001)
Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1367–1372 (2004)
Di Natale, R., Ferro, A., Giugno, R., Mongiovì, M., Pulvirenti, A., Shasha, D.: Sing: Subgraph search in non-homogeneous graphs. BMC Bioinform. 11(1), 1 (2010)
Fan, W., Li, J., Ma, S., Tang, N., Wu, Y., Wu, Y.: Graph pattern matching: from intractable to polynomial time. Proc. VLDB Endow. 3(1–2), 264–275 (2010)
Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-oriented Software. Addison-Wesley, Boston (1994)
He, H., Singh, A.K.: Closure-tree: an index structure for graph queries. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 38–38. IEEE (2006)
Henzinger, M.R., Henzinger, T.A., Kopke, P.W.: Computing simulations on finite and infinite graphs. In: Proceedings of Annual Symposium on Foundations of Computer Science, pp. 453–462. IEEE (1995)
Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Capturing topology in graph pattern matching. Proc. VLDB Endow. 5(4), 310–321 (2011)
Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Strong simulation: capturing topology in graph pattern matching. ACM Trans. Database Syst. (TODS) 39(1), 4 (2014)
Mongiovì, M., Di Natale, R., Giugno, R., Pulvirenti, A., Ferro, A., Sharan, R.: Sigma: a set-cover-based inexact graph matching algorithm. J. Bioinform. Comput. Biol. 8(02), 199–218 (2010)
Mongiovì, M., Pappalardo, G., Tramontana, E.: Specifying and identifying widely used crosscutting concerns. Knowl.-Based Syst. 126, 20–32 (2017)
Peng, P., Zou, L., Chen, L., Lin, X., Zhao, D.: Answering subgraph queries over massive disk resident graphs. World Wide Web 19(3), 417–448 (2016)
Pienta, R., Tamersoy, A., Tong, H., Chau, D.H.: Mage: Matching approximate patterns in richly-attributed graphs. In: Proceedings of International Conference on Big Data (Big Data), pp. 585–590. IEEE (2014)
Pinter, R.Y., Rokhlenko, O., Yeger-Lotem, E., Ziv-Ukelson, M.: Alignment of metabolic pathways. Bioinformatics (Oxford, England) 21(16), 3401–8 (2005)
Zhang, S., Hu, M., Yang, J.: Treepi: A novel graph indexing method. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 966–975. IEEE (2007)
Zhang, S., Li, S., Yang, J.: GADDI: distance index based subgraph matching in biological networks. In: Proceedings of International Conference on Extending Database Technology: Advances in Database Technology, pp. 192–203. ACM (2009)
Zhang, S., Yang, J., Jin, W.: Sapper: subgraph indexing and approximate matching in large graphs. Proc. VLDB Endow. 3(1–2), 1185–1194 (2010)
Acknowledgement
This work is supported by the CLARA - CLoud plAtform and smart underground imaging for natural Risk Assessment - project, SCN 00451, funded by the Italian Ministry of Education, Universities and Research, within the “Smart Cities and Communities and Social Innovation” initiative.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Fornaia, A., Mongiovì, M., Pappalardo, G., Tramontana, E. (2019). A General Powerful Graph Pattern Matching System for Data Analysis. In: Aiello, L., Cherifi, C., Cherifi, H., Lambiotte, R., Lió, P., Rocha, L. (eds) Complex Networks and Their Applications VII. COMPLEX NETWORKS 2018. Studies in Computational Intelligence, vol 813. Springer, Cham. https://doi.org/10.1007/978-3-030-05414-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-05414-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05413-7
Online ISBN: 978-3-030-05414-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)