Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Efficient subgraph join based on connectivity similarity

Published: 01 July 2015 Publication History

Abstract

Graph is a widely accepted model of complex data representation. Graph model has been applied in many real applications including social networks, chemistry and pattern recognition, etc. The existence of noisy and inconsistent data makes graph similarity join imperative. The graph similarity join problem studied in this paper is to find graph pairs that can be joined due to similarity metrics. Exact graph join based on edit distance has been proved to be NP-hard, thus making approximate similarity join essential. In the paper, we propose a connectivity-similarity-based matching method, which is a new measure to evaluate graph similarity. We also apply a strategy called vertex similarity upper bound filtering in order to obtain a set of promising candidate pairs, which turns out to improve join efficiency. We perform experiments on real and synthetic graph databases to test proposed method, which is proven to achieve both good result quality and high efficiency among approximate join methods.

References

[1]
Bunke, H., Allermann, G.: Inexact graph matching for structural pattern recognition. Pattern Recogn. Lett. 1(4), 245---253 (1983)
[2]
Cho, J., Shivakumar, N., Garcia-Molina H.: Finding Replicated Web Collections. SIGMOD Conf. pp. 355---366 (2000)
[3]
Cyr, C.M., Kimia, B.B.: 3D object recognition using shape similarity-based aspect graph. ICCV pp. 254---261 (2001)
[4]
Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Graph pattern matching: from intractable to polynomial time. PVLDB 3(1), 264---275 (2010)
[5]
Fan, W., Li, J., Ma, S., Wang, H., Wu, Y.: Graph homomorphism revisited for graph matching. VLDB 3(1), 1161---1172 (2010)
[6]
Fankhauser, S., Riesen, K., Bunke, H.: Speeding up graph edit distance computation through fast bipartite matching. GbRPR pp. 102---111 (2011)
[7]
Joshi, S.: A bag of paths model for measuring structural similarity in Web documents. KDD pp. 577---582 (2003)
[8]
Justice, D., Hero, A.O.: A binary linear programming formulation of the graph edit distance. IEEE Trans. Pattern Anal. Mach. Intell. 28(8), 1200---1214 (2006)
[9]
Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Capturing topology in graph pattern matching. PVLDB 5(4), 310---321 (2011)
[10]
Melnik, S., Garcia-Molina, H., Rahm, E.: Similarity flooding: A versatile graph matching algorithm. ICDE pp. 117--128 (2002)
[11]
Sanfeliu, A., Fu, K.-S.: A distance measure between attributed relational graphs for pattern recognition. IEEE Trans Syst. Man Cybern. 13(3), 353---362 (1983)
[12]
Schenker, A., Last, M., Bunke, H., Kandel, A.: Classification of Web documents using graph matching. IJPRAI 18(3), 1553---1559 (2004)
[13]
Shang, H., Lin, X., Zhang, Y., Yu, J.X., Wang, W.: Connected substructure similarity search. SIGMOD Conf. pp. 903---914 (2010)
[14]
Wang, G., Wang, B., Yang, X., Yu, G.: Efficiently indexing large sparse graphs for similarity search. IEEE Trans. Knowl. Data Eng. 24(3), 440--451 (2012)
[15]
Williams, D. W., Huan, J., Wang, W.: Graph database indexing using structured graph decomposition. ICDE pp. 976---985 (2007)
[16]
Yan, X., Yu, P.S., Han, J.: Substructure similarity search in graph databases. SIGMOD Conf. 766---777 (2005)
[17]
Zeng, Z., Tung, A.K., Wang, J., Feng, J., Zhou, L.: Comparing stars: on approximating graph edit distance. PVLDB 2(1), 25---36 (2009)
[18]
Zeng, Z., Tung, A. K., Wang, J., Feng, J., Zhou, L.: Edit Distance Evaluation on Graph Structures. Technical Report TRA6/08. National University of Singapore ( 2008)
[19]
Zhao, X., Xiao, C., Lin, X., Wang, W.: Efficient graph similarity joins with edit distance constraints. ICDE pp. 834--845 (2012)
[20]
Zhu, Y., Qin, L., Yu, J. X., Ke, Y., Lin, X.: High efficiency and quality: large graphs matching. CIKM pp. 1755---1764 (2011)
[21]
Zou, L., Chen, L., Özsu, M.T.: Distance-join: pattern match query in a large graph databases. VLDB 2(1), 886--897 (2009)

Cited By

View all
  1. Efficient subgraph join based on connectivity similarity

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image World Wide Web
    World Wide Web  Volume 18, Issue 4
    July 2015
    421 pages

    Publisher

    Kluwer Academic Publishers

    United States

    Publication History

    Published: 01 July 2015

    Author Tags

    1. Connectivity similarity
    2. Graph similarity joins
    3. Upper bound of vertex similarity

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media