Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/ICDM.2006.79guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space

Published: 18 December 2006 Publication History

Abstract

We propose a technique for evaluating the statistical significance of frequent subgraphs in a database. A graph is represented by a feature vector that is a histogram over a set of basis elements. The set of basis elements is chosen based on domain knowledge and consists generally of vertices, edges, or small graphs. A given subgraph is transformed to a feature vector and the significance of the subgraph is computed by considering the significance of occurrence of the corresponding vector. The probability of occurrence of the vector in a random vector is computed based on the prior probability of the basis elements. This is then used to obtain a probability distribution on the support of the vector in a database of random vectors. The statistical significance of the vector/subgraph is then defined as the p-value of its observed support. We develop efficient methods for computing p-values and lower bounds. A simplified model is further proposed to improve the efficiency. We also address the problem of feature vector mining, a generalization of itemset mining where counts are associated with items and the goal is to find significant sub-vectors. We present an algorithm that explores closed frequent sub-vectors to find significant ones. Experimental results show that the proposed techniques are effective, efficient, and useful for ranking frequent subgraphs by their statistical significance.

Cited By

View all
  • (2017)Network Motif DiscoveryIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2016.256661829:3(513-528)Online publication date: 1-Mar-2017
  • (2014)Mining statistically significant connected subgraphs in vertex labeled graphsProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2588574(1003-1014)Online publication date: 18-Jun-2014
  • (2013)A polynomial-time maximum common subgraph algorithm for outerplanar graphs and its application to chemoinformaticsAnnals of Mathematics and Artificial Intelligence10.5555/2593030.259304869:4(343-376)Online publication date: 1-Dec-2013
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
ICDM '06: Proceedings of the Sixth International Conference on Data Mining
December 2006
1209 pages
ISBN:0769527019

Publisher

IEEE Computer Society

United States

Publication History

Published: 18 December 2006

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2017)Network Motif DiscoveryIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2016.256661829:3(513-528)Online publication date: 1-Mar-2017
  • (2014)Mining statistically significant connected subgraphs in vertex labeled graphsProceedings of the 2014 ACM SIGMOD International Conference on Management of Data10.1145/2588555.2588574(1003-1014)Online publication date: 18-Jun-2014
  • (2013)A polynomial-time maximum common subgraph algorithm for outerplanar graphs and its application to chemoinformaticsAnnals of Mathematics and Artificial Intelligence10.5555/2593030.259304869:4(343-376)Online publication date: 1-Dec-2013
  • (2013)Extraction of statistically significant malware behaviorsProceedings of the 29th Annual Computer Security Applications Conference10.1145/2523649.2523659(69-78)Online publication date: 9-Dec-2013
  • (2012)Significant motifs in time seriesStatistical Analysis and Data Mining10.5555/3160825.31608295:1(35-53)Online publication date: 1-Feb-2012
  • (2012)Indexing and mining topological patterns for drug discoveryProceedings of the 15th International Conference on Extending Database Technology10.1145/2247596.2247666(562-565)Online publication date: 27-Mar-2012
  • (2008)Mining significant graph patterns by leap searchProceedings of the 2008 ACM SIGMOD international conference on Management of data10.1145/1376616.1376662(433-444)Online publication date: 9-Jun-2008

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media