Edge sample and discard: A new algorithm for counting triangles in large dynamic graphs

G Han, H Sethu - Proceedings of the 2017 IEEE/ACM International …, 2017 - dl.acm.org
G Han, H Sethu
Proceedings of the 2017 IEEE/ACM International Conference on Advances in …, 2017dl.acm.org
Monitoring key properties of dynamic graphs in real time can help detect spam accounts in
social networks, identify anomalies and intrusions, and inform resource management.
Traditional frameworks for dynamic graphs have relied on processing only the stream of
edges added into or deleted from an evolving graph, but not any additional related
information such as the degrees or neighbor lists of nodes associated with the edges. In this
paper, we propose a new edge sampling framework for big-graph analytics in dynamic …
Monitoring key properties of dynamic graphs in real time can help detect spam accounts in social networks, identify anomalies and intrusions, and inform resource management. Traditional frameworks for dynamic graphs have relied on processing only the stream of edges added into or deleted from an evolving graph, but not any additional related information such as the degrees or neighbor lists of nodes associated with the edges. In this paper, we propose a new edge sampling framework for big-graph analytics in dynamic graphs which enhances the traditional model and offers significantly improved trade-off between accuracy and computational/memory costs in the running estimates of a graph's properties. To demonstrate this higher accuracy using the framework, we present a new sampling algorithm, called Edge Sample and Discard, which generates an unbiased estimate of the total number of triangles in a fully dynamic graph. The experimental results show that, with the help of the neighborhood information of the sampled edges, the accuracy achieved by our algorithm is substantially better than the accuracy achieved by current state-of-the-art algorithms.
ACM Digital Library