research-article

Influence Estimation and Maximization in Continuous-Time Diffusion Networks

Authors:

Manuel Gomez-Rodriguez,

Le Song,

Nan Du,

Hongyuan Zha,

Bernhard SchölkopfAuthors Info & Claims

ACM Transactions on Information Systems (TOIS), Volume 34, Issue 2

Article No.: 9, Pages 1 - 33

https://doi.org/10.1145/2824253

Published: 08 February 2016 Publication History

Get Access

Abstract

If a piece of information is released from a set of media sites, can it spread, in 1 month, to a million web pages? Can we efficiently find a small set of media sites among millions that can maximize the spread of the information, in 1 month? The two problems are called influence estimation and maximization problems respectively, which are very challenging since both the time-sensitive nature of the problems and the issue of scalability need to be addressed simultaneously. In this article, we propose two algorithms for influence estimation in continuous-time diffusion networks. The first one uses continuous-time Markov chains to estimate influence exactly on networks with exponential, or, more generally, phase-type transmission functions, but does not scale to large-scale networks, and the second one is a highly efficient randomized algorithm, which estimates the influence of every node in a network with general transmission functions, |ν| nodes and |ε| edges to an accuracy of ϵ using n = O(1/ϵ²) randomizations and up to logarithmic factors O(n|ε|+n|ν| computations. We then show that finding the set of most influential source nodes in a continuous time diffusion network is an NP-hard problem and develop an efficient greedy algorithm with provable near-optimal performance. When used as subroutines in the influence maximization algorithm, the exact influence estimation algorithm is guaranteed to find a set of C nodes with an influence of at least (1 − 1/e)OPT and the randomized algorithm is guaranteed to find a set with an influence of at least 1 − 1/e)OPT − 2Cε, where OPT is the optimal value. Experiments on both synthetic and real-world data show that the proposed algorithms significantly improve over previous state-of-the-art methods in terms of the accuracy of the estimated influence and the quality of the selected nodes to maximize the influence, and the randomized algorithm can easily scale up to networks of millions of nodes.

References

[1]

S. Asmussen and O. Nerman. 1996. Fitting phase-type distributions via the EM algorithm. Scandinavian Journal of Statistics 23, 4 (1996), 419--441.

Abstract

References

Cited By

Index Terms

Recommendations

Scalable influence maximization for prevalent viral marketing in large-scale social networks

Maximizing influence under influence loss constraint in social networks

Towards Time-Discounted Influence Maximization

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations