Influence maximization in near-linear time: A martingale approach
Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015•dl.acm.org
Given a social network G and a positive integer k, the influence maximization problem asks
for k nodes (in G) whose adoptions of a certain idea or product can trigger the largest
expected number of follow-up adoptions by the remaining nodes. This problem has been
extensively studied in the literature, and the state-of-the-art technique runs in O ((k+ l)(n+ m)
log n ε2) expected time and returns a (1-1 e-ε)-approximate solution with at least 1-1/nl
probability. This paper presents an influence maximization algorithm that provides the same …
for k nodes (in G) whose adoptions of a certain idea or product can trigger the largest
expected number of follow-up adoptions by the remaining nodes. This problem has been
extensively studied in the literature, and the state-of-the-art technique runs in O ((k+ l)(n+ m)
log n ε2) expected time and returns a (1-1 e-ε)-approximate solution with at least 1-1/nl
probability. This paper presents an influence maximization algorithm that provides the same …
Given a social network G and a positive integer k, the influence maximization problem asks for k nodes (in G) whose adoptions of a certain idea or product can trigger the largest expected number of follow-up adoptions by the remaining nodes. This problem has been extensively studied in the literature, and the state-of-the-art technique runs in O((k+l) (n+m) log n ε2) expected time and returns a (1-1 e-ε)-approximate solution with at least 1 - 1/n l probability.
This paper presents an influence maximization algorithm that provides the same worst-case guarantees as the state of the art, but offers significantly improved empirical efficiency. The core of our algorithm is a set of estimation techniques based on martingales, a classic statistical tool. Those techniques not only provide accurate results with small computation overheads, but also enable our algorithm to support a larger class of information diffusion models than existing methods do. We experimentally evaluate our algorithm against the states of the art under several popular diffusion models, using real social networks with up to 1.4 billion edges. Our experimental results show that the proposed algorithm consistently outperforms the states of the art in terms of computation efficiency, and is often orders of magnitude faster.