Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3087556.3087574acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
research-article
Open access

Bicriteria Distributed Submodular Maximization in a Few Rounds

Published: 24 July 2017 Publication History

Abstract

We study the problem of efficiently optimizing submodular functions under cardinality constraints in distributed setting. Recently, several distributed algorithms for this problem have been introduced which either achieve a sub-optimal solution or they run in super-constant number of rounds of computation. Unlike previous work, we aim to design distributed algorithms in multiple rounds with almost optimal approximation guarantees at the cost of outputting a larger number of elements. Toward this goal, we present a distributed algorithm that, for any ε > 0 and any constant r, outputs a set S of O(rk1/r) items in r rounds, and achieves a (1-ε)-approximation of the value of the optimum set with k items. This is the first distributed algorithm that achieves an approximation factor of (1-ε) running in less than log 1/ε number of rounds. We also prove a hardness result showing that the output of any 1-ε approximation distributed algorithm limited to one distributed round should have at least Ω(k/ε) items. In light of this hardness result, our distributed algorithm in one round, r = 1, is asymptotically tight in terms of the output size. We support the theoretical guarantees with an extensive empirical study of our algorithm showing that achieving almost optimum solutions is indeed possible in a few rounds for large-scale real datasets.

References

[1]
Gutenberg. search project gutenberg. https://www.gutenberg.org/ebooks/, 2016.
[2]
Z. Abbassi, V. S. Mirrokni, and M. Thakur. Diversity maximization under matroid constraints. In KDD, KDD '13, pages 32--40, New York, NY, USA, 2013. ACM.
[3]
D. Achlioptas. Database-friendly random projections: Johnson-lindenstrauss with binary coins. J. of computer and System Sciences, 2003.
[4]
A. Badanidiyuru, B. Mirzasoleiman, A. Karbasi, and A. Krause. Streaming submodular maximization: Massive data summarization on the fly. In KDD, 2014.
[5]
R. Barbosa, A. Ene, H. L. Nguyen, and J. Ward. The power of randomization: Distributed submodular maximization on massive datasets. In ICML, pages 1236--1244, 2015.
[6]
R. Barbosa, A. Ene, H. L. Nguyen, and J. Ward. A new framework for distributed submodular maximization, 2016.
[7]
M. Bateni, H. Esfandiari, and V. S. Mirrokni. Distributed coverage maximization via sketching. CoRR, abs/1612.02327, 2016.
[8]
G. E. Blelloch, H. V. Simhadri, and K. Tangwongsan. Parallel and i/o efficient set covering algorithms. In SPAA, pages 82--90, 2012.
[9]
F. Chierichetti, R. Kumar, and A. Tomkins. Max-cover in map-reduce. In WWW, pages 231--240, 2010.
[10]
G. Cormode, H. J. Karloff, and A. Wirth. Set cover algorithms for very large datasets. In CIKM, pages 479--488, 2010.
[11]
U. Feige. A threshold of ln n for approximating set cover. J. ACM, 45(4):634--652, July 1998.
[12]
U. Feige, V. S. Mirrokni, and J. Vondrak. Maximizing non-monotone submodular functions. SIAM Journal on Computing, 40(4):1133--1153, 2011.
[13]
B. J. Frey and D. Dueck. Mixture modeling by affinity propagation. In NIPS, pages 379--386, 2005.
[14]
W. Gasarch and S. Fletcher. The egg game. http://www.cs.umd.edu/ gasarch/BLOGPAPERS/egg.pdf.
[15]
A. Guillory and J. A. Bilmes. Active semi-supervised learning using submodular functions. arXiv preprint arXiv:1202.3726, 2012.
[16]
M. Hoffman, F. R. Bach, and D. M. Blei. Online learning for latent dirichlet allocation. In NIPS, 2010.
[17]
R. kiveris, S. Lattanzi, V. Mirrokni, V. Rastogi, and S. Vasilvitski. Connected components in mapreduce and beyond. In ACM SOCC, 2014.
[18]
R. Kumar, B. Moseley, S. Vassilvitskii, and A. Vattani. Fast greedy algorithms in mapreduce and streaming. In SPAA, pages 1--10, 2013.
[19]
S. Lattanzi, B. Moseley, S. Suri, and S. Vassilvitskii. Filtering: a method for solving graph problems in mapreduce. In SPAA, pages 85--94, 2011.
[20]
H. Lin and J. A. Bilmes. A class of submodular functions for document summarization. In HLT, pages 510--520, 2011.
[21]
V. S. Mirrokni and M. Zadimoghaddam. Randomized composable core-sets for distributed submodular maximization. In STOC, pages 153--162, 2015.
[22]
B. Mirzasoleiman, A. Badanidiyuru, A. Karbasi, J. Vondrák, and A. Krause. Lazier than lazy greedy. In AAAI, pages 1812--1818, 2015.
[23]
B. Mirzasoleiman, A. Karbasi, R. Sarkar, and A. Krause. Distributed submodular maximization: Identifying representative elements in massive data. In NIPS, pages 2049--2057, 2013.
[24]
G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher. An analysis of approximations for maximizing submodular set functions. Mathematical Programming, 14(1):265--294, 1978.
[25]
R.v Reh\r u\v rek and P. Sojka. Software Framework for Topic Modelling with Large Corpora. In LREC, Valletta, Malta, 2010.
[26]
A. Torralba, R. Fergus, and W. T. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE TPAMI, 2008.
[27]
J. Yang and J. Leskovec. Defining and evaluating network communities based on ground-truth. Knowledge and Information Systems, 42(1):181--213, 2015.

Cited By

View all
  • (2024)Federated Submodular Maximization With Differential PrivacyIEEE Internet of Things Journal10.1109/JIOT.2023.332480111:2(1827-1839)Online publication date: 15-Jan-2024
  • (2023)Algorithms for Cardinality-Constrained Monotone DR-Submodular Maximization with Low Adaptivity and Query ComplexityJournal of Optimization Theory and Applications10.1007/s10957-023-02353-7200:1(194-214)Online publication date: 18-Dec-2023
  • (2021)Robust Maximization of Correlated Submodular Functions Under Cardinality and Matroid ConstraintsIEEE Transactions on Automatic Control10.1109/TAC.2021.306165666:12(6148-6155)Online publication date: Dec-2021
  • Show More Cited By

Index Terms

  1. Bicriteria Distributed Submodular Maximization in a Few Rounds

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SPAA '17: Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures
    July 2017
    392 pages
    ISBN:9781450345934
    DOI:10.1145/3087556
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 24 July 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bicriteria approximation algorithms
    2. distributed algorithms
    3. submodular maximization

    Qualifiers

    • Research-article

    Conference

    SPAA '17
    Sponsor:

    Acceptance Rates

    SPAA '17 Paper Acceptance Rate 31 of 127 submissions, 24%;
    Overall Acceptance Rate 447 of 1,461 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)28
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Federated Submodular Maximization With Differential PrivacyIEEE Internet of Things Journal10.1109/JIOT.2023.332480111:2(1827-1839)Online publication date: 15-Jan-2024
    • (2023)Algorithms for Cardinality-Constrained Monotone DR-Submodular Maximization with Low Adaptivity and Query ComplexityJournal of Optimization Theory and Applications10.1007/s10957-023-02353-7200:1(194-214)Online publication date: 18-Dec-2023
    • (2021)Robust Maximization of Correlated Submodular Functions Under Cardinality and Matroid ConstraintsIEEE Transactions on Automatic Control10.1109/TAC.2021.306165666:12(6148-6155)Online publication date: Dec-2021
    • (2020)Continuously Tracking Core Items in Data Streams with Probabilistic Decays2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00072(769-780)Online publication date: Apr-2020
    • (2020)Two-Stage Submodular Maximization Problem Beyond Non-negative and MonotoneTheory and Applications of Models of Computation10.1007/978-3-030-59267-7_13(144-155)Online publication date: 9-Oct-2020
    • (2019)An exponential speedup in parallel running time for submodular maximization without loss in approximationProceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3310435.3310454(283-302)Online publication date: 6-Jan-2019
    • (2019)Submodular maximization with nearly-optimal approximation and adaptivity in nearly-linear timeProceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms10.5555/3310435.3310453(274-282)Online publication date: 6-Jan-2019
    • (2019)Submodular maximization with matroid and packing constraints in parallelProceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing10.1145/3313276.3316389(90-101)Online publication date: 23-Jun-2019
    • (2018)The adaptive complexity of maximizing a submodular functionProceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing10.1145/3188745.3188752(1138-1151)Online publication date: 20-Jun-2018

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media