Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3323165.3323172acmconferencesArticle/Chapter ViewAbstractPublication PagesspaaConference Proceedingsconference-collections
announcement

Scalable Diversity Maximization via Small-size Composable Core-sets (Brief Announcement)

Published: 17 June 2019 Publication History

Abstract

In this paper, we study the diversity maximization problem (a.k.a. maximum dispersion problem) in which given a set of n objects in a metric space, one wants to find a subset of k distinct objects with the maximum sum of pairwise distances. We address this problem using the distributed framework known as randomized composable core-sets[3]. Unlike previous work, we study small-size core-set algorithms allowing minimum possible intermediate output size (and hence achieving large speed-up in the computation and increased parallelism), and at the same time, improving significantly over the approximation guarantees of state-of-the-art core-set-based algorithms. In particular, we present a simple distributed algorithm that achieves an almost optimal communication complexity, and asymptotically achieves approximation factor of 1/2, matching the best known global approximation factor for this problem. Our algorithms are scalable and practical as shown by our extensive empirical evaluation with large datasets and they can be easily used in the major distributed computing systems like MapReduce. Furthermore, we show empirically that, in real-life instances, using small-size core-set algorithms allows speed-ups up to >68 in running time w.r.t. to large-size core-sets while achieving close-to-optimal solutions with approximation factor of >90%.

References

[1]
M. Ghadiri, V. Mirrokni, S. A. Zadeh, and M. Zadimoghaddam. Scalable feature selection via distributed diversity maximization. In to appear in AAAI, 2017.
[2]
P. Indyk, S. Mahabadi, M. Mahdian, and V. Mirrokni. Composable core-sets for diversity and coverage maximization. In PODS, 2014.
[3]
V. S. Mirrokni and M. Zadimoghaddam. Randomized composable core-sets for distributed submodular maximization. In STOC, 2015.

Cited By

View all
  • (2023)Almost Optimal Massively Parallel Algorithms for k-Center Clustering and Diversity MaximizationProceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3558481.3591077(239-247)Online publication date: 17-Jun-2023
  • (2022)Maximizing single attribute diversity in group selectionAnnals of Operations Research10.1007/s10479-022-04764-7320:1(535-540)Online publication date: 25-May-2022
  • (2020)A General Coreset-Based Approach to Diversity Maximization under Matroid ConstraintsACM Transactions on Knowledge Discovery from Data10.1145/340244814:5(1-27)Online publication date: 5-Aug-2020
  • Show More Cited By

Index Terms

  1. Scalable Diversity Maximization via Small-size Composable Core-sets (Brief Announcement)

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SPAA '19: The 31st ACM Symposium on Parallelism in Algorithms and Architectures
    June 2019
    410 pages
    ISBN:9781450361842
    DOI:10.1145/3323165
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    In-Cooperation

    • EATCS: European Association for Theoretical Computer Science

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 June 2019

    Check for updates

    Author Tags

    1. distributed algorithms
    2. diversity maximization
    3. randomized composable core-sets
    4. small-size core-sets

    Qualifiers

    • Announcement

    Conference

    SPAA '19

    Acceptance Rates

    SPAA '19 Paper Acceptance Rate 34 of 109 submissions, 31%;
    Overall Acceptance Rate 447 of 1,461 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 10 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Almost Optimal Massively Parallel Algorithms for k-Center Clustering and Diversity MaximizationProceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3558481.3591077(239-247)Online publication date: 17-Jun-2023
    • (2022)Maximizing single attribute diversity in group selectionAnnals of Operations Research10.1007/s10479-022-04764-7320:1(535-540)Online publication date: 25-May-2022
    • (2020)A General Coreset-Based Approach to Diversity Maximization under Matroid ConstraintsACM Transactions on Knowledge Discovery from Data10.1145/340244814:5(1-27)Online publication date: 5-Aug-2020
    • (2020)Efficient Approaches to k Representative G-Skyline QueriesACM Transactions on Knowledge Discovery from Data10.1145/339750314:5(1-27)Online publication date: 6-Jul-2020
    • (2020)Adversarial Attacks on Graph Neural NetworksACM Transactions on Knowledge Discovery from Data10.1145/339452014:5(1-31)Online publication date: 21-Jun-2020
    • (2020)Automatic Shape Feature Recognition for Ceramic FindsJournal on Computing and Cultural Heritage 10.1145/338673013:3(1-21)Online publication date: 20-Jul-2020
    • (2020)Open Workflows for Polychromatic Reconstruction of Historical Sculptural Monuments in 3DJournal on Computing and Cultural Heritage 10.1145/338631413:3(1-16)Online publication date: 5-Aug-2020
    • (2020)MiSoSouPACM Transactions on Knowledge Discovery from Data10.1145/338565314:5(1-31)Online publication date: 21-Jun-2020
    • (2020)From the Engraved Tablet to the Digital Tablet, History of a 15th-Century Music ScoreJournal on Computing and Cultural Heritage 10.1145/338378213:3(1-18)Online publication date: 5-Aug-2020
    • (2020)“Let Them Talk!”Journal on Computing and Cultural Heritage 10.1145/338277313:3(1-30)Online publication date: 5-Aug-2020
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media