Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Theoretically and Practically Efficient Maximum Defective Clique Search

Published: 30 September 2024 Publication History

Abstract

The study of k-defective cliques, defined as induced subgraphs that differ from cliques by at most k missing edges, has attracted much attention in graph analysis due to their relevance in various applications, including social network analysis and implicit interaction predictions. However, determining the maximum k-defective clique in graphs has been proven to be an NP-hard problem, presenting significant challenges in finding an efficient solution. To address this problem, we develop a theoretically and practically efficient algorithm that leverages newly-designed branch reduction rules and a pivot-based branching technique. Our analysis establishes that the time complexity of the proposed algorithm is bounded by O(mγk n), where γk is a real value strictly less than 2 (e.g., when k= 1, 2, and 3, γk= 1.466, 1.755, and 1.889, respectively). To our knowledge, this algorithm achieves the best worst-case time complexity to date compared to state-of-the-art solutions. Moreover, to further reduce unnecessary branches, we propose a time-efficient upper bound-based pruning technique, which is obtained by manipulating information such as the number of distinct colors assigned to vertices and the presence of non-neighbors among them. Additionally, we employ an ordering-based heuristic approach as a preprocessing step to improve computational efficiency. Finally, we conduct extensive experiments on a diverse set of over 300 graphs to evaluate the efficiency of the proposed solutions. The results demonstrate that our algorithm achieves a speedup of 3 orders of magnitude over state-of-the-art solutions in processing most of real-world graphs.

References

[1]
Eytan Adar and Christopher Ré. 2007. Managing Uncertainty in Social Networks. IEEE Data Eng. Bull., Vol. 30, 2 (2007), 15--22.
[2]
Charu C. Aggarwal. 2011. Social Network Data Analytics. Springer.
[3]
Balabhaskar Balasundaram, Sergiy Butenko, and Illya V. Hicks. 2011. Clique Relaxations in Social Network Analysis: The Maximum k-Plex Problem. Oper. Res., Vol. 59, 1 (2011), 133--142.
[4]
Vladimir Batagelj and Matjaz Zaversnik. 2003. An O(m) Algorithm for Cores Decomposition of Networks. CoRR, Vol. cs.DS/0310049 (2003).
[5]
Punam Bedi and Chhavi Sharma. 2016. Community detection in social networks. Wiley interdisciplinary reviews: Data mining and knowledge discovery, Vol. 6, 3 (2016), 115--135.
[6]
Vladimir Boginski, Sergiy Butenko, and Panos M Pardalos. 2005. Statistical analysis of financial networks. Computational statistics & data analysis, Vol. 48, 2 (2005), 431--443.
[7]
Vladimir Boginski, Sergiy Butenko, and Panos M Pardalos. 2006. Mining market data: A network approach. Computers & Operations Research, Vol. 33, 11 (2006), 3171--3184.
[8]
Immanuel M. Bomze, Marco Budinich, Panos M. Pardalos, and Marcello Pelillo. 1999. The Maximum Clique Problem. In Handbook of Combinatorial Optimization. 1--74.
[9]
Coenraad Bron and Joep Kerbosch. 1973. Finding All Cliques of an Undirected Graph (Algorithm 457). Commun. ACM, Vol. 16, 9 (1973), 575--576.
[10]
Lijun Chang. 2020. Efficient maximum clique computation and enumeration over large sparse graphs. VLDB J., Vol. 29, 5 (2020), 999--1022.
[11]
Lijun Chang. 2023. Efficient Maximum k-Defective Clique Computation with Improved Time Complexity. Proc. ACM Manag. Data, Vol. 1, 3 (2023), 209:1--209:26.
[12]
Lijun Chang, Mouyi Xu, and Darren Strash. 2022. Efficient Maximum k-Plex Computation over Large Sparse Graphs. Proc. VLDB Endow., Vol. 16, 2 (2022), 127--139.
[13]
Xiaoyu Chen, Yi Zhou, Jin-Kao Hao, and Mingyu Xiao. 2021. Computing maximum k-defective cliques in massive graphs. Comput. Oper. Res., Vol. 127 (2021), 105131.
[14]
Qiangqiang Dai, Rong-Hua Li, Meihao Liao, and Guoren Wang. 2023. Maximal Defective Clique Enumeration. Proc. ACM Manag. Data, Vol. 1, 1 (2023), 77:1--77:26.
[15]
Nan Du, Bin Wu, Xin Pei, Bai Wang, and Liutong Xu. 2007. Community detection in large-scale social networks. In WebKDD and SNA-KDD workshop. 16--25.
[16]
David Eppstein, Maarten Löffler, and Darren Strash. 2010. Listing All Maximal Cliques in Sparse Graphs in Near-Optimal Time. In ISAAC, Vol. 6506. 403--414.
[17]
V. Fomin Fedor and Kratsch Dieter. 2010. Exact Exponential Algorithms. Springer.
[18]
Jian Gao, Jiejiang Chen, Minghao Yin, Rong Chen, and Yiyuan Wang. 2018. An Exact Algorithm for Maximum k-Plexes in Massive Graphs. In IJCAI. 1449--1455.
[19]
Jian Gao, Zhenghang Xu, Ruizhi Li, and Minghao Yin. 2022. An Exact Algorithm with New Upper Bounds for the Maximum k-Defective Clique Problem in Massive Sparse Graphs. In AAAI. 10174--10183.
[20]
Timo Gschwind, Stefan Irnich, and Isabel Podlinski. 2018. Maximum weight relaxed cliques and Russian Doll Search revisited. Discret. Appl. Math., Vol. 234 (2018), 131--138.
[21]
Shweta Jain and C. Seshadhri. 2020. Provably and Efficiently Approximating Near-cliques using the Turán Shadow: PEANUTS. In WWW. 1966--1976.
[22]
Hua Jiang, Dongming Zhu, Zhichao Xie, Shaowen Yao, and Zhang-Hua Fu. 2021. A New Upper Bound Based on Vertex Partitioning for the Maximum K-plex Problem. In IJCAI. 1689--1696.
[23]
Richard M. Karp. 1972. Reducibility Among Combinatorial Problems. In Complexity of Computer Computations. 85--103.
[24]
Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, Dandapani Sivakumar, Andrew Tompkins, and Eli Upfal. 2000. The Web as a graph. In PODS. 1--10.
[25]
Ailsa H. Land and Alison G. Doig. 2010. An Automatic Method for Solving Discrete Programming Problems. In 50 Years of Integer Programming 1958--2008 - From the Early Years to the State-of-the-Art. 105--132.
[26]
R. M. R. Lewis. 2016. A Guide to Graph Colouring - Algorithms and Applications. Springer.
[27]
Chu-Min Li, Hua Jiang, and Felip Manyà. 2017. On minimization of the number of branches in branch-and-bound algorithms for the maximum clique problem. Comput. Oper. Res., Vol. 84 (2017), 1--15.
[28]
Chu-Min Li, Zhiwen Fang, and Ke Xu. 2013. Combining MaxSAT reasoning and incremental upper bound for the maximum clique problem. In ICTAI. 939--946.
[29]
Chu Min Li and Zhe Quan. 2010. An Efficient Branch-and-Bound Algorithm Based on MaxSAT for the Maximum Clique Problem. In AAAI. 128--133.
[30]
Don R Lick and Arthur T White. 1970. k-Degenerate graphs. Canadian Journal of Mathematics, Vol. 22, 5 (1970), 1082--1096.
[31]
Guimei Liu and Limsoon Wong. 2008. Effective Pruning Techniques for Mining Quasi-Cliques. In ECML/PKDD, Vol. 5212. 33--49.
[32]
R. Duncan Luce. 1950. Connectivity and generalized cliques in sociometric group structure. Psychometrika, Vol. 15, 2 (1950), 169--190.
[33]
R. Duncan Luce and Albert D. Perry. 1949. A method of matrix analysis of group structure. Psychometrika, Vol. 14, 2 (1949), 95--116.
[34]
Fabrizio Marinelli, Andrea Pizzuti, and Fabrizio Rossi. 2021. LP-based dual bounds for the maximum quasi-clique problem. Discret. Appl. Math., Vol. 296 (2021), 118--140.
[35]
Benjamin McClosky and Illya V. Hicks. 2012. Combinatorial algorithms for the maximum k-plex problem. J. Comb. Optim., Vol. 23, 1 (2012), 29--49.
[36]
Zhuqi Miao and Balabhaskar Balasundaram. 2020. An Ellipsoidal Bounding Scheme for the Quasi-Clique Number of a Graph. INFORMS J. Comput., Vol. 32, 3 (2020), 763--778.
[37]
Robert J. Mokken et al. 1979. Cliques, clubs and clans. Quality & Quantity, Vol. 13, 2 (1979), 161--173.
[38]
Hannes Moser, Rolf Niedermeier, and Manuel Sorge. 2012. Exact combinatorial algorithms and experiments for finding maximum k-plexes. J. Comb. Optim., Vol. 24, 3 (2012), 347--373.
[39]
Kevin A. Naudé. 2016. Refined pivot selection for maximal clique enumeration in graphs. Theor. Comput. Sci., Vol. 613 (2016), 28--37.
[40]
Foad Mahdavi Pajouh, Zhuqi Miao, and Balabhaskar Balasundaram. 2014. A branch-and-bound approach for maximum quasi-cliques. Ann. Oper. Res., Vol. 216, 1 (2014), 145--161.
[41]
Panos M Pardalos and Jue Xue. 1994. The maximum clique problem. Journal of global Optimization, Vol. 4 (1994), 301--328.
[42]
Grigory Pastukhov, Alexander Veremyev, Vladimir Boginski, and Oleg A. Prokopyev. 2018. On maximum degree-based (γ)-quasi-clique problem: Complexity and exact approaches. Networks, Vol. 71, 2 (2018), 136--152.
[43]
Jeffrey Pattillo, Alexander Veremyev, Sergiy Butenko, and Vladimir Boginski. 2013. On the maximum quasi-clique problem. Discret. Appl. Math., Vol. 161, 1--2 (2013), 244--257.
[44]
Jeffrey Pattillo, Nataly Youssef, and Sergiy Butenko. 2011. Clique relaxation models in social network analysis. In Handbook of Optimization in Complex Networks: Communication and Social Networks. Springer, 143--162.
[45]
Jeffrey Pattillo, Nataly Youssef, and Sergiy Butenko. 2013. On clique relaxation models in network analysis. Eur. J. Oper. Res., Vol. 226, 1 (2013), 9--18.
[46]
Eric M. Phizicky and Stanley Fields. 1995. Protein-protein interactions: methods for detection and analysis. Microbiological reviews, Vol. 59, 1 (1995), 94--123.
[47]
Celso C. Ribeiro and José A. Riveaux. 2019. An exact algorithm for the maximum quasi-clique problem. Int. Trans. Oper. Res., Vol. 26, 6 (2019), 2199--2229.
[48]
Ryan A. Rossi and Nesreen K. Ahmed. 2015. The Network Data Repository with Interactive Graph Analytics and Visualization. In AAAI. 4292--4293.
[49]
Ryan A. Rossi, David F. Gleich, and Assefaw Hadish Gebremedhin. 2015. Parallel Maximum Clique Algorithms with Applications to Network Analysis. SIAM J. Sci. Comput., Vol. 37, 5 (2015).
[50]
Pablo San Segundo, Alvaro Lopez, and Panos M. Pardalos. 2016. A new exact maximum clique algorithm for large and massive sparse graphs. Comput. Oper. Res., Vol. 66 (2016), 81--94.
[51]
Stephen B Seidman. 1983. Network structure and minimum degree. Social networks, Vol. 5, 3 (1983), 269--287.
[52]
Stephen B. Seidman and Brian L. Foster. 1978. A graph-theoretic generalization of the clique concept. Journal of Mathematical sociology, Vol. 6, 1 (1978), 139--154.
[53]
Etsuji Tomita and Toshikatsu Kameda. 2007. An Efficient Branch-and-bound Algorithm for Finding a Maximum Clique with Computational Experiments. J. Glob. Optim., Vol. 37, 1 (2007), 95--111.
[54]
Etsuji Tomita, Sora Matsuzaki, Atsuki Nagao, Hiro Ito, and Mitsuo Wakatsuki. 2017. A Much Faster Algorithm for Finding a Maximum Clique with Computational Experiments. J. Inf. Process., Vol. 25 (2017), 667--677.
[55]
Etsuji Tomita, Yoichi Sutani, Takanori Higashi, Shinya Takahashi, and Mitsuo Wakatsuki. 2010. A Simple and Faster Branch-and-Bound Algorithm for Finding a Maximum Clique. In WALCOM, Vol. 5942. 191--203.
[56]
Etsuji Tomita, Akira Tanaka, and Haruhisa Takahashi. 2006. The worst-case time complexity for generating all maximal cliques and computational experiments. Theor. Comput. Sci., Vol. 363, 1 (2006), 28--42.
[57]
Svyatoslav Trukhanov, Chitra Balasubramaniam, Balabhaskar Balasundaram, and Sergiy Butenko. 2013. Algorithms for detecting optimal hereditary structures in graphs, with application to clique relaxations. Comput. Optim. Appl., Vol. 56, 1 (2013), 113--130.
[58]
Alexander Veremyev, Oleg A. Prokopyev, Sergiy Butenko, and Eduardo L. Pasiliao. 2016. Exact MIP-based approaches for finding maximum quasi-cliques and dense subgraphs. Comput. Optim. Appl., Vol. 64, 1 (2016), 177--214.
[59]
Gérard Verfaillie, Michel Lema^itre, and Thomas Schiex. 1996. Russian Doll Search for Solving Constraint Optimization Problems. In AAAI. 181--187.
[60]
Jianxin Wang, Min Li, Youping Deng, and Yi Pan. 2010. Recent advances in clustering methods for protein interaction networks. BMC genomics, Vol. 11, 3 (2010), 1--19.
[61]
Mingyu Xiao, Weibo Lin, Yuanshun Dai, and Yifeng Zeng. 2017. A Fast Algorithm to Compute Maximum k-Plexes in Social Network Analysis. In AAAI. 919--925.
[62]
Mihalis Yannakakis. 1978. Node- and Edge-Deletion NP-Complete Problems. In STOC. 253--264.
[63]
Haiyuan Yu, Alberto Paccanaro, Valery Trifonov, and Mark Gerstein. 2006. Predicting interactions in protein networks by completing defective cliques. Bioinform., Vol. 22, 7 (2006), 823--829.
[64]
Yi Zhou, Shan Hu, Mingyu Xiao, and Zhang-Hua Fu. 2021. Improving Maximum k-plex Solver via Second-Order Reduction and Graph Color Bounding. In AAAI. 12453--12460.
[65]
David Zuckerman. 2006. Linear degree extractors and the inapproximability of max clique and chromatic number. In STOC. 681--690.

Index Terms

  1. Theoretically and Practically Efficient Maximum Defective Clique Search

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the ACM on Management of Data
    Proceedings of the ACM on Management of Data  Volume 2, Issue 4
    SIGMOD
    September 2024
    458 pages
    EISSN:2836-6573
    DOI:10.1145/3698442
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 September 2024
    Published in PACMMOD Volume 2, Issue 4

    Permissions

    Request permissions for this article.

    Author Tags

    1. branch-and-bound
    2. cohesive subgraph search
    3. k-defective clique

    Qualifiers

    • Research-article

    Funding Sources

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 97
      Total Downloads
    • Downloads (Last 12 months)97
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 08 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media