Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Network Embedding for Community Detection in Attributed Networks

Published: 13 May 2020 Publication History

Abstract

Community detection aims to partition network nodes into a set of clusters, such that nodes are more densely connected to each other within the same cluster than other clusters. For attributed networks, apart from the denseness requirement of topology structure, the attributes of nodes in the same community should also be homogeneous. Network embedding has been proved extremely useful in a variety of tasks, such as node classification, link prediction, and graph visualization, but few works dedicated to unsupervised embedding of node features specified for clustering task, which is vital for community detection and graph clustering. By post-processing with clustering algorithms like k-means, most existing network embedding methods can be applied to clustering tasks. However, the learned embeddings are not designed for clustering task, they only learn topological and attributed information of networks, and no clustering-oriented information is explored. In this article, we propose an algorithm named Network Embedding for node Clustering (NEC) to learn network embedding for node clustering in attributed graphs. Specifically, the presented work introduces a framework that simultaneously learns graph structure-based representations and clustering-oriented representations together. The framework consists of the following three modules: graph convolutional autoencoder module, soft modularity maximization module, and self-clustering module. Graph convolutional autoencoder module learns node embeddings based on topological structure and node attributes. We introduce soft modularity, which can be easily optimized using gradient descent algorithms, to exploit the community structure of networks. By integrating clustering loss and embedding loss, NEC can jointly optimize node cluster labels assignment and learn representations that keep local structure of network. This model can be effectively optimized using stochastic gradient algorithm. Empirical experiments on real-world networks and synthetic networks validate the feasibility and effectiveness of our algorithm on community detection task compared with network embedding based methods and traditional community detection methods.

References

[1]
Leman Akoglu, Hanghang Tong, Brendan Meeder, and Christos Faloutsos. 2012. Pics: Parameter-free identification of cohesive subgroups in large attributed graphs. In Proceedings of the 2012 SIAM International Conference on Data Mining. SIAM, 439--450.
[2]
Mikhail Belkin and Partha Niyogi. 2002. Laplacian eigenmaps and spectral techniques for embedding and clustering. In Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic. 585--591.
[3]
Vandana Bhatia and Rinkle Rani. 2019. A distributed overlapping community detection model for large graphs using autoencoder. Future Generation Computer Systems 94 (2019), 16--26.
[4]
Christopher M. Bishop. 2006. Machine learning and pattern recognition. In Information Science and Statistics. Springer.
[5]
Vincent D. Blondel, Jean-Loup Guillaume, Renaud Lambiotte, and Etienne Lefebvre. 2008. Fast unfolding of communities in large networks. Journal of statistical mechanics: Theory and experiment 2008, 10 (2008), P10008.
[6]
Aleksandar Bojchevski and Stephan Günnemann. 2018. Bayesian robust attributed graph clustering: Joint learning of partial anomalies and group structure. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence.
[7]
Deng Cai, Xiaofei He, Jiawei Han, and Thomas S. Huang. 2011. Graph regularized nonnegative matrix factorization for data representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 8 (2011), 1548--1560.
[8]
Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 891--900.
[9]
Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2016. Deep neural networks for learning graph representations. In Proceedings of the 13th AAAI Conference on Artificial Intelligence. 1145--1152.
[10]
Sandro Cavallari, Vincent W. Zheng, Hongyun Cai, Kevin Chen-Chuan Chang, and Erik Cambria. 2017. Learning community embedding with community detection and node embedding on graphs. In Proceedings of the 2017 ACM Conference on Information and Knowledge Management. ACM, 377--386.
[11]
Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, and Chunhong Pan. 2017. Deep adaptive image clustering. In Proceedings of the 2017 IEEE International Conference on Computer Vision.
[12]
Hong Cheng, Yang Zhou, and Jeffrey Xu Yu. 2011. Clustering large attributed graphs: A balance between structural and attribute similarities. ACM Transactions on Knowledge Discovery from Data 5, 2 (2011), 12.
[13]
J.-J. Daudin, Franck Picard, and Stéphane Robin. 2008. A mixture model for random graphs. Statistics and Computing 18, 2 (2008), 173--183.
[14]
Jin Di, Ge Meng, Zhixuan Li, Wenhuan Lu, and Francoise Fogelman-Soulie. 2017. Using deep learning for community discovery in social networks. In Proceedings of the IEEE 29th International Conference on Tools with Artificial Intelligence.
[15]
Santo Fortunato. 2010. Community detection in graphs. Physics Reports 486, 3--5 (2010), 75--174.
[16]
Anna Goldenberg, Alice X. Zheng, Stephen E. Fienberg, Edoardo M. Airoldi, et al. 2010. A survey of statistical network models. Foundations and Trends® in Machine Learning 2, 2 (2010), 129--233.
[17]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 855--864.
[18]
Ting Guo, Jia Wu, Xingquan Zhu, and Chengqi Zhang. 2017b. Combining structured node content and topology information for networked graph clustering. ACM Trans. Knowl. Discov. Data 11, 3, Article 29 (March 2017), 29.
[19]
Xifeng Guo, Long Gao, Xinwang Liu, and Jianping Yin. 2017a. Improved deep embedded clustering with local structure preservation. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 1753--1759.
[20]
Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 1025--1035.
[21]
Dongxiao He, Dayou Liu, Di Jin, and Weixiong Zhang. 2015. A stochastic model for detecting heterogeneous link communities in complex networks. In Proceedings of the 29th AAAI Conference on Artificial Intelligence.
[22]
Paul W. Holland, Kathryn Blackmond Laskey, and Samuel Leinhardt. 1983. Stochastic blockmodels: First steps. Social Networks 5, 2 (1983), 109--137.
[23]
Pengwei Hu, Keith Chan, and Tiantian He. 2017a. Deep graph clustering in social network. In Proceedings of the 26th International Conference on World Wide Web Companion. 1425--1426.
[24]
Weihua Hu, Takeru Miyato, Seiya Tokui, Eiichi Matsumoto, and Masashi Sugiyama. 2017b. Learning discrete representations via information maximizing self-augmented training. In Proceedings of the 34th International Conference on Machine Learning.
[25]
Di Jin, Zheng Chen, Dongxiao He, and Weixiong Zhang. 2015. Modeling with node degree preservation can accurately find communities. In Proceedings of the 29th AAAI Conference on Artificial Intelligence.
[26]
Brian Karrer and Mark E. J. Newman. 2011. Stochastic blockmodels and community structure in networks. Physical Review E 83, 1 (2011), 016107.
[27]
Abeer Khan, Lukasz Golab, Mehdi Kargar, Jaroslaw Szlichta, and Morteza Zihayat. 2020. Compact group discovery in attributed graphs and social networks. Information Processing and Management 57, 2 (2020), 102054.
[28]
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. In Procededings of the 3rd International Conference on Learning Representations (ICLR’15), Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1412.6980
[29]
Thomas N. Kipf and Max Welling. 2016a. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
[30]
Thomas N. Kipf and Max Welling. 2016b. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
[31]
Andrea Lancichinetti, Santo Fortunato, and Filippo Radicchi. 2008. Benchmark graphs for testing community detection algorithms. Physical Review E 78, 4 (2008), 046110.
[32]
Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE. Journal of Machine Learning Research 9, Nov (2008), 2579--2605.
[33]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[34]
Erxue Min, Guo Xifeng, Liu Qiang, Zhang Gen, Cui Jianjing, and Long Jun. 2018. A survey of clustering with deep learning: From the perspective of network architecture. IEEE Access 6 (2018), 39501--39514.
[35]
Mark E. J. Newman. 2006. Modularity and community structure in networks. Proceedings of the National Academy of Sciences 103, 23 (2006), 8577--8582.
[36]
Mark E. J. Newman and Aaron Clauset. 2016. Structure and inference in annotated networks. Nature Communications 7, 1 (2016), 11863.
[37]
Mark E. J. Newman and Michelle Girvan. 2004. Finding and evaluating community structure in networks. Physical Review E 69, 2 (2004), 026113.
[38]
Andrew Y. Ng, Michael I. Jordan, and Yair Weiss. 2002. On spectral clustering: Analysis and an algorithm. In Proceedings of the 14th International Conference on Neural Information Processing Systems. 849--856.
[39]
Kamal Nigam and Rayid Ghani. 2000. Analyzing the effectiveness and applicability of co-training. In Proceedings of the 9th International Conference on Information and Knowledge Management. ACM, 86--93.
[40]
Günce Orman, Vincent Labatut, Marc Plantevit, and Jean-François Boulicaut. 2015. Interpreting communities based on the evolution of a dynamic attributed network. Social Network Analysis and Mining 5, 1 (2015), 20.
[41]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. 2017. Automatic differentiation in PyTorch. In NIPS-W.
[42]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 701--710.
[43]
Michael D. Plummer and László Lovász. 1986. Matching Theory. Vol. 29. Elsevier.
[44]
Ioannis Psorakis, Stephen Roberts, Mark Ebden, and Ben Sheldon. 2011. Overlapping community detection using bayesian non-negative matrix factorization. Physical Review E 83, 6 (2011), 066114.
[45]
Usha Nandini Raghavan, Réka Albert, and Soundar Kumara. 2007. Near linear time algorithm to detect community structures in large-scale networks. Physical Review E 76, 3 (2007), 036106.
[46]
Martin Rosvall and Carl T. Bergstrom. 2008. Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences 105, 4 (2008), 1118--1123.
[47]
Sam T. Roweis and Lawrence K. Saul. 2000. Nonlinear dimensionality reduction by locally linear embedding. Science 290, 5500 (2000), 2323--2326.
[48]
Heli Sun, Hongxia Du, Jianbin Huang, Zhongbin Sun, Liang He, Xiaolin Jia, and Zhongmeng Zhao. 2018. Detecting semantic-based communities in node-attributed graphs. Computational Intelligence 34, 4 (2018), 1199--1222.
[49]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1067--1077.
[50]
Joshua B. Tenenbaum, Vin De Silva, and John C. Langford. 2000. A global geometric framework for nonlinear dimensionality reduction. Science 290, 5500 (2000), 2319--2323.
[51]
Fei Tian, Bin Gao, Qing Cui, Enhong Chen, and Tie-Yan Liu. 2014. Learning deep representations for graph clustering. In Proceedings of the 28th AAAI Conference on Artificial Intelligence. 1293--1299.
[52]
Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and Computing 17, 4 (2007), 395--416.
[53]
Daixin Wang, Peng Cui, and Wenwu Zhu. 2016. Structural deep network embedding. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1225--1234.
[54]
Zhenwen Wang, Yanli Hu, Weidong Xiao, and Bin Ge. 2013. Overlapping community detection using a generative model for networks. Physica A Statistical Mechanics and Its Applications 392, 20 (2013), 5218--5230.
[55]
Junyuan Xie, Ross Girshick, and Ali Farhadi. 2016. Unsupervised deep embedding for clustering analysis. In Proceedings of the International Conference on Machine Learning. 478--487.
[56]
Xiaowei Xu, Nurcan Yuruk, Zhidan Feng, and Thomas A. J. Schweiger. 2007. Scan: A structural clustering algorithm for networks. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 824--833.
[57]
Xiao Xu, Cheng, Fujimaki, and Muraoka. 2017. Efficient nonparametric and asymptotic Bayesian model selection methods for attributed graph clustering. Knowledge and Information Systems 53, 1 (2017), 239--268.
[58]
Zhiqiang Xu, Yiping Ke, Yi Wang, Hong Cheng, and James Cheng. 2012. A model-based approach to attributed graph clustering. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. ACM, 505--516.
[59]
Zhiqiang Xu, Yiping Ke, Yi Wang, Hong Cheng, and James Cheng. 2014. GBAGC: A general bayesian framework for attributed graph clustering. ACM Transactions on Knowledge Discovery from Data 9, 1 (2014), 5.
[60]
Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Chang. 2015. Network representation learning with rich text information. In Proceedings of the 24th International Joint Conference on Artificial Intelligence.
[61]
Jaewon Yang and Jure Leskovec. 2013. Overlapping community detection at scale: A nonnegative matrix factorization approach. In Proceedings of the 6th ACM International Conference on Web Search and Data Mining. ACM, 587--596.
[62]
Jaewon Yang, Julian McAuley, and Jure Leskovec. 2013. Community detection in networks with node attributes. In Proceedings of the IEEE 13th International Conference on Data Mining. IEEE, 1151--1156.
[63]
J. Yang, D. Parikh, and D. Batra. 2016. Joint unsupervised learning of deep representations and image clusters. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 5147--5156.
[64]
Liang Yang, Xiaochun Cao, Dongxiao He, Chuan Wang, Xiao Wang, and Weixiong Zhang. 2016. Modularity based community detection with deep learning. In Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2252--2258.
[65]
Xiao Yu, Xiang Ren, Yizhou Sun, Quanquan Gu, Bradley Sturt, Urvashi Khandelwal, Brandon Norick, and Jiawei Han. 2014. Personalized entity recommendation: A heterogeneous information network approach. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining. ACM, 283--292.
[66]
Hugo Zanghi, Stevenn Volant, and Christophe Ambroise. 2010. Clustering based on random graph model embedding vertex features. Pattern Recognition Letters 31, 9 (2010), 830--836.
[67]
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma. 2016. Collaborative knowledge base embedding for recommender systems. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 353--362.
[68]
Yang Zhou, Hong Cheng, and Jeffrey Xu Yu. 2009. Graph clustering based on structural/attribute similarities. Proceedings of the VLDB Endowment 2, 1 (2009), 718--729.

Cited By

View all
  • (2025)Attribute enhanced random walk for community detection in attributed networksNeurocomputing10.1016/j.neucom.2024.128826615(128826)Online publication date: Jan-2025
  • (2024)Heterogeneous graph community detection method based on K-nearest neighbor graph neural networkIntelligent Data Analysis10.3233/IDA-23035628:6(1445-1466)Online publication date: 15-Nov-2024
  • (2024)SNCA: Semi-Supervised Node Classification for Evolving Large Attributed GraphsBig Data Mining and Analytics10.26599/BDMA.2024.90200337:3(794-808)Online publication date: Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 14, Issue 3
June 2020
381 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3388473
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2020
Online AM: 07 May 2020
Accepted: 01 February 2020
Revised: 01 December 2019
Received: 01 August 2019
Published in TKDD Volume 14, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Community detection
  2. attributed network
  3. network embedding
  4. representation learning

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • Fundamental Research Funds for the Central Universities of China
  • National Science Foundation of China
  • Science and Technology Foundation of Shenzhen City

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)240
  • Downloads (Last 6 weeks)17
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Attribute enhanced random walk for community detection in attributed networksNeurocomputing10.1016/j.neucom.2024.128826615(128826)Online publication date: Jan-2025
  • (2024)Heterogeneous graph community detection method based on K-nearest neighbor graph neural networkIntelligent Data Analysis10.3233/IDA-23035628:6(1445-1466)Online publication date: 15-Nov-2024
  • (2024)SNCA: Semi-Supervised Node Classification for Evolving Large Attributed GraphsBig Data Mining and Analytics10.26599/BDMA.2024.90200337:3(794-808)Online publication date: Sep-2024
  • (2024)ProtoMGAE: Prototype-Aware Masked Graph Auto-Encoder for Graph Representation LearningACM Transactions on Knowledge Discovery from Data10.1145/364914318:6(1-22)Online publication date: 12-Apr-2024
  • (2024)Revisiting Modularity Maximization for Graph Clustering: A Contrastive Learning PerspectiveProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671967(1968-1979)Online publication date: 25-Aug-2024
  • (2024)Iterative embedding and reweighting of complex networks reveals community structureScientific Reports10.1038/s41598-024-68152-w14:1Online publication date: 26-Jul-2024
  • (2024)Lorentz equivariant model for knowledge-enhanced hyperbolic collaborative filteringKnowledge-Based Systems10.1016/j.knosys.2024.111590291(111590)Online publication date: May-2024
  • (2024)Deep graph clustering by integrating community structure with neighborhood informationInformation Sciences10.1016/j.ins.2024.120951(120951)Online publication date: Jun-2024
  • (2024)Interest-driven community detection on attributed heterogeneous information networksInformation Fusion10.1016/j.inffus.2024.102525111(102525)Online publication date: Nov-2024
  • (2024)Community detection algorithm for social network based on node intimacy and graph embedding modelEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.107947132(107947)Online publication date: Jun-2024
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media