Abstract
Community detection is a crucial task that enables the extraction of valuable knowledge and patterns from complex networks. However, the node labels are often unavailable in most real-world applications, which poses limitations on practical implementations. It is common to deal with graphs that have side information. How to effectively fuse the topology information of a graph and the attribute information of nodes also brings great challenges to community discovery task. In this paper, a graph contrastive learning algorithm with cross-encoder is proposed for community discovery. The proposed algorithm introduces graph convolution neural network and graph attention network to fuse the topology information and the attribute information of nodes. Then it employs cross-encoder to obtain the embedding vectors of nodes and the neural networks are trained by contrastive learning which can learn the embedding vectors of nodes from different views. The proposed algorithm and the comparison algorithms are conducted on multiple real data sets. The experimental results show that the proposed algorithm outperforms the comparison algorithms on the most experimental data sets. The experimental results prove that the proposed algorithm is effective for community discovery.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
References
Ando R, Zhang T (2006) Learning on graph with laplacian regularization. Adv Neural Inf Process Syst 19:25–32
Belkin M, Niyogi P, Sindhwani V (2006) Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. J Mach Learn Res 7(11):2399–2434
Bo D, Wang X, Shi C, Zhu M, Lu E, Cui P (2020) Structural deep clustering network. In: Proceedings of the web conference 2020, pp 1400–1410
Brody S, Alon U, Yahav E (2021) How attentive are graph attention networks? In: International conference on learning representations
Cai H, Zheng VW, Chang KCC (2018) A comprehensive survey of graph embedding: Problems, techniques, and applications. IEEE Trans Knowl Data Eng 30(9):1616–1637
Chen C, Zhang M, Zhang Y, Ma W, Liu Y, Ma S (2020) Efficient heterogeneous collaborative filtering without negative sampling for recommendation. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 19–26
Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning, pp 1597–1607
Corso G, Cavalleri L, Beaini D, Liò P, Veličković P (2020) Principal neighbourhood aggregation for graph nets. Adv Neural Inf Process Syst 33:13260–13271
Deng Z, Choi KS, Chung FL, Wang S (2010) Enhanced soft subspace clustering integrating within-cluster and between-cluster information. Pattern Recogn 43(3):767–781
Girvan M, Newman ME (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826
Goyal P, Ferrara E (2018) Graph embedding techniques, applications, and performance: A survey. Knowl-Based Syst 151:78–94
Grill JB, Strub F, Altché F, Tallec C, Richemond PH, Buchatskaya E, Doersch C, Pires BA, Guo ZD, Azar MG, et al (2020) Bootstrap your own latent: A new approach to self-supervised learning. In: Advances in neural information processing systems
Gui Q, Deng R, Xue P, Cheng X (2018) A community discovery algorithm based on boundary nodes and label propagation. Pattern Recogn Lett 109:103–109
Hamilton WL, Ying R, Leskovec J (2017) Inductive representation learning on large graphs. In: Advances in neural information processing systems, pp 1024–1034
He C, Zheng Y, Cheng J, Tang Y, Chen G, Liu H (2022) Semi-supervised overlapping community detection in attributed graph with graph convolutional autoencoder. Inf Sci 608:1464–1479
He K, Fan H, Wu Y, Xie S, Girshick R (2020) Momentum contrast for unsupervised visual representation learning. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9729–9738
Hu F, Zhu Y, Wu S, Huang W, Wang L, Tan T (2021) Graphair: Graph representation learning with neighborhood aggregation and interaction. Pattern Recogn 112:107745
Jin D, Yu Z, Jiao P, Pan S, He D, Wu J, Yu P, Zhang W (2021) A survey of community detection approaches: From statistical modeling to deep learning. IEEE Trans Knowl Data Eng
Khandekar R, Kortsarz G, Mirrokni V (2014) On the advantage of overlapping clusters for minimizing conductance. Algorithmica 69(4):844–863
Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907
Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: International conference on learning representations
Lee DD, Seung HS (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791
Lee H, Lee J, Ng JYH, Natsev P (2020) Large scale video representation learning via relational graph clustering. In: Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition, pp 6807–6816
Li X, Jia M, Islam MT, Yu L, Xing L (2020) Self-supervised feature learning via exploiting multi-modal data for retinal disease diagnosis. IEEE Trans Med Imaging 39(12):4023–4033
Liu F, Xue S, Wu J, Zhou C, Hu W, Paris C, Nepal S, Yang J, Yu PS (2020) Deep learning for community detection: progress, challenges and opportunities. In: Proceedings of the 29th international joint conference on artificial intelligence, pp 4981– 4987
Liu H, Fen L, Jian J, Chen L (2018) Overlapping community discovery algorithm based on hierarchical agglomerative clustering. Int J Pattern Recognit Artif Intell 32(03):1850008
Liu X, Zhang F, Hou Z, Mian L, Wang Z, Zhang J, Tang J (2020) Network representation learning: A macro and micro outlook. AI Open 1:57–81
Liu X, Zhang F, Hou Z, Wang Z, Mian L, Zhang J, Tang J (2021) Self-supervised learning: Generative or contrastive. IEEE Trans Knowl Data Eng
Lu Z, Sun X, Wen Y, Cao G, La Porta T (2014) Algorithms and applications for community detection in weighted networks. IEEE Trans Parallel Distrib Syst 26(11):2916–2926
Lv Z, Zhang S, Xiu W (2020) Solving the security problem of intelligent transportation system with deep learning. IEEE Trans Intell Transp Syst. https://doi.org/10.1109/TITS.2020.2980864
Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th international conference on machine learning, pp 3–8. Journal of Machine Learning Research
Maćkiewicz A, Ratajczak W (1993) Principal components analysis (pca). Computers & Geosciences 19(3):303–342
Mikolov T, Chen K, Corrado G, Dean J (2013) Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781
Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066133
Newman ME (2006) Modularity and community structure in networks. Proc Natl Acad Sci 103(23):8577–8582
Park N, Rossi R, Koh E, Burhanuddin IA, Kim S, Du F, Ahmed N, Faloutsos C (2022) Cgc: Contrastive graph clustering for community detection and tracking. In: Proceedings of the ACM web conference 2022, pp 1115–1126
Peng Z, Huang W, Luo M, Zheng Q, Rong Y, Xu T, Huang J (2020) Graph representation learning via graphical mutual information maximization. In: Proceedings of The Web Conference 2020, pp 259–270
Perozzi B, Al-Rfou R, Skiena S (2014) Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD international conference on knowledge discovery and data mining, pp 701–710
Qiu J, Dong Y, Ma H, Li J, Wang C, Wang K, Tang J (2019) Netsmf: Large-scale network embedding as sparse matrix factorization. In: The World Wide Web Conference, pp. 1509–1520
Qiu J, Dong Y, Ma H, Li J, Wang K, Tang J (2018) Network embedding as matrix factorization: Unifying deepwalk, line, pte, and node2vec. In: Proceedings of the eleventh ACM international conference on web search and data mining, pp 459–467
Rong Y, Xu T, Huang J, Huang W, Cheng H, Ma Y, Wang Y, Derr T, Wu L, Ma T (2020) Deep graph learning: Foundations, advances and applications. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 3555–3556
Rossi RA, Ahmed NK, Koh E, Kim S, Rao A, Abbasi-Yadkori Y (2020) A structural graph representation learning framework. In: Proceedings of the 13th international conference on web search and data mining, pp 483–491
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Su X, Xue S, Liu F, Wu J, Yang J, Zhou C, Hu W, Paris C, Nepal S, Jin D, et al (2022) A comprehensive survey on community detection with deep learning. IEEE Trans Neural Netw Learn Syst
Sun FY, Hoffmann J, Verma V, Tang J (2020) Infograph: Unsupervised and semi-supervised graph-level representation learning via mutual information maximization. In: International conference on learning representations
Tang J, Qu M, Wang M, Zhang M, Yan J, Mei Q (2015) Line: Large-scale information network embedding. In: Proceedings of the 24th international conference on world wide web, pp 1067–1077
Veličković P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2018) Graph attention networks. In: International conference on learning representations
Velickovic P, Fedus W, Hamilton WL, Liò P, Bengio Y, Hjelm RD (2019) Deep graph infomax. In: International conference on learning representations
Wang C, Pan S, Celina PY, Hu R, Long G, Zhang C (2022) Deep neighbor-aware embedding for node clustering in attributed graphs. Pattern Recogn 122:108230
Wang M, Zuo W, Wang Y (2016) An improved density peaks-based clustering method for social circle discovery in social networks. Neurocomputing 179:219–227
Wang X, Bo D, Shi C, Fan S , Ye Y, Yu PS (2020) A survey on heterogeneous graph embedding: Methods, techniques, applications and sources. arXiv:2011.14867
Wang X, Qi GJ (2021) Contrastive learning with stronger augmentations. arXiv:2104.07713
Wang Y, Min Y, Chen X, Wu J (2021) Multi-view graph contrastive representation learning for drug-drug interaction prediction. In: Proceedings of the web conference 2021, pp 2921–2933
Wu X, Kumar V, Quinlan JR, Ghosh J, Yang Q, Motoda H, McLachlan GJ, Ng A, Liu B, Philip SY et al (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14(1):1–37
Xie T, Wang B, Kuo CCJ (2022) Graphhop: An enhanced label propagation method for node classification. IEEE Trans Neural Netw Learn Syst
Xing S, Shan X, Fanzhen L, Jia W, Jian Y, Chuan Z, Wenbin H, Cecile P, Surya N, Di J et al (2022) A comprehensive survey on community detection with deep learning. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2021.3137396
Xu S, Liu S, Feng L (2020) Manifold graph embedding with structure information propagation for community discovery. Knowl-Based Syst 208:106448
Yang Z, Ding M, Zhou C, Yang H, Zhou J, Tang, J (2020) Understanding negative sampling in graph representation learning. In: Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pp 1666–1676
You Y, Chen T, Sui Y, Chen T, Wang Z, Shen Y (2020) Graph contrastive learning with augmentations. In: Advances in neural information processing systems
Zhang H, Lyu MR, King I (2015) Exploiting k-degree locality to improve overlapping community detection. In: Twenty-fourth international joint conference on artificial intelligence, pp 2394–2400
Zhang X, Xu S, Lin W, Wang S (2023) Constrained social community recommendation. In: Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining, pp 5586–5596
Zhou J, Cui G, Hu S, Zhang Z, Yang C, Liu Z, Wang L, Li C, Sun M (2020) Graph neural networks: A review of methods and applications. AI Open 1:57–81
Zhu Y, Xu Y, Yu F, Liu Q, Wu S, Wang L (2021) Graph contrastive learning with adaptive augmentation. In: Proceedings of the web conference 2021, pp 2069–2080
Acknowledgements
This work was supported in part by the National Science Research Project of Department of Education in Guizhou Province Grant No. \(\left( \text {Grant No. KY}\left[ 2020 \right] 062 \right) \), in part by the National Natural Science Foundation of China under Grant 61672439, in part by the Fundamental Research Funds for the Central Universities under Grant 20720181004.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shan, Z., Zhang, D. & Lei, Y. Graph contrastive learning with cross-encoder for community discovery. Appl Intell 54, 2211–2224 (2024). https://doi.org/10.1007/s10489-024-05287-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-024-05287-3