Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3159652.3159706acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

Published: 02 February 2018 Publication History

Abstract

Since the invention of word2vec, the skip-gram model has significantly advanced the research of network embedding, such as the recent emergence of the DeepWalk, LINE, PTE, and node2vec approaches. In this work, we show that all of the aforementioned models with negative sampling can be unified into the matrix factorization framework with closed forms. Our analysis and proofs reveal that: (1) DeepWalk empirically produces a low-rank transformation of a network's normalized Laplacian matrix; (2) LINE, in theory, is a special case of DeepWalk when the size of vertices' context is set to one; (3) As an extension of LINE, PTE can be viewed as the joint factorization of multiple networks» Laplacians; (4) node2vec is factorizing a matrix related to the stationary distribution and transition probability tensor of a 2nd-order random walk. We further provide the theoretical connections between skip-gram based network embedding algorithms and the theory of graph Laplacian. Finally, we present the NetMF method as well as its approximation algorithm for computing network embedding. Our method offers significant improvements over DeepWalk and LINE for conventional network mining tasks. This work lays the theoretical foundation for skip-gram based network embedding methods, leading to a better understanding of latent network representation learning.

References

[1]
Sanjeev Arora, Yuanzhi Li, Yingyu Liang, Tengyu Ma, and Andrej Risteski. 2016. A latent variable model approach to pmi-based word embeddings. TACL Vol. 4 (2016), 385--399.
[2]
Yoshua Bengio, Aaron Courville, and Pierre Vincent. 2013. Representation learning: A review and new perspectives. IEEE TPAMI, Vol. 35, 8 (2013), 1798--1828.
[3]
Austin R. Benson, David F. Gleich, and Jure Leskovec. 2015. Tensor spectral clustering for partitioning higher-order network structures SDM. SIAM, 118--126.
[4]
Austin R. Benson, David F. Gleich, and Lek-Heng Lim. 2017. The Spacey Random Walk: A stochastic Process for Higher-Order Data. SIAM Rev. Vol. 59, 2 (2017), 321--345.
[5]
Matthew Brand and Kun Huang. 2003. A unifying theorem for spectral embedding and clustering. AISTATS.
[6]
Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. GraRep: Learning graph representations with global structural information CIKM. ACM, 891--900.
[7]
Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2016. Deep Neural Networks for Learning Graph Representations. AAAI. 1145--1152.
[8]
Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C. Aggarwal, and Thomas S Huang. 2015. Heterogeneous network embedding via deep architectures KDD. ACM, 119--128.
[9]
Dehua Cheng, Yu Cheng, Yan Liu, Richard Peng, and Shang-Hua Teng. 2015. Efficient sampling for Gaussian graphical models via spectral sparsification COLT. 364--390.
[10]
Kewei Cheng, Jundong Li, and Huan Liu. 2017. Unsupervised Feature Selection in Signed Social Networks KDD. ACM.
[11]
Fan R.K. Chung. 1997. Spectral graph theory. Number 92. American Mathematical Soc.
[12]
Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks KDD.
[13]
David Easley and Jon Kleinberg. 2010. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press.
[14]
Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. JMLR, Vol. 9, Aug (2008), 1871--1874.
[15]
David F. Gleich, Lek-Heng Lim, and Yongyang Yu. 2015. Multilinear pagerank. SIAM J. Matrix Anal. Appl. Vol. 36, 4 (2015), 1507--1541.
[16]
Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. KDD. ACM, 855--864.
[17]
William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. NIPS.
[18]
Tatsunori B. Hashimoto, David Alvarez-Melis, and Tommi S. Jaakkola. 2016. Word embeddings as metric recovery in semantic spaces. TACL Vol. 4 (2016), 273--286.
[19]
Roger A. Horn and Charles R. Johnson. 1991. Topics in Matrix Analysis. Cambridge University Press.
[20]
Yann Jacob, Ludovic Denoyer, and Patrick Gallinari. 2014. Learning latent representations of nodes for classifying in heterogeneous social networks WSDM. ACM, 373--382.
[21]
Thomas N. Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. arXiv preprint arXiv:1609.02907 (2016).
[22]
Richard B. Lehoucq, Danny C. Sorensen, and Chao Yang. 1998. ARPACK users' guide: solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods. SIAM.
[23]
Jure Leskovec, Kevin J. Lang, and Michael Mahoney. 2010. Empirical comparison of algorithms for network community detection WWW. ACM, 631--640.
[24]
Omer Levy and Yoav Goldberg. 2014. Neural Word Embedding as Implicit Matrix Factorization. NIPS. 2177--2185.
[25]
Hang Li, Haozheng Wang, Zhenglu Yang, and Masato Odagaki. 2017. Variation Autoencoder Based Network Representation Learning for Classification ACL. 56.
[26]
László Lovász. 1993. Random walks on graphs. Combinatorics, Paul erdos is eighty Vol. 2 (1993), 1--46.
[27]
Qing Lu and Lise Getoor. 2003. Link-based Classification. In ICML.
[28]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013 a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
[29]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013 b. Distributed Representations of Words and Phrases and their Compositionality. NIPS. 3111--3119.
[30]
Mingdong Ou, Peng Cui, Jian Pei, Ziwei Zhang, and Wenwu Zhu. 2016. Asymmetric Transitivity Preserving Graph Embedding. KDD. 1105--1114.
[31]
Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online learning of social representations KDD. ACM, 701--710.
[32]
S. Yu Philip, Jiawei Han, and Christos Faloutsos. 2010. Link mining: Models, algorithms, and applications. Springer.
[33]
Jianbo Shi and Jitendra Malik. 2000. Normalized cuts and image segmentation. IEEE PAMI, Vol. 22, 8 (2000), 888--905.
[34]
A.N. Shiryaev and A. Lyasoff. 2012. Problems in Probability. Springer New York. showLCCN2012937045
[35]
Chris Stark, Bobby-Joe Breitkreutz, Andrew Chatr-Aryamontri, Lorrie Boucher, Rose Oughtred, Michael S. Livstone, Julie Nixon, Kimberly Van Auken, Xiaodong Wang, Xiaoqi Shi, et almbox. 2010. The BioGRID interaction database: 2011 update. Nucleic acids research Vol. 39, suppl_1 (2010), D698--D704.
[36]
Jian Tang, Meng Qu, and Qiaozhu Mei. 2015 a. PTE: Predictive text embedding through large-scale heterogeneous text networks KDD. ACM, 1165--1174.
[37]
Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015 b. LINE: Large-scale information network embedding. WWW. 1067--1077.
[38]
Lei Tang and Huan Liu. 2009. Relational learning via latent social dimensions. KDD. ACM, 817--826.
[39]
Lei Tang, Suju Rajan, and Vijay K. Narayanan. 2009. Large scale multi-label classification via metalabeler WWW. ACM, 211--220.
[40]
Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer. 2003. Feature-rich part-of-speech tagging with a cyclic dependency network NAACL. Association for Computational Linguistics, 173--180.
[41]
Lloyd N. Trefethen and David Bau III. 1997. Numerical linear algebra. Vol. Vol. 50. Siam.
[42]
Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. 2009. Mining multi-label data. Data mining and knowledge discovery handbook. Springer, 667--685.
[43]
Cunchao Tu, Han Liu, Zhiyuan Liu, and Maosong Sun. 2017. CANE: Context-aware network embedding for relation modeling ACL.
[44]
Cunchao Tu, Weicheng Zhang, Zhiyuan Liu, and Maosong Sun. 2016. Max-Margin DeepWalk: Discriminative Learning of Network Representation. IJCAI. 3889--3895.
[45]
Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and computing Vol. 17, 4 (2007), 395--416.
[46]
Daixin Wang, Peng Cui, and Wenwu Zhu. 2016. Structural deep network embedding. In KDD. ACM, 1225--1234.
[47]
Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y. Chang. 2015. Network Representation Learning with Rich Text Information IJCAI. 2111--2117.
[48]
Zhilin Yang, William W. Cohen, and Ruslan Salakhutdinov. 2016 a. Revisiting Semi-Supervised Learning with Graph Embeddings ICML. 40--48.
[49]
Zhilin Yang, Jie Tang, and William W. Cohen. 2016 b. Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs. IJCAI. 2287--2293.

Cited By

View all
  • (2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
  • (2025)Directed Link Prediction Using GNN With Local and Global Feature FusionIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.349843412:1(409-422)Online publication date: Jan-2025
  • (2025)Time- and Space-Efficiently Sketching Billion-Scale Attributed NetworksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.350825637:2(966-978)Online publication date: Feb-2025
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining
February 2018
821 pages
ISBN:9781450355810
DOI:10.1145/3159652
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 February 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph spectral
  2. matrix factorization
  3. network embedding
  4. representation learning
  5. social network

Qualifiers

  • Research-article

Funding Sources

  • National Basic Research Program of China
  • NSFC

Conference

WSDM 2018

Acceptance Rates

WSDM '18 Paper Acceptance Rate 81 of 514 submissions, 16%;
Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)344
  • Downloads (Last 6 weeks)22
Reflects downloads up to 16 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
  • (2025)Directed Link Prediction Using GNN With Local and Global Feature FusionIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.349843412:1(409-422)Online publication date: Jan-2025
  • (2025)Time- and Space-Efficiently Sketching Billion-Scale Attributed NetworksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.350825637:2(966-978)Online publication date: Feb-2025
  • (2025)Network-to-Network: Self-Supervised Network Representation Learning via Position PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.349339137:3(1354-1365)Online publication date: Mar-2025
  • (2025)Strong and weak random walks on signed networksnpj Complexity10.1038/s44260-025-00027-12:1Online publication date: 3-Feb-2025
  • (2025)Attention-augmented multi-domain cooperative graph representation learning for molecular interaction predictionNeural Networks10.1016/j.neunet.2025.107265(107265)Online publication date: Feb-2025
  • (2025)Complex network structural analysis based on information supplementation graph contrastive learningKnowledge-Based Systems10.1016/j.knosys.2024.112833309(112833)Online publication date: Jan-2025
  • (2025)Semantic graph neural network with multi-measure learning for semi-supervised classificationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109647140(109647)Online publication date: Jan-2025
  • (2025)Counterfactual Learning on Graphs: A SurveyMachine Intelligence Research10.1007/s11633-024-1519-z22:1(17-59)Online publication date: 24-Jan-2025
  • (2025)IntroductionGraph Neural Network Methods and Applications in Scene Understanding10.1007/978-981-97-9933-6_1(1-24)Online publication date: 4-Jan-2025
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media