research-article

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

Authors:

Jie TangAuthors Info & Claims

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

Pages 459 - 467

https://doi.org/10.1145/3159652.3159706

Published: 02 February 2018 Publication History

Abstract

Since the invention of word2vec, the skip-gram model has significantly advanced the research of network embedding, such as the recent emergence of the DeepWalk, LINE, PTE, and node2vec approaches. In this work, we show that all of the aforementioned models with negative sampling can be unified into the matrix factorization framework with closed forms. Our analysis and proofs reveal that: (1) DeepWalk empirically produces a low-rank transformation of a network's normalized Laplacian matrix; (2) LINE, in theory, is a special case of DeepWalk when the size of vertices' context is set to one; (3) As an extension of LINE, PTE can be viewed as the joint factorization of multiple networks» Laplacians; (4) node2vec is factorizing a matrix related to the stationary distribution and transition probability tensor of a 2nd-order random walk. We further provide the theoretical connections between skip-gram based network embedding algorithms and the theory of graph Laplacian. Finally, we present the NetMF method as well as its approximation algorithm for computing network embedding. Our method offers significant improvements over DeepWalk and LINE for conventional network mining tasks. This work lays the theoretical foundation for skip-gram based network embedding methods, leading to a better understanding of latent network representation learning.

References

[1]

Sanjeev Arora, Yuanzhi Li, Yingyu Liang, Tengyu Ma, and Andrej Risteski. 2016. A latent variable model approach to pmi-based word embeddings. TACL Vol. 4 (2016), 385--399.

[2]

Yoshua Bengio, Aaron Courville, and Pierre Vincent. 2013. Representation learning: A review and new perspectives. IEEE TPAMI, Vol. 35, 8 (2013), 1798--1828.

Digital Library

[3]

Austin R. Benson, David F. Gleich, and Jure Leskovec. 2015. Tensor spectral clustering for partitioning higher-order network structures SDM. SIAM, 118--126.

[4]

Austin R. Benson, David F. Gleich, and Lek-Heng Lim. 2017. The Spacey Random Walk: A stochastic Process for Higher-Order Data. SIAM Rev. Vol. 59, 2 (2017), 321--345.

Digital Library

[5]

Matthew Brand and Kun Huang. 2003. A unifying theorem for spectral embedding and clustering. AISTATS.

[6]

Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2015. GraRep: Learning graph representations with global structural information CIKM. ACM, 891--900.

Digital Library

[7]

Shaosheng Cao, Wei Lu, and Qiongkai Xu. 2016. Deep Neural Networks for Learning Graph Representations. AAAI. 1145--1152.

Digital Library

[8]

Shiyu Chang, Wei Han, Jiliang Tang, Guo-Jun Qi, Charu C. Aggarwal, and Thomas S Huang. 2015. Heterogeneous network embedding via deep architectures KDD. ACM, 119--128.

Digital Library

[9]

Dehua Cheng, Yu Cheng, Yan Liu, Richard Peng, and Shang-Hua Teng. 2015. Efficient sampling for Gaussian graphical models via spectral sparsification COLT. 364--390.

[10]

Kewei Cheng, Jundong Li, and Huan Liu. 2017. Unsupervised Feature Selection in Signed Social Networks KDD. ACM.

Digital Library

[11]

Fan R.K. Chung. 1997. Spectral graph theory. Number 92. American Mathematical Soc.

[12]

Yuxiao Dong, Nitesh V. Chawla, and Ananthram Swami. 2017. metapath2vec: Scalable Representation Learning for Heterogeneous Networks KDD.

Digital Library

[13]

David Easley and Jon Kleinberg. 2010. Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press.

Digital Library

[14]

Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. JMLR, Vol. 9, Aug (2008), 1871--1874.

Digital Library

[15]

David F. Gleich, Lek-Heng Lim, and Yongyang Yu. 2015. Multilinear pagerank. SIAM J. Matrix Anal. Appl. Vol. 36, 4 (2015), 1507--1541.

Digital Library

[16]

Aditya Grover and Jure Leskovec. 2016. node2vec: Scalable feature learning for networks. KDD. ACM, 855--864.

Digital Library

[17]

William L. Hamilton, Rex Ying, and Jure Leskovec. 2017. Inductive Representation Learning on Large Graphs. NIPS.

[18]

Tatsunori B. Hashimoto, David Alvarez-Melis, and Tommi S. Jaakkola. 2016. Word embeddings as metric recovery in semantic spaces. TACL Vol. 4 (2016), 273--286.

[19]

Roger A. Horn and Charles R. Johnson. 1991. Topics in Matrix Analysis. Cambridge University Press.

Digital Library

[20]

Yann Jacob, Ludovic Denoyer, and Patrick Gallinari. 2014. Learning latent representations of nodes for classifying in heterogeneous social networks WSDM. ACM, 373--382.

Digital Library

[21]

Thomas N. Kipf and Max Welling. 2016. Semi-Supervised Classification with Graph Convolutional Networks. arXiv preprint arXiv:1609.02907 (2016).

[22]

Richard B. Lehoucq, Danny C. Sorensen, and Chao Yang. 1998. ARPACK users' guide: solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods. SIAM.

[23]

Jure Leskovec, Kevin J. Lang, and Michael Mahoney. 2010. Empirical comparison of algorithms for network community detection WWW. ACM, 631--640.

Digital Library

[24]

Omer Levy and Yoav Goldberg. 2014. Neural Word Embedding as Implicit Matrix Factorization. NIPS. 2177--2185.

Digital Library

[25]

Hang Li, Haozheng Wang, Zhenglu Yang, and Masato Odagaki. 2017. Variation Autoencoder Based Network Representation Learning for Classification ACL. 56.

[26]

László Lovász. 1993. Random walks on graphs. Combinatorics, Paul erdos is eighty Vol. 2 (1993), 1--46.

[27]

Qing Lu and Lise Getoor. 2003. Link-based Classification. In ICML.

Digital Library

[28]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013 a. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).

[29]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013 b. Distributed Representations of Words and Phrases and their Compositionality. NIPS. 3111--3119.

Digital Library

[30]

Mingdong Ou, Peng Cui, Jian Pei, Ziwei Zhang, and Wenwu Zhu. 2016. Asymmetric Transitivity Preserving Graph Embedding. KDD. 1105--1114.

Digital Library

[31]

Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online learning of social representations KDD. ACM, 701--710.

Digital Library

[32]

S. Yu Philip, Jiawei Han, and Christos Faloutsos. 2010. Link mining: Models, algorithms, and applications. Springer.

Digital Library

[33]

Jianbo Shi and Jitendra Malik. 2000. Normalized cuts and image segmentation. IEEE PAMI, Vol. 22, 8 (2000), 888--905.

Digital Library

[34]

A.N. Shiryaev and A. Lyasoff. 2012. Problems in Probability. Springer New York. showLCCN2012937045

[35]

Chris Stark, Bobby-Joe Breitkreutz, Andrew Chatr-Aryamontri, Lorrie Boucher, Rose Oughtred, Michael S. Livstone, Julie Nixon, Kimberly Van Auken, Xiaodong Wang, Xiaoqi Shi, et almbox. 2010. The BioGRID interaction database: 2011 update. Nucleic acids research Vol. 39, suppl_1 (2010), D698--D704.

[36]

Jian Tang, Meng Qu, and Qiaozhu Mei. 2015 a. PTE: Predictive text embedding through large-scale heterogeneous text networks KDD. ACM, 1165--1174.

Digital Library

[37]

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015 b. LINE: Large-scale information network embedding. WWW. 1067--1077.

Digital Library

[38]

Lei Tang and Huan Liu. 2009. Relational learning via latent social dimensions. KDD. ACM, 817--826.

Digital Library

[39]

Lei Tang, Suju Rajan, and Vijay K. Narayanan. 2009. Large scale multi-label classification via metalabeler WWW. ACM, 211--220.

Digital Library

[40]

Kristina Toutanova, Dan Klein, Christopher D. Manning, and Yoram Singer. 2003. Feature-rich part-of-speech tagging with a cyclic dependency network NAACL. Association for Computational Linguistics, 173--180.

Digital Library

[41]

Lloyd N. Trefethen and David Bau III. 1997. Numerical linear algebra. Vol. Vol. 50. Siam.

[42]

Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. 2009. Mining multi-label data. Data mining and knowledge discovery handbook. Springer, 667--685.

[43]

Cunchao Tu, Han Liu, Zhiyuan Liu, and Maosong Sun. 2017. CANE: Context-aware network embedding for relation modeling ACL.

[44]

Cunchao Tu, Weicheng Zhang, Zhiyuan Liu, and Maosong Sun. 2016. Max-Margin DeepWalk: Discriminative Learning of Network Representation. IJCAI. 3889--3895.

Digital Library

[45]

Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and computing Vol. 17, 4 (2007), 395--416.

Digital Library

[46]

Daixin Wang, Peng Cui, and Wenwu Zhu. 2016. Structural deep network embedding. In KDD. ACM, 1225--1234.

Digital Library

[47]

Cheng Yang, Zhiyuan Liu, Deli Zhao, Maosong Sun, and Edward Y. Chang. 2015. Network Representation Learning with Rich Text Information IJCAI. 2111--2117.

Digital Library

[48]

Zhilin Yang, William W. Cohen, and Ruslan Salakhutdinov. 2016 a. Revisiting Semi-Supervised Learning with Graph Embeddings ICML. 40--48.

Digital Library

[49]

Zhilin Yang, Jie Tang, and William W. Cohen. 2016 b. Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs. IJCAI. 2287--2293.

Digital Library

Cited By

Tian LWang QZhou ZLiu XZhang MYan G(2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
https://doi.org/10.1186/s12859-024-06028-6
Zhang YShen XXie YWong KXie WPeng C(2025)Directed Link Prediction Using GNN With Local and Global Feature FusionIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.349843412:1(409-422)Online publication date: Jan-2025
https://doi.org/10.1109/TNSE.2024.3498434
Wu WLi SJiang MLuo CLi F(2025)Time- and Space-Efficiently Sketching Billion-Scale Attributed NetworksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.350825637:2(966-978)Online publication date: Feb-2025
https://doi.org/10.1109/TKDE.2024.3508256
Show More Cited By

Index Terms

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

Recommendations

Dynamic Network Embedding via Temporal Path Adjacency Matrix Factorization
CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management

Network embedding has been widely investigated to learn low dimensional nodes representation of networks, and serves for many downstream machine learning tasks. Previous network embedding studies mainly focus on static networks, and cannot adapt well to ...
A General View for Network Embedding as Matrix Factorization
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

We propose a general view that demonstrates the relationship between network embedding approaches and matrix factorization. Unlike previous works that present the equivalence for the approaches from a skip-gram model perspective, we provide a more ...
Accelerating Dynamic Network Embedding with Billions of Parameter Updates to Milliseconds
KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Network embedding, a graph representation learning method illustrating network topology by mapping nodes into lower-dimension vectors, is challenging to accommodate the ever-changing dynamic graphs in practice. Existing research is mainly based on node-...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

WSDM '18: Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining

February 2018

821 pages

ISBN:9781450355810

DOI:10.1145/3159652

General Chairs:
Yi Chang
Jilin University, Huawei Inc.
,
Chengxiang Zhai
University of Illinois Urbana-Champaign
,
Program Chairs:
Yan Liu
University of Southern California
,
Yoelle Maarek
Amazon

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 February 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Basic Research Program of China
NSFC

Conference

WSDM 2018

Sponsor:

WSDM 2018: The Eleventh ACM International Conference on Web Search and Data Mining

February 5 - 9, 2018

CA, Marina Del Rey, USA

Acceptance Rates

WSDM '18 Paper Acceptance Rate 81 of 514 submissions, 16%;

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

589
Total Citations
View Citations
4,548
Total Downloads

Downloads (Last 12 months)344
Downloads (Last 6 weeks)22

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Tian LWang QZhou ZLiu XZhang MYan G(2025)Predicting drug combination side effects based on a metapath-based heterogeneous graph neural networkBMC Bioinformatics10.1186/s12859-024-06028-626:1Online publication date: 15-Jan-2025
https://doi.org/10.1186/s12859-024-06028-6
Zhang YShen XXie YWong KXie WPeng C(2025)Directed Link Prediction Using GNN With Local and Global Feature FusionIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.349843412:1(409-422)Online publication date: Jan-2025
https://doi.org/10.1109/TNSE.2024.3498434
Wu WLi SJiang MLuo CLi F(2025)Time- and Space-Efficiently Sketching Billion-Scale Attributed NetworksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.350825637:2(966-978)Online publication date: Feb-2025
https://doi.org/10.1109/TKDE.2024.3508256
Liu JZhang CHe ZZhang WLi N(2025)Network-to-Network: Self-Supervised Network Representation Learning via Position PredictionIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.349339137:3(1354-1365)Online publication date: Mar-2025
https://doi.org/10.1109/TKDE.2024.3493391
Babul STian YLambiotte R(2025)Strong and weak random walks on signed networksnpj Complexity10.1038/s44260-025-00027-12:1Online publication date: 3-Feb-2025
https://doi.org/10.1038/s44260-025-00027-1
Wang ZMeng JLi HDai QLin XLuan Y(2025)Attention-augmented multi-domain cooperative graph representation learning for molecular interaction predictionNeural Networks10.1016/j.neunet.2025.107265(107265)Online publication date: Feb-2025
https://doi.org/10.1016/j.neunet.2025.107265
Cai BWang JTang XLi XHu NHu YLiu MMiao Q(2025)Complex network structural analysis based on information supplementation graph contrastive learningKnowledge-Based Systems10.1016/j.knosys.2024.112833309(112833)Online publication date: Jan-2025
https://doi.org/10.1016/j.knosys.2024.112833
Lin JWan YXu JQi X(2025)Semantic graph neural network with multi-measure learning for semi-supervised classificationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109647140(109647)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109647
Guo ZWu ZXiao TAggarwal CLiu HWang S(2025)Counterfactual Learning on Graphs: A SurveyMachine Intelligence Research10.1007/s11633-024-1519-z22:1(17-59)Online publication date: 24-Jan-2025
https://doi.org/10.1007/s11633-024-1519-z
Liu WHao HWang HZou ZXing WLiu WHao HWang HZou ZXing W(2025)IntroductionGraph Neural Network Methods and Applications in Scene Understanding10.1007/978-981-97-9933-6_1(1-24)Online publication date: 4-Jan-2025
https://doi.org/10.1007/978-981-97-9933-6_1
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten