research-article

Scalable Global Alignment Graph Kernel Using Random Features: From Node Embedding to Graph Embedding

Authors:

Ian En-Hsu Yen,

Charu AggarwalAuthors Info & Claims

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 1418 - 1428

https://doi.org/10.1145/3292500.3330918

Published: 25 July 2019 Publication History

Abstract

Graph kernels are widely used for measuring the similarity between graphs. Many existing graph kernels, which focus on local patterns within graphs rather than their global properties, suffer from significant structure information loss when representing graphs. Some recent global graph kernels, which utilizes the alignment of geometric node embeddings of graphs, yield state-of-the-art performance. However, these graph kernels are not necessarily positive-definite. More importantly, computing the graph kernel matrix will have at least quadratic time complexity in terms of the number and the size of the graphs. In this paper, we propose a new family of global alignment graph kernels, which take into account the global properties of graphs by using geometric node embeddings and an associated node transportation based on earth mover's distance. Compared to existing global kernels, the proposed kernel is positive-definite. Our graph kernel is obtained by defining a distribution over random graphs, which can naturally yield random feature approximations. The random feature approximations lead to our graph embeddings, which is named as "random graph embeddings" (RGE). In particular, RGE is shown to achieve (quasi-)linear scalability with respect to the number and the size of the graphs. The experimental results on nine benchmark datasets demonstrate that RGE outperforms or matches twelve state-of-the-art graph classification algorithms.

References

[1]

Rami Al-Rfou, Dustin Zelle, and Bryan Perozzi. 2019. DDGK: Learning Graph Representations for Deep Divergence Graph Kernels. arXiv:1904.09671 (2019).

Digital Library

[2]

James Atwood, Siddharth Pal, Don Towsley, and Ananthram Swami. 2016. Sparse Diffusion-Convolutional Neural Networks. In NIPS.

Digital Library

[3]

Francis Bach. 2017. On the equivalence between kernel quadrature rules and random feature expansions. Journal of Machine Learning Research 18, 21 (2017), 1--38.

Digital Library

[4]

Karsten M Borgwardt and Hans-Peter Kriegel. 2005. Shortest-path kernels on graphs. In Data Mining, Fifth IEEE International Conference on. IEEE, 8--pp.

Digital Library

[5]

Francois Bourgeois and Jean-Claude Lassalle. 1971. An extension of the Munkres algorithm for the assignment problem to rectangular matrices. Commun. ACM 14, 12 (1971), 802--804.

Digital Library

[6]

Pin-Yu Chen and Lingfei Wu. 2017. Revisiting spectral graph clustering with generative community models. In ICDM. 51--60.

[7]

Rong-En Fan, Kai-Wei Chang, Cho-Jui Hsieh, Xiang-Rui Wang, and Chih-Jen Lin. 2008. LIBLINEAR: A library for large linear classification. Journal of machine learning research 9, Aug (2008), 1871--1874.

Digital Library

[8]

Matthew Fisher, Manolis Savva, and Pat Hanrahan. 2011. Characterizing struc- tural relationships in scenes using graph kernels. ACM Transactions on Graphics (TOG) 30, 4 (2011), 34.

Digital Library

[9]

Thomas Gartner, Peter Flach, and Stefan Wrobel. 2003. On graph kernels: Hard- ness results and efficient alternatives. In Learning Theory and Kernel Machines. Springer, 129--143.

[10]

David Haussler. 1999. Convolution kernels on discrete structures. Technical Report. Department of Computer Science, University of California at Santa Cruz.

[11]

Frank L Hitchcock. 1941. The distribution of a product from several sources to numerous localities. Studies in Applied Mathematics 20, 1--4 (1941), 224--230.

[12]

Tama's Horvath, Thomas Gartner, and Stefan Wrobel. 2004. Cyclic pattern kernels for predictive graph mining. In KDD. ACM, 158--167.

Digital Library

[13]

Catalin Ionescu, Alin Popa, and Cristian Sminchisescu. 2017. Large-scale data- dependent kernel approximation. In Artificial Intelligence and Statistics. 19--27.

[14]

Fredrik Johansson, Vinay Jethava, Devdatt Dubhashi, and Chiranjib Bhat- tacharyya. 2014. Global graph kernels using geometric embeddings. In ICML.

Digital Library

[15]

Fredrik D Johansson and Devdatt Dubhashi. 2015. Learning with similarity functions on graphs using matchings of geometric embeddings. In KDD. ACM, 467--476.

Digital Library

[16]

Risi Kondor and Horace Pan. 2016. The multiscale laplacian graph kernel. In NIPS. 2990--2998.

Digital Library

[17]

Nils M Kriege, Pierre-Louis Giscard, and Richard Wilson. 2016. On valid optimal assignment kernels and applications to graph classification. In NIPS. 1623--1631. {18} Matt Kusner, Yu Sun, Nicholas Kolkin, and Kilian Weinberger. 2015. From word embeddings to document distances. In ICML. 957--966.

Digital Library

[18]

Quoc Le, Tama's Sarlo's, and Alex Smola. 2013. Fastfood-approximating kernel expansions in loglinear time. In ICML, Vol. 85.

Digital Library

[19]

La'szlo Lova'sz. 1979. On the Shannon capacity of a graph. IEEE Transactions on Information theory 25, 1 (1979), 1--7.

Digital Library

[20]

Fatemeh Mokhtari and Gholam-Ali Hossein-Zadeh. 2013. Decoding brain states using backward edge elimination and graph kernels in fMRI connectivity net- works. Journal of neuroscience methods 212, 2 (2013), 259--268.

[21]

Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning convolutional neural networks for graphs. In ICML. 2014--2023.

Digital Library

[22]

Giannis Nikolentzos, Polykarpos Meladianos, and Michalis Vazirgiannis. 2017. Matching Node Embeddings for Graph Similarity. In AAAI. 2429--2435.

Digital Library

[23]

Ali Rahimi and Benjamin Recht. 2008. Random features for large-scale kernel machines. In NIPS. 1177--1184.

Digital Library

[24]

Liva Ralaivola, Sanjay J Swamidass, Hiroto Saigo, and Pierre Baldi. 2005. Graph kernels for chemical informatics. Neural networks 18, 8 (2005), 1093--1110.

Digital Library

[25]

Yossi Rubner, Carlo Tomasi, and Leonidas J Guibas. 2000. The earth mover's distance as a metric for image retrieval. International journal of computer vision 40, 2 (2000), 99--121.

Digital Library

[26]

Alessandro Rudi and Lorenzo Rosasco. 2017. Generalization properties of learning with random features. In NIPS. 3218--3228.

Digital Library

[27]

Nino Shervashidze and Karsten M Borgwardt. 2009. Fast subtree kernels on graphs. In NIPS. 1660--1668.

Digital Library

[28]

Nino Shervashidze, Pascal Schweitzer, Erik Jan van Leeuwen, Kurt Mehlhorn, and Karsten M Borgwardt. 2011. Weisfeiler-lehman graph kernels. Journal of Machine Learning Research 12, Sep (2011), 2539--2561.

Digital Library

[29]

NinoShervashidze,SVNVishwanathan,TobiasPetri,KurtMehlhorn,andKarsten Borgwardt. 2009. Efficient graphlet kernels for large graph comparison. In AIStats. 488--495.

[30]

Aman Sinha and John C Duchi. 2016. Learning kernels with random features. In NIPS. 1298--1306.

Digital Library

[31]

Justin Solomon, Raif Rustamov, Leonidas Guibas, and Adrian Butscher. 2016. Continuous-flow graph transportation distances. arXiv:1603.06927 (2016).

[32]

Andreas Stathopoulos and James R McCombs. 2010. PRIMME: preconditioned iterative multimethod eigensolveraATmethods and software description. ACM Transactions on Mathematical Software (TOMS) 37, 2 (2010), 21.

Digital Library

[33]

S Vichy N Vishwanathan, Nicol N Schraudolph, Risi Kondor, and Karsten M Borgwardt. 2010. Graph kernels. Journal of Machine Learning Research 11 (2010), 1201--1242.

[34]

Ulrike Von Luxburg. 2007. A tutorial on spectral clustering. Statistics and computing 17, 4 (2007), 395--416.

Digital Library

[35]

Cynthia Wagner, Gerard Wagener, Radu State, and Thomas Engel. 2009. Malware analysis with graph kernels and support vector machines. In Malicious and Unwanted Software, 2009 4th International Conference on. IEEE, 63--68.

[36]

Ling Wang and Hichem Sahbi. 2013. Directed acyclic graph kernels for action recognition. In ICCV. IEEE, 3168--3175.

Digital Library

[37]

Bo Wu, Yang Liu, Bo Lang, and Lei Huang. 2017. DGCNN: Disordered Graph Convolutional Neural Network Based on the Gaussian Mixture Model. arXiv:1712.03563 (2017).

[38]

Lingfei Wu, Pin-Yu Chen, Ian En-Hsu Yen, Fangli Xu, Yinglong Xia, and Charu Aggarwal. 2018. Scalable spectral clustering using random binning features. In KDD. ACM, 2506--2515.

Digital Library

[39]

Lingfei Wu, Eloy Romero, and Andreas Stathopoulos. 2017. PRIMME_SVDS: A high-performance preconditioned SVD solver for accurate large-scale computa- tions. SIAM Journal on Scientific Computing 39, 5 (2017), S248--S271.

Digital Library

[40]

Lingfei Wu, Ian EH Yen, Jie Chen, and Rui Yan. 2016. Revisiting random binning features: Fast convergence and strong parallelizability. In KDD. ACM, 1265--1274.

Digital Library

[41]

Lingfei Wu, Ian En-Hsu Yen, Fangli Xu, Pradeep Ravikuma, and Michael Wit- brock. 2018. D2KE: From Distance to Kernel and Embedding. arXiv preprint arXiv:1802.04956 (2018).

[42]

Lingfei Wu, Ian En-Hsu Yen, Jinfeng Yi, Fangli Xu, Qi Lei, and Michael Witbrock. 2018. Random Warping Series: A Random Features Method for Time-Series Embedding. In International Conference on Artificial Intelligence and Statistics. 793--802.

[43]

Pinar Yanardag and SVN Vishwanathan. 2015. Deep graph kernels. In KDD. ACM, 1365--1374.

Digital Library

[44]

Pinar Yanardag and SVN Vishwanathan. 2015. A structural smoothing framework for robust graph comparison. In NIPS. 2134--2142.

Digital Library

[45]

Zhen Zhang, Mianzhi Wang, Yijian Xiang, Yan Huang, and Arye Nehorai. 2018. RetGK: Graph Kernels based on Return Probabilities of Random Walks. In NIPS. 3968--3978.

Digital Library

Cited By

Li QYou TChen JZhang YDu C(2024)BioDynGrap: Biomedical event prediction via interpretable learning framework for heterogeneous dynamic graphsExpert Systems with Applications10.1016/j.eswa.2023.122964244(122964)Online publication date: Jun-2024
https://doi.org/10.1016/j.eswa.2023.122964
Welke PThiessen MJogl FGärtner TKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Expectation-complete graph representations with homomorphismsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619944(36910-36925)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619944
Choromanski KKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Taming graph kernels with random featuresProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618645(5964-5977)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3618645
Show More Cited By

Index Terms

Scalable Global Alignment Graph Kernel Using Random Features: From Node Embedding to Graph Embedding
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Kernel methods

Recommendations

A generalized Weisfeiler-Lehman graph kernel
Abstract
After more than one decade, Weisfeiler-Lehman graph kernels are still among the most prevalent graph kernels due to their remarkable predictive performance and time complexity. They are based on a fast iterative partitioning of vertices, ...
Investigation of kernel functions in EDA-GK
GECCO '18: Proceedings of the Genetic and Evolutionary Computation Conference Companion

We have proposed EDA-GK, Estimation of Distribution Algorithms with Graph Kernels. The EDA-GK is designed for solving graph-related problems, where individuals can be represented by graphs. By using graph kernels, the EDA-GK can be solved for graph-...
Dual-decoder graph autoencoder for unsupervised graph representation learning
Abstract
Unsupervised graph representation learning is a challenging task that embeds graph data into a low-dimensional space without label guidance. Recently, graph autoencoders have been proven to be an effective way to solve this problem in ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '19: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

July 2019

3305 pages

ISBN:9781450362016

DOI:10.1145/3292500

General Chairs:
Ankur Teredesai
KenSci
,
Vipin Kumar
University of Minnesota
,
Program Chairs:
Ying Li
EV Analysis Corporation
,
Rómer Rosales
LinkedIn
,
Evimaria Terzi
Boston University
,
George Karypis
University of Minnesota

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 July 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

KDD '19

Sponsor:

KDD '19: The 25th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

August 4 - 8, 2019

AK, Anchorage, USA

Acceptance Rates

KDD '19 Paper Acceptance Rate 110 of 1,200 submissions, 9%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

22
Total Citations
View Citations
1,053
Total Downloads

Downloads (Last 12 months)41
Downloads (Last 6 weeks)5

Reflects downloads up to 30 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Li QYou TChen JZhang YDu C(2024)BioDynGrap: Biomedical event prediction via interpretable learning framework for heterogeneous dynamic graphsExpert Systems with Applications10.1016/j.eswa.2023.122964244(122964)Online publication date: Jun-2024
https://doi.org/10.1016/j.eswa.2023.122964
Welke PThiessen MJogl FGärtner TKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Expectation-complete graph representations with homomorphismsProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619944(36910-36925)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619944
Choromanski KKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Taming graph kernels with random featuresProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3618645(5964-5977)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3618645
Zheng YZhang ZLuo QXie ZYu D(2023)A Truss-Based Framework for Graph Similarity ComputationJournal of Database Management10.4018/JDM.32208734:1(1-18)Online publication date: 25-Apr-2023
https://dl.acm.org/doi/10.4018/JDM.322087
Ling XWu LWang SMa TXu FLiu AWu CJi S(2023)Multilevel Graph Matching Networks for Deep Graph Similarity LearningIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2021.310223434:2(799-813)Online publication date: Feb-2023
https://doi.org/10.1109/TNNLS.2021.3102234
Liu ZWang CLou YFeng H(2023)Fast Unsupervised Graph Embedding via Graph Zoom Learning2023 IEEE 39th International Conference on Data Engineering (ICDE)10.1109/ICDE55515.2023.00196(2551-2564)Online publication date: Apr-2023
https://doi.org/10.1109/ICDE55515.2023.00196
Bibi NMaqbool ARana TAfzal FAkgül AEl Din S(2023)Enhancing Semantic Code Search with Deep Graph MatchingIEEE Access10.1109/ACCESS.2023.3263878(1-1)Online publication date: 2023
https://doi.org/10.1109/ACCESS.2023.3263878
Liu PJi LYe FFu B(2023)GraphLSurvComputer Methods and Programs in Biomedicine10.1016/j.cmpb.2023.107433231:COnline publication date: 1-Apr-2023
https://dl.acm.org/doi/10.1016/j.cmpb.2023.107433
Gao YChowdhury TWu LWu LZhao L(2022)Modeling Health Stage Development of Patients with Dynamic Attributed Graphs in Online Health CommunitiesIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3144083(1-1)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3144083
Chen TSun QZhan CLiu CYu HYu B(2022)Deep H-GCN: Fast Analog IC Aging-Induced Degradation EstimationIEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems10.1109/TCAD.2021.310725041:7(1990-2003)Online publication date: Jul-2022
https://doi.org/10.1109/TCAD.2021.3107250
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents