research-article

A Min-Max Optimization Framework For Online Graph Classification

Authors:

Peilin ZhaoAuthors Info & Claims

CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

Pages 643 - 652

https://doi.org/10.1145/2806416.2806548

Published: 17 October 2015 Publication History

Abstract

Traditional online learning for graph node classification adapts graph regularization into ridge regression, which may not be suitable when data is adversarially generated. To solve this issue, we propose a more general min-max optimization framework for online graph node classification. The derived online algorithm can achieve a min-max regret compared with the optimal linear model found offline. However, this algorithm assumes that the label is provided for every node, while label is scare and labeling is usually either too time-consuming or expensive in real-world applications. To save labeling effort, we propose a novel confidence-based query approach to prioritize the informative labels. Our theoretical result shows that an online algorithm learning on these selected labels can achieve comparable mistake bound with the fully-supervised online counterpart. To take full advantage of these labels, we propose an aggressive algorithm, which can update the model even if no error occurs. Theoretical analysis shows that the mistake bound of the proposed method, thanks to the aggressive update trials, is better than conservative competitor in expectation. We finally empirically evaluate it on several real-world graph databases. Encouraging experimental results further demonstrate the effectiveness of our method.

References

[1]

J. Abernethy, A. Agarwal, and P. L. Bartlett. A stochastic view of optimal regret through minimax duality. In Proceedings of the 22nd Annual Conference on Learning Theory, 2009.

[2]

M. S. Bartlett. An inverse matrix adjustment arising in discriminant analysis. The Annals of Mathematical Statistics, pages 107--111, 1951.

[3]

M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. The Journal of Machine Learning Research, 7:2399--2434, 2006.

Digital Library

[4]

G. Cavallanti, N. Cesa-Bianchi, and C. Gentile. Learning noisy linear classifiers via adaptive and selective sampling. Machine learning, 83(1):71--102, 2011.

Digital Library

[5]

N. Cesa-Bianchi, A. Conconi, and C. Gentile. A second-orde perceptron algorithm. SIAM Journal on Computing, 34(3):640--668, 2005.

Digital Library

[6]

N. Cesa-Bianchi, C. Gentile, and F. Orabona. Robust bounds for classification via selective sampling. In ICML-09, pages 121--128, 2009.

Digital Library

[7]

N. Cesa-Bianchi, C. Gentile, and L. Zaniboni. Worst-case analysis of selective sampling for linear classification. The Journal of Machine Learning Research 7:1205--1230, 2006.

Digital Library

[8]

O. Chapelle, B. Schölkopf, A. Zien, et al. Semi-supervised learning, volume 2. MIT press Cambridge, 2006.

[9]

K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, and Y. Singer. Online passive-aggressive algorithms. JMLR, 7:551--585, 2006.

Digital Library

[10]

K. Crammer, M. Dredze, and F. Pereira. Confidence-weighted linear classification for text categorization. JMLR, 13(1):1891--1926, 2012.

Digital Library

[11]

C. Desrosiers and G. Karypis. Within-network classification using local structure similarity. In Machine Learning and Knowledge Discovery in Databases, pages 260--275. 2009.

Digital Library

[12]

C. Eckart and G. Young. The approximation of one matrix by another of lower rank. Psychometrika, 1(3):211--218, 1936.

[13]

J. Forster. On relative loss bounds in generalized linear regression. In Fundamentals of Computation Theory, pages 269--280, 1999.

Digital Library

[14]

Y. Freund, H. S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine learning, 28(2--3):133--168, 1997.

Digital Library

[15]

C. Gentile. The robustness of the p-norm algorithms. Machine Learning, 53(3):265--299, 2003.

Digital Library

[16]

G. Giacinto, F. Roli, and L. Didaci. Fusion of multiple classifiers for intrusion detection in computer networks. Pattern recognition letters, 24(12):1795--1803, 2003.

Digital Library

[17]

A. B. Goldberg, X. Zhu, A. Furger, and J.-M. Xu. Oasis: Online active semi-supervised learning. In AAAI, 2011.

[18]

Q. Gu, C. Aggarwal, J. Liu, and J. Han. Selective sampling on graphs for classification. In Proceedings of the 19th ACM SIGKDD, pages 131--139, 2013.

Digital Library

[19]

M. Herbster, G. Lever, and M. Pontil. Online prediction on large diameter graphs. In NIPS-09, pages 649--656, 2009.

[20]

M. Herbster and M. Pontil. Prediction on a graph with a perceptron. In NIPS-06, pages 577--584, 2006.

Digital Library

[21]

M. Herbster, M. Pontil, and L. Wainer. Online learning over graphs. In ICML-05, pages 305--312, 2005.

Digital Library

[22]

S. C. Hoi, J. Wang, and P. Zhao. Libol: A library for online learning algorithms. The Journal of Machine Learning Research, 15(1):495--499, 2014.

Digital Library

[23]

M. Ji, J. Han, and M. Danilevsky. Ranking-based classification of heterogeneous information networks. In Proceedings of the 17th ACM SIGKDD, pages 1298--1306, 2011.

Digital Library

[24]

D. Kushnir. Active-transductive learning with label-adapte kernels. In Proceedings of the 20th ACM SIGKDD, pages 462--471, 2014.

Digital Library

[25]

F. Orabona and N. Cesa-Bianchi. Better algorithms for selective sampling. In ICML-11, pages 433--440, 2011.

[26]

F. Orabona and K. Crammer. New adaptive algorithms for online classification. In NIPS-10, pages 1840--1848, 2010.

[27]

M. Pennacchiotti and A.-M. Popescu. A machine learning approach to twitter user classification. ICWSM, 11:281--288, 2011.

[28]

A. A. Ross, K. Nandakumar, and A. K. Jain. Handbook of multibiometrics, volume 6. Springer Science & Business Media, 2006.

Digital Library

[29]

N. Slonim, E. Yom-Tov, and K. Crammer. Active online classification via information maximization. In IJCAI, volume 22, page 1498, 2011.

Digital Library

[30]

A. J. Smola and R. Kondor. Kernels and regularization on graphs. In Learning theory and kernel machines, pages 144--158. Springer, 2003.

[31]

E. Takimoto and M. K. Warmuth. The last-step minimax algorithm. In Algorithmic Learning Theory, pages 279--290, 2000.

Digital Library

[32]

V. Vovk. Competitive on-line linear regression. NIPS-98, pages 364--370, 1998.

Digital Library

[33]

J. Wang, P. Zhao, and S. C. Hoi. Exact soft confidence-weighted learning. In ICML-12, pages 121--128, 2012.

[34]

Z. Yang, J. Tang, and Y. Zhang. Active learning for streaming networked data. In CIKM-14, pages 1129--1138, 2014.

Digital Library

[35]

P. Zhao and S. C. Hoi. Cost-sensitive online active learning with application to malicious url detection. In Proceedings of the 19th ACM SIGKDD, pages 919--927. ACM, 2013.

Digital Library

[36]

P. Zhao, S. C. Hoi, and R. Jin. Double updating online learning. The Journal of Machine Learning Research, 12:1587--1615, 2011.

Digital Library

[37]

P. Zhao, S. C. H. Hoi, R. Jin, and T. Yang. Online AUC maximization. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011, Bellevue, Washington, USA, June 28 - July 2, 2011, pages 233--240, 2011.

Digital Library

[38]

P. Zhao, S. C. H. Hoi, J. Wang, and B. Li. Online transfer learning. Artificial Intelligence, 216:76--102, 2014

Digital Library

Cited By

Li DLi QYe YXu S(2021)Arms Race in Adversarial Malware Detection: A SurveyACM Computing Surveys10.1145/348449155:1(1-35)Online publication date: 23-Nov-2021
https://dl.acm.org/doi/10.1145/3484491
Yang PLi PDemartini GZuccon GCulpepper JHuang ZTong H(2021)Adversarial Kernel Sampling on Class-imbalanced Data StreamsProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482227(2352-2362)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482227
Yang PLi XLi P(2021)Graph-based Adversarial Online Kernel Learning with Adaptive Embedding2021 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM51629.2021.00091(797-806)Online publication date: Dec-2021
https://doi.org/10.1109/ICDM51629.2021.00091
Show More Cited By

Index Terms

A Min-Max Optimization Framework For Online Graph Classification
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Semi-supervised Graph Embedding for Multi-label Graph Node Classification
Web Information Systems Engineering – WISE 2019
Abstract
The graph convolution network (GCN) is a widely-used facility to realize graph-based semi-supervised learning, which usually integrates node, features, and graph topologic information to build learning models. However, as for multi-label learning ...
Multi-label graph node classification with label attentive neighborhood convolution
Highlights
- A novel neural network-based method for multi-label graph node classification.
- ...
Abstract
Learning with graph structured data is of great significance for many practical applications. A crucial and fundamental task in graph learning is node classification. In reality, graph nodes are often encoded with various attributes. ...
A Novel Online Stacked Ensemble for Multi-Label Stream Classification
CIKM '18: Proceedings of the 27th ACM International Conference on Information and Knowledge Management

As data streams become more prevalent, the necessity for online algorithms that mine this transient and dynamic data becomes clearer. Multi-label data stream classification is a supervised learning problem where each instance in the data stream is ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

October 2015

1998 pages

ISBN:9781450337946

DOI:10.1145/2806416

General Chairs:
James Bailey
The University of Melbourne
,
Alistair Moffat
The University of Melbourne
,
Program Chairs:
Charu C. Aggarwal
IBM
,
Maarten de Rijke
University of Amsterdam
,
Ravi Kumar
Google
,
Vanessa Murdock
Microsoft
,
Timos Sellis
RMIT University
,
Jeffrey Xu Yu
Chinese University of Hong Kong

Copyright © 2015 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 October 2015

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Institute for Infocomm Research, Agency for Science, Technology & Research (A*STAR), Singapore

Conference

CIKM'15

Sponsor:

CIKM'15: 24th ACM International Conference on Information and Knowledge Management

October 18 - 23, 2015

Melbourne, Australia

Acceptance Rates

CIKM '15 Paper Acceptance Rate 165 of 646 submissions, 26%;

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Sponsor:
sigir
sigir

The 34th ACM International Conference on Information and Knowledge Management

November 10 - 14, 2025

Seoul , Republic of Korea

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

9
Total Citations
View Citations
375
Total Downloads

Downloads (Last 12 months)13
Downloads (Last 6 weeks)1

Reflects downloads up to 16 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Li DLi QYe YXu S(2021)Arms Race in Adversarial Malware Detection: A SurveyACM Computing Surveys10.1145/348449155:1(1-35)Online publication date: 23-Nov-2021
https://dl.acm.org/doi/10.1145/3484491
Yang PLi PDemartini GZuccon GCulpepper JHuang ZTong H(2021)Adversarial Kernel Sampling on Class-imbalanced Data StreamsProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482227(2352-2362)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482227
Yang PLi XLi P(2021)Graph-based Adversarial Online Kernel Learning with Adaptive Embedding2021 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM51629.2021.00091(797-806)Online publication date: Dec-2021
https://doi.org/10.1109/ICDM51629.2021.00091
Yang PZhao PGao X(2018)Bandit online learning on graphs via adaptive optimizationProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3305076(2991-2997)Online publication date: 13-Jul-2018
https://dl.acm.org/doi/10.5555/3304889.3305076
Yang PZhao PGao X(2017)Robust Online Multi-Task Learning with Correlative and Personalized StructuresIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.270310629:11(2510-2521)Online publication date: 4-Oct-2017
https://dl.acm.org/doi/10.1109/TKDE.2017.2703106
Chen LYe YBourlai T(2017)Adversarial Machine Learning in Malware Detection: Arms Race between Evasion Attack and Defense2017 European Intelligence and Security Informatics Conference (EISIC)10.1109/EISIC.2017.21(99-106)Online publication date: Sep-2017
https://doi.org/10.1109/EISIC.2017.21
Chen LYe Y(2017)SecMD: Make Machine Learning More Secure Against Adversarial Malware AttacksAI 2017: Advances in Artificial Intelligence10.1007/978-3-319-63004-5_7(76-89)Online publication date: 9-Jul-2017
https://doi.org/10.1007/978-3-319-63004-5_7
Yang PZhao PHai ZLiu WHoi SLi X(2016)Efficient multi-class selective sampling on graphsProceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence10.5555/3020948.3021031(805-814)Online publication date: 25-Jun-2016
https://dl.acm.org/doi/10.5555/3020948.3021031
Aggarwal CHe GZhao P(2016)Edge classification in networks2016 IEEE 32nd International Conference on Data Engineering (ICDE)10.1109/ICDE.2016.7498311(1038-1049)Online publication date: May-2016
https://doi.org/10.1109/ICDE.2016.7498311

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents