Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2806416.2806548acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

A Min-Max Optimization Framework For Online Graph Classification

Published: 17 October 2015 Publication History

Abstract

Traditional online learning for graph node classification adapts graph regularization into ridge regression, which may not be suitable when data is adversarially generated. To solve this issue, we propose a more general min-max optimization framework for online graph node classification. The derived online algorithm can achieve a min-max regret compared with the optimal linear model found offline. However, this algorithm assumes that the label is provided for every node, while label is scare and labeling is usually either too time-consuming or expensive in real-world applications. To save labeling effort, we propose a novel confidence-based query approach to prioritize the informative labels. Our theoretical result shows that an online algorithm learning on these selected labels can achieve comparable mistake bound with the fully-supervised online counterpart. To take full advantage of these labels, we propose an aggressive algorithm, which can update the model even if no error occurs. Theoretical analysis shows that the mistake bound of the proposed method, thanks to the aggressive update trials, is better than conservative competitor in expectation. We finally empirically evaluate it on several real-world graph databases. Encouraging experimental results further demonstrate the effectiveness of our method.

References

[1]
J. Abernethy, A. Agarwal, and P. L. Bartlett. A stochastic view of optimal regret through minimax duality. In Proceedings of the 22nd Annual Conference on Learning Theory, 2009.
[2]
M. S. Bartlett. An inverse matrix adjustment arising in discriminant analysis. The Annals of Mathematical Statistics, pages 107--111, 1951.
[3]
M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. The Journal of Machine Learning Research, 7:2399--2434, 2006.
[4]
G. Cavallanti, N. Cesa-Bianchi, and C. Gentile. Learning noisy linear classifiers via adaptive and selective sampling. Machine learning, 83(1):71--102, 2011.
[5]
N. Cesa-Bianchi, A. Conconi, and C. Gentile. A second-orde perceptron algorithm. SIAM Journal on Computing, 34(3):640--668, 2005.
[6]
N. Cesa-Bianchi, C. Gentile, and F. Orabona. Robust bounds for classification via selective sampling. In ICML-09, pages 121--128, 2009.
[7]
N. Cesa-Bianchi, C. Gentile, and L. Zaniboni. Worst-case analysis of selective sampling for linear classification. The Journal of Machine Learning Research 7:1205--1230, 2006.
[8]
O. Chapelle, B. Schölkopf, A. Zien, et al. Semi-supervised learning, volume 2. MIT press Cambridge, 2006.
[9]
K. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, and Y. Singer. Online passive-aggressive algorithms. JMLR, 7:551--585, 2006.
[10]
K. Crammer, M. Dredze, and F. Pereira. Confidence-weighted linear classification for text categorization. JMLR, 13(1):1891--1926, 2012.
[11]
C. Desrosiers and G. Karypis. Within-network classification using local structure similarity. In Machine Learning and Knowledge Discovery in Databases, pages 260--275. 2009.
[12]
C. Eckart and G. Young. The approximation of one matrix by another of lower rank. Psychometrika, 1(3):211--218, 1936.
[13]
J. Forster. On relative loss bounds in generalized linear regression. In Fundamentals of Computation Theory, pages 269--280, 1999.
[14]
Y. Freund, H. S. Seung, E. Shamir, and N. Tishby. Selective sampling using the query by committee algorithm. Machine learning, 28(2--3):133--168, 1997.
[15]
C. Gentile. The robustness of the p-norm algorithms. Machine Learning, 53(3):265--299, 2003.
[16]
G. Giacinto, F. Roli, and L. Didaci. Fusion of multiple classifiers for intrusion detection in computer networks. Pattern recognition letters, 24(12):1795--1803, 2003.
[17]
A. B. Goldberg, X. Zhu, A. Furger, and J.-M. Xu. Oasis: Online active semi-supervised learning. In AAAI, 2011.
[18]
Q. Gu, C. Aggarwal, J. Liu, and J. Han. Selective sampling on graphs for classification. In Proceedings of the 19th ACM SIGKDD, pages 131--139, 2013.
[19]
M. Herbster, G. Lever, and M. Pontil. Online prediction on large diameter graphs. In NIPS-09, pages 649--656, 2009.
[20]
M. Herbster and M. Pontil. Prediction on a graph with a perceptron. In NIPS-06, pages 577--584, 2006.
[21]
M. Herbster, M. Pontil, and L. Wainer. Online learning over graphs. In ICML-05, pages 305--312, 2005.
[22]
S. C. Hoi, J. Wang, and P. Zhao. Libol: A library for online learning algorithms. The Journal of Machine Learning Research, 15(1):495--499, 2014.
[23]
M. Ji, J. Han, and M. Danilevsky. Ranking-based classification of heterogeneous information networks. In Proceedings of the 17th ACM SIGKDD, pages 1298--1306, 2011.
[24]
D. Kushnir. Active-transductive learning with label-adapte kernels. In Proceedings of the 20th ACM SIGKDD, pages 462--471, 2014.
[25]
F. Orabona and N. Cesa-Bianchi. Better algorithms for selective sampling. In ICML-11, pages 433--440, 2011.
[26]
F. Orabona and K. Crammer. New adaptive algorithms for online classification. In NIPS-10, pages 1840--1848, 2010.
[27]
M. Pennacchiotti and A.-M. Popescu. A machine learning approach to twitter user classification. ICWSM, 11:281--288, 2011.
[28]
A. A. Ross, K. Nandakumar, and A. K. Jain. Handbook of multibiometrics, volume 6. Springer Science & Business Media, 2006.
[29]
N. Slonim, E. Yom-Tov, and K. Crammer. Active online classification via information maximization. In IJCAI, volume 22, page 1498, 2011.
[30]
A. J. Smola and R. Kondor. Kernels and regularization on graphs. In Learning theory and kernel machines, pages 144--158. Springer, 2003.
[31]
E. Takimoto and M. K. Warmuth. The last-step minimax algorithm. In Algorithmic Learning Theory, pages 279--290, 2000.
[32]
V. Vovk. Competitive on-line linear regression. NIPS-98, pages 364--370, 1998.
[33]
J. Wang, P. Zhao, and S. C. Hoi. Exact soft confidence-weighted learning. In ICML-12, pages 121--128, 2012.
[34]
Z. Yang, J. Tang, and Y. Zhang. Active learning for streaming networked data. In CIKM-14, pages 1129--1138, 2014.
[35]
P. Zhao and S. C. Hoi. Cost-sensitive online active learning with application to malicious url detection. In Proceedings of the 19th ACM SIGKDD, pages 919--927. ACM, 2013.
[36]
P. Zhao, S. C. Hoi, and R. Jin. Double updating online learning. The Journal of Machine Learning Research, 12:1587--1615, 2011.
[37]
P. Zhao, S. C. H. Hoi, R. Jin, and T. Yang. Online AUC maximization. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011, Bellevue, Washington, USA, June 28 - July 2, 2011, pages 233--240, 2011.
[38]
P. Zhao, S. C. H. Hoi, J. Wang, and B. Li. Online transfer learning. Artificial Intelligence, 216:76--102, 2014

Cited By

View all
  • (2021)Arms Race in Adversarial Malware Detection: A SurveyACM Computing Surveys10.1145/348449155:1(1-35)Online publication date: 23-Nov-2021
  • (2021)Adversarial Kernel Sampling on Class-imbalanced Data StreamsProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482227(2352-2362)Online publication date: 26-Oct-2021
  • (2021)Graph-based Adversarial Online Kernel Learning with Adaptive Embedding2021 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM51629.2021.00091(797-806)Online publication date: Dec-2021
  • Show More Cited By

Index Terms

  1. A Min-Max Optimization Framework For Online Graph Classification

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management
      October 2015
      1998 pages
      ISBN:9781450337946
      DOI:10.1145/2806416
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 17 October 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. graph node classification
      2. min-max optimization
      3. online learning
      4. sampling

      Qualifiers

      • Research-article

      Funding Sources

      • Institute for Infocomm Research, Agency for Science, Technology & Research (A*STAR), Singapore

      Conference

      CIKM'15
      Sponsor:

      Acceptance Rates

      CIKM '15 Paper Acceptance Rate 165 of 646 submissions, 26%;
      Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

      Upcoming Conference

      CIKM '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)13
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 16 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)Arms Race in Adversarial Malware Detection: A SurveyACM Computing Surveys10.1145/348449155:1(1-35)Online publication date: 23-Nov-2021
      • (2021)Adversarial Kernel Sampling on Class-imbalanced Data StreamsProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482227(2352-2362)Online publication date: 26-Oct-2021
      • (2021)Graph-based Adversarial Online Kernel Learning with Adaptive Embedding2021 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM51629.2021.00091(797-806)Online publication date: Dec-2021
      • (2018)Bandit online learning on graphs via adaptive optimizationProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304889.3305076(2991-2997)Online publication date: 13-Jul-2018
      • (2017)Robust Online Multi-Task Learning with Correlative and Personalized StructuresIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2017.270310629:11(2510-2521)Online publication date: 4-Oct-2017
      • (2017)Adversarial Machine Learning in Malware Detection: Arms Race between Evasion Attack and Defense2017 European Intelligence and Security Informatics Conference (EISIC)10.1109/EISIC.2017.21(99-106)Online publication date: Sep-2017
      • (2017)SecMD: Make Machine Learning More Secure Against Adversarial Malware AttacksAI 2017: Advances in Artificial Intelligence10.1007/978-3-319-63004-5_7(76-89)Online publication date: 9-Jul-2017
      • (2016)Efficient multi-class selective sampling on graphsProceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence10.5555/3020948.3021031(805-814)Online publication date: 25-Jun-2016
      • (2016)Edge classification in networks2016 IEEE 32nd International Conference on Data Engineering (ICDE)10.1109/ICDE.2016.7498311(1038-1049)Online publication date: May-2016

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media