Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Batch Mode Active Learning for Networked Data

Published: 01 February 2012 Publication History

Abstract

We study a novel problem of batch mode active learning for networked data. In this problem, data instances are connected with links and their labels are correlated with each other, and the goal of batch mode active learning is to exploit the link-based dependencies and node-specific content information to actively select a batch of instances to query the user for learning an accurate model to label unknown instances in the network. We present three criteria (i.e., minimum redundancy, maximum uncertainty, and maximum impact) to quantify the informativeness of a set of instances, and formalize the batch mode active learning problem as selecting a set of instances by maximizing an objective function which combines both link and content information. As solving the objective function is NP-hard, we present an efficient algorithm to optimize the objective function with a bounded approximation rate. To scale to real large networks, we develop a parallel implementation of the algorithm. Experimental results on both synthetic datasets and real-world datasets demonstrate the effectiveness and efficiency of our approach.

References

[1]
Anguelov, D., Taskar, B., Chatalbashev, V., Koller, D., Gupta, D., Heitz, G., and Ng, A. 2005. Discriminative learning of markov random fields for segmentation of 3d scan data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 169--176.
[2]
Attenberg, J. and Provost, F. 2010. Active inference and learning for classifying streams. In Proceedings of the Budgeted Learning Workshop in International Conference on Machine Learning (ICML Workshop).
[3]
Besag, J. 1986. On the statistical analysis of dirty pictures. J. Roy. Statist. Soc. 259--302.
[4]
Beygelzimer, A., Dasgupta, S., and Langford, J. 2009. Importance weighted active learning. In Proceedings of the International Conference on Machine Learning (ICML). 49--56.
[5]
Bilgic, M. and Getoor, L. 2008. Effective label acquisition for collective classification. In Proceedings of the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 43--51.
[6]
Bilgic, M. and Getoor, L. 2009. Reflect and correct: A misclassification prediction approach to active inference. ACM Trans. Knowl. Discov. Data (TKDD) 3, 4, 1--32.
[7]
Bilgic, M. and Getoor, L. 2010. Active inference for collective classification. In Proceedings of the National Conference on Artificial Intelligence (AAAI).
[8]
Bilgic, M., Mihalkova, L., and Getoor, L. 2010. Active learning for networked data. In Proceedings of the International Conference on Machine Learning (ICML).
[9]
Boykov, Y., Veksler, O., and Zabih, R. 2001. Fast approximate energy minimization via graph cuts. IEEE Trans. Pattern Anal. Mach. Intell. 23, 1222--1239.
[10]
Brinker, K. 2003. Incorporating diversity in active learning with support vector machines. In Proceedings of the International Conference on Machine Learning (ICML). 59--66.
[11]
Cannon, L. E. 1969. A cellular computer to implement the kalman filter algorithm. Ph.D. thesis, Montana State University.
[12]
Cesa-Bianchi, N., Gentile, C., Vitale, F., and Zappella, G. 2010. Active learning on trees and graphs. Tech. rep., MIT Press.
[13]
Chakrabarti, S., Dom, B., and Indyk, P. 1998. Enhanced hypertext categorization using hyperlinks. In Proceedings of the ACM Special Interest Group on Management of Data (SIGMOD). 307--318.
[14]
Craven, M., DiPasquo, D., Freitag, D., McCallum, A., Mitchell, T., Nigam, K., and Slattery, S. 1998. Learning to extract symbolic knowledge from the world wide web. In Proceedings of the National Conference on Artificial Intelligence (AAAI). 509--516.
[15]
Frank, A. and Asuncion, A. 2010. UCI machine learning repository. http://archive.ics.uci.edu/ml/citation_policy.html.
[16]
Getoor, L., Segal, E., Taskar, B., and Koller, D. 2001. Probabilistic models of text and link structure for hypertext classification. In Proceedings of the International Joint Conference on Artificial Intelligence Workshop on “Text Learning: Beyond Supervision” (IJCAI Workshop). 24--29.
[17]
Gropp, W., Lusk, E., and Skjellum, A. 1994. Using MPI: Portable Parallel Programming with the Message Passing Interface. MIT Press.
[18]
Guo, Y. and Schuurmans, D. 2008. Discriminative batch mode active learning. In Proceedings of the Conference on Advances in Neural Information Processing Systems (NIPS). 593--600.
[19]
Harpale, A. S. and Yang, Y. 2008. Personalized active learning for collaborative filtering. In Proceedings of the Special Interest Group on Information Retrieval (SIGIR). 91--98.
[20]
Heß, A. and Kushmerick, N. 2004. Iterative ensemble classification for relational data: A case study of semantic web services. In Proceedings of the European Conference on Machine Learning (ECML). 156--167.
[21]
Hoi, S. C. H., Jin, R., and Lyu, M. R. 2006a. Large-Scale text categorization by batch mode active learning. In Proceedings of the World Wide Web Conference (WWW). 633--642.
[22]
Hoi, S. C. H., Jin, R., Zhu, J., and Lyu, M. R. 2006b. Batch mode active learning and its application to medical image classification. In Proceedings of the International Conference on Machine Learning (ICML). 417--424.
[23]
Hoi, S. C. H., Jin, R., Zhu, J., and Lyu, M. R. 2008. Semi-Supervised svm batch mode active learning for image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1--7.
[24]
Huang, K. and Wang, F. 1997. Design patterns for parallel computations of master-slave model. In Proceedings of the International Conference on Information, Communications and Signal Processing (ICICS). 1508--1512.
[25]
Hummel, R. and Zucker, S. 1983. On the foundations of relaxation labeling processes. IEEE Trans. Pattern Anal. Mach. Intell. 5, 3, 267--287.
[26]
Jensen, D. and Neville, J. 2002. Linkage and autocorrelation cause feature selection bias in relational learning. In Proceedings of the International Conference on Machine Learning (ICML). 259--266.
[27]
Jensen, D., Neville, J., and Gallagher, B. 2004. Why collective inference improves relational classification. In Proceedings of ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 593--598.
[28]
Joshi, A. J., Porikli, F., and Papanikolopoulos, N. 2010. Multi-Class batch-mode active learning for image classification. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA). 1873--1878.
[29]
Kawahara, Y., Nagano, K., Tsuda, K., and Bilmes, J. 2009. Submodularity cuts and applications. In Proceedings of the Conference on Advances in Neural Information Processing Systems (NIPS).
[30]
Kschischang, F. R., Frey, B. J., and Loeliger, H. 2001. Factor graphs and the sum-product algorithm. IEEE Trans. Inf. Theory 47, 498--519.
[31]
Lafferty, J. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the International Conference on Machine Learning (ICML). 282--289.
[32]
Li, Y. and Jain, A. K. 1998. Classification of text documents. In Proceedings of the International Conference on Pattern Recognition (ICPR). 1295.
[33]
Liben-Nowell, D. and Kleinberg, J. 2007. The link-prediction problem for social networks. J. Amer. Soc. Inf. Sci. Technol. 58, 7, 1019--1031.
[34]
Lu, Q. and Getoor, L. 2003. Link-Based classification. In Proceedings of the 20th International Conference on Machine Learning (ICML). 496--503.
[35]
Macskassy, S. A. 2007. Improving learning in networked data by combining explicit and mined links. In Proceedings of the National Conference on Artificial Intelligence (AAAI). 590--595.
[36]
Macskassy, S. A. 2009. Using graph-based metrics with empirical risk minimization to speed up active learning on networked data. In Proceedings of the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 597--606.
[37]
Macskassy, S. A. and Provost, F. 2007. Classification in networked data: A toolkit and a univariate case study. J. Mach. Learn. Res. 8, 935--983.
[38]
Namata, G. M., Sen, P., Bilgic, M., and Getoor, L. 2009. Collective classification for text classification. In Text Mining: Classification, Clustering, and Applications. Taylor and Francis Group.
[39]
Nemhauser, G., Wolsey, L., and Fisher, M. 1978. An analysis of the approximations for maximizing submodular set functions. Math. Program. 14, 265--294.
[40]
Neville, J. and Jensen, D. 2000. Iterative classificataion in relational data. In Proceedings of the National Conference on Artificial Intelligence Workshop on Statistical Relational Learing (AAAI Workshop). 42--49.
[41]
Nodelman, U., Shelton, C., and Koller, D. 2003. Learning continuous time bayesian networks. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI). 451--458.
[42]
Özdogan, C. 2006. Cannon’s matrix-matrix multiplication with mpi’s topologies. siber.cankaya.edu.tr/ozdogan/GraduateParallelComputing/ceng505/.
[43]
Pease, M. C. 1967. Matrix inversion using parallel processing. J. ACM 14, 4, 757--764.
[44]
Rajan, S., Yankov, D., Gaffney, S. J., and Ratnaparkhi, A. 2010. A large-scale active learning system for topical categorization on the web. In Proceedings of the World Wide Web Conference (WWW). 791--800.
[45]
Rattigan, M. J., Maier, M., and JJensen, D. 2007. Exploiting network structure for active inference in collective classification. In Proceedings of the International Conference on Data Mining Workshop (ICDM Workshop). 429--434.
[46]
Roy, N. and McCallum, A. 2001. Toward optimal active learning through sampling estimation of error reduction. In Proceedings of the 18th International Conference on Machine Learning (ICML). 441--448.
[47]
Sen, P., Namata, G. M., Bilgic, M., Getoor, L., Gallagher, B., and Eliassi-Rad, T. 2008. Collective classification in network data. AI Mag. 29, 3, 93--106.
[48]
Settles, B. 2010. Active learning literture survey. Tech. rep. 1648, Computer Science Department, University of Wisconsin-Madison.
[49]
Shi, L. and Zhao, Y. 2010. Batch mode sparse active learning. In Proceedings of the International Conference on Data Mining Workshop (ICDM Workshop). 875--882.
[50]
Slattery, S. and Craven, M. 1998. Combining statistical and relational methods for learning in hypertext domains. In Proceedings of the 8th International Conference on Inductive Logic Programming (ILP). 38--52.
[51]
Steven, C. H. H., Rong, J., and R, R. L. M. 2009. Batch mode active learning with applications to text categorization and image retrieval. IEEE Trans. Knowl. Data Engin. 21, 1233--1248.
[52]
Tan, C., Tang, J., Sun, J., Lin, Q., and Wang, F. 2010. Social action tracking via noise tolerant time-varying factor graphs. In Proceedings of the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 1049--1058.
[53]
Tang, J., Sun, J., Wang, C., and Yang, Z. 2009. Social influence analysis in large-scale networks. In Proceedings of the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 807--816.
[54]
Taskar, B., Segal, E., and Koller, D. 2001. Probabilistic classification and clustering in relational data. In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI). 870--878.
[55]
Taskar, B., Abbeel, P., and D.Kooler. 2002. Discriminative probabilistic models for relational data. In Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence (UAI). 485--492.
[56]
Tong, S. and Chang, E. 2001. Support vector machine active learning for image retrieval. In Proceedings of the ACM Multimedia Conference (MULTIMEDIA). 107--118.
[57]
Xu, L., Wilkinson, D., Southey, F., and Schuurmans, D. 2006. Discriminative unsupervised learning of structured predictors. In Proceedings of the International Conference on Machine Learning (ICML). 1057--1064.
[58]
Xu, Z., Hogan, C., and Bauer, R. 2009. Greedy is not enough: An efficient batch mode active learning algorithm. In Proceedings of the International Conference on Data Mining Workshop (ICDM Worshop). 326--331.
[59]
Yang, T., Jin, R., Chi, Y., and Zhu, S. 2009. Combining link and content for community detection: A discriminative approach. In Proceedings of the ACM Special Interest Group on Knowledge Discovery and Data Mining (SIGKDD). 927--936.
[60]
Yedidia, J., Freeman, W., and Weiss, Y. 2005. Constructing free-energy approximations and generalized belief propagation algorithms. IEEE Trans. Inf. Theory 51, 7, 2282--2312.
[61]
Zhu, X. 2005. Semi-Supervised learning with graphs. Ph.D. thesis, Carnegie Mellon University. CMU-LTI-05-192.
[62]
Zhu, X., Ghahramani, Z., and Lafferty, J. 2003a. Semi-Supervised learning using gaussian fields and harmonic functions. In Proceedings of the International Conference on Machine Learning (ICML). 912--919.
[63]
Zhu, X., Lafferty, J., and Ghahramani, Z. 2003b. Combining active learning and semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the International Conference on Machine Learning Workshop (ICML Workshop). 58--65.

Cited By

View all
  • (2023)Adaptive batch mode active learning with deep similarityEgyptian Informatics Journal10.1016/j.eij.2023.10041224:4(100412)Online publication date: Dec-2023
  • (2021)A Novel Query Strategy-Based Rank Batch-Mode Active Learning Method for High-Resolution Remote Sensing Image ClassificationRemote Sensing10.3390/rs1311223413:11(2234)Online publication date: 7-Jun-2021
  • (2021)Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream TaskACM Transactions on Intelligent Systems and Technology10.1145/344634312:2(1-24)Online publication date: 11-Feb-2021
  • Show More Cited By

Index Terms

  1. Batch Mode Active Learning for Networked Data

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Transactions on Intelligent Systems and Technology
      ACM Transactions on Intelligent Systems and Technology  Volume 3, Issue 2
      February 2012
      455 pages
      ISSN:2157-6904
      EISSN:2157-6912
      DOI:10.1145/2089094
      Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 February 2012
      Accepted: 01 August 2011
      Revised: 01 June 2011
      Received: 01 March 2011
      Published in TIST Volume 3, Issue 2

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Batch mode active learning
      2. combine link and content
      3. network classification

      Qualifiers

      • Research-article
      • Research
      • Refereed

      Funding Sources

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 02 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Adaptive batch mode active learning with deep similarityEgyptian Informatics Journal10.1016/j.eij.2023.10041224:4(100412)Online publication date: Dec-2023
      • (2021)A Novel Query Strategy-Based Rank Batch-Mode Active Learning Method for High-Resolution Remote Sensing Image ClassificationRemote Sensing10.3390/rs1311223413:11(2234)Online publication date: 7-Jun-2021
      • (2021)Active Learning for Effectively Fine-Tuning Transfer Learning to Downstream TaskACM Transactions on Intelligent Systems and Technology10.1145/344634312:2(1-24)Online publication date: 11-Feb-2021
      • (2021)Attent: Active Attributed Network AlignmentProceedings of the Web Conference 202110.1145/3442381.3449886(3896-3906)Online publication date: 19-Apr-2021
      • (2021)SEAL: Semisupervised Adversarial Active Learning on Attributed GraphsIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2020.300968232:7(3136-3147)Online publication date: Jul-2021
      • (2021)Power Load flow analysis for Active Islanding Mode2021 Fifth International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud) (I-SMAC)10.1109/I-SMAC52330.2021.9640725(1670-1674)Online publication date: 11-Nov-2021
      • (2020)Context-Aware Query Selection for Active Learning in Event RecognitionIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2018.287869642:3(554-567)Online publication date: 1-Mar-2020
      • (2020)Active Semi-Supervised Learning for Diffusions on GraphsICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)10.1109/ICASSP40776.2020.9054300(9075-9079)Online publication date: May-2020
      • (2020)Batch Active Learning With Two-Stage SamplingIEEE Access10.1109/ACCESS.2020.29793158(46518-46528)Online publication date: 2020
      • (2019)Identifying malicious social media contents using multi-view Context-Aware active learningFuture Generation Computer Systems10.1016/j.future.2019.03.015Online publication date: May-2019
      • Show More Cited By

      View Options

      Get Access

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media