Nothing Special   »   [go: up one dir, main page]

skip to main content
article

A two-phase hybrid of semi-supervised and active learning approach for sequence labeling

Published: 01 March 2013 Publication History

Abstract

In recent years, many NLP systems and tasks are developed using machine learning methods. In order to achieve the best performance, these systems are generally trained on a large human annotated corpus. Since annotating such corpora is a very expensive and time-consuming procedure, manually annotating corpora is become one of the significant issues in many text based tasks such as text mining, semantic annotation, Named Entity Recognition and generally Information Extraction. Semi-supervised Learning and Active Learning are two distinct approaches that deal with reduction of labeling costs. Based on their natures, Active and semi-supervised learning can produce better results when they are jointly applied. In this paper we propose a combined Semi-Supervised and Active Learning approach for Sequence Labeling which extremely reduces manual annotation cost in a way that only highly uncertain tokens need to be manually labeled and other sequences and subsequences are labeled automatically. The proposed approach reduces manual annotation cost around 90% compare with a supervised learning and 30% in contrast with a similar fully active learning approach. Conditional Random Field CRF is chosen as the underlying learning model due to its promising performance in many sequence labeling tasks. In addition we proposed a confidence measure based on the model's variance reduction that reaches a considerable accuracy for finding informative samples.

References

[1]
A. Esuli, D. Marcheggiani and F. Sebastiani, Sentence-based active learning strategies for information extraction, in: Proceedings of the 1st Italian Information Retrieval Workshop (IIR'10), Padova, Italy, 2010.
[2]
A. McCallum and K. Nigam, Employing EM and pool-based active learning for text classification, in: Proceedings of the International Conference on Machine Learning (ICML), Morgan Kaufmann, (1998), 350-358.
[3]
A. McCallum, MALLET: A machine learning for language toolkit, in, 2002.
[4]
B. Settles, Active learning literature survey, in: University of Wisconsin-Madison, 2009.
[5]
B. Settles and M. Craven, An analysis of active learning strategies for sequence labeling tasks, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2008), 1069-1078.
[6]
C. Campbell, N. Cristianini and A. Smola, Query learning with large margin classifiers, in: Proceedings of ICML'00 (2000), 111-118.
[7]
D.D. Lewis and J. Catlett, Heterogeneous uncertainty sampling for supervised learning, in: Proceedings of the 11th International Conference on Machine Learning (1994), 148-156.
[8]
D. Nadeau and S. Sekine, A survey of named entity recognition and classification, Linguisticae Investigation 30 (2007), 2-26.
[9]
E.F.T.K. Sang and F.D. Meulder, Introduction to the CoNLL - 2003 shared task: Language-independent named entity recognition, in: Proceedings of CoNLL-2003, Edmonton, Canada, (2003), 155-158.
[10]
F. Olsson, On privacy preservation in text and document-based active learning for named entity recognition, in: Proceeding of the ACM First International Workshop on Privacy and Anonymity for very Large Databases, Hong Kong, China, 2009.
[11]
H. Cheng, R. Zhang, Y. Peng, J. Mao and P.-N. Tan, Maximum margin active learning for sequence labeling with different length, in: Proceedings of the 8th Industrial Conference on Advances in Data Mining: Medical Applications, E-Commerce, Marketing and Theoretical Aspects (2008), 345-359.
[12]
H. Hassanzadeh and M.R. Keyvanpour, A variance based active learning approach for named entity recognition, in: International Conference on Intelligent Computing and Information Science, in Communications in Computer and Information Science (CCIS), Springer Berlin Heidelberg 135 (2011), 347-352.
[13]
H. Seung, M. Opper and H. Sompolinsky, Query by committee, in: Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory (1992), 287-294.
[14]
H. Wallach, Conditional random fields: An introduction, in, rapport technique MS-CIS-04-21, department of computer and information science, University of Pennsylvania, 2004.
[15]
I. Muslea, S. Minton and C. Knoblock, Active + semisupervised learning = robust multi-view learning, in: Proceedings of International Conference on Machine Learning (ICML), Sydney, Australia, (2002), 435-442.
[16]
J. Friedman, On bias, variance, 0/1-loss and the curse-of dimensionality, Data Mining Knowledge Discover 1 (1996), 55-77.
[17]
J. Lafferty, A. McCallum and F. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, in: Proceeding of 18th International Conference on Machine Learning (2001), 282-289.
[18]
K. Tomanek, Resource-aware annotation through active learning, Doctor of Philosophy Thesis, Technical University of Dortmund, 2010.
[19]
K. Tomanek and F. Olsson, A web survey on the use of active learning to support annotation of text data, in: Proceedings of the NAACL HLT Workshop on Active Learning for Natural Language Processing, Boulder, Colorado, (2009), 45-48.
[20]
K. Tomanek and U. Hahn, Semi-supervised active learning for sequence labeling, in: Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP (2009), 1039-1047.
[21]
K. Tumer and J. Ghosh, Analysis of decision boundaries in linearly combined neural classifiers, Pattern Recognition 29 (1996), 341-348.
[22]
K. Tumer and J. Ghosh, Error correlation and error reduction in ensemble classifier, Connection Sci 8 (1996), 385-404.
[23]
K. Xu, S.S. Liao, R.Y.K. Lau, L. Liao and H. Tang, Self-teaching semantic annotation method for knowledge discovery from text, in: 42nd Hawaii International Conference on System Sciences, 2009.
[24]
Linguistic, Linguistic data consortium, Message Understanding Conference 7 (2001), LDC2001T02.
[25]
L. Rabiner, A tutorial on hidden Markov models and selected applications inspeech recognition, Proceedings of the IEEE 77 (1989), 257-286.
[26]
L. Yao, C. Sun, X. Wang and X. Wang, Combining self learning and active learning for chinese named entity recognition, Journal of Software 5 (2010), 530-537.
[27]
R. Klinger, C. Kolarik, J. Fluck, M. Hofmann-Apitius and C. Friedrich, Detection of IUPAC and IUPAC-like chemical names, Bioinformatics 24 (2008), 268-276.
[28]
R. Kohavi and D. Wolpert, Bias plus variance decomposition for zeroone loss function, in: Proceedings of the 13th International Conference on Machine Learning (1996), 275-283.
[29]
X. Zhu and A.B. Goldberg, Introduction to semi-supervised learning, Morgan & Claypool, 2009.
[30]
Y. Altun, I. Tsochantaridis and T. Hofmann, Hidden markov support vector machines, in: Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), Washington DC, (2003).
[31]
Y. Freund, H. Seung and E. Tishby, Selective sampling using the query by committee algorithm, Machine Learning 28 (1997), 133-168.
[32]
Y. Qi, P. Kuksa, R. Collobert, K. Sadamasa, K. Kavukcuoglu and J. Weston, Semi-supervised sequence labeling with self-learned features, in: Proceedings of the 2009 Ninth IEEE International Conference on Data Mining (ICDM09) (2009), 428-437.

Cited By

View all
  • (2022)Enhancing Personalized Recommendation by Transductive Support Vector Machine and Active LearningSecurity and Communication Networks10.1155/2022/17055272022Online publication date: 1-Jan-2022
  • (2018)Semi-supervised learning combining transductive support vector machine with active learningNeurocomputing10.1016/j.neucom.2015.08.087173:P3(1288-1298)Online publication date: 31-Dec-2018
  1. A two-phase hybrid of semi-supervised and active learning approach for sequence labeling

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Intelligent Data Analysis
      Intelligent Data Analysis  Volume 17, Issue 2
      March 2013
      183 pages

      Publisher

      IOS Press

      Netherlands

      Publication History

      Published: 01 March 2013

      Author Tags

      1. Active Learning
      2. Named Entity Recognition
      3. Semi-Supervised Learning
      4. Sequence Labeling

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 14 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Enhancing Personalized Recommendation by Transductive Support Vector Machine and Active LearningSecurity and Communication Networks10.1155/2022/17055272022Online publication date: 1-Jan-2022
      • (2018)Semi-supervised learning combining transductive support vector machine with active learningNeurocomputing10.1016/j.neucom.2015.08.087173:P3(1288-1298)Online publication date: 31-Dec-2018

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media