article

A two-phase hybrid of semi-supervised and active learning approach for sequence labeling

Authors:

Hamed Hassanzadeh,

Mohammadreza KeyvanpourAuthors Info & Claims

Intelligent Data Analysis, Volume 17, Issue 2

Pages 251 - 270

Published: 01 March 2013 Publication History

Abstract

In recent years, many NLP systems and tasks are developed using machine learning methods. In order to achieve the best performance, these systems are generally trained on a large human annotated corpus. Since annotating such corpora is a very expensive and time-consuming procedure, manually annotating corpora is become one of the significant issues in many text based tasks such as text mining, semantic annotation, Named Entity Recognition and generally Information Extraction. Semi-supervised Learning and Active Learning are two distinct approaches that deal with reduction of labeling costs. Based on their natures, Active and semi-supervised learning can produce better results when they are jointly applied. In this paper we propose a combined Semi-Supervised and Active Learning approach for Sequence Labeling which extremely reduces manual annotation cost in a way that only highly uncertain tokens need to be manually labeled and other sequences and subsequences are labeled automatically. The proposed approach reduces manual annotation cost around 90% compare with a supervised learning and 30% in contrast with a similar fully active learning approach. Conditional Random Field CRF is chosen as the underlying learning model due to its promising performance in many sequence labeling tasks. In addition we proposed a confidence measure based on the model's variance reduction that reaches a considerable accuracy for finding informative samples.

References

[1]

A. Esuli, D. Marcheggiani and F. Sebastiani, Sentence-based active learning strategies for information extraction, in: Proceedings of the 1st Italian Information Retrieval Workshop (IIR'10), Padova, Italy, 2010.

[2]

A. McCallum and K. Nigam, Employing EM and pool-based active learning for text classification, in: Proceedings of the International Conference on Machine Learning (ICML), Morgan Kaufmann, (1998), 350-358.

[3]

A. McCallum, MALLET: A machine learning for language toolkit, in, 2002.

[4]

B. Settles, Active learning literature survey, in: University of Wisconsin-Madison, 2009.

[5]

B. Settles and M. Craven, An analysis of active learning strategies for sequence labeling tasks, in: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2008), 1069-1078.

Digital Library

[6]

C. Campbell, N. Cristianini and A. Smola, Query learning with large margin classifiers, in: Proceedings of ICML'00 (2000), 111-118.

[7]

D.D. Lewis and J. Catlett, Heterogeneous uncertainty sampling for supervised learning, in: Proceedings of the 11th International Conference on Machine Learning (1994), 148-156.

[8]

D. Nadeau and S. Sekine, A survey of named entity recognition and classification, Linguisticae Investigation 30 (2007), 2-26.

[9]

E.F.T.K. Sang and F.D. Meulder, Introduction to the CoNLL - 2003 shared task: Language-independent named entity recognition, in: Proceedings of CoNLL-2003, Edmonton, Canada, (2003), 155-158.

[10]

F. Olsson, On privacy preservation in text and document-based active learning for named entity recognition, in: Proceeding of the ACM First International Workshop on Privacy and Anonymity for very Large Databases, Hong Kong, China, 2009.

Digital Library

[11]

H. Cheng, R. Zhang, Y. Peng, J. Mao and P.-N. Tan, Maximum margin active learning for sequence labeling with different length, in: Proceedings of the 8th Industrial Conference on Advances in Data Mining: Medical Applications, E-Commerce, Marketing and Theoretical Aspects (2008), 345-359.

[12]

H. Hassanzadeh and M.R. Keyvanpour, A variance based active learning approach for named entity recognition, in: International Conference on Intelligent Computing and Information Science, in Communications in Computer and Information Science (CCIS), Springer Berlin Heidelberg 135 (2011), 347-352.

[13]

H. Seung, M. Opper and H. Sompolinsky, Query by committee, in: Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory (1992), 287-294.

[14]

H. Wallach, Conditional random fields: An introduction, in, rapport technique MS-CIS-04-21, department of computer and information science, University of Pennsylvania, 2004.

[15]

I. Muslea, S. Minton and C. Knoblock, Active + semisupervised learning = robust multi-view learning, in: Proceedings of International Conference on Machine Learning (ICML), Sydney, Australia, (2002), 435-442.

[16]

J. Friedman, On bias, variance, 0/1-loss and the curse-of dimensionality, Data Mining Knowledge Discover 1 (1996), 55-77.

[17]

J. Lafferty, A. McCallum and F. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, in: Proceeding of 18th International Conference on Machine Learning (2001), 282-289.

Digital Library

[18]

K. Tomanek, Resource-aware annotation through active learning, Doctor of Philosophy Thesis, Technical University of Dortmund, 2010.

[19]

K. Tomanek and F. Olsson, A web survey on the use of active learning to support annotation of text data, in: Proceedings of the NAACL HLT Workshop on Active Learning for Natural Language Processing, Boulder, Colorado, (2009), 45-48.

Digital Library

[20]

K. Tomanek and U. Hahn, Semi-supervised active learning for sequence labeling, in: Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP (2009), 1039-1047.

[21]

K. Tumer and J. Ghosh, Analysis of decision boundaries in linearly combined neural classifiers, Pattern Recognition 29 (1996), 341-348.

[22]

K. Tumer and J. Ghosh, Error correlation and error reduction in ensemble classifier, Connection Sci 8 (1996), 385-404.

[23]

K. Xu, S.S. Liao, R.Y.K. Lau, L. Liao and H. Tang, Self-teaching semantic annotation method for knowledge discovery from text, in: 42nd Hawaii International Conference on System Sciences, 2009.

[24]

Linguistic, Linguistic data consortium, Message Understanding Conference 7 (2001), LDC2001T02.

[25]

L. Rabiner, A tutorial on hidden Markov models and selected applications inspeech recognition, Proceedings of the IEEE 77 (1989), 257-286.

[26]

L. Yao, C. Sun, X. Wang and X. Wang, Combining self learning and active learning for chinese named entity recognition, Journal of Software 5 (2010), 530-537.

[27]

R. Klinger, C. Kolarik, J. Fluck, M. Hofmann-Apitius and C. Friedrich, Detection of IUPAC and IUPAC-like chemical names, Bioinformatics 24 (2008), 268-276.

Digital Library

[28]

R. Kohavi and D. Wolpert, Bias plus variance decomposition for zeroone loss function, in: Proceedings of the 13th International Conference on Machine Learning (1996), 275-283.

[29]

X. Zhu and A.B. Goldberg, Introduction to semi-supervised learning, Morgan & Claypool, 2009.

Digital Library

[30]

Y. Altun, I. Tsochantaridis and T. Hofmann, Hidden markov support vector machines, in: Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), Washington DC, (2003).

[31]

Y. Freund, H. Seung and E. Tishby, Selective sampling using the query by committee algorithm, Machine Learning 28 (1997), 133-168.

Digital Library

[32]

Y. Qi, P. Kuksa, R. Collobert, K. Sadamasa, K. Kavukcuoglu and J. Weston, Semi-supervised sequence labeling with self-learned features, in: Proceedings of the 2009 Ninth IEEE International Conference on Data Mining (ICDM09) (2009), 428-437.

Digital Library

Cited By

Wang XLi YChen JYang J(2022)Enhancing Personalized Recommendation by Transductive Support Vector Machine and Active LearningSecurity and Communication Networks10.1155/2022/17055272022Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1155/2022/1705527
Wang XWen JAlam SJiang ZWu Y(2018)Semi-supervised learning combining transductive support vector machine with active learningNeurocomputing10.1016/j.neucom.2015.08.087173:P3(1288-1298)Online publication date: 31-Dec-2018
https://dl.acm.org/doi/10.1016/j.neucom.2015.08.087

A two-phase hybrid of semi-supervised and active learning approach for sequence labeling
1. Computing methodologies
2. Hardware
  1. Power and energy
    1. Power estimation and optimization

Recommendations

Consistency-Based Semi-supervised Active Learning: Towards Minimizing Labeling Cost
Computer Vision – ECCV 2020
Abstract
Active learning (AL) combines data labeling and model training to minimize the labeling cost by prioritizing the selection of high value data that can best improve model performance. In pool-based active learning, accessible unlabeled data are not ...
Combining active learning and semi-supervised for improving learning performance
ISABEL '11: Proceedings of the 4th International Symposium on Applied Sciences in Biomedical and Communication Technologies

In many learning tasks, there are abundant unlabeled samples but the number of labeled training samples is limited, because labeling the samples requires the efforts of human annotators and expertise. There are three major techniques for labeling the ...
Semi-supervised Dictionary Active Learning for Pattern Classification
Pattern Recognition and Computer Vision
Abstract
Gathering labeled data is one of the most time-consuming and expensive tasks in supervised machine learning. In practical applications, there are usually quite limited labeled training samples but abundant unlabeled data that is easy to collect. ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Intelligent Data Analysis

Intelligent Data Analysis Volume 17, Issue 2

March 2013

183 pages

ISSN:1088-467X

Issue’s Table of Contents

Publisher

IOS Press

Netherlands

Publication History

Published: 01 March 2013

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang XLi YChen JYang J(2022)Enhancing Personalized Recommendation by Transductive Support Vector Machine and Active LearningSecurity and Communication Networks10.1155/2022/17055272022Online publication date: 1-Jan-2022
https://dl.acm.org/doi/10.1155/2022/1705527
Wang XWen JAlam SJiang ZWu Y(2018)Semi-supervised learning combining transductive support vector machine with active learningNeurocomputing10.1016/j.neucom.2015.08.087173:P3(1288-1298)Online publication date: 31-Dec-2018
https://dl.acm.org/doi/10.1016/j.neucom.2015.08.087

View Options

View options

Figures

Tables

Media

View Issue’s Table of Contents