Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1075218.1075252dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Part-of-speech tagging based on hidden Markov model assuming joint independence

Published: 03 October 2000 Publication History

Abstract

In this paper we present part-of-speech taggers based on hidden Markov models, which adopt a less strict Markov assumption to consider rich contexts. In models whose parameters are very specific like lexicalized ones, sparse-data problem is very serious and also conditional probabilities tend to be estimated unreliably. To overcome data-sparseness, a simplified version of the well-known back-off smoothing method is used. To mitigate unreliable estimation problem, our models assume joint independence instead of conditional independence because joint probabilities have the same degree of estimation reliability. In experiments for the Brown corpus, models with rich contexts achieve relatively high accuracy and some models assuming joint independence show better results than the corresponding HMMs.

References

[1]
E. Brill. 1994. Some Advances in Transformation-Based Part of Speech Tagging. In Proc. of the 12th Nat'l Conf. on Artificial Intelligence(AAAI-94), 722--27.]]
[2]
E. Charniak, C. Hendrickson, N. Jacobson, and M. Perkowitz. 1993. Equations for Part-of-Speech Tagging. In Proc. of the 11th Nat'l Conf. on Artificial Intelligence(AAAI-93), 784--789.]]
[3]
S. F. Chen. 1996. Building Probabilistic Models for Natural Language. Doctoral Dissertation, Harvard University, USA.]]
[4]
E. Cinlar. 1975. Introduction to Stochastic Processes. Prentice-Hall, New Jersey.]]
[5]
R. O. Duda and R. E. Hart. 1973. Pattern Classification and Scene Analysis. John Wiley.]]
[6]
W. N. Francis and H. Kučera. 1982. Frequency Analysis of English Usage: Lexicon and Grammar. Houghton Mifflin Company, Boston, Massachusetts.]]
[7]
I. J. Good. 1953. "The Population Frequencies of Species and the Estimation of Population Parameters," In Biometrika, 40(3--4):237--264.]]
[8]
S. M. Katz. 1987. Estimation of Probabilities from Sparse Data for the Language Model Component of a Speech Recognizer. In IEEE Transactions on Acoustics, Speech and Signal Processing(ASSP), 35(3):400--401.]]
[9]
S.-Z. Lee, J.-D. Kim, W.-H. Ryu, and H.-C. Rim. 1999. A Part-of-Speech Tagging Model Using Lexical Rules Based on Corpus Statistics. In Proc. of the International Conference on Computer Processing of Oriental Languages(ICCPOL-99), 385--390.]]
[10]
S.-Z. Lee. 1999. New Statistical Models for Automatic POS Tagging. Doctoral Dissertation, Korea University, Korea.]]
[11]
Y.-C. Lin, T.-H. Chiang, and K.-Y. Su. 1992. Discrimination Oriented Probabilistic Tagging. In Proc. of the 5th International Conference: Research on Computational Linguistics(ROCLING-V), 85--96.]]
[12]
B. Merialdo. 1991. Tagging Text with a Probabilistic Model. In Proc. of the International Conference on Acoustic, Speech and Signal Processing(ICASSP-91), 809--812.]]
[13]
T. M. Mitchell. 1997. Machine Learning. New York: McGraw-Hill.]]
[14]
L. Padró. 1996. POS Tagging Using Relaxation Labeling. Research Report LSI-96-10-R. Department de Llenguatgatges i Sistemes Informàtics, Universitat Politècnica de Catalunya.]]
[15]
A. Ratnaparkhi. 1996. A Maximum Entropy Model for Part-of-Speech Tagging. In Proc. of the Empirical Methods in Natural Language Processing Conference(EMNLP-96), 133--142.]]
[16]
C. Samuelsson. 1993. Morphological Tagging Based Entirely on Bayesian Inference. In Proc. of the 9th Nordic Conference on Computational Linguistics, 225--238.]]
[17]
H. Schmid. 1994. Part-of-Speech Tagging with Neural Networks. In Proc. of the International Conference on Computational Linguistics(COLING-94), 172--176.]]

Cited By

View all
  • (2005)Part-of-speech tagging using virtual evidence and negative trainingProceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing10.3115/1220575.1220633(459-466)Online publication date: 6-Oct-2005
  • (2003)Feature-rich part-of-speech tagging with a cyclic dependency networkProceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 110.3115/1073445.1073478(173-180)Online publication date: 27-May-2003

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '00: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
October 2000
598 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 03 October 2000

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)357
  • Downloads (Last 6 weeks)13
Reflects downloads up to 22 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2005)Part-of-speech tagging using virtual evidence and negative trainingProceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing10.3115/1220575.1220633(459-466)Online publication date: 6-Oct-2005
  • (2003)Feature-rich part-of-speech tagging with a cyclic dependency networkProceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 110.3115/1073445.1073478(173-180)Online publication date: 27-May-2003

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media