Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1034678.1034712dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

A second-order Hidden Markov Model for part-of-speech tagging

Published: 20 June 1999 Publication History

Abstract

This paper describes an extension to the hidden Markov model for part-of-speech tagging using second-order approximations for both contextual and lexical probabilities. This model increases the accuracy of the tagger to state of the art levels. These approximations make use of more contextual information than standard statistical systems. New methods of smoothing the estimated probabilities are also introduced to address the sparse data problem.

References

[1]
Eric Brill. 1994. A report of recent progress in transformation-based error-driven learning. Proceedings of the Twelfth National Conference on Artifical Intelligence, pages 722--727.
[2]
Eric Brill. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21(4): 543--565.
[3]
Eugene Charniak, Curtis Hendrickson, Neil Jacobson, and Mike Perkowitz. 1993. Equations for part-of-speech tagging. Proceedings of the Eleventh National Conference on Artificial Intelligence, pages 784--789.
[4]
Walter Daelemans, Jakub Zavrel, Peter Berck, and Steven Gillis. 1996. MBT: A memory-based part of speech tagger-generator. Proceedings of the Fourth Workshop on Very Large Corpora, pages 14--27.
[5]
William A. Gale and Kenneth W. Church. 1994. What's wrong with adding one? In Corpus-Based Research into Language. Rodolpi, Amsterdam.
[6]
I. J. Good. 1953. The population frequencies of species and the estimation of population parameters. Biometrika, 40: 237--264.
[7]
Frederick Jelinek and Robert L. Mercer. 1980. Interpolated estimation of markov source parameters from sparse data. Proceedings of the Workshop on Pattern Recognition in Practice.
[8]
Salva M. Katz. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech and Signal Processing, 35(3): 400--401.
[9]
Julian Kupiec. 1992. Robust part-of-speech tagging using a hidden Markov model. Computer Speech and Language, 6(3): 225--242.
[10]
Mitchell Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2): 313--330.
[11]
Andrei Mikheev. 1996. Unsupervised learning of word-category guessing rules. Proceedings of the 34th Annual Meeting of the Association for Compuatational Linguistics, pages 327--334.
[12]
Andrei Mikheev. 1997. Automatic rule induction for unknown-word guessing. Computational Linguistics, 23(3): 405--423.
[13]
Lawrence R. Rabiner. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceeding of the IEEE, pages 257--286.
[14]
Adwait Ratnaparkhi. 1996. A maximum entropy model for part-of-speech tagging. Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 133--142.
[15]
Dan Roth and Dmitry Zelenko. 1998. Part of speech tagging using a network of linear separators. Proceedings of COLING-ACL '98, pages 1136--1142.
[16]
Scott M. Thede. 1998. Predicting part-of-speech information about unknown words using statistical methods. Proceedings of COLING-ACL '98, pages 1505--1507.
[17]
Gökhan Tür and Kemal Oflazer. 1998. Tagging English by path voting constraints. Proceedings of COLING-ACL '98, pages 1277--1281.
[18]
Evelyne Tzoukermann and Dragomir R. Radev. 1996. Using word class for part-of-speech disambiguation. Proceedings of the Fourth Workshop on Very Large Corpora, pages 1--13.
[19]
Hans van Halteren, Jakub Zavrel, and Walter Daelemans. 1998. Improving data driven wordclass tagging by system combination. Proceedings of COLING-ACL '98, pages 491--497.
[20]
Ralph Weischedel, Marie Meeter, Richard Schwartz, Lance Ramshaw, and Jeff Palmucci. 1993. Coping with ambiguity and unknown words through probabilitic models. Computational Linguistics, 19: 359--382.

Cited By

View all
  • (2023)Unleashing the Power of Large Language Models: A Hands-On TutorialProceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation10.1145/3632754.3632943(149-152)Online publication date: 15-Dec-2023
  • (2023)Learning the Structure of Commands by Detecting Random Tokens Using Markov ModelProceedings of the 2023 8th International Conference on Machine Learning Technologies10.1145/3589883.3589892(61-67)Online publication date: 10-Mar-2023
  • (2020)LAMAR: LiDAR based Multi-inhabitant Activity RecognitionMobiQuitous 2020 - 17th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services10.1145/3448891.3450334(1-9)Online publication date: 7-Dec-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '99: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
June 1999
642 pages
ISBN:1558606093

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 20 June 1999

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)67
  • Downloads (Last 6 weeks)9
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Unleashing the Power of Large Language Models: A Hands-On TutorialProceedings of the 15th Annual Meeting of the Forum for Information Retrieval Evaluation10.1145/3632754.3632943(149-152)Online publication date: 15-Dec-2023
  • (2023)Learning the Structure of Commands by Detecting Random Tokens Using Markov ModelProceedings of the 2023 8th International Conference on Machine Learning Technologies10.1145/3589883.3589892(61-67)Online publication date: 10-Mar-2023
  • (2020)LAMAR: LiDAR based Multi-inhabitant Activity RecognitionMobiQuitous 2020 - 17th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services10.1145/3448891.3450334(1-9)Online publication date: 7-Dec-2020
  • (2019)Toward an Effective Igbo Part-of-Speech TaggerACM Transactions on Asian and Low-Resource Language Information Processing10.1145/331494218:4(1-26)Online publication date: 21-May-2019
  • (2018)Experimental Study of Chinese POS TaggingProceedings of the 2018 2nd International Conference on Computer Science and Artificial Intelligence10.1145/3297156.3297158(1-5)Online publication date: 8-Dec-2018
  • (2018)Towards Deep Player Behavior Models in MMORPGsProceedings of the 2018 Annual Symposium on Computer-Human Interaction in Play10.1145/3242671.3242706(381-392)Online publication date: 23-Oct-2018
  • (2015)IrishScience of Computer Programming10.1016/j.scico.2014.11.017105:C(26-43)Online publication date: 1-Jul-2015
  • (2014)Evaluation of information extraction techniques to label extracted data from e-commerce web pagesProceedings of the 23rd International Conference on World Wide Web10.1145/2567948.2579703(1275-1278)Online publication date: 7-Apr-2014
  • (2014)Second-Order Belief Hidden Markov ModelsProceedings of the Third International Conference on Belief Functions: Theory and Applications - Volume 876410.1007/978-3-319-11191-9_31(284-293)Online publication date: 26-Sep-2014
  • (2012)Low-dimensional discriminative rerankingProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382144(699-709)Online publication date: 3-Jun-2012
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media