Abstract
This paper describes a supervised algorithm for word sensedisambiguation based on hierarchies of decision lists. This algorithmsupports a useful degree of conditional branching while minimizing thetraining data fragmentation typical of decision trees. Classificationsare based on a rich set of collocational, morphological and syntacticcontextual features, extracted automatically from training data andweighted sensitive to the nature of the feature and feature class. Thealgorithm is evaluated comprehensively in the SENSEVAL framework,achieving the top performance of all participating supervised systems onthe 36 test words where training data is available.
Similar content being viewed by others
References
Bruce, R. and J. Wiebe. "Word-sense disambiguation using decomposable models." Proceedings of ACL '94, MD: Las Cruces, 1994, pp. 139–146
Collins, M. and Y. Singer. "Unsupervised models for named entity classification." Proc. of the 1999 Joint SIGDAT Conference, MD: College Park, 1999, pp. 100–110
Gale, W., K. Church, and D. Yarowsky. "A method for disambiguating word senses in a large corpus." Computers and the Humanities, 26 (1992), 415–439.
Golding, A. "A Bayesian hybrid method for context-sensitive spelling correction." Proceedings of the 3rd Workshop on Very Large Corpora, 1995, pp. 39–53.
Kilgarriff, A. "SENSEVAL: An exercise in evaluating word sense disambiguation programs." Proceedings of LREC, Granada, 1998, pp. 581–588.
Mooney, R. "Comparative experiments on disambiguating word senses: An illustration of the role of bias in machine learning." Proc. of the Conference on Empirical Methods in Natural Language Processing, Philadelphia. 1996, pp. 82–91
Rivest, R. "Learning decision lists." Machine Learning, 2 (1987), 229–246.
Wilks, Y. and M. Stevenson. "World sense disambiguation using optimised combinations of knowledge sources." Proceedings of COLING/ACL-98. 1998.
Yarowsky, D. "Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French." Proceedings of ACL '94, 1994, pp. 88–95.
Yarowsky, D. "Unsupervised word sense disambiguation rivaling supervised methods." Proceedings of ACL '95, 1995, pp. 189–196.
Yarowsky, D. "Homograph disambiguation in speech synthesis." In J. van Santen, R. Sproat, J. Olive and J. Hirschberg (eds.), Progess in Speech Synthesis, Springer-Verlag, 1997, pp. 159–175.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Yarowsky, D. Hierarchical Decision Lists for Word Sense Disambiguation. Computers and the Humanities 34, 179–186 (2000). https://doi.org/10.1023/A:1002674829964
Issue Date:
DOI: https://doi.org/10.1023/A:1002674829964