Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1117794.1117820dlproceedingsArticle/Chapter ViewAbstractPublication PagesemnlpConference Proceedingsconference-collections
Article
Free access

One sense per collocation and genre/topic variations

Published: 07 October 2000 Publication History

Abstract

This paper revisits the one sense per collocation hypothesis using fine-grained sense distinctions and two different corpora. We show that the hypothesis is weaker for fine-grained sense distinctions (70% vs. 99% reported earlier on 2-way ambiguities). We also show that one sense per collocation does hold across corpora, but that collocations vary from one corpus to the other, following genre and topic variations. This explains the low results when performing word sense disambiguation across corpora. In fact, we demonstrate that when two independent corpora share a related genre/topic, the word sense disambiguation results would be better. Future work on word sense disambiguation will have to take into account genre and topic as important parameters on their models.

References

[1]
Agirre, E. and D. Martinez. Exploring automatic word sense disambiguation with decision lists and the Web. Proceedings of the COLING Workshop on Semantic Annotation and Intelligent Content. Saarbrücken, Germany. 2000.
[2]
Brants, T. TnT-A Statistical Part-of-Speech Tagger. In Proceedings of the Sixth Applied Natural Language Processing Conference, Seattle, WA. 2000.
[3]
Escudero, G., L. Màrquez and G. Rigau. On the Portability and Tuning of Supervised Word Sense Disambiguation Systems. In Proceedings of the Joint Sigdat Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, Hong Kong. 2000.
[4]
Francis, W. M. and H. Kucera. Brown Corpus Manual of Information. Department of Linguistics, Brown University. Also available at http://khnt.hit.uib.no/icame/manuals/brown/. 1964.
[5]
Gale, W., K. W. Church, and D. Yarowsky. A Method for Disambiguating Word Senses in a Large Corpus, Computers and the Humanities, 26, 415--439, 1993.
[6]
Ide, N. and J. Veronis. Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics, 24(1), 1--40, 1998.
[7]
Karlgren, J. and D. Cutting. Recognizing Text Genres with Simple Metrics Using Discriminant Analysis. Proceedings of the International Conference on Computational Linguistics. 1994
[8]
Krovetz, R. More Than One Sense Per Discourse, Proceedings of SENSEVAL and the Lexicography Loop Workshop, http://www.itri.brighton.ac.uk/events/senseval/PROCEEDINGS/. 1998
[9]
Leacock, C., M. Chodorow, and G. A. Miller. Using Corpus Statistics and WordNet Relations for Sense Identification. Computational Linguistics, 24(1), 147--166, 1998.
[10]
Miller, G. A., R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. Five Papers on WordNet. Special Issue of International Journal of Lexicography, 3(4), 1990.
[11]
Miller, G. A., C. Leacock, R. Tengi, and R. T. Bunker, A Semantic Concordance. Proceedings of the ARPA Workshop on Human Language Technology, 1993.
[12]
Ng, H. T. and H. B. Lee. Integrating Multiple Knowledge Sources to Disambiguate Word Sense: An Exemplar-based Approach. Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics. 1996.
[13]
Ng, H. T., C. Y. Lim and S. K. Foo. A Case Study on Inter-Annotator Agreement for Word Sense Disambiguation. Proceedings of the Siglex-ACL Workshop on Standarizing Lexical Resources. 1999.
[14]
Yarowsky, D. One Sense per Collocation. Proc. of the 5th DARPA Speech and Natural Language Workshop. 1993
[15]
Yarowsky, D. Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French. Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, pp. 88--95. 1994.
[16]
Yarowsky, D. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods. Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics. Cambridge, MA, pp. 189--196, 1995.
[17]
Yarowsky, D. Homograph Disambiguation in Text-to-speech Synthesis. J Hirschburg, R. Sproat and J. Van Santen (eds.) Progress in Speech Synthesis, Springer-Vorlag, pp. 159--175. 1996.

Cited By

View all
  • (2017)Improving selection of synsets from WordNet for domain-specific word sense disambiguationComputer Speech and Language10.1016/j.csl.2016.06.00341:C(128-145)Online publication date: 1-Jan-2017
  • (2012)Learning word sense disambiguation in biomedical text with difference between training and test distributionsInternational Journal of Data Mining and Bioinformatics10.1504/IJDMB.2012.0481996:2(216-237)Online publication date: 1-Jul-2012
  • (2009)SemEval-2010 task 17Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions10.5555/1621969.1621991(123-128)Online publication date: 4-Jun-2009
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
EMNLP '00: Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
October 2000
233 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 07 October 2000

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 73 of 234 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)35
  • Downloads (Last 6 weeks)8
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2017)Improving selection of synsets from WordNet for domain-specific word sense disambiguationComputer Speech and Language10.1016/j.csl.2016.06.00341:C(128-145)Online publication date: 1-Jan-2017
  • (2012)Learning word sense disambiguation in biomedical text with difference between training and test distributionsInternational Journal of Data Mining and Bioinformatics10.1504/IJDMB.2012.0481996:2(216-237)Online publication date: 1-Jul-2012
  • (2009)SemEval-2010 task 17Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions10.5555/1621969.1621991(123-128)Online publication date: 4-Jun-2009
  • (2009)Supervised domain adaption for WSDProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics10.5555/1609067.1609071(42-50)Online publication date: 30-Mar-2009
  • (2009)Learning word sense disambiguation in biomedical text with difference between training and test distributionsProceedings of the third international workshop on Data and text mining in bioinformatics10.1145/1651318.1651330(59-66)Online publication date: 6-Nov-2009
  • (2009)Coping with Distribution Change in the Same Domain Using Similarity-Based Instance WeightingProceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning10.1007/978-3-642-05224-8_27(354-366)Online publication date: 3-Nov-2009
  • (2008)Word sense disambiguation using OntoNotesProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/1613715.1613845(1002-1010)Online publication date: 25-Oct-2008
  • (2008)On robustness and domain adaptation using SVD for word sense disambiguationProceedings of the 22nd International Conference on Computational Linguistics - Volume 110.5555/1599081.1599084(17-24)Online publication date: 18-Aug-2008
  • (2003)Syntactic features and word similarity for supervised metonymy resolutionProceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 110.3115/1075096.1075104(56-63)Online publication date: 7-Jul-2003
  • (2002)MEANINGProceedings of the 2002 COLING workshop: A roadmap for computational linguistics - Volume 1310.3115/1118754.1118758(1-7)Online publication date: 31-Aug-2002
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media