Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/976744.976755dlproceedingsArticle/Chapter ViewAbstractPublication PageseaclConference Proceedingsconference-collections
Article
Free access

An endogeneous corpus-based method for structural noun phrase disambiguation

Published: 21 April 1993 Publication History

Abstract

In this paper, we describe a method for structural noun phrase disambiguation which mainly relies on the examination of the text corpus under analysis and doesn't need to integrate any domain-dependent lexico- or syntactico-semantic information. This method is implemented in the Terminology Extraction Sottware LEXTER. We first explain why the integration of LEXTER in the LEXTER-K project, which aims at building a tool for knowledge extraction from large technical text corpora, requires improving the quality of the terminolgy extracted by LEXTER. Then we briefly describe the way LEXTER works and show what kind of disambiguation it has to perform when parsing "maximal-length" noun phrases. We introduce a method of disambiguation which relies on a very simple idea: whenever LEXTER has to choose among several competing noun sub-groups in order to disambiguate a maximal-length noun phrase, it checks each of these sub-groups if it occurs anywhere else in the corpus in a non-ambiguous situation, and then it makes a choice. The half-a-million words corpus analysis resulted in an efficient strategy of disambiguation. The average rates are:27% no disambiguation70% correct disambiguation3% wrong disambiguation

References

[1]
{Andreewsky et al., 1977} Alexandre Andreewsky, Christian Fluhr and Fathi Debilli. Computational Learning of Semantic Lexical Relations for the Generation and Automatical Analysis of Content. Information Processing 77, 1977
[2]
{Bod, 1992} Rens Bod. A Computational Model of Language Performance: Data Oriented Parsing. In Proceedings of COLING-92, Nantes, August 1992
[3]
{Bourigault, 1992a} Didier Bourigault. LEXTER, un Logical d'EXtraction de TERminologie. In Proceedings of the 2nd symposium of TermNet, Avignon, May 1992
[4]
{Bourigault, 1992b} Didier Bourigault. Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases. In Proceedings of COLING-92, Nantes, August 1992
[5]
{Jensen and Binot, 1987} Karen Jensen and Jean-Louis Binot. Disambiguating Prepositional Phrase Attachments by Using On-Line Dictionary Definition. Computational Linguistics 13 (3--4), 1987
[6]
{Wermter, 1989} Stefan Wermter. Integration of Semantic and Syntactic Constraints for Structural Noun Phrases Disambiguation. In Proceedings of the 11th IJCAI, 1989, Detroit
[7]
{Zernik, 1992a} Uri Zernik. Shipping Departments vs. Shipping Pacemakers: Using Thematic Analysis to Improve Tagging Accuracy. In Proceedings of AAAI-92, July 1992, San Jose
[8]
{Zernik, 1992b} Uri Zernik. Closed Yesterday and Closed Minds: Asking the Right Questions of the Corpus To Distinguish Thematic from Sentential Relations. In Proceedings of COLING-92, August 1992, Nantes

Cited By

View all
  • (2002)TExtractorProceedings of the second international conference on Human Language Technology Research10.5555/1289189.1289237(393-398)Online publication date: 24-Mar-2002
  • (1999)Term extraction + term clusteringProceedings of the ninth conference on European chapter of the Association for Computational Linguistics10.3115/977035.977039(15-22)Online publication date: 8-Jun-1999
  • (1999)Projecting corpus-based semantic links on a thesaurusProceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics10.3115/1034678.1034739(389-396)Online publication date: 20-Jun-1999
  • Show More Cited By
  1. An endogeneous corpus-based method for structural noun phrase disambiguation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      EACL '93: Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
      April 1993
      494 pages
      ISBN:9054340142

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 21 April 1993

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 100 of 360 submissions, 28%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)41
      • Downloads (Last 6 weeks)7
      Reflects downloads up to 19 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2002)TExtractorProceedings of the second international conference on Human Language Technology Research10.5555/1289189.1289237(393-398)Online publication date: 24-Mar-2002
      • (1999)Term extraction + term clusteringProceedings of the ninth conference on European chapter of the Association for Computational Linguistics10.3115/977035.977039(15-22)Online publication date: 8-Jun-1999
      • (1999)Projecting corpus-based semantic links on a thesaurusProceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics10.3115/1034678.1034739(389-396)Online publication date: 20-Jun-1999
      • (1997)Recycling the results of robustComputer-Assisted Information Searching on Internet - Volume 210.5555/2857665.2857697(751-760)Online publication date: 25-Jun-1997
      • (1997)Expansion of multi-word terms for indexing and retrieval using morphology and syntaxProceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics10.3115/976909.979621(24-31)Online publication date: 7-Jul-1997
      • (1996)Symbolic word clustering for medium-size corporaProceedings of the 16th conference on Computational linguistics - Volume 110.3115/992628.992713(490-495)Online publication date: 5-Aug-1996
      • (1994)FASTRIntelligent Multimedia Information Retrieval Systems and Management - Volume 110.5555/2856823.2856828(34-47)Online publication date: 11-Oct-1994
      • (1994)Recycling terms into a partial parserProceedings of the fourth conference on Applied natural language processing10.3115/974358.974384(113-118)Online publication date: 13-Oct-1994

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media