Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1034678.1034737dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Corpus-based identification of non-anaphoric noun phrases

Published: 20 June 1999 Publication History

Abstract

Coreference resolution involves finding antecedents for anaphoric discourse entities, such as definite noun phrases. But many definite noun phrases are not anaphoric because their meaning can be understood from general world knowledge (e.g., "the White House" or "the news media"). We have developed a corpus-based algorithm for automatically identifying definite noun phrases that are non-anaphoric, which has the potential to improve the efficiency and accuracy of coreference resolution systems. Our algorithm generates lists of non-anaphoric noun phrases and noun phrase patterns from a training corpus and uses them to recognize non-anaphoric noun phrases in new texts. Using 1600 MUC-4 terrorism news articles as the training corpus, our approach achieved 78% recall and 87% precision at identifying such noun phrases in 50 text documents.

References

[1]
James Allen. 1995. Natural Language Understanding. Benjamin/Cummings Press, Redwood City, CA.
[2]
Chinatsu Aone and Scott William Bennett. 1996. Applying Machine Learning to Anaphora Resolution. In Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Understanding, pages 302--314. Springer-Verlag, Berlin.
[3]
Andrew Kehler. 1997. Probabilistic coreference in information extraction. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing (EMNLP-97).
[4]
Christopher Kennedy and Branimir Boguraev. 1996. Anaphor for everyone: Pronomial anaphora resolution without a parser. In Proceedings of the 16th International Conference on Computational Linguistics (COLING-96).
[5]
Shalom Lappin and Herbert J. Leass. 1994. An algorithm for pronomial anaphora resolution. Computational Linguistics, 20(4): 535--561.
[6]
Joseph F. McCarthy and Wendy G. Lehnert. 1995. Using Decision Trees for Coreference Resolution. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI-95), pages 1050--1055.
[7]
Ellen F. Prince. 1981. Toward a taxonomy of given-new information. In Peter Cole, editor, Radical Pragmatics, pages 223--255. Academic Press.
[8]
Brian Roark and Eugene Charniak. 1998. Noun-phrase co-occurence statistics for semi-automatic semantic lexcon construction. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics.
[9]
R. Vieira and M. Poesio. 1997. Processing definite descriptions in corpora. In S. Botley and M. McEnery, editors, Corpus-based and Computational Approaches to Discourse Anaphora. UCL Press.

Cited By

View all
  • (2012)Automatically acquiring fine-grained information status distinctions in GermanProceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue10.5555/2392800.2392842(232-236)Online publication date: 5-Jul-2012
  • (2011)Learning the information status of noun phrases in spoken dialoguesProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145547(1069-1080)Online publication date: 27-Jul-2011
  • (2011)Methodological ReviewJournal of Biomedical Informatics10.1016/j.jbi.2011.08.00644:6(1113-1122)Online publication date: 1-Dec-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '99: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
June 1999
642 pages
ISBN:1558606093

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 20 June 1999

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)60
  • Downloads (Last 6 weeks)14
Reflects downloads up to 28 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2012)Automatically acquiring fine-grained information status distinctions in GermanProceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue10.5555/2392800.2392842(232-236)Online publication date: 5-Jul-2012
  • (2011)Learning the information status of noun phrases in spoken dialoguesProceedings of the Conference on Empirical Methods in Natural Language Processing10.5555/2145432.2145547(1069-1080)Online publication date: 27-Jul-2011
  • (2011)Methodological ReviewJournal of Biomedical Informatics10.1016/j.jbi.2011.08.00644:6(1113-1122)Online publication date: 1-Dec-2011
  • (2010)Dependency-driven anaphoricity determination for coreference resolutionProceedings of the 23rd International Conference on Computational Linguistics10.5555/1873781.1873849(599-607)Online publication date: 23-Aug-2010
  • (2010)Supervised noun phrase coreference researchProceedings of the 48th Annual Meeting of the Association for Computational Linguistics10.5555/1858681.1858823(1396-1411)Online publication date: 11-Jul-2010
  • (2009)Global learning of noun phrase anaphoricity in coreference resolution via label propagationProceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 210.5555/1699571.1699640(978-986)Online publication date: 6-Aug-2009
  • (2009)Supervised models for coreference resolutionProceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 210.5555/1699571.1699639(968-977)Online publication date: 6-Aug-2009
  • (2009)Graph-cut-based anaphoricity determination for coreference resolutionProceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics10.5555/1620754.1620838(575-583)Online publication date: 31-May-2009
  • (2009)A chain-starting classifier of definite NPs in SpanishProceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop10.5555/1609179.1609185(46-53)Online publication date: 2-Apr-2009
  • (2008)Coreference-inspired coherence modelingProceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers10.5555/1557690.1557702(41-44)Online publication date: 16-Jun-2008
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media