Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/981863.981888dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

A new statistical parser based on bigram lexical dependencies

Published: 24 June 1996 Publication History

Abstract

This paper describes a new statistical parser which is based on probabilities of dependencies between head-words in the parse tree. Standard bigram probability estimation techniques are extended to calculate probabilities of dependencies between pairs of words. Tests using Wall Street Journal data show that the method performs at least as well as SPATTER (Magerman 95; Jelinek et al. 94), which has the best published results for a statistical parser on this task. The simplicity of the approach means the model trains on 40,000 sentences in under 15 minutes. With a beam search strategy parsing speed can be improved to over 200 sentences a minute with negligible loss in accuracy.

References

[1]
E. Black et al. 1991. A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars. Proceedings of the February 1991 DARPA Speech and Natural Language Workshop.
[2]
T. Briscoe and J. Carroll. 1993. Generalized LR Parsing of Natural Language (Corpora) with Unification-Based Grammars. Computational Linguistics, 19(1):25--60.
[3]
K. Church. 1988. A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text. Second Conference on Applied Natural Language Processing, ACL.
[4]
M. Collins and J. Brooks. 1995. Prepositional Phrase Attachment through a Backed-off Model. Proceedings of the Third Workshop on Very Large Corpora, pages 27--38.
[5]
D. Hindle and M. Rooth. 1993. Structural Ambiguity and Lexical Relations. Computational Linguistics, 19(1):103--120.
[6]
F. Jelinek. 1990. Self-organized Language Modeling for Speech Recognition. In Readings in Speech Recognition. Edited by Waibel and Lee. Morgan Kaufmann Publishers.
[7]
F. Jelinek, J. Lafferty, D. Magerman, R. Mercer, A. Ratnaparkhi, S. Roukos. 1994. Decision Tree Parsing using a Hidden Derivation Model. Proceedings of the 1994 Human Language Technology Workshop, pages 272--277.
[8]
J. Lafferty, D. Sleator and, D. Temperley. 1992. Grammatical Trigrams: A Probabilistic Model of Link Grammar. Proceedings of the 1992 AAAI Fall Symposium on Probabilistic Approaches to Natural Language.
[9]
D. Magerman. 1995. Statistical Decision-Tree Models for Parsing. Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 276--283.
[10]
D. Magerman and M. Marcus. 1991. Pearl: A Probabilistic Chart Parser. Proceedings of the 1991 European ACL Conference, Berlin, Germany.
[11]
M. Marcus, B. Santorini and M. Marcinkiewicz. 1993. Building a Large Annotated Corpus of English: the Penn Treebank. Computational Linguistics, 19(2):313--330.
[12]
F. Pereira and Y. Schabes. 1992. Inside-Outside Reestimation from Partially Bracketed Corpora. Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics, pages 128--135.
[13]
L. Ramshaw and M. Marcus. 1995. Text Chunking using Transformation-Based Learning. Proceedings of the Third Workshop on Very Large Corpora, pages 82--94.
[14]
A. Ratnaparkhi. 1996. A Maximum Entropy Model for Part-Of-Speech Tagging. Conference on Empirical Methods in Natural Language Processing, May 1996.
[15]
M. M. Wood. 1993. Categorial Grammars, Routledge.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '96: Proceedings of the 34th annual meeting on Association for Computational Linguistics
June 1996
399 pages
  • Program Chairs:
  • Aravind Joshi,
  • Martha Palmer

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 24 June 1996

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)31
  • Downloads (Last 6 weeks)7
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Ordered tree decomposition for hrg rule extractionComputational Linguistics10.1162/coli_a_0035045:2(339-379)Online publication date: 1-Jun-2019
  • (2016)Word sense disambiguation based sentiment lexicons for sentiment classificationKnowledge-Based Systems10.1016/j.knosys.2016.07.030110:C(224-232)Online publication date: 15-Oct-2016
  • (2013)Towards graphical models for text processingKnowledge and Information Systems10.1007/s10115-012-0552-336:1(1-21)Online publication date: 1-Jul-2013
  • (2012)Syntax-aware phrase-based statistical machine translationProceedings of the Seventh Workshop on Statistical Machine Translation10.5555/2393015.2393055(292-297)Online publication date: 7-Jun-2012
  • (2012)Rediscovering ACL discoveries through the lens of ACL anthology network citing sentencesProceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries10.5555/2390507.2390509(1-12)Online publication date: 10-Jul-2012
  • (2011)BioNLP Shared Task 2011Proceedings of the BioNLP Shared Task 2011 Workshop10.5555/2107691.2107707(112-120)Online publication date: 24-Jun-2011
  • (2011)Improving dependency parsing with semantic classesProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 210.5555/2002736.2002871(699-703)Online publication date: 19-Jun-2011
  • (2011)Dynamic programming algorithms for transition-based dependency parsersProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 110.5555/2002472.2002558(673-682)Online publication date: 19-Jun-2011
  • (2011)Applying semantic-based probabilistic context-free grammar to medical language processing - A preliminary study on parsing medication sentencesJournal of Biomedical Informatics10.1016/j.jbi.2011.08.00944:6(1068-1075)Online publication date: 1-Dec-2011
  • (2010)Head-modifier relation based non-lexical reordering model for phrase-based translationProceedings of the 23rd International Conference on Computational Linguistics: Posters10.5555/1944566.1944652(748-756)Online publication date: 23-Aug-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media