Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1219840.1219879dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

What to do when lexicalization fails: parsing German with suffix analysis and smoothing

Published: 25 June 2005 Publication History

Abstract

In this paper, we present an unlexicalized parser for German which employs smoothing and suffix analysis to achieve a labelled bracket F-score of 76.2, higher than previously reported results on the NEGRA corpus. In addition to the high accuracy of the model, the use of smoothing in an unlexicalized parser allows us to better examine the interplay between smoothing and parsing results.

References

[1]
Franz Beil, Glenn Carroll, Detlef Prescher, Stefan Riezler, and Mats Rooth. 1999. Inside-Outside Estimation of a Lexicalized PCFG for German. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, University of Maryland, College Park.
[2]
Daniel M. Bikel. 2004. Intricacies of Collins' Parsing Model. Computational Linguistics, 30(4).
[3]
Don Blaheta and Eugene Charniak. 2000. Assigning function tags to parsed text. In Proceedings of the 1st Conference of the North American Chapter of the ACL (NAACL), Seattle, Washington., pages 234--240.
[4]
Rens Bod. 1995. Enriching Linguistics with Statistics: Performance Models of Natural Language. Ph.D. thesis, University of Amsterdam.
[5]
Taylor L. Booth. 1969. Probabilistic Representation of Formal Languages. In Tenth Annual IEEE Symposium on Switching and Automata Theory, pages 74--81.
[6]
Thorsten Brants. 2000. TnT: A statistical part-of-speech tagger. In Proceedings of the 6th Conference on Applied Natural Language Processing, Seattle.
[7]
Eugene Charniak. 2000. A Maximum-Entropy-Inspired Parser. In Proceedings of the 1st Conference of North American Chapter of the Association for Computational Linguistics, pages 132--139, Seattle, WA.
[8]
Stanley F. Chen and Joshua Goodman. 1998. An empirical study of smoothing techniques for language modeling. Technical Report TR-10-98, Center for Research in Computing Technology, Harvard University.
[9]
Michael Collins. 1999. Head-Driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania.
[10]
Amit Dubey and Frank Keller. 2003. Parsing German with Sister-head Dependencies. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 96--103, Sapporo, Japan.
[11]
Gerald Gazdar, Ewan Klein, Geoffrey Pullum, and Ivan Sag. 1985. Generalized Phase Structure Grammar. Basil Blackwell, Oxford, England.
[12]
Joshua Goodman. 1998. Parsing inside-out. Ph.D. thesis, Harvard University.
[13]
Mark Johnson. 1998. PCFG models of linguistic tree representations. Computational Linguistics, 24(4):613--632.
[14]
Dan Klein and Christopher D. Manning. 2002. A* Parsing: Fast Exact Viterbi Parse Selection. Technical Report dbpubs/2002-16, Stanford University.
[15]
Dan Klein and Christopher D. Manning. 2003. Accurate Unlexicalized Parsing. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pages 423--430, Sapporo, Japan.
[16]
Roger Levy and Christopher D. Manning. 2003. Is it Harder to Parse Chinese, or the Chinese Treebank? In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics.
[17]
Roger Levy and Christopher D. Manning. 2004. Deep Dependencies from Context-Free Statistical Parsers: Correcting the Surface Dependency Approximation. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics.
[18]
David M. Magerman. 1995. Statistical Decision-Tree Models for Parsing. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 276--283, Cambridge, MA.
[19]
Mitchell P. Marcus, Beatrice Santorini, and Mary Ann Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313--330.
[20]
Micheal Schiehlen. 2004. Annotation Strategies for Probabilistic Parsing in German. In Proceedings of the 20th International Conference on Computational Linguistics.
[21]
Wojciech Skut, Brigitte Krenn, Thorsten Brants, and Hans Uszkoreit. 1997. An annotation scheme for free word order languages. In Proceedings of the 5th Conference on Applied Natural Language Processing, Washington, DC.
[22]
Hans Uszkoreit. 1987. Word Order and Constituent Structure in German. CSLI Publications, Stanford, CA.

Cited By

View all
  • (2011)On the role of explicit morphological feature representation in syntactic dependency parsing for GermanProceedings of the 12th International Conference on Parsing Technologies10.5555/2206329.2206337(58-62)Online publication date: 5-Oct-2011
  • (2011)The surprising variance in shortest-derivation parsingProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 210.5555/2002736.2002875(720-725)Online publication date: 19-Jun-2011
  • (2010)Statistical parsing of morphologically rich languages (SPMRL)Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages10.5555/1868771.1868772(1-12)Online publication date: 5-Jun-2010
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '05: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
June 2005
657 pages
  • General Chair:
  • Kevin Knight

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 25 June 2005

Qualifiers

  • Article

Acceptance Rates

ACL '05 Paper Acceptance Rate 77 of 423 submissions, 18%;
Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)43
  • Downloads (Last 6 weeks)7
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2011)On the role of explicit morphological feature representation in syntactic dependency parsing for GermanProceedings of the 12th International Conference on Parsing Technologies10.5555/2206329.2206337(58-62)Online publication date: 5-Oct-2011
  • (2011)The surprising variance in shortest-derivation parsingProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 210.5555/2002736.2002875(720-725)Online publication date: 19-Jun-2011
  • (2010)Statistical parsing of morphologically rich languages (SPMRL)Proceedings of the NAACL HLT 2010 First Workshop on Statistical Parsing of Morphologically-Rich Languages10.5555/1868771.1868772(1-12)Online publication date: 5-Jun-2010
  • (2009)Feasibility of human-in-the-loop minimum error rate trainingProceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 110.5555/1699510.1699518(52-61)Online publication date: 6-Aug-2009
  • (2009)Scalable discriminative parsing for GermanProceedings of the 11th International Conference on Parsing Technologies10.5555/1697236.1697262(134-137)Online publication date: 7-Oct-2009
  • (2008)Further meta-evaluation of machine translationProceedings of the Third Workshop on Statistical Machine Translation10.5555/1626394.1626403(70-106)Online publication date: 19-Jun-2008
  • (2008)A dependency-driven parser for German dependency and constituency representationsProceedings of the Workshop on Parsing German10.5555/1621401.1621408(47-54)Online publication date: 20-Jun-2008
  • (2008)Parsing three German treebanksProceedings of the Workshop on Parsing German10.5555/1621401.1621407(40-46)Online publication date: 20-Jun-2008
  • (2008)Parsing German with latent variable grammarsProceedings of the Workshop on Parsing German10.5555/1621401.1621406(33-39)Online publication date: 20-Jun-2008
  • (2008)Ontology-based information extraction and integration from heterogeneous data sourcesInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2008.07.00766:11(759-788)Online publication date: 1-Nov-2008
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media