Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1220355.1220395dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free access

A deterministic word dependency analyzer enhanced with preference learning

Published: 23 August 2004 Publication History

Abstract

Word dependency is important in parsing technology. Some applications such as Information Extraction from biological documents benefit from word dependency analysis even without phrase labels. Therefore, we expect an accurate dependency analyzer trainable without using phrase labels is useful. Although such an English word dependency analyzer was proposed by Yamada and Matsumoto, its accuracy is lower than state-of-the-art phrase structure parsers because of the lack of top-down information given by phrase labels. This paper shows that the dependency analyzer can be improved by introducing a Root-Node Finder and a Prepositional-Phrase Attachment Resolver. Experimental results show that these modules based on Preference Learning give better scores than Collins' Model 3 parser for these subproblems. We expect this method is also applicable to phrase structure parsers.

References

[1]
Eugene Charniak. 2000. A maximum-entropyinspired parser. In Proceedings of the North American Chapter of the Association for Computational Linguistics, pages 132--139.
[2]
Michael Collins and Nigel Duffy. 2002. New ranking algorithms for parsing and tagging: Kernels over discrete structures, and the voted perceptron. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pages 263--270.
[3]
Michael Collins. 1997. Three generative, lexicalised models for statistical parsing. In Proceedings of the Annual Meeting of the Association for Computational Linguistics, pages 16--23.
[4]
Michael Collins. 1999. Head-Driven Statistical Models for Natural Language Parsing. Ph.D. thesis, Univ. of Pennsylvania.
[5]
Nello Cristianini and John Shawe-Taylor. 2000. An Introduction to Support Vector Machines. Cambridge University Press.
[6]
Jason M. Eisner. 1996. Three new probabilistic models for dependency parsing: An exploration. In Proceedings of the International Conference on Computational Linguistics, pages 340--345.
[7]
Ralf Herbrich, Thore Graepel, Peter Bollmann-Sdorra, and Klaus Obermayer. 1998. Learning preference relations for information retrieval. In Proceedings of ICML-98 Workshop on Text Categorization and Machine Learning, pages 80--84.
[8]
Ralf Herbrich, Thore Graepel, and Klaus Obermayer, 2000. Large Margin Rank Boundaries for Ordinal Regression, chapter 7, pages 115--132. MIT Press.
[9]
Julia Hockenmaier and Mark Steedman. 2002. Generative models for statistical parsing with combinatory categorial grammar. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 335--342.
[10]
Hideki Isozaki and Hideto Kazawa. 2002. Efficient support vector classifiers for named entity recognition. In Proceedings of COLING-2002, pages 390--396.
[11]
Thorsten Joachims. 1999. Making large-scale support vector machine learning practical. In B. Schölkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods, chapter 16, pages 170--184. MIT Press.
[12]
Thorsten Joachims. 2002. Optimizing search engines using clickthrough data. In Proceedings of the ACM Conference on Knowledge Discovery and Data Mining.
[13]
Taku Kudo and Yuji Matsumoto. 2001. Chunking with support vector machines. In Proceedings of NAACL-2001, pages 192--199.
[14]
Taku Kudo and Yuji Matsumoto. 2002. Japanese dependency analysis using cascaded chunking. In Proceedings of CoNLL, pages 63--69.
[15]
Mitchell P. Marcus, Beatrice Santorini, and Mary A. Marcinkiewicz. 1993. Building a large annotated corpus of english: the penn treebank. Computational Linguistics, 19(2):313--330.
[16]
Adwait Ratnaparkhi. 1996. A maximum entropy part-of-speech tagger. In Proceedings of the Conference on Empirical Methods in Natural Language Processing.
[17]
Libin Shen and Aravind K. Joshi. 2003. An SVM based voting algorithm with application to parse reranking. In Proceedings of the Seventh Conference on Natural Language Learning, pages 9--16.
[18]
Daniel Sleator and Davy Temperley. 1991. Parsing English with a Link grammar. Technical Report CMU-CS-91-196, Carnegie Mellon University.
[19]
Kiyoshi Sudo, Satoshi Sekine, and Ralph Grishman. 2003. An improved extraction pattern representation model for automatic IE pattern acquisition. In Proceedings of the Annual Meeting of the Association for Cimputational Linguistics, pages 224--231.
[20]
Jun Suzuki, Tsutomu Hirao, Yutaka Sasaki, and Eisaku Maeda. 2003. Hierarchical direct acyclic graph kernel: Methods for structured natural language data. In Proceedings of ACL-2003, pages 32--39.
[21]
Vladimir N. Vapnik. 1995. The Nature of Statistical Learning Theory. Springer.
[22]
Hiroyasu Yamada and Yuji Matsumoto. 2003. Statistical dependency analysis. In Proceedings of the International Workshop on Parsing Technologies, pages 195--206.

Cited By

View all
  • (2012)Head-driven transition-based parsing with top-down predictionProceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 110.5555/2390524.2390617(657-665)Online publication date: 8-Jul-2012
  • (2007)Minimally lexicalized dependency parsingProceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions10.5555/1557769.1557829(205-208)Online publication date: 25-Jun-2007
  1. A deterministic word dependency analyzer enhanced with preference learning

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      COLING '04: Proceedings of the 20th international conference on Computational Linguistics
      August 2004
      1411 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 23 August 2004

      Qualifiers

      • Article

      Acceptance Rates

      COLING '04 Paper Acceptance Rate 1,411 of 1,411 submissions, 100%;
      Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)28
      • Downloads (Last 6 weeks)18
      Reflects downloads up to 13 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2012)Head-driven transition-based parsing with top-down predictionProceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 110.5555/2390524.2390617(657-665)Online publication date: 8-Jul-2012
      • (2007)Minimally lexicalized dependency parsingProceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions10.5555/1557769.1557829(205-208)Online publication date: 25-Jun-2007

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media