Ambiguous Supertagging Using a Feature Structure

François Toussenel²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3206))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

878 Accesses

Abstract

Tree Adjoining Grammar parsers can use a statistical supertagger as a preprocessor to help disambiguate the category of words and thus speed up the parsing phase dramatically. However, since the errors in supertagging propagate to the latter, it is vital to keep the word error rate of the supertagger reasonably low. With very large tagsets coming from extracted grammars, this error rate can be of almost 20% (whereas the error rate of part of speech tagging is under 5%), using standard Hidden Markov Model techniques. To address this problem, we can trade some ambiguity in the supertagger output for a higher accuracy. We propose a new approach to introduce ambiguity in the supertags, looking for a suitable trade-off. The method is based on a representation of the supertags as a feature structure, and consists in grouping the values, or some of the values, of certain features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Enhancing Practical TAG Parsing Efficiency by Capturing Redundancy

Supertagging for a Statistical HPSG Parser for Spanish

A deterministic parsing algorithm for ambiguous regular expressions

Article 04 February 2020

References

Joshi, A.K., Bangalore, S.: Disambiguation of super parts of speech (or supertags): Almost parsing. In: International Conference on Computational Linguistics (COLING 1994), Kyoto University, Japan (August 1994)
Google Scholar
Bangalore, S.: Complexity of lexical descriptions and its relevance for partial parsing. Ph.D. thesis, University of Pennsylvania, Philadelphia (1997)
Google Scholar
Chen, J.: Towards Efficient Statistical Parsing using Lexicalized Grammatical Information. Ph.D. thesis, University of Delaware (2001)
Google Scholar
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: The Penn treebank. Computational Linguistics 19, 313–330 (1993)
Google Scholar
Nasr, A., Rambow, O., Chen, J., Bangalore, S.: Context-Free Parsing of a Tree Adjoining Grammar Using Finite State Machines. In: Sixth International Workshop on Tree Adjoining Grammars and Related Frameworks, Venice, Italy (2002)
Google Scholar
Chen, J., Bangalore, S., Collins, M., Rambow, O.: Reranking an n-gram supertagger. In: Proceedings of the Sixth International Workshop on Tree Adjoining Grammars and Related Frameworks, Venice, Italy (2002)
Google Scholar
Chen, J., Bangalore, S., Vijay-Shanker, K.: New models for improving supertag disambiguation. In: Proceedings of the Ninth Conference of the European Chapter of the Assocation for Computational Linguistics, Bergen, Norway (1999)
Google Scholar
Bangalore, S., Joshi, A.K.: Supertagging: An approach to almost parsing. Computational Linguistics 25, 237–265 (1999)
Google Scholar
Xia, F.: Automatic grammar generation from two different perspectives. Ph.D. thesis, Department of Computer and Information Science, University of Pennsylvania (2001)
Google Scholar
Kinyon, A.: Hypertags. In: Proceedings of COLING 2000, Saarbrücken, Germany (2000)
Google Scholar
Candito, M.H.: Représentation modulaire et paramétrable de grammaires électroniques lexicalisées. Ph.D. thesis, University Paris 7 (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Lattice / University Paris 7,
François Toussenel

Authors

François Toussenel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Petr Sojka
Faculty of Informatics, Masaryk University, Botanická 68a, CZ-602 00, Brno, Czech Republic
Ivan Kopeček
Faculty of Informatics, Department of Computer Graphics and Design, Masaryk University, Botanická 68a, 602 00, Brno, Czech Republic
Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Toussenel, F. (2004). Ambiguous Supertagging Using a Feature Structure. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2004. Lecture Notes in Computer Science(), vol 3206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30120-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-540-30120-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23049-6
Online ISBN: 978-3-540-30120-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Ambiguous Supertagging Using a Feature Structure

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Practical TAG Parsing Efficiency by Capturing Redundancy

Supertagging for a Statistical HPSG Parser for Spanish

A deterministic parsing algorithm for ambiguous regular expressions

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Ambiguous Supertagging Using a Feature Structure

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Enhancing Practical TAG Parsing Efficiency by Capturing Redundancy

Supertagging for a Statistical HPSG Parser for Spanish

A deterministic parsing algorithm for ambiguous regular expressions

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation