Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2187681.2187684dlproceedingsArticle/Chapter ViewAbstractPublication PagesenlgConference Proceedingsconference-collections
research-article
Free access

Text simplification using typed dependencies: a comparison of the robustness of different generation strategies

Published: 28 September 2011 Publication History

Abstract

We present a framework for text simplification based on applying transformation rules to a typed dependency representation produced by the Stanford parser. We test two approaches to regeneration from typed dependencies: (a) gen-light, where the transformed dependency graphs are linearised using the word order and morphology of the original sentence, with any changes coded into the transformation rules, and (b) gen-heavy, where the Stanford dependencies are reduced to a DSyntS representation and sentences are generating formally using the RealPro surface realiser. The main contribution of this paper is to compare the robustness of these approaches in the presence of parsing errors, using both a single parse and an n-best parse setting in an overgenerate and rank approach. We find that the gen-light approach is robust to parser error, particularly in the n-best parse setting. On the other hand, parsing errors cause the realiser in the gen-heavy approach to order words and phrases in ways that are disliked by our evaluators.

References

[1]
Richard Anderson and Peter Freebody. 1981. Vocabulary knowledge. In John Guthrie, editor, Comprehension and Teaching: Research Reviews, pages 77--117. International Reading Association, Newark, DE.
[2]
Richard Anderson. 1981. A proposal to continue a center for the study of reading. Technical Report 487, University of Illinois, Center for the Study of Reading, Urbana-Champaign.
[3]
Regina Barzilay and Lillian Lee. 2003. Learning to paraphrase: An unsupervised approach using multiple-sequence alignment. In HLT-NAACL 2003: Main Proceedings, pages 16--23.
[4]
Regina Barzilay and Kathleen McKeown. 2005. Sentence fusion for multidocument news summarization. Computational Linguistics, 31(3):297--328.
[5]
Isabel L. Beck, Margaret G. McKeown, Gale M. Sinatra, and Jane A. Loxterman. 1991. Revising social studies text from a text-processing perspective: Evidence of improved comprehensibility. Reading Research Quarterly, pages 251--276.
[6]
John Carroll, Guido Minnen, Yvonne Canning, Siobhan Devlin, and John Tait. 1998. Practical simplification of English newspaper text to assist aphasic readers. In Proceedings of AAAI98 Workshop on Integrating Artificial Intelligence and Assistive Technology, pages 7--10, Madison, Wisconsin.
[7]
Raman Chandrasekar and Bangalore Srinivas. 1997. Automatic induction of rules for text simplification. Knowledge-Based Systems, 10:183--190.
[8]
Raman Chandrasekar, Christine Doran, and Bangalore Srinivas. 1996. Motivations and methods for text simplification. In Proceedings of the 16th International Conference on Computational Linguistics (COLING '96), pages 1041--1044, Copenhagen, Denmark.
[9]
Trevor Cohn and Mirella Lapata. 2009. Sentence compression as tree transduction. Journal of Artificial Intelligence Research, 34(1):637--674.
[10]
Trevor Cohn, Chris Callison-Burch, and Mirella Lapata. 2008. Constructing corpora for the development and evaluation of paraphrase systems. Computational Linguistics, 34(4):597--614.
[11]
Meredyth Daneman and Patricia Carpenter. 1980. Individual differences in working memory and reading. Journal of Verbal Learning and Verbal Behavior, 19:450--466.
[12]
Marie-Catherine De Marneffe, Bill MacCartney, and Christopher D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of LREC, volume 6, pages 449--454. Citeseer.
[13]
Siobhan Devlin and John Tait. 1998. The use of a psycholinguistic database in the simplification of text for aphasic readers. In J. Nerbonne, editor, Linguistic Databases, pages 161--173. CSLI Publications, Stanford, California.
[14]
Michel Galley and Kathleen McKeown. 2007. Lexicalized Markov grammars for sentence compression. In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pages 180--187, Rochester, New York, April. Association for Computational Linguistics.
[15]
Caroline Gasperin, Erick Maziero, and Sandra Aluísio. 2010. Challenging choices for text simplification. Computational Processing of the Portuguese Language, pages 40--50.
[16]
Ali Ibrahim, Boris Katz, and Jimmy Lin. 2003. Extracting paraphrases from aligned corpora. In Proceedings of The Second International Workshop on Paraphrasing.
[17]
Nobuhiro Kaji, Daisuke Kawahara, Sadao Kurohash, and Satoshi Sato. 2002. Verb paraphrase based on case frame alignment. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL'02), pages 215--222, Philadelphia, USA.
[18]
Kevin Knight and Daniel Marcu. 2000. Statistics-based summarization---step one: Sentence compression. In Proceeding of The American Association for Artificial Intelligence Conference (AAAI-2000), pages 703--710.
[19]
Emiel Krahmer, Erwin Marsi, and Paul van Pelt. 2008. Query-based sentence fusion is better defined and leads to more preferred results than generic sentence fusion. In Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pages 193--196. Association for Computational Linguistics.
[20]
James J. L'Allier. 1980. An evaluation study of a computer-based lesson that adjusts reading level by monitoring on task reader characteristics. Ph.D. thesis, University of Minnesota, Minneapolis, MN.
[21]
Benoit Lavoie and Owen Rambow. 1997. A fast and portable realizer for text generation systems. In Proceedings of the fifth conference on Applied natural language processing, pages 265--268. Association for Computational Linguistics.
[22]
Tracy Linderholm, Michelle G. Everson, Paul van den Broek, Maureen Mischinski, Alex Crittenden, and J. Samuels. 2000. Effects of Causal Text Revisions on More-and Less-Skilled Readers' Comprehension of Easy and Difficult Texts. Cognition and Instruction, 18(4):525--556.
[23]
Erwin Marsi and Emiel Krahmer. 2005. Explorations in sentence fusion. In Proceedings of the European Workshop on Natural Language Generation, pages 109--117.
[24]
Jana Mason and Janet Kendall. 1979. Facilitating reading comprehension through text structure manipulation. Alberta Journal of Medical Psychology, 24:68--76.
[25]
George A. Miller, Richard Beckwith, Christiane D. Fellbaum, Derek Gross, and Katherine Miller. 1993. Five Papers on WordNet. Technical report, Princeton University, Princeton, N. J.
[26]
Sarah E. Petersen. 2007. Natural language processing tools for reading level assessment and text simplification for bilingual education. Ph.D. thesis, University of Washington, Seattle, WA.
[27]
Stephen P. Quigley and Peter V. Paul. 1984. Language and Deafness. College-Hill Press, San Diego, California.
[28]
Philip Quinlan. 1992. The Oxford Psycholinguistic Database. Oxford University Press, U. K.
[29]
Stefan Riezler, Tracy H. King, Richard Crouch, and Annie Zaenen. 2003. Statistical sentence condensation using ambiguity packing and stochastic disambiguation methods for lexical-functional grammar. In Proceedings of the Human Language Technology Conference and the 3rd Meeting of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL'03), Edmonton, Canada.
[30]
Advaith Siddharthan and Napoleon Katsos. 2010. Reformulating discourse connectives for non-expert readers. In Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2010), Los Angeles, CA.
[31]
Advaith Siddharthan. 2003. Preserving discourse structure when simplifying text. In Proceedings of the European Natural Language Generation Workshop (ENLG), 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL'03), pages 103--110, Budapest, Hungary.
[32]
Advaith Siddharthan. 2006. Syntactic simplification and text cohesion. Research on Language and Computation, 4(1):77--109.
[33]
Advaith Siddharthan. 2010. Complex lexico-syntactic reformulation of sentences using typed dependency representations. In Proceedings of the 6th International Natural Language Generation Conference (INLG 2010), pages 125--133. Dublin, Ireland.

Cited By

View all
  • (2019)Text Analysis in Adversarial SettingsACM Computing Surveys10.1145/331033152:3(1-36)Online publication date: 18-Jun-2019
  • (2015)Making It SimplextACM Transactions on Accessible Computing10.1145/27380466:4(1-36)Online publication date: 11-May-2015
  • (2014)Transforming graph-based sentence representations to alleviate overfitting in relation extractionProceedings of the 2014 ACM symposium on Document engineering10.1145/2644866.2644875(53-62)Online publication date: 16-Sep-2014
  • Show More Cited By
  1. Text simplification using typed dependencies: a comparison of the robustness of different generation strategies

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ENLG '11: Proceedings of the 13th European Workshop on Natural Language Generation
      September 2011
      336 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 28 September 2011

      Qualifiers

      • Research-article

      Acceptance Rates

      Overall Acceptance Rate 33 of 78 submissions, 42%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)30
      • Downloads (Last 6 weeks)6
      Reflects downloads up to 21 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2019)Text Analysis in Adversarial SettingsACM Computing Surveys10.1145/331033152:3(1-36)Online publication date: 18-Jun-2019
      • (2015)Making It SimplextACM Transactions on Accessible Computing10.1145/27380466:4(1-36)Online publication date: 11-May-2015
      • (2014)Transforming graph-based sentence representations to alleviate overfitting in relation extractionProceedings of the 2014 ACM symposium on Document engineering10.1145/2644866.2644875(53-62)Online publication date: 16-Sep-2014
      • (2013)ERNESTAProceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 210.1007/978-3-642-37256-8_39(476-487)Online publication date: 24-Mar-2013
      • (2012)A hybrid system for Spanish text simplificationProceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies10.5555/2392855.2392865(75-84)Online publication date: 7-Jun-2012
      • (2012)SBProceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation10.5555/2387636.2387716(482-486)Online publication date: 7-Jun-2012

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media