Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2002736.2002835dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
research-article
Free access

Putting it simply: a context-aware approach to lexical simplification

Published: 19 June 2011 Publication History

Abstract

We present a method for lexical simplification. Simplification rules are learned from a comparable corpus, and the rules are applied in a context-aware fashion to input sentences. Our method is unsupervised. Furthermore, it does not require any alignment or correspondence among the complex and simple corpora. We evaluate the simplification according to three criteria: preservation of grammaticality, preservation of meaning, and degree of simplification. Results show that our method outperforms an established simplification baseline for both meaning preservation and simplification, while maintaining a high level of grammaticality.

References

[1]
Androutsopoulos, Ion and Prodromos Malakasiotis. 2010. A survey of paraphrasing and textual entailment methods. Journal of Artificial Intelligence Research 38:135--187.
[2]
Barzilay, Regina and Noemie Elhadad. 2003. Sentence alignment for monolingual comparable corpora. In Proc. EMNLP. pages 25--32.
[3]
Blake, Catherine, Julia Kampov, Andreas Or-phanides, David West, and Cory Lown. 2007. Query expansion, lexical simplification, and sentence selection strategies for multi-document summarization. In Proc. DUC.
[4]
Carroll, John, Guido Minnen, Yvonne Canning, Siobhan Devlin, and John Tait. 1998. Practical simplication of english newspaper text to assist aphasic readers. In Proc. AAAI Workshop on Integrating Artificial Intelligence and Assistive Technology.
[5]
Chandrasekar, R., Christine Doran, and B. Srinivas. 1996. Motivations and methods for text simplification. In Proc. COLING.
[6]
Daelemans, Walter, Anja Hthker, and Erik Tjong Kim Sang. 2004. Automatic sentence simplification for subtitling in Dutch and English. In Proc. LREC. pages 1045--1048.
[7]
Deléger, Louise and Pierre Zweigenbaum. 2009. Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. In Proc. Workshop on Building and Using Comparable Corpora. pages 2--10.
[8]
Devlin, Siobhan and Gary Unthank. 2006. Helping aphasic people process online information. In Proc. ASSETS. pages 225--226.
[9]
Elhadad, Noemie and Komal Sutaria. 2007. Mining a lexicon of technical terms and lay equivalents. In Proc. ACL BioNLP Workshop. pages 49--56.
[10]
Fellbaum, Christiane, editor. 1998. WordNet: An Electronic Database. MIT Press, Cambridge, MA.
[11]
Huenerfauth, Matt, Lijun Feng, and Noémie El-hadad. 2009. Comparing evaluation techniques for text readability software for adults with intellectual disabilities. In Proc. ASSETS. pages 3--10.
[12]
Jonnalagadda, Siddhartha, Luis Tari, Jörg Hakenberg, Chitta Baral, and Graciela Gonzalez. 2009. Towards effective sentence simplification for automatic processing of biomedical text. In Proc. NAACL-HLT. pages 177--180.
[13]
McCarthy, Diana and Roberto Navigli. 2007. Semeval-2007 task 10: English lexical substitution task. In Proc. SemEval. pages 48--53.
[14]
Napoles, Courtney and Mark Dredze. 2010. Learning simple wikipedia: a cogitation in ascertaining abecedarian language. In Proc. of the NAACL-HLT Workshop on Computational Linguistics and Writing. pages 42--50.
[15]
Nelken, Rani and Stuart Shieber. 2006. Towards robust context-sensitive sentence alignment for monolingual corpora. In Proc. EACL. pages 161--166.
[16]
Siddharthan, Advaith. 2004. Syntactic simplification and text cohesion. Technical Report UCAM-CL-TR-597, University of Cambridge, Computer Laboratory.
[17]
Vickrey, David and Daphne Koller. 2008. Applying sentence simplification to the CoNLL-2008 shared task. In Proc. CoNLL. pages 268--272.
[18]
Williams, Sandra and Ehud Reiter. 2005. Generating readable texts for readers with low basic skills. In Proc. ENLG. pages 127--132.
[19]
Yatskar, Mark, Bo Pang, Cristian Danescu-Niculescu-Mizil, and Lillian Lee. 2010. For the sake of simplicity: Unsupervised extraction of lexical simplifications from wikipedia. In Proc. NAACL-HLT. pages 365--368.

Cited By

View all
  • (2019)Unsupervised Clinical Language TranslationProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330710(3121-3131)Online publication date: 25-Jul-2019
  • (2015)A plug-in to aid online reading in SpanishProceedings of the 12th International Web for All Conference10.1145/2745555.2746661(1-4)Online publication date: 18-May-2015
  • (2015)Measuring text simplification with the crowdProceedings of the 12th International Web for All Conference10.1145/2745555.2746658(1-9)Online publication date: 18-May-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
HLT '11: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
June 2011
765 pages
ISBN:9781932432886

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 19 June 2011

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 240 of 768 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)36
  • Downloads (Last 6 weeks)7
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Unsupervised Clinical Language TranslationProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330710(3121-3131)Online publication date: 25-Jul-2019
  • (2015)A plug-in to aid online reading in SpanishProceedings of the 12th International Web for All Conference10.1145/2745555.2746661(1-4)Online publication date: 18-May-2015
  • (2015)Measuring text simplification with the crowdProceedings of the 12th International Web for All Conference10.1145/2745555.2746658(1-9)Online publication date: 18-May-2015
  • (2015)Making It SimplextACM Transactions on Accessible Computing10.1145/27380466:4(1-36)Online publication date: 11-May-2015
  • (2015)From senses to textsArtificial Intelligence10.1016/j.artint.2015.07.005228:C(95-128)Online publication date: 1-Nov-2015
  • (2014)Evaluation of DysWebxiaProceedings of the 11th Web for All Conference10.1145/2596695.2596697(1-10)Online publication date: 7-Apr-2014
  • (2013)Comparing resources for spanish lexical simplificationProceedings of the First international conference on Statistical Language and Speech Processing10.1007/978-3-642-39593-2_21(236-247)Online publication date: 29-Jul-2013
  • (2013)Automatic text simplification in spanishProceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 210.1007/978-3-642-37256-8_40(488-500)Online publication date: 24-Mar-2013
  • (2012)Towards automatic lexical simplification in SpanishProceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations10.5555/2390916.2390919(8-16)Online publication date: 7-Jun-2012
  • (2012)UOW-SHEFProceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation10.5555/2387636.2387715(477-481)Online publication date: 7-Jun-2012
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media