Dependency Analysis and CBR to Bridge the Generation Gap in Template-Based NLG

Virginia Francisco¹,
Raquel Hervás¹ &
Pablo Gervás¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4394))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

1528 Accesses
1 Citations

Abstract

The present paper describes how dependency analysis can be used to automatically extract from a corpus a set of cases - and an accompanying vocabulary - which enable a template-based generator to achieve reasonable coverage over conceptual messages beyond the explicit scope of the templates defined in it. Details are provided on the actual process of partial automation that has been applied to obtain the case base, together with the various ingredients of the template-based generator, which applies case-based reasoning techniques. This module resorts to the taxonomy of concepts in WordNet to compute similarity between concepts involved in the texts. A case retrieval net is used as a memory model. The set of data to be converted into text acts as a query to the system. The process of solving a given query may involve several retrieval processes - to obtain a set of cases that together constitute a good solution for transcribing the data in the query as text messages - and a process of knowledge-intensive adaptation which resorts to a knowledge base to identify appropriate substitutions and completions for the concepts that appear in the cases, using the query as a source. We describe this case-based solution for selecting an appropriate set of templates to render a given set of data as text, we present numeric results of system performance in the domain of press articles, and we discuss its advantages and shortcomings.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Data Extraction Using NLP Techniques and Its Transformation to Linked Data

Constructing of Semantically Dependent Patterns Based on SpaCy and StanfordNLP Libraries

Speeding up Natural Language Parsing by Reusing Partial Results

References

Meteer, M.W.: The generation gap: the problem of expressibility in text planning. PhD thesis, Amherst, MA, USA (1990)
Google Scholar
Aamodt, A., Plaza, E.: Case-based reasoning: Foundational issues, methodological variations, and system approaches (1994)
Google Scholar
Lenz, M., Burkhard, H.D.: Case Retrieval Nets: Basic Ideas and Extensions. In: KI - Kunstliche Intelligenz, pp. 227–239 (1996)
Google Scholar
Hervás, R., Gervás, P.: Case Retrieval Nets for Heuristic Lexicalization in Natural Language Generation. In: Bento, C., Cardoso, A., Dias, G. (eds.) EPIA 2005. LNCS (LNAI), vol. 3808, Springer, Heidelberg (2005)
Chapter Google Scholar
Hervás, R., Gervás, P.: Case-based reasoning for knowledge-intensive template selection during text generation. In: Proc. of the 8th European Conference on Case-Based Reasoning, Springer, Heidelberg (2006)
Google Scholar
Bateman, J.A., Kasper, R.T., Moore, J.D., Whitney, R.A.: A General Organization of Knowledge for Natural Language Processing: the PENMAN upper model (1990)
Google Scholar
Mahesh, K.: Ontology development for machine translation: Ideology and methodology. Technical Report MCCS-96-292 (1996)
Google Scholar
Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38, 39–41 (1995)
Article Google Scholar
Barzilay, R., Lee, L.: Bootstrapping lexical choice via multiple-sequence alignment. In: Proc. of the EMNLP’02, pp. 164–171 (2002)
Google Scholar
Ide, N., Veroni, J.: Word Sense Disambiguation: The State of the Art. Computational Linguistics, 1–40 (1998)
Google Scholar
Nelson Francis, W., Kucera, H.: Computing Analysis of Present-day American English. Brown University Press, Providence (1967)
Google Scholar
Maxwell, D., Schubert, K.: Metataxis in Practice: Dependency Syntax for Multilingual Machine Translation. Foris Publications (1989)
Google Scholar
Kouylekov, M., Magnini, B.: Tree edit distance for recognizing textual entailment: Estimating the cost of insertion. In: Proceedings of the Second PASCAL Challenges Workshop on Recognising Textual Entailment, Venezia, Italia (2006)
Google Scholar
Herrera, J., Peñas, A., Rodrigo, A., Verdejo, F.: UNED at PASCAL RTE-2 Challenge. In: Proceedings of the Second PASCAL Challenges Workshop on Recognising Textual Entailment, Venezia, Italia (2006)
Google Scholar
Lin, D.: Dependency-based evaluation of MINIPAR. In: Proc. of Workshop on the Evaluation of Parsing Systems, Granada, Spain, May (1998)
Google Scholar
McRoy, S., Channarukul, S., Ali, S.: A Natural Language Generation Component for Dialog Systems. In: Cox, M. (ed.) Working Notes of the AAAI Workshop on Mixed-Initiative Intelligence (AAAI99) (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Ingeniería del Software e Inteligencia Artificial, Universidad Complutense de Madrid, Spain
Virginia Francisco, Raquel Hervás & Pablo Gervás

Authors

Virginia Francisco
View author publications
You can also search for this author in PubMed Google Scholar
Raquel Hervás
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Gervás
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Francisco, V., Hervás, R., Gervás, P. (2007). Dependency Analysis and CBR to Bridge the Generation Gap in Template-Based NLG. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_38

Download citation

DOI: https://doi.org/10.1007/978-3-540-70939-8_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70938-1
Online ISBN: 978-3-540-70939-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Dependency Analysis and CBR to Bridge the Generation Gap in Template-Based NLG

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Data Extraction Using NLP Techniques and Its Transformation to Linked Data

Constructing of Semantically Dependent Patterns Based on SpaCy and StanfordNLP Libraries

Speeding up Natural Language Parsing by Reusing Partial Results

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Dependency Analysis and CBR to Bridge the Generation Gap in Template-Based NLG

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Data Extraction Using NLP Techniques and Its Transformation to Linked Data

Constructing of Semantically Dependent Patterns Based on SpaCy and StanfordNLP Libraries

Speeding up Natural Language Parsing by Reusing Partial Results

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation