Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/978-3-642-34478-7_9guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Obtaining single document summaries using latent dirichlet allocation

Published: 12 November 2012 Publication History

Abstract

In this paper, we present a novel approach that makes use of topic models based on Latent Dirichlet allocation(LDA) for generating single document summaries. Our approach is distinguished from other LDA based approaches in that we identify the summary topics which best describe a given document and only extract sentences from those paragraphs within the document which are highly correlated given the summary topics. This ensures that our summaries always highlight the crux of the document without paying any attention to the grammar and the structure of the documents. Finally, we evaluate our summaries on the DUC 2002 Single document summarization data corpus using ROUGE measures. Our summaries had higher ROUGE values and better semantic similarity with the documents than the DUC summaries.

References

[1]
Blei, D.: Probabilistic Topic Models. Commun. ACM 55(4), 77-84 (2012)
[2]
Changm, Y., Chien, J.: Latent Dirichlet Learning for Document Summarization. In: ICASSP, pp. 1689-1692 (2009)
[3]
Edmundson, H. P.: New Methods in Automatic Extracting. J. ACM 16(2), 264-285 (1969)
[4]
Luhn, H. P.: The Automatic Creation of Literature Abstracts. Presented at IRE National Convention (1958)
[5]
Conroy, J. M., O'leary, D. P.: Text Summarization via Hidden Markov Models. In: SIGIR, pp. 406-407 (2001)
[6]
Arora, R., Ravindran, B.: Latent Dirichlet Allocation Based Multi-Document Summarization. In: AND, pp. 91-97 (2008)
[7]
Hanna, W., David, M., Andrew, M.: Rethinking LDA: Why Priors Matter. In: NIPS, pp. 1973-1981 (2009)
[8]
Manning, C. D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval, 1st edn. Cambridge University Press (2008)
[9]
Lin, C.Y.: Rouge: A package for Automatic Evaluation of Summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, WAS 2004 (2004)
[10]
Simpson, J., Weiner, E.: Oxford English Dictionary, 2nd edn. Oxford University Press (1989)

Index Terms

  1. Obtaining single document summaries using latent dirichlet allocation
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    ICONIP'12: Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
    November 2012
    698 pages
    ISBN:9783642344770
    • Editors:
    • Tingwen Huang,
    • Zhigang Zeng,
    • Chuandong Li,
    • Chi Sing Leung

    Sponsors

    • ExxonMobil
    • QAPCO: QAPCO
    • United Development: United Development Co.
    • Qatar Petrochemical Company: Qatar Petrochemical Company

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 12 November 2012

    Author Tags

    1. SVM
    2. latent dirichlet allocation
    3. naïve bayes classifier
    4. single document summaries

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 0
      Total Downloads
    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 04 Oct 2024

    Other Metrics

    Citations

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media