Towards a unified framework for opinion retrieval, mining and summarization

Elena Lloret¹,
Alexandra Balahur¹,
José M. Gómez¹,
Andrés Montoyo¹ &
…
Manuel Palomar¹

825 Accesses
10 Citations
Explore all metrics

Abstract

The exponential increase of subjective, user-generated content since the birth of the Social Web, has led to the necessity of developing automatic text processing systems able to extract, process and present relevant knowledge. In this paper, we tackle the Opinion Retrieval, Mining and Summarization task, by proposing a unified framework, composed of three crucial components (information retrieval, opinion mining and text summarization) that allow the retrieval, classification and summarization of subjective information. An extensive analysis is conducted, where different configurations of the framework are suggested and analyzed, in order to determine which is the best one, and under which conditions. The evaluation carried out and the results obtained show the appropriateness of the individual components, as well as the framework as a whole. By achieving an improvement over 10% compared to the state-of-the-art approaches in the context of blogs, we can conclude that subjective text can be efficiently dealt with by means of our proposed framework.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Opinion Summarization-Evaluation System Based on Pre-trained Models

An Unsupervised Technique to Generate Summaries from Opinionated Review Documents

Text Mining for News and Blogs Analysis

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

www.swotti.com
http://www.nist.gov/tac
Document Understanding Conference http://duc.nist.gov (Last Access: 06/02/2012).
http://www.ciao.co.uk/ (Last Access: 06/02/2012).
http://www.swotti.com (Last Access: 06/02/2012).
http://www.nist.gov/tac/2008/summarization/op.summ.08.guidelines.html (Last Access: 06/02/2012).
http://alias-i.com/lingpipe/ (Last Access: 06/02/2012).
http://www.yahoo.com/ (Last Access: 06/02/2012).
http://infomap-nlp.sourceforge.net/ (Last Access: 06/02/2012).
http://duc.nist.gov/duc2004/software/duc2003.breakSent.tar.gz (Last Access: 06/02/2012).
http://cogcomp.cs.illinois.edu/page/tools_view/8 (Last Access: 06/02/2012).
http://tartarus.org/~martin/PorterStemmer/ (Last Access: 06/02/2012).
http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/ (Last Access: 06/02/2012).
http://jmlr.csail.mit.edu/papers/volume5/lewis04a/a11-smart-stop-list/english.stop (Last Access: 06/02/2012).
http://www.nist.gov/tac/data/past/2008/OpSummQA08.html (Last Access: 06/02/2012).
http://www.nist.gov/tac/data/past/2008/OpSummQA08.html (Last Access: 06/02/2012).
For specific detail of the different IR, OM and TS components, please refer to Section 3.
http://www.d.umn.edu/~tpederse/text-similarity.html (Last Access: 06/02/2012).
The cosine similarity was computed using Pedersen’s Text Similarity Package: http://www.d.umn.edu/~tpederse/text-similarity.html (Last Access: 06/02/2012).
We can consider opinion question answering as a specific type of information retrieval.
A t-test was carried out in order to account for the significance of the results.
We have used these snippets for building our QA-snippets baseline.

References

Aslandogan, Y. A., & Yu, C. T. (1999). Techniques and systems for image and video retrieval. IEEE Transactions on Knowledge and Data Engineering, 11(1), 56–63.
Article Google Scholar
Balahur, A., Boldrini, E., Montoyo, A., & Martínez-Barco, P. (2009a). A comparative study of open domain and opinion question answering systems for factual and opinionated queries. In Proceedings of the international conference RANLP-2009 (pp. 18–22). Borovets, Bulgaria: Association for Computational Linguistics.
Google Scholar
Balahur, A., Boldrini, E., Montoyo, A., & Martínez-Barco, P. (2009b). Opinion and generic question answering systems: A performance analysis. In Proceedings of the ACL-IJCNLP 2009 conference short papers (pp. 157–160). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Balahur, A., Boldrini, E., Montoyo, A., & Martínez-Barco, P. (2010b). Going beyond traditional QA systems: Challenges and keys in opinion question answering. In Proceedings of the 23rd international conference on computational linguistics: Posters (pp. 27–35). Stroudsburg, PA, USA: Association for Computational Linguistics.
Google Scholar
Balahur, A., Boldrini, E., Montoyo, A., & Martínez-Barco, P. (2010c). Opinion question answering: Towards a unified approach. In Proceeding of the 2010 conference on ECAI 2010: 19th European conference on artificial intelligence (pp. 511–516). Amsterdam, The Netherlands, The Netherlands: IOS Press.
Google Scholar
Balahur, A., Kabadjov, M., & Steinberger, J. (2010a). Exploiting higher-level semantic information for the opinion-oriented summarization of blogs. International Journal of Computational Linguistics and Applications, 1(1–2), 45–59.
Google Scholar
Balahur, A., Kabadjov, M., Steinberger, J., Steinberger, R., & Montoyo, A. (2009d). Summarizing opinions in blog threads. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC) (pp. 606–613). City University of Hong Kong Press.
Balahur, A., Lloret, E., Ferrández, O., Montoyo, A., Palomar, M., & Muñoz, R. (2008). The DLSIUAES team’s participation in the TAC 2008 tracks. In Proceedings of the text analysis conference (TAC).
Balahur, A., Steinberger, R., Kabadjov, M. A., Zavarella, V., Van der Goot, E., Halkia, M., et al. (2010d). Sentiment analysis in the news. In Proceedings of the seventh conference on international Language Resources and Evaluation (LREC’10) (pp. 2216–2220).
Balahur, A., Steinberger, R., van der Goot, E., Pouliquen, B., & Kabadjov, M. (2009c). Opinion mining from newspaper quotations. In Proceedings of the workshop on intelligent analysis and processing of web news content at the IEEE/WIC/ACM international conferences on web intelligence and intelligent agent technology (WI-IAT) (pp. 523–526).
Beineke, P., Hastie, T., Manning, C., & Vaithyanathan, S. (2003). An exploration of sentiment summarization. In Proceedings of the AAAI spring symposium on exploring attitude and affect in text: Theories and applications (pp. 1–4).
Bossard, A., Généreux, M., & Poibeau, T. (2008). Description of the LIPN systems at TAC 2008: Summarizing information and opinions. In Proceedings of the text analysis conference (TAC).
Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. In Proceedings of the seventh international conference on world wide web 7. WWW7 (pp. 107–117). Amsterdam, The Netherlands, The Netherlands: Elsevier Science Publishers B.V.
Google Scholar
Buscaldi, D., Rosso, P., Gómez-Soriano, J. M., & Sanchis, E. (2010). Answering questions with an n-gram based passage retrieval engine. Journal of Intelligent Information Systems, 34, 113–134.
Article Google Scholar
Carenini, G., & Cheung, J. C. K. (2008). Extractive vs. NLG-based abstractive summarization of evaluative text: The effect of corpus controversiality. In Proceedings of the fifth international natural language generation conference, ACL 2008 (pp. 33–40).
Cerini, S., Compagnoni, V., Demontis, A., Formentelli, M., & Gandini, G. (2007). Micro-WNOp: A gold standard for the evaluation of automatically compiled lexical resources for opinion mining. In A. Sansó (Ed.), Language resources and linguistic theory: Typology, second language acquisition, english linguistics (pp. 1–4). Milano, IT: Franco Angeli.
Google Scholar
Cesarano, C., Mazzeo, A., & Picariello, A. (2007). A system for summary-document similarity in notary domain. In Proceedings of the international workshop on database and expert systems applications (pp. 254–258).
Chaovalit, P., & Zhou, L. (2005). Movie review mining: A comparison between supervised and unsupervised classification approaches. In Proceedings of HICSS-05, the 38th Hawaii international conference on system sciences.
Christensen, H. U., & Ortiz-Arroyo, D. (2007). Applying data fusion methods to passage retrieval in QAS. In Proceedings of the 7th international conference on Multiple Classifier Systems (MCS’07) (pp. 82–92). Berlin, Heidelberg: Springer.
Chapter Google Scholar
Conroy, J. M., & Schlesinger, J. D. (2008). CLASSY at TAC 2008 metrics. In: Proceedings of the text analysis conference (TAC).
Cruz, F., Troyano, J. A., Ortega, J., & Enríquez, F. (2008). The Italica system at TAC 2008 opinion summarization task. In Proceedings of the text analysis conference (TAC).
Dagan, I., Glickman, O., & Magnini, B. (2006). The PASCAL recognising textual entailment challenge. Machine Learning Challenges. Lecture Notes in Computer Science, 3944, 177–190.
Google Scholar
Dang, H. T., & Owczarzak, K. (2009). Overview of the TAC 2009 summarization track. In Proceedings of the text analysis conference (TAC).
Datta, R., Joshi, D., Li, J., & Wang, J. Z. (2008). Image retrieval: Ideas, influences, and trends of the new age. Acm Computing Surveys, 40(2), 1–60.
Article Google Scholar
Dave, K., Lawrence, S., & Pennock, D. M. (2003). Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In Proceedings of WWW-03.
Dunlavy, D. M., O’Leary, D. P., Conroy, J. M., & Schlesinger, J. D. (2007). QCS: A system for querying, clustering and summarizing documents. Information Processing & Management, 43(6), 1588–1605.
Article Google Scholar
Esuli, A., & Sebastiani, F. (2006). SentiWordNet: A publicly available resource for opinion mining. In Proceedings of the 6th international conference on Language Resources and Evaluation (LREC’06) (pp. 417–422).
Fellbaum, C. (1998). WordNet: An electronic lexical database. The MIT Press.
Ferrández, Ó., Micol, D., Muñoz, R., & Palomar, M. (2007). A perspective-based approach for solving textual entailment recognition. In Proceedings of the ACL-Pascal workshop on textual entailment and paraphrasing (pp. 66–71). Prague: Association for Computational Linguistics.
Google Scholar
Foote, J. (1999). An overview of audio information retrieval. Multimedia Systems—Special Issue on Audio and Multimedia, 7(1), 2–10.
Google Scholar
Giannakopoulos, G., Karkaletsis, V., & Vouros, G. (2008). Testing the use of n-gram graphs in summarization sub-tasks. In Proceedings of the text analysis conference (TAC).
Givón, T. (1990). Syntax: A functional-typological introduction, II. John Benjamins.
Gómez, J. M. (2007). Recuperación de Pasajes Multilingüe para la Búsqueda de Respuestas. Ph.D. thesis, Universidad Politécnica de Valencia, Valencia, Spain.
Gómez, J. M., Buscaldi, D., Rosso, P., & Sanchis, E. (2007). Jirs: Language-independent passage retrieval system: A comparative study. In: 5th international conference on natural language processing 2006.
Gómez Soriano, J. M., Buscaldi, D., Asensi, E. B., Rosso, P., & Arnal, E. S. (2005). QUASAR: The question answering system of the universidad Politcnica de Valencia. In C. Peters, F. C. Gey, J. Gonzalo, H. Mller, G. J. F. Jones, M. Kluck, et al. (Eds.) Accessing multilingual information repositories, 6th workshop of the cross-language evalution forum, CLEF 2005, Vienna, Austria, 21–23 September. Lecture notes in computer science (Vol. 4022, pp. 439–448). Springer.
Harabagiu, S., & Lacatusu, F. (2005). Topic themes for multi-document summarization. In Sigir ’05: Proceedings of the 28th annual international ACM SIGIR conference on research and development in information retrieval (pp. 202–209). New York, NY, USA: ACM.
Chapter Google Scholar
Hatzivassiloglou, V., & Wiebe, J. M. (2000). Effects of adjective orientation and gradability on sentence subjectivity. In Proceedings of the 18th conference on computational Linguistics (Vol. 1, pp. 299–305). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
He, T., Chen, J., Gui, Z., & Li, F. (2008). CCNU at TAC 2008: Proceeding on using semantic method for automated summarization. In Proceedings of the text analysis conference (TAC).
Hsin-Hsi, C., & Chuan-Jie, L. (2000). A multilingual news summarizer. In Proceedings of the 18th conference on computational linguistics (pp. 159–165). Morristown, NJ, USA: Association for Computational Linguistics.
Google Scholar
Hu, M., & Liu, B. (2004). Mining opinion features in customer reviews. In Proceedings of nineteenth national conference on artificial intellgience AAAI-2004.
Jin, F., Huang, M., & Zhu, X. (2009). A query-specific opinion summarization system. In Proceedings of 8th IEEE international conference on cognitive informatics (pp. 428–433).
Kabadjov, M., Balahur, A., & Boldrini, E. (2009). Sentiment intensity: Is it a good summary indicator? In Proceedings of the 4th language and technology conference (LTC) (pp. 380–384).
Kaisser, M., Hearst, M. A., & Lowe, J. B. (2008). Improving search results quality by customizing summary lengths. In Proceedings of the association for computational linguistics—human language technologies (ACL-08: HLT) (pp. 701–709). Columbus, Ohio: Association for Computational Linguistics.
Google Scholar
Kan, M.-Y., & Klavans, J. L. (2002). Using librarian techniques in automatic text summarization for information retrieval. In Proceedings of the 2nd ACM/IEEE-CS joint conference on digital libraries (JCDL’02) (pp. 36–45). New York, NY, USA: ACM.
Chapter Google Scholar
Kazantseva, A. (2006). An approach to summarizing short stories. In Proceedings of the student research workshop at the 11th conference of the European chapter of the association for computational linguistics (pp. 55–62). Morristown, NJ, USA: Association for Computational Linguistics.
Google Scholar
Kim, S.-M., & Hovy, E. (2004). Determining the sentiment of opinions. In Proceedings of the 20th international conference on computational linguistics (pp. 1367–1373). Stroudsburg, PA, USA: Association for Computational Linguistics.
Google Scholar
Koppel, M., & Shtrimberg, I. (2006). Good news or bad news? Let the market decide. In Computing attitude and affect in text: Theory and applications (pp. 297–301).
Kudo, T., & Matsumoto, Y. (2004). A boosting algorithm for classification of semi-structured text. EMNLP, 2004, 301–308.
Google Scholar
Kuo, J.-J., & Chen, H.-H. (2008). Multidocument summary generation: Using informative and event words. Acm Transactions on Asian Language Information Processing (TALIP), 7(1), 1–23.
Article Google Scholar
Lerman, K., Blair-Goldensohn, S., & McDonald, R. (2009). Sentiment summarization: Evaluating and learning user preferences. In Proceedings of the 12th conference of the European chapter of the ACL (EACL 2009) (pp. 514–522). Athens, Greece: Association for Computational Linguistics.
Google Scholar
Lerman, K., & McDonald, R. (2009). Contrastive summarization: An experiment with consumer reviews. In Proceedings of human language technologies: The 2009 annual conference of the North American chapter of the association for computational linguistics, companion volume: Short papers (pp. 113–116). Boulder, Colorado: Association for Computational Linguistics.
Google Scholar
Lin, C.-Y. (2004). ROUGE: A package for automatic evaluation of summaries. In Proceedings of ACL text summarization workshop (pp. 74–81). Barcelona, Spain: Association for Computational Linguistics.
Google Scholar
Lin, C.-Y., & Hovy, E. (2000). The automated acquisition of topic signatures for text summarization (pp. 495–501). In Proceedings of the 18th conference on computational linguistics. Morristown, NJ, USA: Association for Computational Linguistics.
Chapter Google Scholar
Liu, B. (2006). Web data mining. exploring hyperlinks, contents and usage data (1st ed.). Springer.
Lloret, E. (2011). Text summarisation based on human language technologies and its applications. Ph.D. thesis, University of Alicante.
Lloret, E., Balahur, A., Palomar, M., & Montoyo, A. (2009). Towards building a competitive opinion summarization system: Challenges and keys. In Proceedings of the North American chapter of the association for computational linguistics. Student research workshop and doctoral consortium (pp. 72–77).
Lloret, E., Ferrández, Ó., Muñoz, R., & Palomar, M. (2008a). Integración del reconocimiento de la implicación textual en tareas automáticas de resúmenes de textos. Procesamiento del Lenguaje Natural, 183–190.
Lloret, E., Ferrández, Ó., Muñoz, R., & Palomar, M. (2008b). A text summarization approach under the influence of textual entailment. In Proceedings of the 5th international workshop on natural language processing and cognitive science (NLPCS 2008) in conjunction with the 10th international conference on enterprise information systems (ICEIS 2008), 12–16 June, Barcelona, Spain (pp. 22–31).
Lloret, E., & Palomar, M. (2009). A gradual combination of features for building automatic summarisation systems. In Proceedings of the 12th international conference on text, speech and dialogue (TSD) (pp. 16–23). Berlin, Heidelberg: Springer.
Google Scholar
Lloret, E., Romá-Ferri, M. T., & Palomar, M. (2011). COMPENDIUM: A text summarization system for generating abstracts of research papers. In Proceedings of the 16th international conference on applications of natural language to information systems (NLDB).
Lloret, E., Saggion, H., & Palomar, M. (2010). Experiments on summary-based opinion classification. In Proceedings of the NAACL HLT 2010 workshop on computational approaches to analysis and generation of emotion in text (pp. 107–115). Los Angeles, CA: Association for Computational Linguistics.
Google Scholar
Luhn, H. P. (1958). The automatic creation of literature abstracts. In I. Mani & M. Maybury (Eds.), Advances in automatic text summarization (pp. 15–22). MIT Press.
Ma, L., He, T., Li, F., Gui, Z., & Chen, J. (2008). Query-focused multi-document summarization using keyword extraction. In Proceedings of the 2008 international conference on computer science and software engineering (Vol. 1, pp. 20–23).
Mani, I. (2001). Automatic summarization. John Benjamins Pub Co.
Mani, I., House, D., Klein, G., Hirschman, L., Firmin, T., & Sundheim, B. (1999). The TIPSTER SUMMAC text summarization evaluation. In Proceedings of the ninth conference on European chapter of the association for computational linguistics (pp. 77–85). Morristown, NJ, USA: Association for Computational Linguistics.
Chapter Google Scholar
Manning, C. D., Raghavan, P., & Schtze, H. (2008). Introduction to information retrieval. New York, NY, USA: Cambridge University Press.
Book MATH Google Scholar
McCargar, V. (2005). Statistical approaches to automatic text summarization. Bulletin of the American Society for Information Science and Technology, 30(4), 21–25.
Article Google Scholar
McKeown, K., & Radev, D. R. (1999). Generating summaries of multiple news articles. In I. Mani, & M. Maybury (Eds.), Advances in automatic text summarization (pp. 381–390). MIT Press.
Mihalcea, R., & Ceylan, H. (2007). Explorations in automatic book summarization. In Proceedings of the joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CONLL) (pp. 380–389).
Mishne, G. (2006). Multiple ranking strategies for opinion retrieval in blogs. 2006 TREC blog track.
Montes-Gómez, M., Pineda, L. V., Pérez-Coutiño, M. A., Gómez Soriano, J. M., Arnal, E. S., & Rosso, P. (2005). A full data-driven system for multiple language question answering. In Accessing multilingual information repositories, 6th workshop of the cross-language evalution forum, CLEF 2005, Vienna, Austria, 21–23 September, 2005. Lecture Notes in Computer Science (Vol. 4022, pp. 420–428). Springer.
Najork, M., & Heydon, A. (2002). Handbook of massive data sets. Norwell, MA, USA: Kluwer Academic Publishers.
Google Scholar
Nenkova, A., Passonneau, R., & McKeown, K. (2007). The pyramid method: Incorporating human content selection variation in summarization evaluation. ACM Transactions on Speech and Language Processing, 4(2), 4.
Article Google Scholar
Ng, V., Dasgupta, S., & Arifin, S. M. N. (2006). Examining the role of linguistic knowledge sources in the automatic identification and classification of reviews. In Proceedings of the coling/acl on main conference poster sessions (pp. 611–618). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Ou, S., Khoo, C. S. G., & Goh, D. H. (2007). Automatic multidocument summarization of research abstracts: Design and user evaluation. Journal of American Society for Information Science and Technology, 58(10), 1419–1435.
Article Google Scholar
Ounis, I., de Rijke, M., Macdonald, C., Mishne, G., & Soboroff, I. (2006). Overview of the TREC-2006 blog track. In Proceeddings of the 15th Text Retrieval Conference (TREC 2007).
Pang, B., & Lee, L. (2003). Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales. In Proceedings of the 43rd annual meeting of the association for computational linguistics (pp. 115–124).
Pang, B., & Lee, L. (2004). A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the ACL 2004.
Pang, B., & Lee, L. (2008). Opinion mining and sentiment analysis. Foundations and trends in information retrieval, 2(1–2), 1–135.
Article Google Scholar
Pang, B., Lee, L., & Vaithyanathan, S. (2002). Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of EMNLP-02, the conference on empirical methods in natural language processing (pp. 79–86). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Plaza, L., Lloret, E., & Aker, A. (2010). Improving automatic image captioning using text summarization techniques. In Proceedings of the 13th international conference on text, speech and dialogue (TSD) (pp. 165–172). Berlin, Heidelberg: Springer.
Google Scholar
Radev, D. R., & Fan, W. (2000). Automatic summarization of search engine hit lists. In Proceedings of the ACL-2000 workshop on recent advances in natural language processing and information retrieval (pp. 99–109). Morristown, NJ, USA: Association for Computational Linguistics.
Google Scholar
Riloff, E., & Wiebe, J. (2003). Learning extraction patterns for subjective expressions. In Proceedings of the 2003 conference on empirical methods in natural language processing (pp. 105–112). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Riloff, E., Wiebe, J., & Phillips, W. (2005). Exploiting subjectivity classification to improve information extraction. In Proceedings of the 20th national conference on artificial intelligence (Vol. 3, pp. 1106–1111). AAAI Press.
Saggion, H. (2009). A classification algorithm for predicting the structure of summaries. In Proceedings of the 2009 workshop on language generation and summarisation (ucnlg+sum 2009) (pp. 31–38). Suntec, Singapore: Association for Computational Linguistics.
Chapter Google Scholar
Sakai, T., & Sparck-Jones, K. (2001). Generic summaries for indexing in information retrieval. In Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval (SIGIR ’01) (pp. 190–198). New York, NY, USA: ACM.
Chapter Google Scholar
Sauper, C., & Barzilay, R. (2009). Automatically generating wikipedia articles: A structure-aware approach. In Proceedings of the joint conference of the 47th annual meeting of the ACL and the 4th international joint conference on natural language processing of the AFNLP (pp. 208–216). Suntec, Singapore: Association for Computational Linguistics.
Google Scholar
Scherer, K. (2005). What are emotions? And how can they be measured? Social Science Information, 44(4), 693–727.
Article Google Scholar
Shen, D., Yang, Q., & Chen, Z. (2007). Noise reduction through summarization for web-page classification. Information Processing and Management, 43(6), 1735–1747.
Article Google Scholar
Steinberger, J., Jezek, K., & Sloup, M. (2008). Web topic summarization. In Proceedings of the 12th international conference on electronic publishing, Elpub, Toronto, Canada, 25–27 June (pp. 322–334).
Stoyanov, V., & Cardie, C. (2006). Toward opinion summarization: Linking the sources. In Proceedings of the workshop on sentiment and subjectivity in text (pp. 9–14).
Strapparava, C., & Valitutti, A. (2004). WordNet-Affect: An affective extension of wordnet. In Proceedings of the 4th international conference on language resources and evaluation (pp. 1083–1086).
Teufel, S., & Moens, M. (2002). Summarizing scientific articles: Experiments with relevance and rhetorical status. Computational Linguistis, 28(4), 409–445.
Article Google Scholar
Titov, I., & McDonald, R. (2008). A joint model of text and aspect ratings for sentiment summarization. In Proceedings of association for computational linguistics—human language technologies (ACL-08: HLT) (pp. 308–316). Columbus, Ohio: Association for Computational Linguistics.
Google Scholar
Torres-Moreno, J.-M., St-Onge, P.-L., Gagnon, M., El-Bze, M., & Bellot, P. (2009). Automatic summarization system coupled with a question-answering system (QAAS). Nlp News. Computation and Language.
Turney, P. (2002). Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings 40th annual meeting of the association for computational linguistics.
Wilson, T., Wiebe, J., & Hoffmann, P. (2005). Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the conference on human language technology and empirical methods in natural language processing (pp. 347–354). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Wilson, T., Wiebe, J., & Hwa, R. (2004). Just how mad are you? Finding strong and weak opinion clauses. In Proceedings of the 19th national conference on artifical intelligence (pp. 761–767). AAAI Press.
Witten, I. H., Moffat, A., & Bell, T. C. (1999). Managing gigabytes: Compressing and indexing documents and images (2nd ed.). San Francisco, CA: Morgan Kaufmann.
Google Scholar
Yang, K. (2008). WIDIT in TREC 2008 blog track: Leveraging multiple sources of opinion evidence. In Proceedings of the 17th text retrieval conference.
Yang, X.-P., & Liu, X.-R. (2008). Personalized multi-document summarization in information retrieval. Proceedings of International Conference on Machine Learning and Cybernetics, 7, 4108–4112.
Google Scholar
Yu, D., & Hatzivassiloglou, V. (2003). Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on empirical methods in natural language processing (pp. 129–136). Stroudsburg, PA, USA: Association for Computational Linguistics.
Chapter Google Scholar
Zhang, W., Yu, C., & Meng, W. (2007). Opinion retrieval from blogs. In Proceedings of the sixteenth ACM conference on conference on information and knowledge management (pp. 831–840). New York, NY, USA: ACM.
Chapter Google Scholar
Zhao, L., Wu, L., & Huang, X. (2009). Using query expansion in graph-based approach for query-focused multi-document summarization. Information Processing and Management, 45(1), 35–41.
Article MathSciNet Google Scholar
Zhuang, L., Jing, F., & Zhu, X.-Y. (2006). Movie review mining and summarization. In Proceedings of the 15th ACM international conference on information and knowledge management (CIKM ’06) (pp. 43–50). New York, NY, USA: ACM.
Google Scholar

Download references

Acknowledgements

We would like to thank all the anonymous reviewers for their useful comments and suggestions. This research work has been funded by the Spanish Government through the project TEXT-MESS 2.0 (TIN2009-13391-C04) and by the Valencian Government through projects PROMETEO (PROMETEO/2009/199) and ACOMP/2011/001.

Author information

Authors and Affiliations

Department of Software and Computing Systems, University of Alicante, Apdo. de correos, 99, 03080, Alicante, Spain
Elena Lloret, Alexandra Balahur, José M. Gómez, Andrés Montoyo & Manuel Palomar

Authors

Elena Lloret
View author publications
You can also search for this author in PubMed Google Scholar
Alexandra Balahur
View author publications
You can also search for this author in PubMed Google Scholar
José M. Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Montoyo
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Palomar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Elena Lloret.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lloret, E., Balahur, A., Gómez, J.M. et al. Towards a unified framework for opinion retrieval, mining and summarization. J Intell Inf Syst 39, 711–747 (2012). https://doi.org/10.1007/s10844-012-0209-4

Download citation

Received: 07 June 2011
Revised: 16 May 2012
Accepted: 16 May 2012
Published: 31 May 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10844-012-0209-4

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An Opinion Summarization-Evaluation System Based on Pre-trained Models

An Unsupervised Technique to Generate Summaries from Opinionated Review Documents

Text Mining for News and Blogs Analysis

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Towards a unified framework for opinion retrieval, mining and summarization

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

An Opinion Summarization-Evaluation System Based on Pre-trained Models

An Unsupervised Technique to Generate Summaries from Opinionated Review Documents

Text Mining for News and Blogs Analysis

Explore related subjects

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now