Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2494266.2494300acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
short-paper

Supervised polarity classification of Spanish tweets based on linguistic knowledge

Published: 10 September 2013 Publication History

Abstract

We describe a system that classifies the polarity of Spanish tweets. We adopt a hybrid approach, which combines machine learning and linguistic knowledge acquired by means of NLP. We use part-of-speech tags, syntactic dependencies and semantic knowledge as features for a supervised classifier. Lexical particularities of the language used in Twitter are taken into account in a pre-processing step. Experimental results improve over those of pure machine learning approaches and confirm the practical utility of the proposal.

References

[1]
A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. Passonneau. Sentiment analysis of Twitter data. In Proceedings of the Workshop on Languages in Social Media, LSM '11, pages 30--38, Stroudsburg, PA, USA, 2011. ACL.
[2]
F. Batista and R. Ribeiro. The L2F Strategy for Sentiment Analysis and Topic Classification. Procesamiento de Lenguaje Natural, 50:77--84, 2013.
[3]
J. Brooke, M. Tofiloski, and M. Taboada. Cross-Linguistic Sentiment Analysis: From English to Spanish. In Proceedings of the International Conference RANLP-2009, pages 50--54, Borovets, Bulgaria, 2009. ACL.
[4]
K. Gimpel, N. Schneider, B. O'Connor, D. Das, D. Mills, J. Eisenstein, M. Heilman, D. Yogatama, J. Flanigan, and N. Smith. Part-of-speech tagging for Twitter: annotation, features, and experiments. HLT '11 Proc. of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers, 2:42--47, 2011.
[5]
M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. The WEKA data mining software: an update. SIGKDD Explor. Newsl., 11(1):10--18, Nov. 2009.
[6]
J. Nivre, J. Hall, J. Nilsson, A. Chanev, G. Eryigit, S. Kübler, S. Marinov, and E. Marsi. Maltparser: A language-independent system for data-driven dependency parsing. Natural Language Engineering, 13(2):95--135, 2007.
[7]
J. Oliva, J. I. Serrano, M. D. Del Castillo, and A. Igesias. A SMS normalization system integrating multiple grammatical resources. Natural Language Engineering, 19:121--141, 2013.
[8]
A. Pak and P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. In Proceedings of the Seventh International conference on Language Resources and Evaluation (LREC'10), Valletta, Malta, May 2010. European Language Resources Association (ELRA).
[9]
B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up sentiment classification using machine learning techniques. In Proceedings of EMNLP, pages 79--86, 2002.
[10]
J. C. Platt. Advances in kernel methods. chapter Fast training of support vector machines using sequential minimal optimization, pages 185--208. MIT Press, Cambridge, MA, USA, 1999.
[11]
X. Saralegi Urizar and I. San Vicente Roncal. Detecting Sentiments in Spanish Tweets. In TASS 2012 Working Notes, Castellón, Spain, 2012.
[12]
J. Spencer and G. Uchyigit. Sentimentor: Sentiment Analysis on Twitter Data. In The 1st International Workshop on Sentiment Discovery from Affective Data, Bristol, United Kingdom, 2012.
[13]
M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede. Lexicon-based methods for sentiment analysis. Computational Linguistics, 37(2):267--307, 2011.
[14]
M. Taulé, M. A. Martı, and M. Recasens. AnCora: Multilevel Annotated Corpora for Catalan and Spanish. In N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odjik, S. Piperidis, and D. Tapias, editors, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco, 2008.
[15]
P. D. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL '02, pages 417--424, Stroudsburg, PA, USA, 2002. ACL.
[16]
D. Vilares, M. A. Alonso, and C. Gómez-Rodrıguez. Clasificación de polaridad en textos con opiniones en espanol mediante análisis sintáctico de dependencias. Procesamiento de Lenguaje Natural, 50:13--20, 2013.
[17]
J. Villena-Román, S. Lana-Serrano, J. C. González Cristóbal, and E. Martınez-Cámara. TASS - Worshop on Sentiment Analysis at SEPLN. Procesamiento de Lenguaje Natural, 50:37--44, 2013.
[18]
L. Zhang, R. Ghosh, M. Dekhil, M. Hsu, and B. Liu. Combining lexicon-based and learning-based methods for Twitter sentiment analysis. Technical Report HPL-2011-89, HP Laboratories, Palo Alto, CA, 2011.

Cited By

View all
  • (2021)Systematic literature review of sentiment analysis in the Spanish languageData Technologies and Applications10.1108/DTA-09-2020-020055:4(461-479)Online publication date: 16-Feb-2021
  • (2020)Evolutionary Optimization of Ensemble Learning to Determine Sentiment Polarity in an Unbalanced Multiclass CorpusEntropy10.3390/e2209102022:9(1020)Online publication date: 12-Sep-2020
  • (2015)The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweetsJournal of Information Science10.1177/016555151559892641:6(799-813)Online publication date: 1-Dec-2015
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '13: Proceedings of the 2013 ACM symposium on Document engineering
September 2013
582 pages
ISBN:9781450317894
DOI:10.1145/2494266
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 September 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. document analysis
  2. linguistic analysis
  3. machine learning
  4. opinion mining
  5. sentiment analysis
  6. twitter

Qualifiers

  • Short-paper

Conference

DocEng '13
Sponsor:
DocEng '13: ACM Symposium on Document Engineering 2013
September 10 - 13, 2013
Florence, Italy

Acceptance Rates

DocEng '13 Paper Acceptance Rate 16 of 50 submissions, 32%;
Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Systematic literature review of sentiment analysis in the Spanish languageData Technologies and Applications10.1108/DTA-09-2020-020055:4(461-479)Online publication date: 16-Feb-2021
  • (2020)Evolutionary Optimization of Ensemble Learning to Determine Sentiment Polarity in an Unbalanced Multiclass CorpusEntropy10.3390/e2209102022:9(1020)Online publication date: 12-Sep-2020
  • (2015)The megaphone of the people? Spanish SentiStrength for real-time analysis of political tweetsJournal of Information Science10.1177/016555151559892641:6(799-813)Online publication date: 1-Dec-2015
  • (2015)ExpressionProceedings of the Latin American Conference on Human Computer Interaction10.1145/2824893.2824903(1-8)Online publication date: 18-Nov-2015
  • (2015)Sentiment Groups as Features of a Classification Model Using a Spanish Sentiment LexiconProceedings of the 7th Mexican Conference on Pattern Recognition - Volume 911610.1007/978-3-319-19264-2_25(258-268)Online publication date: 24-Jun-2015
  • (2015)On the usefulness of lexical and syntactic processing in polarity classification of Twitter messagesJournal of the Association for Information Science and Technology10.1002/asi.2328466:9(1799-1816)Online publication date: 1-Sep-2015
  • (2014)A linguistic approach for determining the topics of Spanish Twitter messagesJournal of Information Science10.1177/016555151456165241:2(127-145)Online publication date: 12-Dec-2014

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media