Abstract
This paper presents an innovative approach that explores the role of lexicalization for Arabic sentiment analysis. Sentiment Analysis in Arabic is hindered due to lack of resources, language in use with sentiment lexicons, pre-processing of dataset as a must and major concern is repeatedly following same approaches. One of the key solution found to resolve these problems include applying the extension of lexicon to include more words not restricted to Modern Standard Arabic. Secondly, avoiding pre-processing of dataset. Third, and the most important one, is investigating the development of an Arabic Sentiment Analysis system using a novel rule-based approach. This approach uses heuristics rules in a manner that accurately classifies tweets as positive or negative. The manner in which a series of abstraction occurs resulting in an end-to-end mechanism with rule-based chaining approach. For each lexicon, this chain specifically follows a chaining of rules, with appropriate positioning and prioritization of rules. Expensive rules in terms of time and effort thus resulted in outstanding results. The results with end-to-end rule chaining approach achieved 93.9 % accuracy when tested on baseline dataset and 85.6 % accuracy on OCA, the second dataset. A further comparison with the baseline showed huge increase in accuracy by 23.85 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aldayel, H.K., Azmi, A.M.: Arabic tweets sentiment analysis–a hybrid scheme. J. Inf. Sci. (2015). doi:10.1177/0165551515610513
Muhammad, A.-M., Kübler, S., Diab, M.: Samar: a system for subjectivity and sentiment analysis of Arabic social media. In: Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, pp. 19–28. Association for Computational Linguistics (2012)
Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M., Al-Kabi, M.N., Al-rifai, S.: Towards improving the lexicon-based approach for Arabic sentiment analysis. Int. J. Inf. Technol. Web Eng. (IJITWE) 9(3), 55–71 (2014)
Hammo, B., Abu-Salem, H., Lytinen, S., Evens, M.: QARAB: a question answering system to support the Arabic language. In: The Proceedings of Workshop on Computational Approaches to Semitic Languages, ACL 2002, Philadelphia, PA, pp. 55–65, July 2002
Beesley, K.R.: Arabic finite-state morphological analysis and generation. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 1, pp. 89–94. Association for Computational Linguistics, August 1996
Darwish, K.: Building a shallow Arabic morphological analyzer in one day. In: Proceedings of the ACL-2002 Workshop on Computational Approaches to Semitic languages, pp. 1–8. Association for Computational Linguistics, July 2002
Alaa, E.-H.: Arabic opinion mining using combined classification approach (2011)
Farghaly, A., Shaalan, K.: Arabic natural language processing: challenges and solutions. ACM Trans. Asian Lang. Inf. Process. (TALIP) 8(4), 1–22 (2009). The Association for Computing Machinery (ACM)
Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56(4), 82–89 (2013)
Korayem, M., Crandall, D., Abdul-Mageed, M.: Subjectivity and sentiment analysis of arabic: a survey. In: Hassanien, A.E., Salem, A.-B.M., Ramadan, R., Kim, T.-h. (eds.) AMLTA 2012. CCIS, vol. 322, pp. 128–139. Springer, Heidelberg (2012)
Syiam, M., Fayed, Z., Habib, M.: An intelligent system for Arabic text categorization. Int. J. Intell. Comput. Inf. 6(1) (2006)
Rushdi-Saleh, M., Martín-Valdivia, M.T., Ureña-López, L.A., Perea-Ortega, J.M.: OCA: opinion corpus for Arabic. J. Am. Soc. Inf. Sci. Technol. 62(10), 2045–2054 (2011)
Siddiqui, S., Monem, A.A., Shaalan, K.: Sentiment analysis in Arabic. In: Métais, E., Meziane, F., Saraee, M., Sugumaran, V., Vadera, S. (eds.) Proceedings of the 21st International Conference on the Application of Natural Language to Information Systems (NLDB 2016). LNCS. Springer, Heidelberg (2016)
Khaled, S.: A survey of Arabic named entity recognition and classification. Comput. Linguist. 40(2), 469–510 (2014)
Shoukry, A., Rafea, A.: Sentence-level Arabic sentiment analysis. In: 2012 International Conference on Collaboration Technologies and Systems (CTS), pp. 546–550. IEEE (2012)
Shoukry, A., Rafea, A.: Preprocessing Egyptian dialect tweets for sentiment mining. In: The 4th Workshop on Computational Approaches to Arabic Script-Based Languages, pp. 47–56 (2012)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 9–27 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Siddiqui, S., Monem, A.A., Shaalan, K. (2017). Towards Improving Sentiment Analysis in Arabic. In: Hassanien, A., Shaalan, K., Gaber, T., Azar, A., Tolba, M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2016. AISI 2016. Advances in Intelligent Systems and Computing, vol 533. Springer, Cham. https://doi.org/10.1007/978-3-319-48308-5_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-48308-5_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48307-8
Online ISBN: 978-3-319-48308-5
eBook Packages: EngineeringEngineering (R0)