Abstract
A huge amount of social media data is generated daily as a result of the interactions between subscribers of the social networking platforms. Subscribers are characterized by diversity in background, geographical locations, opinions, etc. Further, the diversity and pervasiveness of the data together with its social characteristics make it an intriguing source for gauging the public opinion. Twitter is one of the most famous and popular networking platforms. It provides a micro-blogging environment in which subscribers can post and read short messages called Tweets. In this paper, we propose a method for finding groups of people who express similar ideas about a specific subject in their postings. We also introduce an algorithm to label each group of users based on their postings. The labels help us to get a more profound insight into the main opinion of each group. The work described in this paper has high significance and a variety of practical applications. Applicability and effectiveness of the proposed approach have been demonstrated in a case study on the Tweets related to Royal Bank of Canada. The reported results prove the viability and efficiency of our approach. Besides, the proposed approach is generic enough to be adapted and applied to any type of existing social media platforms to extract main ideas related to a specific subject.
Similar content being viewed by others
Notes
References
Adnan M, Alhajj R, Rokne J (2010) Identifying social communities by frequent pattern mining. In: Proceedings of the international conference on information visualisation
Agichtein E, Castillo C, Donato D, Gionis A, Mishne G (2008) Finding high-quality content in social media. In: Proceedings of the international conference on Web search and web data mining, pp 183–194
Asur S, Huberman BA (2010) Predicting the future with social media. CoRR, abs/1003.5699
Bansal N, Chiang F, Koudas N, Tompa FW (2007) Seeking stable clusters in the blogosphere. In: Proceedings of the international conference on Very large data bases, pp 806–817
Bian J, Liu Y, Agichtein E, Zha H (2008) Finding the right facts in the crowd: factoid question answering over social media. In: Proceedings of the international conference on World Wide Web, pp 467–476
Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 2008:P10008. doi:10.1088/1742-5468/2008/10/P10008
Derényi I, Palla G, Vicsek T (2005) Clique percolation in random networks. Phys Rev Lett 94:160202
Fortunato S (2010) Community detection in graphs. Phys Rep 486(3-5):75–174
Gilbert E, Karahalios K (2009) Predicting tie strength with social media. In: Proceedings of the international conference on Human factors in computing systems, pp 211–220
Girvan M, Newman MEJ (2002) Community structure in social and biological networks. Proc Natl Acad Sci 99(12):7821–7826
Hansen DL, Rotman D, Bonsignore E, Milic’-frayling N, Rodrigues EM, Smith M, Shneiderman B, Capone T (2009) Do you know the way to sna ?: a process model for analyzing and visualizing social media data. Group, pp 1–10
Huberman BA, Romero DM, Wu F (2008) Social networks that matter: Twitter under the microscope. CoRR, abs/0812.1045
Jansen BJ, Zhang M, Sobel K, Chowdury A (2009) Twitter power: Tweets as electronic word of mouth. J Am Soc Inf Sci Technol 60(11):2169–2188
Java A, Song X, Finin T, Tseng B (2007) Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, pp 56–65
Kaplan AM, Haenlein M (2010) Users of the world, unite! the challenges and opportunities of social media. Bus Horizons 53(1):59–68
Kianmehr K, Alhajj R (2009) Calling communities analysis and identification using machine learning techniques. J Expert Systems Appl 36:6218–6226
Kumar S, Barbier G, Abbasi MA, Liu H (2011) Tweettracker: An analysis tool for humanitarian and disaster relief. In: Proceedings of the International Conference on Weblogs and Social Media
Kumar S, Morstatter F, Zafarani R, Liu H (2013) Whom should i follow? identifying relevant users during crises. In: Proceedings of ACM conference on Hypertext and social media
Li R, Lei KH, Khadiwala R, Chang K-C (2012) Tedas: a twitter-based event detection and analysis system. In: Proceedings of IEEE international conference on data engineering, pp 1273–1276
MacEachren A, Jaiswal A, Robinson A, Pezanowski S, Savelyev A, Mitra P, Zhang X, Blanford J (2011) Senseplace2: Geotwitter analytics support for situational awareness. In Proceedings of 2011 IEEE Conference on Visual Analytics Science and Technology (VAST), p 181190
Manning CD, Raghavan P, Schtze H (2008) Introduction to Information Retrieval. Cambridge University Press, Cambridge
Mathioudakis M, Koudas N (2010) Twittermonitor: Trend detection over the twitter stream. In: Proceedings of ACM SIGMOD International Conference on Management of data, pp 1155–1158
Mathioudakis M, Koudas N, Marbach P (2010) Early online identification of attention gathering items in social media. In: Proceedings of ACM international conference on Web search and data mining, pp 301–310
Mendoza M, Poblete B, Castillo C (2010) Twitter under crisis: Can we trust what we rt? In: Proceedings of the First Workshop on Social Media Analytics
Morstatter F, Kumar S, Liu H, Maciejewski R (2013) Understanding twitter data with tweetxplorer. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Naji G, Nagi M, Elsheikh AM, Gao S, Kianmehr K, Özyer T, Demetrick D, Alhajj R, Rokne J, Ridley M (2011) Effectiveness of social networks for studying biological agents and identifying cancer biomarkers. Counterterrorism and Open Source Intelligence
Newman M (2004) Detecting community structure in networks. Eur Phys J B Condens Matter Complex Systems 38:321–330
Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69:026113. doi:10.1103/PhysRevE.69.026113
O’Connor B, Balasubramanyan R, Routledge B, Smith N (2010) From tweets to polls: Linking text sentiment to public opinion time series
Palla G, Derenyi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818
Purohit H, Sheth A (2013) Twitris v3: From citizen sensing to analysis, coordination and action. In: Proceedings of the International Conference on Weblogs and Social Media
Shafiq O, Alhajj R, Rokne J (2009) Community aware personalized web search. In: Proceedings of the international conference on advaces in social network analysis and mining, pp 3510–355
Smith MA, Shneiderman B, Milic-Frayling N, Mendes Rodrigues E, Barash V, Dunne C, Capone T, Perer A, Gleave E (2009) Analyzing (social media) networks with nodexl. In: Proceedings of the international conference on Communities and technologies, pp 255–264
Tumasjan A, Sprenger T, Sandner P, Welpe I (2010) Predicting elections with twitter: What 140 characters reveal about political sentiment. In: Proceedings of AAAI international conference on weblogs and social media, pp 178–185
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Rahmani, A., Chen, A., Sarhan, A. et al. Social media analysis and summarization for opinion mining: a business case study. Soc. Netw. Anal. Min. 4, 171 (2014). https://doi.org/10.1007/s13278-014-0171-y
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13278-014-0171-y