Abstract
As an important media for social interactions and information dissemination through the internet, Sina microblog contains emotional state and important opinion of participants. Dealing with microblog data belongs to big data areas, the premise of which is to obtain a large amount of microblog data for further analysis and data mining. For commercial interests as well as security considerations, the access to the data is becoming increasingly difficult and the API Sina microblog officially provided doesn’t support large amount of data mining. In this paper, we try to design a platform that is mainly based on the access mechanism of multi-strategy and existing resources to collect data stably from Sina microblog. The results demonstrate that a combination of API and web crawler allows efficient data mining. In such way, sentiment analysis and opinion mining are performed on the data obtained by the multi-strategy method, which proved that the proposed solutions will be allowed to build straightforward application of hot words searching, opinion mining and sentiment analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lian, J., Zhou, X., Cao, W., Liu, Y.: SINA microb log d a ta ret ri ev al. Journal of Tsinghua Un iversity( Sci & Tech) 20, 11(5),1(10),1300–1305
Zhang, Y.X.: Sina microblog dissemina-tion mechanism research. Southwest University (2011)
Xie, A.P.: Look from the network market-ing tools using vancl development. Discovering Value (4), 21 (2011)
Zhang, L.L.: Sina microblog public opinion analysis and research. East China Normal University (2011)
The 21st century economic report, to find the new world in 2013 on sina microblog (2013 ), http://jinhua.house.sina.com.cn/news/2013-02-23/10292163302.shtml,2013-02-23
Sina technology, Sina microblog 2012 annu-al inventory (2012) , http://tech.sina.com.cn/i/2012-12-19/13447902817.shtml,2012-12-19
Liu, Z., Chen, X., Ssun, M.: Mining the in-terests of Chinese microbloggers via keyword ex-traction. Frontiers of Computer Science 6(1), 76–87 (2012)
Cano, A.E., Mazumdar, S., Ciravegna, F.: Social influence analysis in microblogging platforms–A topic-sensitive based approach. Semantic Web.
Chen, S., Zhang, H., Lin, M., et al.: Com-parision of microblogging service between Sina Weibo and Twitter. In: 2011 International Conference on Computer Science and Network Technology (ICCSNT), vol. 4, pp. 2259–2263. IEEE (2011)
Wang, X.G.: Empirical analysis on behavior characteristics and relation characteristics of microblog users – take ”sina microblog ” for ex-ample. Library and Information Service 54(14), 66–70 (2010) (in chinese)
Fang, W.W., Li, J.Y., Liu, Y.: Re-search on twitter data collection. Journal of Shandong University(Natural Science) 47(5) (2012)
Microblog login with program (Python and RSA encode algorithm) (May 5, 2013), http://www.2cto.com/kf/201303/192970.html (in chinese )
Microblog open plate-form, http://open.weibo.com/,2013-3-15
Zhou, S.C., Zhai, W.T., Shi, Y.: Overview on sentiment analysis of chinese microblog. Computer Application and Software 30(3), 161–164 (2013)
Xie, L.X., Zhou, M., Sun, M.S.: Hierarchical structure based hybrid approach to sentiment analysis of chinese microblog and its feature extraction. Journal of Chinese Information Processing 26(1), 73–83 (2012)
Sina news report about Shanghai dead pig event (April 2013 ), http://roll.news.sina.com.cn/s_sizhu_all/index.shtml
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Sun, X., Ye, Jq., Ren, Fj. (2014). Multi-strategy Based Sina Microblog Data Acquisition for Opinion Mining. In: Huang, DS., Jo, KH., Wang, L. (eds) Intelligent Computing Methodologies. ICIC 2014. Lecture Notes in Computer Science(), vol 8589. Springer, Cham. https://doi.org/10.1007/978-3-319-09339-0_56
Download citation
DOI: https://doi.org/10.1007/978-3-319-09339-0_56
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09338-3
Online ISBN: 978-3-319-09339-0
eBook Packages: Computer ScienceComputer Science (R0)