Abstract
Rumor detection is a recent and quite active topic of multidisciplinary research due to its evident impact on society, which can even result in physical harm to people. Despite its broad definition and scope, most existing research focuses on Twitter, as it makes the problem a bit more tractable from the point of view of information gathering, processing, and further retrieval of social network features. This paper presents an empirical study of novel machine learning ensembles for the rumor classification task on Twitter. As it has been observed that certain neural models perform better for specific veracity labels, we present a study on how the combination of such classifiers in different kinds of ensemble results in a new classifier that has better performance. Using benchmark data, we evaluate three groups of models (two groups of deep neural networks and a control group of classical machine learning methods). In addition, we study the performance of three ensemble strategies: Bagging, Stacking, and Simple Soft Voting. After varying several parameters of the models, such as the number of hidden units and dropout, among others experimental factors, our study shows that the LSTM, Stacked LSTM (S-LSTM), Recurrent Convolutional Neural Networks (RCNN), and Bidirectional Gated Recurrent Unit (Bi-GRU) yields the best results reaching an accuracy of 0.93 and an average precision of 0.97.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aggarwal, A., Dixit, A.: Fake news detection: an ensemble learning approach. In: Proceedings of the International Conference on Intelligent Computing and Control Systems, ICICCS 2020, pp. 1178–1183 (2020)
Ahmad, I., Yousaf, M., Yousaf, S., Ahmad, M.: Fake news detection using machine learning ensemble methods. Complexity 2020, 8885861:1–8885861:11 (2020)
Akhter, M., Zheng, J., Afzal, F., Lin, H., Riaz, A., Mehmood, S.: Supervised ensemble learning methods towards automatically filtering Urdu fake news within social media. PeerJ Comput. Sci. 7, 1–24 (2021)
Al-Rakhami, M., Al-Amri, A.: Lies kill, facts save: detecting COVID-19 misinformation in Twitter. IEEE Access 8, 155961–155970 (2020)
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
Castillo, C., Mendoza, M., Poblete, B.: Information credibility on Twitter. In: Proceedings of the 20th International Conference on World Wide Web, WWW 2011, Hyderabad, India, pp. 675–684. ACM (2011)
Dong, S., Qian, Z., Li, P., Zhu, X., Zhu, Q.: Rumor detection on hierarchical attention network with user and sentiment information. In: Zhu, X., Zhang, M., Hong, Yu., He, R. (eds.) NLPCC 2020. LNCS (LNAI), vol. 12431, pp. 366–377. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-60457-8_30
Hunt, K., Agarwal, P., Zhuang, J.: Monitoring misinformation on Twitter during crisis events: a machine learning approach. Risk Anal. 2020, (Early View) (2020)
Kaur, S., Kumar, P., Kumaraguru, P.: Automating fake news detection system using multi-level voting model. Soft. Comput. 24(12), 9049–9069 (2019). https://doi.org/10.1007/s00500-019-04436-y
Kim, Y., Kim, H., Kim, H., Hong, J.: Do many models make light work? Evaluating ensemble solutions for improved rumor detection. IEEE Access 8, 150709–150724 (2020)
Kotteti, C., Dong, X., Qian, L.: Ensemble deep learning on time-series representation of tweets for rumor detection in social media. Appl. Sci. (Switzerland) 10(21), 1–21 (2020)
Kumar, S., Asthana, R., Upadhyay, S., Upreti, N., Akbar, M.: Fake news detection using deep learning models: a novel approach. Trans. Emerg. Telecommun. Technol. 31(2), e3767 (2020)
Lin, H., Zhang, X., Fu, X.: A graph convolutional encoder and decoder model for rumor detection. In: 2020 IEEE 7th International Conference on Data Science and Advanced Analytics (DSAA), pp. 300–306 (2020)
Ma, J., Gao, W., Wong, K.-F.: Detect rumors in microblog posts using propagation structure via kernel learning. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 708–717, July 2017
Malhotra, B., Vishwakarma, D.: Classification of propagation path and tweets for rumor detection using graphical convolutional networks and transformer based encodings. In: Proceedings - 2020 IEEE 6th International Conference on Multimedia Big Data, BigMM 2020, pp. 183–190 (2020)
Mendoza, M.: A new term-weighting scheme for Naïve Bayes text categorization. Int. J. Web Inf. Syst. 8(1), 55–72 (2012)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems 26: 27th Annual Conference on Neural Information Processing Systems 2013. Proceedings of a Meeting held Lake Tahoe, Nevada, United States, 5–8 December 2013, pp. 3111–3119 (2013)
Providel, E., Mendoza, M.: Using deep learning to detect rumors in Twitter. In: Meiselwitz, G. (ed.) HCII 2020. LNCS, vol. 12194, pp. 321–334. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49570-1_22
Providel, E., Mendoza, M.: Misleading information in Spanish: a survey. Soc. Netw. Anal. Min. 11(1), 36 (2021)
Tian, L., Zhang, X., Wang, Y., Liu, H.: Early detection of rumours on Twitter via stance transfer learning. In: Jose, J.M., et al. (eds.) ECIR 2020. LNCS, vol. 12035, pp. 575–588. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-45439-5_38
Ting, K., Witten, I.: Stacking bagged and dagged models. In: Proceedings of the Fourteenth International Conference on Machine Learning, ICML 1997, San Francisco, CA, USA, pp. 367–375. Morgan Kaufmann Publishers Inc (1997)
Vijeev, A., Mahapatra, A., Shyamkrishna, A., Murthy, S.: A hybrid approach to rumour detection in microblogging platforms. In: 2018 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018, pp. 337–342 (2018)
Vu, D.T., Jung, J.J.: Rumor detection by propagation embedding based on graph convolutional network. Int. J. Comput. Intell. Syst. 14, 1053–1065 (2021)
Zhang, X., Ghorbani, A.: An overview of online fake news: characterization, detection, and discussion. Inf. Process. Manag. 57(2), 102025 (2019)
Zhou, Z.-H.: Ensemble Methods: Foundations and Algorithms, 1st edn. Chapman & Hall/CRC (2012)
Zubiaga, A., Aker, A., Bontcheva, K., Liakata, M., Procter, R.: Detection and resolution of rumours in social media: a survey. ACM Comput. Surv. 51(2), 1–36 (2018)
Zubiaga, A., Liakata, M., Procter, R., Hoi, G.W.S., Tolmie, P.: Analysing how people orient to and spread rumours in social media by looking at conversational threads. PLoS ONE 11(3), e0150989 (2016)
Acknowledgements
Mr. Mendoza acknowledges funding from the Millennium Institute for Foundational Research on Data. Mr. Mendoza was also funded by grants ANID BASAL FB210017 and ANID FONDECYT 1200211.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zapata, A., Providel, E., Mendoza, M. (2022). Empirical Evaluation of Machine Learning Ensembles for Rumor Detection. In: Meiselwitz, G. (eds) Social Computing and Social Media: Design, User Experience and Impact. HCII 2022. Lecture Notes in Computer Science, vol 13315. Springer, Cham. https://doi.org/10.1007/978-3-031-05061-9_30
Download citation
DOI: https://doi.org/10.1007/978-3-031-05061-9_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05060-2
Online ISBN: 978-3-031-05061-9
eBook Packages: Computer ScienceComputer Science (R0)