research-article

I-S $^{2}$ FND: a novel interpretable self-ensembled semi-supervised model based on transformers for fake news detection

Authors:

Shivani Sri Varshini U,

Praneetha Sree R,

Subramanyam R.B.V.Authors Info & Claims

Journal of Intelligent Information Systems, Volume 62, Issue 2

Pages 355 - 375

https://doi.org/10.1007/s10844-023-00821-0

Published: 19 October 2023 Publication History

Abstract

One of the serious consequences of social media usage is fake information dissemination that locomotes society towards negativity. Existing solutions focus on supervised fake news detection models, which requires extensive labelled data. In this paper, we deal with two different problems of fake news detection such as (1) Detecting fake news with limited annotated data and (2) Interpretability of the proposed model on fake news detection. We address these issues by designing an Interpretable Self Ensembled Semi-Supervised Fake News Detection Model (I-S

^{2}

FND). In I-S

^{2}

FND, the model learns the enhanced representations of labelled and unlabelled fake news by incorporating an adaptive pseudo-labelling mechanism on unlabelled data. Moreover, interpretation of the model on text using the gradients improves the identification of essential words in the content of fake news. Based on the experimental findings, it is evident that the proposed model outperforms existing state-of-the-art models by approximately 5% in terms of accuracy when trained with only a limited amount of labeled data across different datasets.

References

[1]

Allcott H and Gentzkow M Social media and fake news in the 2016 election Journal of Economic Perspectives 2017 31 211-236

[2]

Alrubaian M, Al-Qurishi M, Hassan MM, et al. A credibility analysis system for assessing information on twitter IEEE Transactions on Dependable and Secure Computing 2018 15 4 661-674

Digital Library

[3]

Bansal, R., Paka, W.S., Sengupta, S., et al. (2021) Combining exogenous and endogenous signals with a semi-supervised co-attention network for early detection of covid-19 fake tweets. Pacific-Asia conference on knowledge discovery and data mining pp 188–200

[4]

Chen, J., Yang, Z., & Yang, D. (2020). MixText: Linguistically-informed interpolation of hidden space for semi-supervised text classification. Proceedings of the 58th annual meeting of the association for computational linguistics.

[5]

Choraś, M., Demestichas., K., Giełczyk, A., et al. (2021). Advanced machine learning techniques for fake news (online disinformation) detection: A systematic mapping study. Applied Soft Computing,101,.

Digital Library

[6]

Clark, K., Khandelwal, U., Levy, O., et al. (2019). What does BERT look at? an analysis of BERT’s attention. In: Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Association for Computational Linguistics, Florence, Italy, pp 276–286. https://aclanthology.org/W19-4828

[7]

Croce, D., Castellucci, G., & Basili, R. (2020). GAN-BERT: Generative adversarial learning for robust text classification with a bunch of labeled examples. Proceedings of the 58th annual meeting of the association for computational linguistics.

[8]

De Souza, M., Nogueira, B., & Rossi, R. (2021). A network-based positive and unlabeled learning approach for fake news detection. Machine Learning.

Digital Library

[9]

Devlin, J., Chang, M.W., Lee, K., et al. (2019). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the conference of the north american chapter of the association for computational linguistics: human language technologies 1.

[10]

Dong, X., & Qian, L. (2022). Semi-supervised bidirectional rnn for misinformation detection. Machine Learning with Applications,10(100), 428. https://www.sciencedirect.com/science/article/pii/S2666827022001037

[11]

Dong X, Victor U, and Qian L Two-path deep semisupervised learning for timely fake news detection IEEE Transactions on Computational Social Systems 2020 7 6 1386-1398

[12]

Engelen, V., Hoos, J. E., et al. (2020). A survey on semi-supervised learning. Mach Learn,109,.

[13]

FND1 (2017). Retrieved from https://www.kaggle.com/jruvika/fake-news-detection

[14]

FND2 (2018). Retrieved from https://www.kaggle.com/c/fake-news/data

[15]

Gadek, G., & Guélorget, P. (2020). An interpretable model to measure fakeness and emotion in news. Procedia Computer Science, 176, 78–87., knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 24th International Conference KES2020

[16]

Galli, A., Masciari, E., Moscato, V., et al. (2022). A comprehensive benchmark for fake news detection. Journal of Intelligent Information Systems,59,.

Digital Library

[17]

Gossipcop (2019) Retrieved from https://github.com/KaiDMML/FakeNewsNet

[18]

Guacho, G.B., Abdali, S., Shah, N., et al. (2018). Semi-supervised content-based detection of misinformation via tensor embeddings. 2018 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM) pp 322–325.

[19]

Jin, Z., Cao, J., Zhang, Y., et al. (2016). News verification by exploiting conflicting social viewpoints in microblogs. Proceedings of the thirtieth AAAI conference on artificial intelligence p 2972–2978

[20]

Karisani, P., Karisani, N. (2021). Semi-Supervised Text Classification via Self-Pretraining, Association for Computing Machinery

[21]

Li, X., Lu, P., Hu, L., et al. (2021). A novel self-learning semi-supervised deep learning network to detect fake news on social media. Multimedia Tools and Applications.

Digital Library

[22]

Li, Y., & Ye, J. (2018). Learning adversarial networks for semi-supervised text classification via policy gradient. Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, p 1715–1723.

Digital Library

[23]

Liu, C. L., Hsaio, W. H., Lee, C. H., et al. (2016). Semi-supervised text classification with universum learning. IEEE Transactions on Cybernetics,46(2), 462–473.

[24]

Liu, Y., & Wu, Y.F.B. (2018). Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. Proceedings of the Thirty-Second AAAI conference on artificial intelligence

[25]

Lundberg, S.M., & Lee, S.I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems 30

[26]

Meel, P., & Vishwakarma, D.K. (2021a). Fake news detection using semi-supervised graph convolutional network.

[27]

Meel, P., & Vishwakarma, D. K. (2021). A temporal ensembling based semi-supervised convnet for the detection of fake news articles. Expert Systems with Applications,177,.

Digital Library

[28]

Mohseni, S., & Ragan, E. (2018). Combating fake news with interpretable news feed algorithms.

[29]

Mohseni S, Ragan E, & Hu X (2019). Open issues in combating fake news: Interpretability as an opportunity.

[30]

Paka, W. S., Bansal, R., Kaushik, A., et al. (2021). Cross-sean: A cross-stitch semi-supervised neural attention model for covid-19 fake news detection. Applied Soft Computing,107,.

[31]

Politifact (2019). Retrieved from https://github.com/KaiDMML/FakeNewsNet

[32]

Qiao, Y., Wiechmann, D., & Kerz, E. (2020). A language-based approach to fake news detection through interpretable features and BRNN. Proceedings of the 3rd international workshop on rumours and deception in social media (RDSM)

[33]

Ramnath, S., Nema, P., Sahni, D., et al. (2020). Towards interpreting BERT for reading comprehension based QA. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, pp 3236–3242. 10.18653/v1/2020.emnlp-main.261. https://aclanthology.org/2020.emnlp-main.261

[34]

Reis JCS, Correia A, Murai F, et al. Supervised learning for fake news detection IEEE Intelligent Systems 2019 34 2 76-81

Digital Library

[35]

Ribeiro, M.T., Singh, S., & Guestrin, C. (2016). " why should i trust you?" explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144

[36]

Sachan, D. S., Zaheer, M., & Salakhutdinov, R. (2019). Revisiting lstm networks for semi-supervised text classification via mixed objective function. Proceedings of the AAAI conference on artificial intelligence,33,.

Digital Library

[37]

Sharma, K., Qian, F., Jiang, H., et al. (2019). Combating fake news: A survey on identification and mitigation techniques 10.

Digital Library

[38]

Shu, K., Sliva, A., Wang, S., et al. (2017). Fake news detection on social media: A data mining perspective. SIGKDD Explor Newsl,19,.

Digital Library

[39]

Shu, K., Wang, S., Liu, H. (2018). Understanding user profiles on social media for fake news detection. 2018 IEEE conference on multimedia information processing and retrieval (MIPR).

[40]

Shu, K., Cui, L., Wang, S., et al. (2019a). Defend: Explainable fake news detection. Proceedings of the 25th ACM SIGKDD International conference on knowledge discovery & data mining p 395–405.

Digital Library

[41]

Shu, K., Zhou, X., Wang, S., et al. (2019b). The role of user profiles for fake news detection. Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining.

Digital Library

[42]

Shu, K., Mahudeswaran, D., Wang, S., et al. (2020). Fakenewsnet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data,8, 171–188.

[43]

Varshini, U. S. S., Sree, R. P., Srinivas, M., et al. (2023). Rdgt-gan: Robust distribution generalization of transformers for covid-19 fake news detection. IEEE Transactions on Computational Social Systems, 1–15.

[44]

Vosoughi S, Roy D, and Aral S The spread of true and false news online Science 2018 359 6380 1146-1151

[45]

Wynne, H.E., Wint, Z.Z. (2019). Content based fake news detection using n-gram models. Proceedings of the 21st international conference on information integration and web-based applications & services.

Digital Library

[46]

Yang, X., Song, Z., King, I., et al. (2021). A survey on deep semi-supervised learning.

[47]

Zhang, D., Xu, J., Zadorozhny, V., et al. (2022). Fake news detection based on statement conflict. Journal of Intelligent Information Systems, 59.

Digital Library

[48]

Zhou, X., & Zafarani, R. (2019). Network-based fake news detection: A pattern-driven approach. SIGKDD Explor Newsl,21, 48–60.

Digital Library

Recommendations

Weakly supervised learning for fake news detection on Twitter
ASONAM '18: Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

The problem of automatic detection of fake news in social media, e.g., on Twitter, has recently drawn some attention. Although, from a technical perspective, it can be regarded as a straight-forward, binary classification problem, the major challenge is ...
Satire or Fake News: Social Media Consumers' Socio-Demographics Decide
WWW '18: Companion Proceedings of the The Web Conference 2018

Ever since the surprising results from the 2016 U.S. presidential race, the subject of Fake News in our worldwide media consumption has grown steadily. On a smaller scale, mainstream media have taken a closer look at the relatively narrow genre of ...
Falling for Fake News: Investigating the Consumption of News via Social Media
CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems

In the so called 'post-truth' era, characterized by a loss of public trust in various institutions, and the rise of 'fake news' disseminated via the internet and social media, individuals may face uncertainty about the veracity of information available, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Intelligent Information Systems

Journal of Intelligent Information Systems Volume 62, Issue 2

Apr 2024

300 pages

Issue’s Table of Contents

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 19 October 2023

Accepted: 02 October 2023

Revision received: 29 September 2023

Received: 10 May 2023

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 26 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents