Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

I-S2FND: a novel interpretable self-ensembled semi-supervised model based on transformers for fake news detection

Published: 19 October 2023 Publication History

Abstract

One of the serious consequences of social media usage is fake information dissemination that locomotes society towards negativity. Existing solutions focus on supervised fake news detection models, which requires extensive labelled data. In this paper, we deal with two different problems of fake news detection such as (1) Detecting fake news with limited annotated data and (2) Interpretability of the proposed model on fake news detection. We address these issues by designing an Interpretable Self Ensembled Semi-Supervised Fake News Detection Model (I-S2FND). In I-S2FND, the model learns the enhanced representations of labelled and unlabelled fake news by incorporating an adaptive pseudo-labelling mechanism on unlabelled data. Moreover, interpretation of the model on text using the gradients improves the identification of essential words in the content of fake news. Based on the experimental findings, it is evident that the proposed model outperforms existing state-of-the-art models by approximately 5% in terms of accuracy when trained with only a limited amount of labeled data across different datasets.

References

[1]
Allcott H and Gentzkow M Social media and fake news in the 2016 election Journal of Economic Perspectives 2017 31 211-236
[2]
Alrubaian M, Al-Qurishi M, Hassan MM, et al. A credibility analysis system for assessing information on twitter IEEE Transactions on Dependable and Secure Computing 2018 15 4 661-674
[3]
Bansal, R., Paka, W.S., Sengupta, S., et al. (2021) Combining exogenous and endogenous signals with a semi-supervised co-attention network for early detection of covid-19 fake tweets. Pacific-Asia conference on knowledge discovery and data mining pp 188–200
[4]
Chen, J., Yang, Z., & Yang, D. (2020). MixText: Linguistically-informed interpolation of hidden space for semi-supervised text classification. Proceedings of the 58th annual meeting of the association for computational linguistics.
[5]
Choraś, M., Demestichas., K., Giełczyk, A., et al. (2021). Advanced machine learning techniques for fake news (online disinformation) detection: A systematic mapping study. Applied Soft Computing,101,.
[6]
Clark, K., Khandelwal, U., Levy, O., et al. (2019). What does BERT look at? an analysis of BERT’s attention. In: Proceedings of the 2019 ACL Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP. Association for Computational Linguistics, Florence, Italy, pp 276–286. https://aclanthology.org/W19-4828
[7]
Croce, D., Castellucci, G., & Basili, R. (2020). GAN-BERT: Generative adversarial learning for robust text classification with a bunch of labeled examples. Proceedings of the 58th annual meeting of the association for computational linguistics.
[8]
De Souza, M., Nogueira, B., & Rossi, R. (2021). A network-based positive and unlabeled learning approach for fake news detection. Machine Learning.
[9]
Devlin, J., Chang, M.W., Lee, K., et al. (2019). Bert: Pre-training of deep bidirectional transformers for language understanding. Proceedings of the conference of the north american chapter of the association for computational linguistics: human language technologies 1.
[10]
Dong, X., & Qian, L. (2022). Semi-supervised bidirectional rnn for misinformation detection. Machine Learning with Applications,10(100), 428. https://www.sciencedirect.com/science/article/pii/S2666827022001037
[11]
Dong X, Victor U, and Qian L Two-path deep semisupervised learning for timely fake news detection IEEE Transactions on Computational Social Systems 2020 7 6 1386-1398
[12]
Engelen, V., Hoos, J. E., et al. (2020). A survey on semi-supervised learning. Mach Learn,109,.
[15]
Gadek, G., & Guélorget, P. (2020). An interpretable model to measure fakeness and emotion in news. Procedia Computer Science, 176, 78–87., knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 24th International Conference KES2020
[16]
Galli, A., Masciari, E., Moscato, V., et al. (2022). A comprehensive benchmark for fake news detection. Journal of Intelligent Information Systems,59,.
[17]
[18]
Guacho, G.B., Abdali, S., Shah, N., et al. (2018). Semi-supervised content-based detection of misinformation via tensor embeddings. 2018 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM) pp 322–325.
[19]
Jin, Z., Cao, J., Zhang, Y., et al. (2016). News verification by exploiting conflicting social viewpoints in microblogs. Proceedings of the thirtieth AAAI conference on artificial intelligence p 2972–2978
[20]
Karisani, P., Karisani, N. (2021). Semi-Supervised Text Classification via Self-Pretraining, Association for Computing Machinery
[21]
Li, X., Lu, P., Hu, L., et al. (2021). A novel self-learning semi-supervised deep learning network to detect fake news on social media. Multimedia Tools and Applications.
[22]
Li, Y., & Ye, J. (2018). Learning adversarial networks for semi-supervised text classification via policy gradient. Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, p 1715–1723.
[23]
Liu, C. L., Hsaio, W. H., Lee, C. H., et al. (2016). Semi-supervised text classification with universum learning. IEEE Transactions on Cybernetics,46(2), 462–473.
[24]
Liu, Y., & Wu, Y.F.B. (2018). Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks. Proceedings of the Thirty-Second AAAI conference on artificial intelligence
[25]
Lundberg, S.M., & Lee, S.I. (2017). A unified approach to interpreting model predictions. Advances in neural information processing systems 30
[26]
Meel, P., & Vishwakarma, D.K. (2021a). Fake news detection using semi-supervised graph convolutional network.
[27]
Meel, P., & Vishwakarma, D. K. (2021). A temporal ensembling based semi-supervised convnet for the detection of fake news articles. Expert Systems with Applications,177,.
[28]
Mohseni, S., & Ragan, E. (2018). Combating fake news with interpretable news feed algorithms.
[29]
Mohseni S, Ragan E, & Hu X (2019). Open issues in combating fake news: Interpretability as an opportunity.
[30]
Paka, W. S., Bansal, R., Kaushik, A., et al. (2021). Cross-sean: A cross-stitch semi-supervised neural attention model for covid-19 fake news detection. Applied Soft Computing,107,.
[31]
[32]
Qiao, Y., Wiechmann, D., & Kerz, E. (2020). A language-based approach to fake news detection through interpretable features and BRNN. Proceedings of the 3rd international workshop on rumours and deception in social media (RDSM)
[33]
Ramnath, S., Nema, P., Sahni, D., et al. (2020). Towards interpreting BERT for reading comprehension based QA. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, pp 3236–3242. 10.18653/v1/2020.emnlp-main.261. https://aclanthology.org/2020.emnlp-main.261
[34]
Reis JCS, Correia A, Murai F, et al. Supervised learning for fake news detection IEEE Intelligent Systems 2019 34 2 76-81
[35]
Ribeiro, M.T., Singh, S., & Guestrin, C. (2016). " why should i trust you?" explaining the predictions of any classifier. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
[36]
Sachan, D. S., Zaheer, M., & Salakhutdinov, R. (2019). Revisiting lstm networks for semi-supervised text classification via mixed objective function. Proceedings of the AAAI conference on artificial intelligence,33,.
[37]
Sharma, K., Qian, F., Jiang, H., et al. (2019). Combating fake news: A survey on identification and mitigation techniques 10.
[38]
Shu, K., Sliva, A., Wang, S., et al. (2017). Fake news detection on social media: A data mining perspective. SIGKDD Explor Newsl,19,.
[39]
Shu, K., Wang, S., Liu, H. (2018). Understanding user profiles on social media for fake news detection. 2018 IEEE conference on multimedia information processing and retrieval (MIPR).
[40]
Shu, K., Cui, L., Wang, S., et al. (2019a). Defend: Explainable fake news detection. Proceedings of the 25th ACM SIGKDD International conference on knowledge discovery & data mining p 395–405.
[41]
Shu, K., Zhou, X., Wang, S., et al. (2019b). The role of user profiles for fake news detection. Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining.
[42]
Shu, K., Mahudeswaran, D., Wang, S., et al. (2020). Fakenewsnet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data,8, 171–188.
[43]
Varshini, U. S. S., Sree, R. P., Srinivas, M., et al. (2023). Rdgt-gan: Robust distribution generalization of transformers for covid-19 fake news detection. IEEE Transactions on Computational Social Systems, 1–15.
[44]
Vosoughi S, Roy D, and Aral S The spread of true and false news online Science 2018 359 6380 1146-1151
[45]
Wynne, H.E., Wint, Z.Z. (2019). Content based fake news detection using n-gram models. Proceedings of the 21st international conference on information integration and web-based applications & services.
[46]
Yang, X., Song, Z., King, I., et al. (2021). A survey on deep semi-supervised learning.
[47]
Zhang, D., Xu, J., Zadorozhny, V., et al. (2022). Fake news detection based on statement conflict. Journal of Intelligent Information Systems, 59.
[48]
Zhou, X., & Zafarani, R. (2019). Network-based fake news detection: A pattern-driven approach. SIGKDD Explor Newsl,21, 48–60.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Intelligent Information Systems
Journal of Intelligent Information Systems  Volume 62, Issue 2
Apr 2024
300 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 19 October 2023
Accepted: 02 October 2023
Revision received: 29 September 2023
Received: 10 May 2023

Author Tags

  1. Fake news
  2. Social media
  3. Semi-supervised
  4. Text classification
  5. Transformers
  6. Deep learning

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media