Abstract
The rise of social media enables people to disseminate information. However, when false but appealing information is presented as news, it becomes a cause for serious concern as it might lead to a harmful influence on communities of innocent believers. To address this issue, we propose I-FLASH, an interpretable fake news detector that not only detects fake news but also explains why it considers some content fake or genuine. Moreover, recent research evaluated their models for fake news detection on domain-specific datasets. Therefore, in this paper, two new tiny datasets, FactCheck and FactCheck2, were culled from the official Twitter accounts/websites of various well-known media outlets, covering a variety of other societal domains such as education, crime, and technology. We also compared the performance of the machine learning model (logistic regression with TF-IDF), deep learning model (bidirectional LSTM with GloVe word embeddings), and the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model on curated datasets along with two other popular datasets, namely, LIAR and COVID-19. The stratified 10-fold cross-validation accuracy of 94.41 ± 0.38% on the COVID-19 dataset, 61.18 ± 0.55% on the LIAR dataset, 87.25 ± 2.45% on FactCheck, and 92.91 ± 2.07% on FactCheck2, attained at 95% confidence interval, establishes the efficacy of models. On cross-dataset validation, we observe that the model trained on a generalized dataset like FactCheck2 can perform well on domain-specific datasets like COVID-19 and LIAR with a validation accuracy of 64.25% and 54.22%, respectively. Finally, using XAI methods—LIME and SHAP revealed important terms while predicting the news class (fake/real).
Similar content being viewed by others
Data Availability
All data generated or analyzed during this study are included in this article.
References
“Fake news in India.” https://en.wikipedia.org/wiki/Fake_news_in_India. Accessed from 10 Mar 2022.
Albahar, M. (2021). A hybrid model for fake news detection: Leveraging news content and user comments in fake news. IET Information Security, 15(2), 169–177.
Zhou, X., & Zafarani, R. (2020). A survey of fake news: Fundamental theories, detection methods, and opportunities. ACM Computing Surveys (CSUR), 53(5), 1–40.
Zhang, X., & Ghorbani, A. A. (2020). An overview of online fake news: Characterization, detection, and discussion. Information Processing & Management, 57(2), 102025.
Apuke, O. D., & Omar, B. (2021). Fake news and COVID-19: Modelling the predictors of fake news sharing among social media users. Telematics and Informatics, 56, 101475.
“COVID-19 and WhatsApp Fuel Surge of Fake News in India.” https://thewire.in/tech/covid-19-and-whatsapp-fuel-surge-of-fake-news-in-india. Accessed from 10 Mar 2022.
Pathwar, P., & Gill, S. (2022). Tackling COVID-19 infodemic using deep learning. In Proceedings of international conference on computational intelligence and data engineering (pp. 319–335). Springer.
Ayoub, J., Yang, X. J., & Zhou, F. (2021). Combat COVID-19 infodemic using explainable natural language processing models. Information Processing & Management, 58(4), 102569.
Bovet, A., & Makse, H. A. (2019). Influence of fake news in Twitter during the 2016 US presidential election. Nature Communications, 10(1), 1–14.
Zhou, X., Jain, A., Phoha, V. V., & Zafarani, R. (2020). Fake news early detection: A theory-driven model. Digital Threats: Research and Practice, 1(2), 1–25.
Saleh, H., Alharbi, A., & Alsamhi, S. H. (2021). OPCNN-FAKE: Optimized convolutional neural network for fake news detection. IEEE Access, 9, 129471–129489.
Shahid, W., Li, Y., Staples, D., Amin, G., Hakak, S., & Ghorbani, A. (2022). Are you a cyborg, bot or human? A survey on detecting fake news spreaders. IEEE Access, 10, 27069–27083.
Shao, C., Ciampaglia, G. L., Varol, O., Yang, K.-C., Flammini, A., & Menczer, F. (2018). The spread of low-credibility content by social bots. Nature Communications, 9(1), 1–9.
Kaliyar, R. K., Goswami, A., & Narang, P. (2021). FakeBERT: Fake news detection in social media with a BERT-based deep learning approach. Multimedia Tools and Applications, 80(8), 11765–11788.
Ribeiro, M. T., Singh, S., & Guestrin, C. (2016).“" Why should i trust you?" Explaining the predictions of any classifier”. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 1135–1144).
Zhang, C., Gupta, A., Kauten, C., Deokar, A. V., & Qin, X. (2019). Detecting fake news for reducing misinformation risks using analytics approaches. European Journal of Operational Research, 279(3), 1036–1052. https://doi.org/10.1016/j.ejor.2019.06.022
Conroy, N. K., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. In Proceedings of the association for information science and technology (vol. 52, pp. 1–4).
Shahzad, K., Khan, S. A., Ahmad, S., & Iqbal, A. (2022). A Scoping review of the relationship of big data analytics with context-based fake news detection on digital media in data age. Sustainability, 14(21), 14365. https://doi.org/10.3390/su142114365
Davoudi, M., Moosavi, M. R., & Sadreddini, M. H. (2022). DSS: A hybrid deep model for fake news detection using propagation tree and stance network. Expert Systems with Applications, 198, 116635.
Zhou, X., & Zafarani, R. (2019). Network-based fake news detection: A pattern-driven approach. ACM SIGKDD Explorations Newsletter, 21(2), 48–60.
Raza, S., & Ding, C. (2022). Fake news detection based on news content and social contexts: A transformer-based approach. International Journal of Data Science and Analytics, 13(4), 335–362.
Szczepański, M., Pawlicki, M., Kozik, R., & Choraś, M. (2021). New explainability method for BERT-based model in fake news detection. Scientific Reports, 11(1), 1–13.
Bhutani, B., Rastogi, N., Sehgal, P., & Purwar, A. (2019). “Fake news detection using sentiment analysis,” In 2019 twelfth international conference on contemporary computing (IC3) (pp. 1–5). IEEE.
Aldwairi, M., & Alwahedi, A. (2018). Detecting fake news in social media networks. Procedia Computer Science, 141, 215–222.
Trueman, T. E., Kumar, A., Narayanasamy, P., & Vidya, J. (2021). Attention-based C-BiLSTM for fake news detection. Applied Soft Computing, 110, 107600.
Nasir, J. A., Khan, O. S., & Varlamis, I. (2021). Fake news detection: A hybrid CNN-RNN based deep learning approach. International Journal of Information Management Data Insights, 1(1), 100007.
Sastrawan, I. K., Bayupati, I., & Arsa, D. M. S. (2021). “Detection of fake news using deep learning CNN–RNN based methods.” ICT Express.
Khan, J. Y., Khondaker, M. T. I., Afroz, S., Uddin, G., & Iqbal, A. (2021). A benchmark study of machine learning models for online fake news detection. Machine Learning with Applications, 4, 100032.
Flores, L. J. Y., & Hao, Y. (2022). “An Adversarial Benchmark for Fake News Detection Models,” arXiv preprintarXiv:2201.00912
Xu, F., Sheng, V. S., & Wang, M. (2020). Near real-time topic-driven rumor detection in source microblogs. Knowledge-Based Systems, 207, 106391.
Jwa, H., Oh, D., Park, K., Kang, J., & Lim, H. (2019). exBAKE: Automatic fake news detection model based on bidirectional encoder representations from transformers (BERT). Applied Sciences, 9(19), 4062. https://doi.org/10.3390/app9194062
Reis, J. C. S., Correia, A., Murai, F., Veloso, A., & Benevenuto, F. (2019). Supervised learning for fake news detection. IEEE Intelligent Systems, 34(2), 76–81. https://doi.org/10.1109/MIS.2019.2899143
“Coronavirus: The human cost of fake news in India.” https://www.bbc.com/news/world-asia-india-53165436. Accessed from 10 Mar 2022.
“Fake news, rumours on social media hit Indian firms.” https://www.livemint.com/Companies/Cqbmv2eOniYHzEqLYkxFyO/Fake-news-rumours-on-social-media-hit-Indian-firms.html. Accessed from 10 Mar 2022.
Wang, W. Y. (2017). “Liar, liar pants on fire": A new benchmark dataset for fake news detection,” arXiv preprintarXiv:1705.00648
Patwa, P., Sharma, S., Pykl, S., Guptha, V., Kumari, G., Akhtar, M. S., Ekbal, A., Das, A., & Chakraborty, T. (2021). Fighting an Infodemic: COVID-19 fake news dataset. In International Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situation, (pp. 21–29). Springer.
“tf–idf.” https://en.wikipedia.org/wiki/Tf%E2%80%93idf. Accessed from 12 Mar 2022.
Setiawan, R., Ponnam, V. S., Sengan, S., Anam, M., Subbiah, C., Phasinam, K., Vairaven, M., & Ponnusamy, S. (2022). Certain investigation of fake news detection from facebook and twitter using artificial intelligence approach. Wireless Personal Communications, 127(2), 1737–1762. https://doi.org/10.1007/s11277-021-08720-9
Pennington, J., Socher, R., Manning, C. D. (2014). “Glove: Global vectors for word representation”. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (pp. 1532–1543).
“GloVe.” https://en.wikipedia.org/wiki/GloVe. Accessed from 12 Mar 2022.
Graves, A., Schmidhuber, J. (2005). Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18(56), 602–610.
Schuster, M., & Paliwal, K. K. (1997). Bidirectional recurrent neural networks. IEEE transactions on Signal Processing, 45(11), 2673–2681.
“BERT Explained: State of the art language model for NLP.” https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp-f8b21a9b6270. Accessed from 12 Mar 2022.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., Polosukhin, I. (2017). “Attention is all you need”. Advances in Neural Information Processing Systems, 30.
Devlin, J., Chang, M.-W., Lee, K., Toutanova, K. (2018). “Bert: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprintarXiv:1810.04805
“Bert-base-uncased.” https://huggingface.co/bert-base-uncased. Accessed from 12 Mar 2022.
Miller, T. (2019). Explanation in artificial intelligence: Insights from the social sciences. Artificial Intelligence, 267, 1–38. https://doi.org/10.1016/j.artint.2018.07.007
Kim, B., Khanna, R., Koyejo, O. O. (2016). “Examples are not enough, learn to criticize! criticism for interpretability”. Advances in Neural Information Processing Systems, 29.
Selvaraju, R. R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., & Batra, D. (2017). “Grad-cam: Visual explanations from deep networks via gradient-based localization”. In Proceedings of the IEEE international conference on computer vision (pp. 618–626).
Molnar, C. (2022). Interpretable machine learning (2 ed.) Lulu.com
Doran, D., Schulz, S., & Besold, T. R. (2017). “What does explainable AI really mean? A new conceptualization of perspectives”. arXiv preprint arXiv:1710.00794
Samek, W., Montavon, G., Lapuschkin, S., Anders, C. J., & Muller, K-R. (2021). Explaining deep neural networks and beyond: A review of methods and applications. Proceedings of the IEEE, 109(3), 247–278. https://doi.org/10.1109/JPROC.2021.3060483
Danilevsky, M., Qian, K., Aharonov, R., Katsis, Y., Kawas, B., & Sen, P. (2020). “A survey of the state of explainable AI for natural language processing”. arXiv preprint arXiv:2010.00711
Alhindi, T., Petridis, S., & Muresan, S. (2018). “Where is your evidence: Improving fact-checking by justification modeling”. In Proceedings of the first workshop on fact extraction and verification (FEVER) (pp. 85–90).
Funding
This research did not receive any grant from any of the funding agencies. All data generated or analyzed during this study are included in this article.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest. This research did not receive any grant from any of the funding agencies.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Dua, V., Rajpal, A., Rajpal, S. et al. I-FLASH: Interpretable Fake News Detector Using LIME and SHAP. Wireless Pers Commun 131, 2841–2874 (2023). https://doi.org/10.1007/s11277-023-10582-2
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11277-023-10582-2