research-article

Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion

Authors:

Xinpeng ZhangAuthors Info & Claims

ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia Retrieval

Pages 343 - 352

https://doi.org/10.1145/3591106.3592271

Published: 12 June 2023 Publication History

Abstract

The easy sharing of multimedia content on social media has caused a rapid dissemination of fake news, which threatens society’s stability and security. Therefore, fake news detection has garnered extensive research interest in the field of social forensics. Current methods primarily concentrate on the integration of textual and visual features but fail to effectively exploit multi-modal information at both fine-grained and coarse-grained levels. Furthermore, they suffer from an ambiguity problem due to a lack of correlation between modalities or a contradiction between the decisions made by each modality. To overcome these challenges, we present a Multi-grained Multi-modal Fusion Network (MMFN) for fake news detection. Inspired by the multi-grained process of human assessment of news authenticity, we respectively employ two Transformer-based pre-trained models to encode token-level features from text and images. The multi-modal module fuses fine-grained features, taking into account coarse-grained features encoded by the CLIP encoder. To address the ambiguity problem, we design uni-modal branches with similarity-based weighting to adaptively adjust the use of multi-modal features. Experimental results demonstrate that the proposed framework outperforms state-of-the-art methods on three prevalent datasets.

References

[1]

Sahar Abdelnabi, Rakibul Hasan, and Mario Fritz. 2022. Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 14940–14949.

[2]

Liesbeth Allein, Marie-Francine Moens, and Domenico Perrotta. 2021. Like Article, Like Audience: Enforcing Multimodal Correlations for Disinformation Detection. arXiv preprint arXiv:2108.13892 (2021).

[3]

Tadas Baltrušaitis, Chaitanya Ahuja, and Louis-Philippe Morency. 2018. Multimodal machine learning: A survey and taxonomy. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 2 (2018), 423–443.

Digital Library

[4]

Gaurav Bhatt, Aman Sharma, Shivam Sharma, Ankush Nagpal, Balasubramanian Raman, and Ankush Mittal. 2018. Combining neural, statistical and external features for fake news stance identification. In Proceedings of The Web Conference. 1353–1357.

Digital Library

[5]

Tian Bian, Xi Xiao, Tingyang Xu, Peilin Zhao, Wenbing Huang, Yu Rong, and Junzhou Huang. 2020. Rumor detection on social media with bi-directional graph convolutional networks. In Proceedings of the AAAI conference on artificial intelligence, Vol. 34. 549–556.

[6]

Christina Boididou, Symeon Papadopoulos, Markos Zampoglou, Lazaros Apostolidis, Olga Papadopoulou, and Yiannis Kompatsiaris. 2018. Detection and visualization of misleading content on Twitter. International Journal of Multimedia Information Retrieval 7, 1 (2018), 71–86.

[7]

Yixuan Chen, Dongsheng Li, Peng Zhang, Jie Sui, Qin Lv, Lu Tun, and Li Shang. 2022. Cross-modal Ambiguity Learning for Multimodal Fake News Detection. In Proceedings of The Web Conference. 2897–2905.

Digital Library

[8]

Marcos V Conde and Kerem Turgutlu. 2021. CLIP-Art: contrastive pre-training for fine-grained art classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3956–3960.

[9]

Nadia K Conroy, Victoria L Rubin, and Yimin Chen. 2015. Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information science and Technology 52, 1 (2015), 1–4.

[10]

Corentin Dancette, Remi Cadene, Damien Teney, and Matthieu Cord. 2021. Beyond question-based biases: Assessing multimodal shortcut learning in visual question answering. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 1574–1583.

[11]

Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018).

[12]

Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).

[13]

Xiuye Gu, Tsung-Yi Lin, Weicheng Kuo, and Yin Cui. 2021. Open-vocabulary object detection via vision and language knowledge distillation. arXiv preprint arXiv:2104.13921 (2021).

[14]

Zhiwei Jin, Juan Cao, Han Guo, Yongdong Zhang, and Jiebo Luo. 2017. Multimodal fusion with recurrent neural networks for rumor detection on microblogs. In Proceedings of the ACM international conference on Multimedia. 795–816.

Digital Library

[15]

Gregory Johnson. 2012. Google Translate http://translate. google. com. Technical Services Quarterly 29, 2 (2012), 165–165.

[16]

Dhruv Khattar, Jaipal Singh Goud, Manish Gupta, and Vasudeva Varma. 2019. Mvae: Multimodal variational autoencoder for fake news detection. In Proceedings of the The Web Conference. 2915–2921.

Digital Library

[17]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[18]

Boyi Li, Kilian Q Weinberger, Serge Belongie, Vladlen Koltun, and René Ranftl. 2022. Language-driven semantic segmentation. arXiv preprint arXiv:2201.03546 (2022).

[19]

Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10012–10022.

[20]

Jiasen Lu, Dhruv Batra, Devi Parikh, and Stefan Lee. 2019. Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. Advances in Neural Information Processing Systems 32 (2019).

[21]

Jing Ma, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J Jansen, Kam-Fai Wong, and Meeyoung Cha. 2016. Detecting rumors from microblogs with recurrent neural networks. In Proceedings of the International Joint Conference on Artificial Intelligence. 3818–3824.

[22]

Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, and Ralph Ewerth. 2020. Multimodal analytics for real-world news using measures of cross-modal entity consistency. In Proceedings of the International Conference on Multimedia Retrieval. 16–25.

Digital Library

[23]

Qiong Nan, Juan Cao, Yongchun Zhu, Yanyan Wang, and Jintao Li. 2021. MDFEND: Multi-domain Fake News Detection. In Proceedings of the ACM International Conference on Information & Knowledge Management. 3343–3347.

Digital Library

[24]

Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, Pranav Shyam, Pamela Mishkin, Bob McGrew, Ilya Sutskever, and Mark Chen. 2021. Glide: Towards photorealistic image generation and editing with text-guided diffusion models. arXiv preprint arXiv:2112.10741 (2021).

[25]

Peng Qi, Juan Cao, Xirong Li, Huan Liu, Qiang Sheng, Xiaoyue Mi, Qin He, Yongbiao Lv, Chenyang Guo, and Yingchao Yu. 2021. Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues. In Proceedings of the ACM International Conference on Multimedia. 1212–1220.

Digital Library

[26]

Peng Qi, Juan Cao, Tianyun Yang, Junbo Guo, and Jintao Li. 2019. Exploiting multi-domain visual information for fake news detection. In Proceedings of the IEEE International Conference on Data Mining. IEEE, 518–527.

[27]

Shengsheng Qian, Jinguang Wang, Jun Hu, Quan Fang, and Changsheng Xu. 2021. Hierarchical multi-modal contextual attention network for fake news detection. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. 153–162.

Digital Library

[28]

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, 2021. Learning transferable visual models from natural language supervision. In Proceedings of the International Conference on Machine Learning. 8748–8763.

[29]

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2019. Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv preprint arXiv:1910.10683 (2019).

[30]

Kai Shu, Deepak Mahudeswaran, Suhang Wang, Dongwon Lee, and Huan Liu. 2020. Fakenewsnet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data 8, 3 (2020), 171–188.

[31]

Shivangi Singhal, Anubha Kabra, Mohit Sharma, Rajiv Ratn Shah, Tanmoy Chakraborty, and Ponnurangam Kumaraguru. 2020. Spotfake+: A multimodal framework for fake news detection via transfer learning (student abstract). In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 13915–13916.

[32]

Shivangi Singhal, Tanisha Pandey, Saksham Mrig, Rajiv Ratn Shah, and Ponnurangam Kumaraguru. 2022. Leveraging Intra and Inter Modality Relationship for Multimodal Fake News Detection. In Companion Proceedings of The Web Conference. 726–734.

[33]

Shivangi Singhal, Rajiv Ratn Shah, Tanmoy Chakraborty, Ponnurangam Kumaraguru, and Shin’ichi Satoh. 2019. Spotfake: A multi-modal framework for fake news detection. In Proceedings of the IEEE international conference on multimedia big data. 39–47.

[34]

Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.Journal of Machine Learning Research 9, 11 (2008).

[35]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems 30 (2017).

[36]

Yaqing Wang, Fenglong Ma, Zhiwei Jin, Ye Yuan, Guangxu Xun, Kishlay Jha, Lu Su, and Jing Gao. 2018. Eann: Event adversarial neural networks for multi-modal fake news detection. In Proceedings of the ACM international conference on knowledge discovery & data mining. 849–857.

Digital Library

[37]

Youze Wang, Shengsheng Qian, Jun Hu, Quan Fang, and Changsheng Xu. 2020. Fake news detection via knowledge-driven multimodal graph convolutional networks. In Proceedings of the International Conference on Multimedia Retrieval. 540–547.

Digital Library

[38]

Tianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Zhentao Tan, Lu Yuan, Weiming Zhang, and Nenghai Yu. 2021. Hairclip: Design your hair by text and reference image. arXiv preprint arXiv:2112.05142 (2021).

[39]

Yang Wu, Pengwei Zhan, Yunjian Zhang, Liming Wang, and Zhen Xu. 2021. Multimodal Fusion with Co-Attention Networks for Fake News Detection. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. 2560–2569.

[40]

Junxiao Xue, Yabo Wang, Yichen Tian, Yafei Li, Lei Shi, and Lin Wei. 2021. Detecting fake news by exploring the consistency of multimodal data. Information Processing & Management 58, 5 (2021), 102610.

Digital Library

[41]

Keren Ye and Adriana Kovashka. 2021. A case study of the shortcut effects in visual commonsense reasoning. In Proceedings of the AAAI conference on Artificial Intelligence, Vol. 35. 3181–3189.

[42]

Qichao Ying, Xiaoxiao Hu, Yangming Zhou, Zhenxing Qian, Dan Zeng, and Shiming Ge. 2023. Bootstrapping Multi-view Representations for Fake News Detection. In Proceedings of the AAAI conference on Artificial Intelligence.

Digital Library

[43]

Feng Yu, Qiang Liu, Shu Wu, Liang Wang, Tieniu Tan, 2017. A Convolutional Approach for Misinformation Identification. In IJCAI. 3901–3907.

[44]

Xueyao Zhang, Juan Cao, Xirong Li, Qiang Sheng, Lei Zhong, and Kai Shu. 2021. Mining dual emotion for fake news detection. In Proceedings of The Web Conference. 3465–3476.

Digital Library

[45]

Xinyi Zhou, Jindi Wu, and Reza Zafarani. 2020. SAFE: Similarity-Aware Multi-modal Fake News Detection. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 354–367.

[46]

Yangming Zhou, Yuzhou Yang, Qichao Ying, Zhenxing Qian, and Xinpeng Zhang. 2022. Multimodal fake news detection via CLIP-guided learning. arXiv preprint arXiv:2205.14304 (2022).

[47]

Yongchun Zhu, Qiang Sheng, Juan Cao, Qiong Nan, Kai Shu, Minghui Wu, Jindong Wang, and Fuzhen Zhuang. 2022. Memory-Guided Multi-View Multi-Domain Fake News Detection. IEEE Transactions on Knowledge and Data Engineering (2022).

Digital Library

[48]

Arkaitz Zubiaga, Ahmet Aker, Kalina Bontcheva, Maria Liakata, and Rob Procter. 2018. Detection and resolution of rumours in social media: A survey. ACM Computing Surveys (CSUR) 51, 2 (2018), 1–36.

Digital Library

Cited By

Jing CGao HZhang XGao TZhou C(2025)DPSGInformation Fusion10.1016/j.inffus.2024.102595113:COnline publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1016/j.inffus.2024.102595
Sharma RArya A(2024)MMHFND: Fusing Modalities for Multimodal Multiclass Hindi Fake News Detection via Contrastive LearningACM Transactions on Asian and Low-Resource Language Information Processing10.1145/368679723:11(1-25)Online publication date: 21-Nov-2024
https://dl.acm.org/doi/10.1145/3686797
Liu YRen YSui J(2024)PMMC: Prompt-based Multi-Modal Rumor Detection Model with Modality Conversion2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650555(1-6)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650555
Show More Cited By

Index Terms

Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion
1. Computing methodologies
  1. Artificial intelligence
2. Information systems
  1. Information systems applications
    1. Data mining

Recommendations

Hierarchical Multi-modal Contextual Attention Network for Fake News Detection
SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval

Nowadays, detecting fake news on social media platforms has become a top priority since the widespread dissemination of fake news may mislead readers and have negative effects. To date, many algorithms have been proposed to facilitate the detection of ...
Cross-modal Ambiguity Learning for Multimodal Fake News Detection
WWW '22: Proceedings of the ACM Web Conference 2022

Cross-modal learning is essential to enable accurate fake news detection due to the fast-growing multimodal contents in online social communities. A fundamental challenge of multimodal fake news detection lies in the inherent ambiguity across different ...
Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News Detection
Pattern Recognition and Computer Vision
Abstract
The automatic detection of multimodal fake news has attracted significant attention recently. Numerous existing methods focus on the fusion of unimodal features to generate multimodal news representations. However, it is possible that these ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia Retrieval

June 2023

694 pages

ISBN:9798400701788

DOI:10.1145/3591106

Editors:
Ioannis (Yiannis) Kompatsiaris
Centre for Research and Technology Hellas, Greece
,
Jiebo Luo
University of Rochester,USA
,
Nicu Sebe
University of Trento, Italy
,
Angela Yao
National University of Singapore, Singapore
,
Vasileios Mezaris
Centre for Research and Technology Hellas, Greece
,
Symeon Papadopoulos
Centre for Research and Technology Hellas, Greece
,
Adrian Popescu
CEA LIST, France
,
Zi (Helen) Huang
University of Queensland, Australia

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICMR '23

Sponsor:

SIGMM

ICMR '23: International Conference on Multimedia Retrieval

June 12 - 15, 2023

Thessaloniki, Greece

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

13
Total Citations
View Citations
539
Total Downloads

Downloads (Last 12 months)319
Downloads (Last 6 weeks)34

Reflects downloads up to 20 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Jing CGao HZhang XGao TZhou C(2025)DPSGInformation Fusion10.1016/j.inffus.2024.102595113:COnline publication date: 1-Jan-2025
https://dl.acm.org/doi/10.1016/j.inffus.2024.102595
Sharma RArya A(2024)MMHFND: Fusing Modalities for Multimodal Multiclass Hindi Fake News Detection via Contrastive LearningACM Transactions on Asian and Low-Resource Language Information Processing10.1145/368679723:11(1-25)Online publication date: 21-Nov-2024
https://dl.acm.org/doi/10.1145/3686797
Liu YRen YSui J(2024)PMMC: Prompt-based Multi-Modal Rumor Detection Model with Modality Conversion2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650555(1-6)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650555
Ma ZLiu HZeng ZGuo HZhao XLuo M(2024)Learning Multimodal Attention Mixed with Frequency Domain Information as Detector for Fake News Detection2024 IEEE International Conference on Multimedia and Expo (ICME)10.1109/ICME57554.2024.10687912(1-6)Online publication date: 15-Jul-2024
https://doi.org/10.1109/ICME57554.2024.10687912
Festus Ayetiran EÖzgöbek Ö(2024)A Review of Deep Learning Techniques for Multimodal Fake News and Harmful Languages DetectionIEEE Access10.1109/ACCESS.2024.340625812(76133-76153)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3406258
Liu YBing WRen SMa H(2024)BC-FND: An Approach Based on Hierarchical Bilinear Fusion and Multimodal Consistency for Fake News DetectionIEEE Access10.1109/ACCESS.2024.339240912(62738-62749)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3392409
Yan FZhang MWei BRen KJiang W(2024)SARD: Fake news detection based on CLIP contrastive learning and multimodal semantic alignmentJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10216036:8(102160)Online publication date: Oct-2024
https://doi.org/10.1016/j.jksuci.2024.102160
Yan FZhang MWei BRen KJiang W(2024)FMC: Multimodal fake news detection based on multi-granularity feature fusion and contrastive learningAlexandria Engineering Journal10.1016/j.aej.2024.08.103109(376-393)Online publication date: Dec-2024
https://doi.org/10.1016/j.aej.2024.08.103
Dhawan MSharma SKadam ASharma RKumaraguru P(2024)Game-on: graph attention network based multimodal fusion for fake news detectionSocial Network Analysis and Mining10.1007/s13278-024-01271-414:1Online publication date: 11-Jun-2024
https://doi.org/10.1007/s13278-024-01271-4
Liu HTan ZChen QWei YZhao YWang J(2024)Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-modal ManipulationInternational Journal of Computer Vision10.1007/s11263-024-02245-xOnline publication date: 7-Oct-2024
https://doi.org/10.1007/s11263-024-02245-x
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents