A Scoping Review of the Relationship of Big Data Analytics with Context-Based Fake News Detection on Digital Media in Data Age
<p>Diagram of the search process.</p> "> Figure 2
<p>The search process.</p> "> Figure 3
<p>Geographical distribution of the studies (<span class="html-italic">n</span> = 42).</p> "> Figure 4
<p>Comparison between numbers of publications in the periods from 2015 to 2018 with the period from 2019 to 2022.</p> "> Figure 5
<p>Research methodologies of the previous studies.</p> "> Figure 6
<p>Trending approaches to detect fake news.</p> ">
Abstract
:1. Introduction
Research Questions
2. Methodology
- A.
- Phase 1: Planning
- (1)
- Focused research questions
- (2)
- Search strategy
- B.
- Phase 2: Selection
- (1)
- Search process
- (2)
- Scrutiny and filtering
- C.
- Phase 3: Extraction
- D.
- Phase 4: Execution
3. Results
3.1. An Overview of the Selected Studies
3.2. Geographical Distribution of the Studies
3.3. Years Trends of the Selected Studies
3.4. Research Methodologies of the Previous Studies
3.5. Relationship between Big Data Analytics with Context-Based Fake News Detection
3.6. Trending Approaches to Detect Fake News on Digital Media
3.7. Artificial Intelligence
3.8. Fact-Checking Sites
3.9. Neural Networks
3.10. New Media Literacy
3.11. Miscellaneous Trends
3.12. Challenges for Constructing Quality Big Data to Detect Misinformation on Social Media
3.13. Hidden Agenda
3.14. Volume of Fake Information on Digital Media
3.15. Massive Unstructured Data
3.16. Fast Speed of Fake News on Digital Media
3.17. Fake User Accounts
4. Discussion and Implications
5. Conclusions and Recommendations
- An innovative course on big data, covering diverse dimensions, should be taught in library schools for spreading awareness and necessary skills to identify contextual fake news on digital media platforms. There should be a strong positive liaison between library schools and the industry to develop need-based content for imparting creative learning and to provide skilled workers in the market.
- Digital media generators should take strict measures against all those users who post hidden agendas to prevail over irrational practices to shake foundations of the society.
- Adequate steps should be executed to control heterogeneity, volume, and pace of unstructured data for stopping fake news diffusion on digital media.
- Fake accounts should be banned permanently from digital media sites so the amount of posted content may be minimized.
- Quality big data and social media metadata should be developed for detecting context-based fake news.
- New media literacy skills should be infused in web users so that they may verify the originality of the news before posting on digital media applications.
- Artificial intelligence-powered tools should be applied for automatically detecting fake online news effectively and efficiently.
- Government and higher education bodies should plan and execute all necessary steps for implementing, maintaining, and sustaining quality big digital media content for the immediate detection of context-based fake news on social media applications.
6. Limitations and Future Research Directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
S.N. | Author | Year | Country | Journal | Relation of Big Data Analytics with Fake News Detection | Trending Approaches to Detect Fake News on Digital Media | Challenges for Constructing Quality Big Data to Detect Misinformation on Social Media |
---|---|---|---|---|---|---|---|
1. | Vargo and Amazeen | 2018 | USA | New media & society | Fact checking | Fake news spreads on social media and is perhaps more popular than ever. | |
2. | Guo and Vargo | 2017 | USA | Journal of Communication | Correlation between big data analytics and fake news detection is significant. |
| |
3. | Baur et al. | 2020 | Germany | Historical Social Research/Historische Sozialforschung |
| ||
4. | Golbeck et al. | 2018 | Netherlands | WebSci | Big dataset is useful to the research community and on understanding the nature of fake news and ways of fighting it. | Automated system for fake news detection. | |
5. | Nakamura et al. | 2020 | USA | arXiv preprint arXiv | Big data analytics can be used to advance efforts to combat the ever-growing rampant spread of disinformation in today’s society. |
|
|
6. | Khan et al. | 2019 | Bangladesh | Machine Learning with Applications | Big data detects fake information. |
| |
7. | Supriyanto et al. | 2021 | Indonesia | Paedagoria: Jurnal Kajian, Penelitian dan Pengembangan Kependidikan | With big data we can use the correct and fast data from anywhere safely and conveniently. | ||
8. | Murayama | 2021 | Japan | arXiv preprint arXiv | Big dataset assesses the truthfulness of a certain piece of news from news content |
|
|
9. | Darwiesh et al. | 2022 | Egypt | Journal of Healthcare Engineering |
|
| |
10. | Torabi and Taboada | 2019 | Canada | Big Data & Society | Large data sets confirm news credibility. |
|
|
11. | Mahabub | 2020 | Bangladesh | SN Applied Sciences | Authentic big data is positively associated with fake news detection. |
|
|
12. | Ianni et al. | 2020 | Italy | Journal of Intelligent Information Systems | Big data analytics assist in analyzing the social networks data. |
|
|
13. | Jo et al. | 2022 | Korea | Telematics and Informatics |
|
| |
14. | Ebadi et al. | 2020 | United States | IEEE Transactions on Big Data |
|
| |
15. | Zrnec et al. | 2022 | Slovenia | Information Processing and Management |
|
| |
16. | Al-Rawi et al. | 2018 | Canada | Online Information Review |
| ||
17. | Qayyum et al. | 2019 | Pakistan | Cryptography and Security |
| Digitization of human life via social networking applications | |
18. | Jung et al. | 2020 | Germany | Big Data and Society | Big data analysis assists in uncovering digital fake news. |
|
|
19. | Kozik et al. | 2022 | Poland | Journal of Computational Science |
| ||
20. | Meesad | 2021 | Singapore | SN Computer Science |
| ||
21. | Liu | 2019 | USA | Journal of Services Marketing | Artificial intelligence tools |
| |
22. | Lewis and Westlund | 2015 | USA | Digital Journalism | Big data analytics and fake news detection are positively correlated with each other. | ||
23. | Veglis and Maniou | 2018 | Greece | KOME − An International Journal of Pure Communication Inquiry |
| ||
24. | Huckle and White | 2017 | United Kingdom | Big Data | Blockchain-based applications |
| |
25. | Marquez et al. | 2019 | Spain | International Journal of Information Management |
| The industry 4.0 is generating more data than ever before in the history of humanity. | |
26. | Bates et al. | 2018 | United States | Health Policy and Technology | Big data improves accuracy in health-related information. | ||
27. | Olmedilla et al. | 2016 | Spain | Computer Standards and Interfaces | Big data assists in detecting accurate information from online user-generated content. | Effective web-crawler | Huge amount of contextual data |
28. | Shu et al. | 2020 | United States | Big Data | Computational solutions |
| |
29. | Awan et al. | 2021 | Pakistan | Int. J. Computer Applications in Technology |
| ||
30. | Raza and Ding | 2022 | Canada | International Journal of Data Science and Analytics | Big data sets prove useful in fake news identification. | Social contexts to detect fake news |
|
31. | Kauffmann et al. | 2020 | Spain | Industrial Marketing Management | Big data transformed into valuable information detects fake news. |
| |
32. | King and Wang | 2021 | United States | International Journal of Information Management | Big data-driven approach finds out validity of online posted news. | ||
33. | Hassani et al. | 2020 | Iran | Big Data and Cognitive Computing | Text mining in big data analytics is a powerful tool against fake news on digital media. | ||
34. | Thota et al. | 2018 | United States | SMU Data Science Review |
| ||
35. | Ahmad et al. | 2020 | Pakistan | Complexity | Machine learning ensemble approach | Rapid adoption of social media platforms | |
36. | Monti et al. | 2019 | United Kingdom | Social and Information Networks | Forming propagation patterns could be harnessed for the automatic fake news detection. | ||
37. | Sahoo and Gupta | 2021 | India | Applied Soft Computing Journal |
| ||
38. | Sharma et al. | 2020 | India | International Journal of Engineering Research & Technology |
| Biased opinions | |
39. | Aslam et al. | 2021 | Saudi Arabia | Complexity | Ensemble-based deep learning model to classify news as fake or real using LIAR dataset | Diffusion of low-quality news in social media | |
40. | Chauhan and Palivela | 2021 | India | International Journal of Information Management Data Insights |
| ||
41. | Jiang et al. | 2022 | China | Information Processing and Management | Machine learning and deep learning methods | ||
42. | Galli et al. | 2022 | Italy | Journal of Intelligent Information Systems |
| Huge volumes of fake news posted by malicious users |
References
- Allcott, H.; Gentzkow, M. Social media and fake news in 2016 election. J. Econ. Perspect. 2017, 31, 211–236. [Google Scholar] [CrossRef] [Green Version]
- Golbeck, J.; Mauriello, M.; Auxier, B.; Bhanushali, K.H.; Bonk, C.; Bouzaghrane, M.A.; Visnansky, G. Fake news vs satire: A dataset and analysis. In Proceedings of the 10th ACM Conference on Web Science, Amsterdam, The Netherlands, 27–30 May 2018; pp. 17–21. [Google Scholar]
- Al-Rawi, A.; Groshek, J.; Zhang, L. What the fake? Assessing the extent of networked political spamming and bots in the propagation of fakenews on Twitter. Online Inf. Rev. 2018, 43, 53–71. [Google Scholar] [CrossRef] [Green Version]
- Veglis, A.; Maniou, T.A. The mediated data model of communication flow: Big data and data journalism. KOME Int. J. Pure Commun. Inq. 2018, 6, 32–43. [Google Scholar] [CrossRef]
- Huckle, S.; White, M. Fake news: A technological approach to proving the origins of content, using blockchains. Big Data 2017, 5, 356–371. [Google Scholar] [CrossRef]
- Khan, J.Y.; Khondaker, M.d.T.I.; Afroz, S.; Uddin, G.; Iqbal, A. A benchmark study of machine learning models for online fake news detection. Mach. Learn. Appl. 2021, 4, 100032. [Google Scholar] [CrossRef]
- Qayyum, A.; Qadir, J.; Janjua, M.U.; Sher, F. Using blockchain to rein in the new post-truth world and check the spread of fake news. IT Prof. 2019, 21, 16–24. [Google Scholar]
- Jung, A.; Ross, B.; Stieglitz, S. Caution: Rumors ahead—A case study on the debunking of false information on twitter. Big Data Soc. 2020, 7, 1–15. [Google Scholar] [CrossRef]
- Mahabub, A. A robust technique of fake news detection using ensemble voting classifier and comparison with other classifiers. SN Appl. Sci. 2020, 2, 525. [Google Scholar] [CrossRef] [Green Version]
- Mayer-Schönberger, V.; Cukier, K. Big Data: A Revolution That Will Transform How We Live, Work, and Think; Houghton Mifflin Harcourt: Boston, MA, USA, 2013. [Google Scholar]
- Tan, W.; Blake, M.B.; Saleh, I.; Dustdar, S. Social-network-sourced big data analytics. IEEE Internet Comput. 2013, 17, 62–69. [Google Scholar]
- Olmedilla, M.; Martínez-Torres, M.R.; Toral, S.L. Harvesting big data in social science: A methodological approach for collecting online user-generated content. Comput. Stand. Interfaces 2016, 46, 79–87. [Google Scholar] [CrossRef]
- Marquez, J.L.J.; Gonzalez-Carrasco, I.; Lopez-Cuadrado, J.L.; Ruiz-Mezcua, B. Towards a big data framework for analyzing social media content. Int. J. Inf. Manag. 2019, 44, 1–12. [Google Scholar] [CrossRef] [Green Version]
- Hassani, H.; Beneki, C.; Unger, S.; Mazinani, M.T.; Yeganegi, M.R. Text mining in big data analytics. Big Data Cogn. Comput. 2020, 4, 1–34. [Google Scholar] [CrossRef]
- Bates, D.W.; Heitmueller, A.; Kakad, M.; Saria, S. Why policymakers should care about big data in healthcare. Health Policy Technol. 2018, 7, 211–216. [Google Scholar] [CrossRef]
- Torabi, A.F.; Taboada, M. Big data and quality data for fake news and misinformation detection. Big Data Soc. 2019, 6, 1–14. [Google Scholar] [CrossRef] [Green Version]
- Kauffmann, E.; Peral, J.; Gil, D.; Ferrández, A.; Sellers, R.; Mora, H. A framework for big data analytics in commercial social networks: A case study on sentiment analysis and fake review detection for marketing decision-making. Ind. Mark. Manag. 2020, 90, 523–537. [Google Scholar] [CrossRef]
- King, K.K.; Wang, B. Diffusion of real versus misinformation during a crisis event: A big data-driven approach. Int. J. Inf. Manag. 2021, in press. [CrossRef]
- Ebadi, N.; Jozani, M.; Choo, K.R.; Rad, P. A memory network information retrieval model for identification of news misinformation. IEEE Trans. Big Data 2020, 8, 1358–1370. [Google Scholar] [CrossRef]
- Zrnec, A.; Pozenel, M.; Lavbic, D. Users’ ability to perceive misinformation: An information quality assessment approach. Inf. Process. Manag. 2022, 59, 102739. [Google Scholar] [CrossRef]
- Supriyanto, E.E.; Bakti, I.S.; Furqon, M. The role of big data in the implementation of distance learning. Paedagoria 2021, 12, 61–68. [Google Scholar]
- Darwiesh, A.; Alghamdi, M.; El-Baz, A.H.; Elhoseny, M. Social media big data analysis: Towards enhancing competitiveness of firms in a post-pandemic world. J. Healthc. Eng. 2022, 2022, 6967158. [Google Scholar] [CrossRef]
- Thota, A.; Tilak, P.; Ahluwalia, S.; Lohia, N. Fake news detection: A deep learning approach. SMU Data Sci. Rev. 2018, 1, 10. [Google Scholar]
- Ahmad, I.; Yousaf, M.; Yousaf, S.; Ahmad, M.O. Fake news detection using machine learning ensemble methods. Complexity 2020, 1, 8885861. [Google Scholar] [CrossRef]
- Monti, F.; Frasca, F.; Eynard, D.; Mannion, D.; Bronstein, M.M. Fake news detection on social media using geometric deep learning. Soc. Inf. Netw. 2019, 1, 1–15. [Google Scholar]
- Sahoo, S.R.; Gupta, B.B. Multiple features based approach for automatic fake news detection on social networks using deep learning. Appl. Soft Comput. 2021, 100, 106983. [Google Scholar] [CrossRef]
- Sharma, U.; Saran, S.; Patil, S.M. Fake news detection using machine learning algorithms. Int. J. Creat. Res. Thoughts (IJCRT) 2020, 8, 509–518. [Google Scholar]
- Aslam, N.; Ullah Khan, I.; Alotaibi, F.S.; Aldaej, L.A.; Aldubaikil, A.K. Fake detect: A deep learning ensemble model for fake news detection. Complexity 2021, 1, 5557784. [Google Scholar] [CrossRef]
- Chauhan, T.; Palivela, H. Optimization and improvement of fake news detection using deep learning approaches for societal benefit. Int. J. Inf. Manag. Data Insights 2021, 1, 100051. [Google Scholar] [CrossRef]
- Vyas, P.; Liu, J.; El-Gayar, O.F. Fake News Detection on the Web: An LSTM-based Approach. In Proceedings of the AMCIS 2021, Digital Innovation and Entrepreneurship, Virtual, 9–13 August 2021; Volume 5. [Google Scholar]
- Jiang, G.; Liu, S.; Zhao, Y.; Sun, Y.; Zhang, M. Fake news detection via knowledgeable prompt learning. Inf. Processing Manag. 2022, 59, 103029. [Google Scholar] [CrossRef]
- Galli, A.; Masciari, E.; Moscato, V.; Sperlí, G. A comprehensive benchmark for fake news detection. J. Intell. Inf. Syst. 2022, 59, 237–261. [Google Scholar] [CrossRef]
- Nakamura, K.; Levy, S.; Wang, W.Y. A new multimodal benchmark dataset for fine-grained fake news detection. Comput. Lang. 2020, 1, 1–9. [Google Scholar]
- Ianni, M.; Masciari, E.; Sperli, G. A survey of big data dimensions vs social networks analysis. J. Intell. Inf. Syst. 2020, 57, 73–100. [Google Scholar] [CrossRef]
- Jo, H.; Park, S.; Shin, D.; Shin, J.; Lee, C. Estimating cost of fighting against fake news during catastrophic situations. Telemat. Inform. 2022, 66, 101734. [Google Scholar] [CrossRef]
- Liu, X. A big data approach to examining social bots on twitter. J. Serv. Mark. 2019, 33, 369–379. [Google Scholar] [CrossRef]
- Kozik, R.; Kula, S.; Choras, M.; Woźniak, M. Technical solution to counter potential crime: Text analysis to detect fake news and disinformation. J. Comput. Sci. 2022, 60, 101576. [Google Scholar] [CrossRef]
- Meesad, P. Thai fake news detection based on information retrieval, natural language processing and machine learning. SN Comput. Sci. 2021, 2, 425. [Google Scholar] [CrossRef]
- Raza, S.; Ding, C. Fake news detection based on news content and social contexts: A transformer-based approach. Int. J. Data Sci. Anal. 2022, 13, 335–362. [Google Scholar] [CrossRef]
- Moher, D.; Shamseer, L.; Clarke, M.; Ghersi, D.; Liberati, A.; Petticrew, M.; Shekelle, P.; Stewart, L.A. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst. Rev. 2015, 4, 1. [Google Scholar] [CrossRef]
- Shahzad, K.; Khan, S.A. Factors affecting the adoption of integrated semantic digital libraries (SDLs): A systematic review. Library Hi Tech 2022. ahead of print. [Google Scholar] [CrossRef]
- Lewis, S.C.; Westlund, O. Big data and journalism: Epistemology, expertise, economics, and ethics. Digit. J. 2015, 3, 447–466. [Google Scholar] [CrossRef]
- Guo, L.; Vargo, C.J. Global intermedia agenda setting: A big data analysis of international news flow. J. Commun. 2017, 67, 499–520. [Google Scholar] [CrossRef]
- Murayama, T. Dataset of fake news detection and fact verification: A survey. ACM Comput. Surv. 2021, 1, 1–33. [Google Scholar]
- Vargo, C.J.; Guo, L.; Amazeen, M.A. The agenda-setting power of fake news: A big data analysis of the online media landscape from 2014 to 2016. New Media Soc. 2018, 20, 2028–2049. [Google Scholar] [CrossRef] [Green Version]
- Shu, K.; Mahudeswaran, D.; Wang, S.; Lee, D.; Liu, H. Fakenewsnet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media. Big Data 2020, 8, 171–188. [Google Scholar] [CrossRef]
- Baur, N.; Graeff, P.; Braunisch, L.; Schweia, M. The quality of big data: Development, problems, and possibilities of use of process-generated data in the digital age. Hist. Soc. Res. 2020, 45, 209–243. [Google Scholar]
- Awan, M.J.; Khan, M.A.; Ansari, Z.K.; Yasin, A.; Shehzad, H.M.F. Fake profile recognition using big data analytics in social media platforms. Int. J. Comput. Appl. Technol. 2021, 68, 215–222. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shahzad, K.; Khan, S.A.; Ahmad, S.; Iqbal, A. A Scoping Review of the Relationship of Big Data Analytics with Context-Based Fake News Detection on Digital Media in Data Age. Sustainability 2022, 14, 14365. https://doi.org/10.3390/su142114365
Shahzad K, Khan SA, Ahmad S, Iqbal A. A Scoping Review of the Relationship of Big Data Analytics with Context-Based Fake News Detection on Digital Media in Data Age. Sustainability. 2022; 14(21):14365. https://doi.org/10.3390/su142114365
Chicago/Turabian StyleShahzad, Khurram, Shakeel Ahmad Khan, Shakil Ahmad, and Abid Iqbal. 2022. "A Scoping Review of the Relationship of Big Data Analytics with Context-Based Fake News Detection on Digital Media in Data Age" Sustainability 14, no. 21: 14365. https://doi.org/10.3390/su142114365
APA StyleShahzad, K., Khan, S. A., Ahmad, S., & Iqbal, A. (2022). A Scoping Review of the Relationship of Big Data Analytics with Context-Based Fake News Detection on Digital Media in Data Age. Sustainability, 14(21), 14365. https://doi.org/10.3390/su142114365