Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3323503.3361698acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
short-paper

FACTCK.BR: a new dataset to study fake news

Published: 29 October 2019 Publication History

Abstract

Machine learning algorithms can be used to combat fake news propagation. For the news classification, labeled datasets are required, however, among the existing datasets, few separate verified false from skewed ones with a good variety of sources. This work presents FACTCK.BR, a new dataset to study Fake News in Portuguese, presenting a supposedly false News along with their respective fact check and classification. The data is collected from the ClaimReview, a structured data schema used by fact check agencies to share their results in search engines, enabling data collect in real time.

References

[1]
Lucas Graves. 2018. Understanding the promise and limits of automated fact-checking. Factsheet 2 (2018), 2018--02.
[2]
Naeemul Hassan, Bill Adair, James T Hamilton, Chengkai Li, Mark Tremayne, Jun Yang, and Cong Yu. 2015. The quest to automate fact-checking. In Proceedings of the 2015 Computation+ Journalism Symposium.
[3]
Naeemul Hassan, Gensheng Zhang, Fatma Arslan, Josue Caraballo, Damian Jimenez, Siddhant Gawsane, Shohedul Hasan, Minumol Joseph, Aaditya Kulkarni, Anil Kumar Nayak, et al. 2017. Claimbuster: The first-ever end-to-end fact-checking system. Proceedings of the VLDB Endowment 10, 12 (2017), 1945--1948.
[4]
Rafael A. Monteiro, Roney L. S. Santos, Thiago A. S. Pardo, Tiago A. de Almeida, Evandro E. S. Ruiz, and Oto A. Vale. 2018. Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results. In Computational Processing of the Portuguese Language. Springer International Publishing, 324--334.
[5]
Marcio Moretto Ribeiro Pablo Ortellado. 2018. Polarização e desinformação online no Brasil (44 ed.).
[6]
Victoria L Rubin, Yimin Chen, and Niall J Conroy. 2015. Deception detection for news: three types of fakes. In Proceedings of the 78th ASIS&T Annual Meeting: Information Science with Impact: Research in and for the Community. American Society for Information Science, 83.
[7]
William Yang Wang. 2017. "liar, liar pants on fire": A new benchmark dataset for fake news detection. arXiv preprint arXiv:1705.00648 (2017).

Cited By

View all
  • (2023)Exploring Digital Misinformation as a Sociotechnical Phenomenon: Insights from a Small-scale StudyProceedings of the XXII Brazilian Symposium on Human Factors in Computing Systems10.1145/3638067.3638098(1-12)Online publication date: 16-Oct-2023
  • (2022)A Systematic Literature Mapping on Profile Trustworthiness in Fake News Spread2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD54268.2022.9776232(275-279)Online publication date: 4-May-2022
  • (2022)FakeRecogna: A New Brazilian Corpus for Fake News DetectionComputational Processing of the Portuguese Language10.1007/978-3-030-98305-5_6(57-67)Online publication date: 16-Mar-2022
  • Show More Cited By

Index Terms

  1. FACTCK.BR: a new dataset to study fake news

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WebMedia '19: Proceedings of the 25th Brazillian Symposium on Multimedia and the Web
    October 2019
    537 pages
    ISBN:9781450367639
    DOI:10.1145/3323503
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2019

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. dataset
    2. fact check
    3. fake news
    4. information extraction

    Qualifiers

    • Short-paper

    Conference

    WebMedia '19
    WebMedia '19: Brazilian Symposium on Multimedia and the Web
    October 29 - November 1, 2019
    Rio de Janeiro, Brazil

    Acceptance Rates

    Overall Acceptance Rate 270 of 873 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)22
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 18 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Exploring Digital Misinformation as a Sociotechnical Phenomenon: Insights from a Small-scale StudyProceedings of the XXII Brazilian Symposium on Human Factors in Computing Systems10.1145/3638067.3638098(1-12)Online publication date: 16-Oct-2023
    • (2022)A Systematic Literature Mapping on Profile Trustworthiness in Fake News Spread2022 IEEE 25th International Conference on Computer Supported Cooperative Work in Design (CSCWD)10.1109/CSCWD54268.2022.9776232(275-279)Online publication date: 4-May-2022
    • (2022)FakeRecogna: A New Brazilian Corpus for Fake News DetectionComputational Processing of the Portuguese Language10.1007/978-3-030-98305-5_6(57-67)Online publication date: 16-Mar-2022
    • (2021)Using NER + ML to Automatically Detect Fake NewsIntelligent Systems Design and Applications10.1007/978-3-030-71187-0_109(1176-1187)Online publication date: 3-Jun-2021
    • (2021)Technological Approaches to Detecting Online Disinformation and ManipulationChallenging Online Propaganda and Disinformation in the 21st Century10.1007/978-3-030-58624-9_5(139-166)Online publication date: 10-Mar-2021
    • (2020)A Linguistic-Based Method that Combines Polarity, Emotion and Grammatical Characteristics to Detect Fake News in PortugueseProceedings of the Brazilian Symposium on Multimedia and the Web10.1145/3428658.3430975(217-224)Online publication date: 30-Nov-2020
    • (2020)FakeNewsSetGenProceedings of the Brazilian Symposium on Multimedia and the Web10.1145/3428658.3430965(241-248)Online publication date: 30-Nov-2020
    • (2020)Early Detection of Social Media Hoaxes at ScaleACM Transactions on the Web10.1145/340719414:4(1-23)Online publication date: 18-Aug-2020

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media