Abstract
This paper presents online hate speech as a societal and computational challenge. Offensive content detection in social media is considered as a multilingual, multi-level, multi-class classification problem for three Indo-European languages. This research problem is offered to the community through the HASOC shared task. HASOC intends to stimulate research and development in hate speech recognition across different languages. Three datasets (in English, German, and Hindi) were developed from Twitter and Facebook, and made available. This paper describes the creation of the multilingual datasets and the annotation method. We will present the numerous approaches based on traditional classifiers, deep neural models, and transfer learning models, along with features used for the classification. Results show that the best classifier for the binary classification might not perform best in the multi-class classification, and the performance of the same classifier varies across the languages. Overall, transfer learning models such as BERT, and deep neural models based on LSTMs and CNNs perform similar but better than traditional classifiers such as SVM. We will conclude the discussion with a list of issues that needs to be addressed for future datasets.
Similar content being viewed by others
References
Al-Hassan A, Al-Dossari H. Detection of hate speech in social networks: a survey on multilingual corpus. Comput Sci Inf Technol (CS & IT). 2019;9(2):83.
Saroj A, Mundotiya RK, Pal S. IRLab@ IITBHU at hasoc 2019: traditional machine learning for hate speech and offensive content identification. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Baruah A, Barbhuiya F, Dey K. IIITG-ADBU at HASOC 2019: automated hate speech and offensive content detection in english and code-mixed hindi text. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Bashar MA, Nayak R. QutNocturnal@HASOC’19: CNN for hate speech and offensive content identification in hindi language. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Basile V, Bosco C, Fersini E, Nozza D, Patti V, Pardo FMR, Rosso P, Sanguinetti M. Semeval-2019 task 5: Multilingual detection of hate speech against immigrants and women in twitter. In: Proceedings of the 13th international workshop on semantic evaluation. 2019. p. 54–63.
Wang B, Yunxia Ding SL, Zhou X. YNU Wb at HASOC 2019: ordered neurons LSTM with attention for identifying hate speech and offensive language. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Burnap P, Williams ML. Cyber hate speech on twitter: an application of machine classification and statistical modeling for policy and decision making. Policy Internet. 2015;7(2):223–42.
Casavantes M, López R, González LC, Montes-y Gómez M. UACh-INAOE at HASOC 2019: detecting aggressive tweets by incorporating authors’ traits as descriptors. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Conover MD, Ratkiewicz J, Francisco M, Gonçalves B, Menczer F, Flammini A. Political polarization on twitter. In: Fifth international AAAI conference on weblogs and social media. 2011.
Dana Ruiter MAR, Klakow D. LSV-UdS at HASOC 2019: the problem of defining hate? In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Davidson T, Warmsley D, Macy M, Weber I. Automated hate speech detection and the problem of offensive language. In: Proceedings of ICWSM. 2017.
De Smedt T, Jaki S, Kotzé E, Saoud L, Gwóźdź M, De Pauw G, Daelemans W. Multilingual cross-domain perspectives on online hate speech. 2018. https://repository.uantwerpen.be/docman/irua/e092ae/156589.pdf. Accessed 7 Mar 2020.
Devlin J, Chang MW, Lee K, Toutanova K. Bert: pre-training of deep bidirectional transformers for language understanding. 2018. arXiv:1810.04805.
Djuric N, Zhou J, Morris R, Grbovic M, Radosavljevic V, Bhamidipati N. Hate speech detection with comment embeddings. In: Proceedings of the 24th international conference on world wide web companion. International World Wide Web conferences steering committee. 2015. p. 29–30.
Fersini E, Rosso P, Anzovino M. Overview of the task on automatic misogyny identification at ibereval 2018. 2018.
Fišer D, Erjavec T, Ljubešić N. Legal framework, dataset and annotation schema for socially unacceptable on-line discourse practices in Slovene. In: Proceedings of the workshop on abusive language online (ALW). Canada: Vancouver; 2017.
Fortuna P, Nunes S. A survey on automatic detection of hate speech in text. ACM Comput Surv (CSUR). 2018;51(4):85.
Habermas J. Vorstudien und Ergänzungen zur Theorie des kommunikativen Handelns. Frankfurt: Suhrkamp; 1984.
Mensonides J-C, Jean, P-A, Tchechmedjiev A, Harispe S. IMT mines ales at HASOC 2019: automatic hate speech detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Jhaver S, Ghoshal S, Bruckman A, Gilbert E. Online harassment and content moderation: the case of blocklists. ACM Trans Comput Hum Interact (TOCHI). 2018;25(2):12.
Jiang A. QMUL-NLP at HASOC 2019: offensive content detection and classification in social media. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Das KA, Barbhuiya FA. FalsePostive at HASOC 2019: transfer-learning for detection and classification of hate speech. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Kumari K, Singh JP. AI ML NIT Patna at HASOC 2019: deep learning approach for identification of abusive content. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Kwok I, Wang Y. Locate the hate: detecting tweets against blacks. In: Twenty-seventh AAAI conference on artificial intelligence. 2013.
Lu Z, Nie JY. RALIGRAPH at HASOC 2019: VGCN-BERT: augmenting BERT with graph embedding for offensive language detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Mishra A, Pal S. IIT Varanasi at HASOC 2019: hate speech and offensive content identification in Indo-European languages. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Mishra S, Mishra S. 3Idiots at HASOC 2019: Fine-tuning transformer neural networks for hate speech identification in Indo-European languages. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Mondal M, Silva LA, Benevenuto F. A measurement study of hate speech in social media. In: Proceedings of the 28th ACM conference on hypertext and social media. ACM; 2017. p. 85–94.
Mubarak H, Darwish K, Magdy W. Abusive language detection on Arabic social media. In: Proceedings of the first workshop on abusive language online. 2017. p. 52–6.
Mubarak H, Kareem D, Walid M. Abusive language detection on Arabic social media. In: Proceedings of the workshop on abusive language online (ALW). Vancouver, Canada; 2017.
Mujadia V, Mishra P, Sharma DM. IIIT-Hyderabad at HASOC 2019: hate speech detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Nagle A. Kill all normies: online culture wars from 4chan and Tumblr to Trump and the alt-right. London: John Hunt Publishing; 2017.
Nayel HA, Shashirekha HL. DEEP at HASOC2019: a machine learning framework for hate speech and offensive language detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Nina-Alcocer V. Vito at HASOC 2019: detecting hate speech and offensive content through ensembles. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Nobata C, Tetreault J, Thomas A, Mehdad Y, Chang Y. Abusive language detection in online user content. In: Proceedings of the 25th international conference on world wide web. International world wide web conferences steering committee. 2016. p. 145–53.
Nockleby JT. Hate speech. Encycl Am Const. 2000;3(2):1277–9.
Parikh A, Desai H, Bisht AS. DA Master at HASOC 2019: identification of hate speech using machine learning and deep learning approaches for social media post. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Alonso P, Saini R, Kovács G. TheNorth at HASOC 2019: hate speech detection in social media data. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Pereira-Kohatsu JC, Quijano-Sánchez L, Liberatore F, Camacho-Collados M. Detecting and monitoring hate speech in twitter. Sensors. 2019;19(21):4654.
Peters ME, Neumann M, Iyyer M, Gardner M, Clark C, Lee K, Zettlemoyer L. Deep contextualized word representations. 2018. arXiv:1802.05365
Rajalakshmi, R, Reddy BY. YR: DLRG@HASOC 2019—an enhanced ensemble classifier for hate and offensive content identification. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Ranasinghe T, Zampieri M, Hettiarachchi H. BRUMS at HASOC 2019: deep learning models for multilingual hate speech and offensive language identification. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Kumar R, Reganti AN, Bhatia A, Maheshwari T. Aggression-annotated corpus of hindi–english code-mixed data. In: Proceedings of the 11th language resources and evaluation conference (LREC). Miyazaki: Japan; 2018. p. 1–11.
Kumar R, Ojha AK. KMI-Panlingua at HASOC 2019: SVM vs BERT for hate speech and offensive content detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Ross B, Rist M, Carbonell G, Cabrera B, Kurowsky N, Wojatzki M. Measuring the reliability of hate speech annotations: the case of the european refugee crisis. In: Proceedings of the workshop on natural language processing for computer-mediated communication (NLP4CMC). Germany: Bochum; 2016.
Ross B, Rist M, Carbonell G, Cabrera B, Kurowsky N, Wojatzki M. Measuring the reliability of hate speech annotations: the case of the european refugee crisis. 2017. arXiv:1701.08118
Saha BN, Senapati A. CIT Kokrajhar team: LSTM based deep RNN architecture for hate speech and offensive content (HASOC) identification in Indo-European languages. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Saha P, Mathew B, Goyal P, Mukherjee A. HateMonitors at HASOC 2019: language agnostic online abuse detection. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Salminen J, Almerekhi H, Kamel AM, Jung SQ, Jansen BJ. Online hate ratings vary by extremes: a statistical analysis. In: Proceedings of the 2019 conference on human information interaction and retrieval. ACM; 2019. p. 213–7.
Sap M, Card D, Gabriel S, Choi Y, Smith NA. The risk of racial bias in hate speech detection. In: Proceedings of the 57th annual meeting of the association for computational linguistics. 2019. p. 1668–78.
Schmidt A, Wiegand M. A survey on hate speech detection using natural language processing. In: Proceedings of the fifth international workshop on natural language processing for social media. Association for computational linguistics. Valencia, Spain; 2017. p. 1–10.
Sharon T, John NA. Unpacking (the) secret: anonymous social media and the impossibility of networked anonymity. New Media Soc. 2018;20(11):4177–94.
Silva L, Mondal M, Correa D, Benevenuto F, Weber I. Analyzing the targets of hate in online social media. In: Tenth international AAAI conference on web and social media. 2016.
Sreelakshmi KP. AmritaCEN at HASOC 2019: hate speech detection in Roman and Devanagiri scripted text. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Struß JM, Siegel M, Ruppenhofer J, Wiegand M, Klenner M. Overview of germeval task 2, 2019 shared task on the identification of offensive language. 2019.
Su HP, Huang CJ, Chang HT, Lin CJ. Rephrasing profanity in Chinese Text. In: Proceedings of the workshop on abusive language online (ALW). Vancouver, Canada; 2017.
Swinger N, De-Arteaga M, Heffernan IV NT, Leiserson MD, Kalai AT. What are the biases in my word embedding? In: Proceedings of the 2019 AAAI/ACM conference on AI, ethics, and society. ACM; 2019. p. 305–11.
Tulkens S, Hilte L, Lodewyckx E, Verhoeven B, Daelemans W. A dictionary-based approach to racism detection in dutch social media. 2016. arXiv:1608.08738.
Urmi Saha AD, Bhattacharyya P. IIT Bombay at HASOC 2019: supervised hate speech and offensive content detection in Indo-European languages. In: Proceedings of the 11th annual meeting of the Forum for Information Retrieval Evaluation. 2019.
Wagner K, Bumann C. Challenges in annotating a corpus for automatic hate speech detection. In: BOBCATSSS Paris. January 2020.
Wallach HM. Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd international conference on machine learning. ACM; 2006. p. 977–84.
Warner W, Hirschberg J. Detecting hate speech on the world wide web. In: Proceedings of the second workshop on language in social media. Association for Computational Linguistics. 2012. p. 19–26.
Waseem Z, Hovy D. Hateful symbols or hateful people? Predictive features for hate speech detection on twitter. In: Proceedings of the NAACL student research workshop. 2016. p. 88–93.
Wiegand M, Ruppenhofer J, Kleinbauer T. Detection of abusive language: the problem of biased datasets. In: Proceedings of the 2019 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1 (long and short papers). 2019. p. 602–8.
Wiegand M, Siegel M, Ruppenhofer J. Overview of the germeval 2018 shared task on the identification of offensive language. 2018.
Zafar MB, Valera I, Gomez Rodriguez M, Gummadi KP. Fairness beyond disparate treatment & disparate impact: learning classification without disparate mistreatment. In: Proceedings of the 26th international conference on world wide web. International world wide web conferences steering committee. 2017. p. 1171–80.
Zampieri M, Malmasi S, Nakov P, Rosenthal S, Farra N, Kumar R. Predicting the type and target of offensive posts in social media. In: Proceedings of NAACL. 2019.
Zampieri M, Malmasi S, Nakov P, Rosenthal S, Farra N, Kumar R. Predicting the type and target of offensive posts in social media. 2019. arXiv:1902.09666.
Zampieri M, Malmasi S, Nakov P, Rosenthal S, Farra N, Kumar R. Semeval-2019 task 6: identifying and categorizing offensive language in social media (offenseval). 2019. arXiv:1903.08983.
Acknowledgements
We would like to acknowledge Mr. Chintak Mandlia, Mohana Dave, Aditya Patel for the help in managing the HASOC track. We are also thankful to the college junior students who helped us to annotate the dataset.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the topical collection “Forum for Information Retrieval Evaluation” guest edited by Mandar Mitra and Prasenjit Majumder.
Rights and permissions
About this article
Cite this article
Modha, S., Mandl, T., Majumder, P. et al. Tracking Hate in Social Media: Evaluation, Challenges and Approaches. SN COMPUT. SCI. 1, 105 (2020). https://doi.org/10.1007/s42979-020-0082-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-020-0082-0