Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3531146.3533144acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfacctConference Proceedingsconference-collections
research-article
Open access

Exploring the Role of Grammar and Word Choice in Bias Toward African American English (AAE) in Hate Speech Classification

Published: 20 June 2022 Publication History

Abstract

Language usage on social media varies widely even within the context of American English. Despite this, the majority of natural language processing systems are trained only on “Standard American English,” or SAE, the construction of English most prominent among white Americans. For hate speech classification, prior work has shown that African American English (AAE) is more likely to be misclassified as hate speech. This has harmful implications for Black social media users as it reinforces and exacerbates existing notions of anti-Black racism. While past work has highlighted the relationship between AAE and hate speech classification, no work has explored the linguistic characteristics of AAE that lead to misclassification. Our work uses Twitter datasets for AAE dialect and hate speech classifiers to explore the fine-grained relationship between specific characteristics of AAE such as word choice and grammatical features and hate speech predictions. We further investigate these biases by removing profanity and examining the influence of four aspects of AAE grammar that are distinct from SAE. Results show that removing profanity accounts for a roughly 20 to 30% reduction in the percentage of samples classified as ’hate’ ’abusive’ or ’offensive,’ and that similar classification patterns are observed regardless of grammar categories.

References

[1]
Pinkesh Badjatiya, Shashank Gupta, Manish Gupta, and Vasudeva Varma. 2017. Deep learning for hate speech detection in tweets. In Proceedings of the 26th international conference on World Wide Web companion. 759–760.
[2]
Su Lin Blodgett and Brendan O’Connor. 2017. Racial disparity in natural language processing: A case study of social media african-american english. arXiv preprint arXiv:1707.00061(2017).
[3]
Su Lin Blodgett, Johnny Wei, and Brendan O’Connor. 2018. Twitter Universal Dependency Parsing for African-American and Mainstream American English. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 1415–1425. https://doi.org/10.18653/v1/P18-1131
[4]
Pete Burnap and Matthew L Williams. 2016. Us and them: identifying cyber hate on Twitter across multiple protected characteristics. EPJ Data science 5(2016), 1–15.
[5]
Ying Chen, Yilu Zhou, Sencun Zhu, and Heng Xu. 2012. Detecting offensive language in social media to protect adolescent online safety. In 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. IEEE, 71–80.
[6]
Chris Collins, Simanique Moody, and Paul M. Postal. 2008. An AAE Camouflage Construction. Language 84, 1 (2008), 29–68. http://www.jstor.org/stable/40071011
[7]
Patricia Cukor-Avila and Guy Bailey. 1995. Grammaticalization in AAVE. In Annual Meeting of the Berkeley Linguistics Society, Vol. 21. 401–413.
[8]
Thomas Davidson, Debasmita Bhattacharya, and Ingmar Weber. 2019. Racial bias in hate speech and abusive language detection datasets. arXiv preprint arXiv:1905.12516(2019).
[9]
Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated hate speech detection and the problem of offensive language. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 11.
[10]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805(2018).
[11]
Nemanja Djuric, Jing Zhou, Robin Morris, Mihajlo Grbovic, Vladan Radosavljevic, and Narayan Bhamidipati. 2015. Hate speech detection with comment embeddings. In Proceedings of the 24th international conference on world wide web. 29–30.
[12]
Antigoni Maria Founta, Despoina Chatzakou, Nicolas Kourtellis, Jeremy Blackburn, Athena Vakali, and Ilias Leontiadis. 2019. A unified deep learning architecture for abuse detection. In Proceedings of the 10th ACM conference on web science. 105–114.
[13]
Antigoni Maria Founta, Constantinos Djouvas, Despoina Chatzakou, Ilias Leontiadis, Jeremy Blackburn, Gianluca Stringhini, Athena Vakali, Michael Sirivianos, and Nicolas Kourtellis. 2018. Large scale crowdsourcing and characterization of twitter abusive behavior. In Twelfth International AAAI Conference on Web and Social Media.
[14]
Kishonna L Gray. 2018. Gaming out online: Black lesbian identity development and community building in Xbox Live. Journal of lesbian studies 22, 3 (2018), 282–296.
[15]
Lisa J Green. 2002. African American English: a linguistic introduction. Cambridge University Press.
[16]
Sophie Groenwold, Lily Ou, Aesha Parekh, Samhita Honnavalli, Sharon Levy, Diba Mirza, and William Yang Wang. 2020. Investigating African-American Vernacular English in Transformer-Based Text Generation. arXiv preprint arXiv:2010.02510(2020).
[17]
Matan Halevy, Camille Harris, Amy Bruckman, Diyi Yang, and Ayanna Howard. 2021. Mitigating Racial Biases in Toxic Language Detection with an Equity-Based Ensemble Framework. In Equity and Access in Algorithms, Mechanisms, and Optimization. 1–11.
[18]
Jacob Hoffman-Andrews. 2015. BlockTogether. https://blocktogether.org/
[19]
Shagun Jhaver, Sucheta Ghoshal, Amy Bruckman, and Eric Gilbert. 2018. Online harassment and content moderation: The case of blocklists. ACM Transactions on Computer-Human Interaction (TOCHI) 25, 2(2018), 1–33.
[20]
Taylor Jones. 2015. Toward a Description of African American Vernacular English Dialect Regions Using “Black Twitter”. American Speech 90, 4 (2015), 403–440.
[21]
Tiffany Marquise’ Jones. 2008. ” You Done Lost Yo’Mind Ain’t No Such Thang as AAVE”: Exploring African American Resistance to AAVE. (2008).
[22]
Svetlana Kiritchenko and Saif M Mohammad. 2018. Examining gender and race bias in two hundred sentiment analysis systems. arXiv preprint arXiv:1805.04508(2018).
[23]
Deepak Kumar, Patrick Gage Kelley, Sunny Consolvo, Joshua Mason, Elie Bursztein, Zakir Durumeric, Kurt Thomas, and Michael Bailey. 2021. Designing Toxic Content Classification for a Diversity of Perspectives. arXiv preprint arXiv:2106.04511(2021).
[24]
Austin Lane. 2014. “You Tryna Grammaticalize?” An Analysis of “Tryna” as a Grammaticalized Semi-Auxiliary. Eagle Feather 11, 2014 (2014).
[25]
Stefan Martin, Walt Wolfram, 1998. The sentence in African-American vernacular English. African American English: structure, history, and use (1998), 11–36.
[26]
Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781(2013).
[27]
Chikashi Nobata, Joel Tetreault, Achint Thomas, Yashar Mehdad, and Yi Chang. 2016. Abusive language detection in online user content. In Proceedings of the 25th international conference on world wide web. 145–153.
[28]
James W Pennebaker, Ryan L Boyd, Kayla Jordan, and Kate Blackburn. 2015. The development and psychometric properties of LIWC2015. Technical Report.
[29]
James W Pennebaker, Martha E Francis, and Roger J Booth. 2001. Linguistic inquiry and word count: LIWC 2001. Mahway: Lawrence Erlbaum Associates 71, 2001 (2001), 2001.
[30]
Georgios K Pitsilis, Heri Ramampiaro, and Helge Langseth. 2018. Effective hate-speech detection in Twitter data using recurrent neural networks. Applied Intelligence 48, 12 (2018), 4730–4742.
[31]
Daniel Preoţiuc-Pietro and Lyle Ungar. 2018. User-Level Race and Ethnicity Predictors from Twitter Text. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, Santa Fe, New Mexico, USA, 1534–1545. https://www.aclweb.org/anthology/C18-1130
[32]
Thomas Purnell, William Idsardi, and John Baugh. 1999. Perceptual and phonetic experiments on American English dialect identification. Journal of language and social psychology 18, 1 (1999), 10–30.
[33]
Jing Qian, Mai ElSherief, Elizabeth M Belding, and William Yang Wang. 2018. Leveraging intra-user and inter-user representation learning for automated hate speech detection. arXiv preprint arXiv:1804.03124(2018).
[34]
Jacquelyn Rahman. 2012. The N word: Its history and use in the African American community. Journal of English Linguistics 40, 2 (2012), 137–171.
[35]
Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi, and Noah A Smith. 2019. The risk of racial bias in hate speech detection. In Proceedings of the 57th annual meeting of the association for computational linguistics. 1668–1678.
[36]
Anna Schmidt and Michael Wiegand. 2017. A survey on hate speech detection using natural language processing. In Proceedings of the fifth international workshop on natural language processing for social media. 1–10.
[37]
Umme Aymun Siddiqua, Abu Nowshed Chy, and Masaki Aono. 2019. KDEHatEval at SemEval-2019 Task 5: A Neural Network Model for Detecting Hate Speech in Twitter. In Proceedings of the 13th International Workshop on Semantic Evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, 365–370. https://doi.org/10.18653/v1/S19-2064
[38]
Deanna Thompson. 2016. The Morpho-syntax of Aspectual Stay in AAVE. (2016).
[39]
Christopher Tuckwood. 2017. Hatebase: Online database of hate speech. The Sentinal Project. Available at: https://www. hatebase. org (2017).
[40]
Cynthia Van Hee, Els Lefever, Ben Verhoeven, Julie Mennes, Bart Desmet, Guy De Pauw, Walter Daelemans, and Véronique Hoste. 2015. Detection and fine-grained classification of cyberbullying events. In International conference recent advances in natural language processing (RANLP). 672–680.
[41]
Zeerak Waseem. 2016. Are you a racist or am i seeing things? annotator influence on hate speech detection on twitter. In Proceedings of the first workshop on NLP and computational social science. 138–142.
[42]
Zeerak Waseem and Dirk Hovy. 2016. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In Proceedings of the NAACL Student Research Workshop. Association for Computational Linguistics, San Diego, California, 88–93. https://doi.org/10.18653/v1/N16-2013
[43]
Robert L Williams. 1997. The ebonics controversy. Journal of Black Psychology 23, 3 (1997), 208–214.
[44]
Xuhui Zhou, Maarten Sap, Swabha Swayamdipta, Noah A Smith, and Yejin Choi. 2021. Challenges in automated debiasing for toxic language detection. arXiv preprint arXiv:2102.00086(2021).

Cited By

View all
  • (2024)NLPGuard: A Framework for Mitigating the Use of Protected Attributes by NLP ClassifiersProceedings of the ACM on Human-Computer Interaction10.1145/36869248:CSCW2(1-25)Online publication date: 8-Nov-2024
  • (2024)Algorithmic SubjectivitiesACM Transactions on Computer-Human Interaction10.1145/366034431:3(1-34)Online publication date: 27-Apr-2024
  • (2024)"We're Not in That Circle of Misinformation": Understanding Community-Based Trusted Messengers Through Cultural Code-SwitchingProceedings of the ACM on Human-Computer Interaction10.1145/36374298:CSCW1(1-36)Online publication date: 26-Apr-2024
  • Show More Cited By

Index Terms

  1. Exploring the Role of Grammar and Word Choice in Bias Toward African American English (AAE) in Hate Speech Classification
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    FAccT '22: Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency
    June 2022
    2351 pages
    ISBN:9781450393522
    DOI:10.1145/3531146
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 June 2022

    Check for updates

    Author Tags

    1. African American English
    2. Fairness
    3. Hate Speech
    4. Linguistics
    5. Natural Language Processing
    6. Social Media

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • Cisco Systems Inc.

    Conference

    FAccT '22
    Sponsor:

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)798
    • Downloads (Last 6 weeks)86
    Reflects downloads up to 13 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)NLPGuard: A Framework for Mitigating the Use of Protected Attributes by NLP ClassifiersProceedings of the ACM on Human-Computer Interaction10.1145/36869248:CSCW2(1-25)Online publication date: 8-Nov-2024
    • (2024)Algorithmic SubjectivitiesACM Transactions on Computer-Human Interaction10.1145/366034431:3(1-34)Online publication date: 27-Apr-2024
    • (2024)"We're Not in That Circle of Misinformation": Understanding Community-Based Trusted Messengers Through Cultural Code-SwitchingProceedings of the ACM on Human-Computer Interaction10.1145/36374298:CSCW1(1-36)Online publication date: 26-Apr-2024
    • (2024)AI generates covertly racist decisions about people based on their dialectNature10.1038/s41586-024-07856-5633:8028(147-154)Online publication date: 28-Aug-2024
    • (2024)Policy advice and best practices on bias and fairness in AIEthics and Information Technology10.1007/s10676-024-09746-w26:2Online publication date: 29-Apr-2024
    • (2024)Fairness Certification for Natural Language Processing and Large Language ModelsIntelligent Systems and Applications10.1007/978-3-031-66329-1_39(606-624)Online publication date: 31-Jul-2024
    • (2023)A Multidisciplinary Lens of Bias in Hate SpeechProceedings of the 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining10.1145/3625007.3627491(121-125)Online publication date: 6-Nov-2023
    • (2023)"Honestly, I Think TikTok has a Vendetta Against Black Creators": Understanding Black Content Creator Experiences on TikTokProceedings of the ACM on Human-Computer Interaction10.1145/36101697:CSCW2(1-31)Online publication date: 4-Oct-2023
    • (2023)Reclaiming the Digital Commons: A Public Data Trust for Training DataProceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society10.1145/3600211.3604658(855-868)Online publication date: 8-Aug-2023
    • (2023)Hate Speech Detection and Bias in Supervised Text ClassificationThe Oxford Handbook of the Sociology of Machine Learning10.1093/oxfordhb/9780197653609.013.16Online publication date: 18-Dec-2023
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media