Using Explainable AI (XAI) for Identification of Subjectivity in Hate Speech Annotations for Low-Resource Languages

Madhuri Sawant, Arjumand Younus, Simon Caton, Muhammad Atif Qureshi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The proliferation of hate speech on digital platforms has become a significant issue, and automated content moderation systems built on machine learning are a proposed solution. However, they face challenges in multilingual and low-resource settings due to the need for extensive labelled data. This paper introduces an explainable AI framework designed to identify annotation discrepancies in low-resource languages, focusing on Hindi, the third most-spoken language worldwide, which lacks comprehensive research in hate speech detection. By examining the labelling quality of the Hate speech and Offensive Content Identification in English and Indo-Aryan Languages (HASOC) challenge, we use unsupervised learning methods to extract topical variations and annotation behavior and apply these features in an explainable AI-based classification model, TabNet. We release a relabelled Hindi hate speech benchmark dataset with label-flipping information and related metadata to facilitate research in this area. The source code has also been released for reproducibility purposes. Please be advised that this work contains examples of toxic content
Original languageEnglish
Title of host publicationOASIS 2024 - Proceedings of the 2024 Workshop on Open Challenges in Online Social Media, Held in conjunction with the 35th ACM Conference on Hypertext and Social Media, HT 2024
Pages10-17
Number of pages8
ISBN (Electronic)9798400710827
DOIs
Publication statusPublished - 10 Sep 2024

Publication series

NameOASIS 2024 - Proceedings of the 2024 Workshop on Open Challenges in Online Social Media, Held in conjunction with the 35th ACM Conference on Hypertext and Social Media, HT 2024

Keywords

  • Hate Speech Classification
  • Transformers
  • Under-resourced languages
  • XAI
  • multilingual

Fingerprint

Dive into the research topics of 'Using Explainable AI (XAI) for Identification of Subjectivity in Hate Speech Annotations for Low-Resource Languages'. Together they form a unique fingerprint.

Cite this