TY - JOUR
T1 - Hate speech classification for Sinhalese and Gujarati
AU - Qureshi, Muhammad Deedahwar Mazhar
AU - Sawant, Madhuri
AU - Qureshi, M. Atif
AU - Rashwan, Wael
AU - Younus, Arjumand
AU - Caton, Simon
N1 - Publisher Copyright:
© 2023 Copyright for this paper by its authors.
PY - 2023
Y1 - 2023
N2 - We, representing Team”XAG-TUD,” participated in HASOC 2023, focusing on Task 1, which comprises subtasks 1A and 1B. Task 1A revolves around coarse-grained binary classification, specifically discriminating between content falling into the categories of HOF (Hateful or Offensive) and NOT for Sinhalese, a low-resource language. Similarly, Task 1B involves a similar classification for Gujarati, another low-resource language. In this paper, we provide detailed insights into our solutions for both sub-tasks within Task 1. Notably, our observations reveal that the LaBSE (Language-agnostic BERT Sentence Embedding) model consistently outperformed the XLM-R model for both sub-tasks, demonstrating its effectiveness in addressing hate speech classification challenges in these languages.
AB - We, representing Team”XAG-TUD,” participated in HASOC 2023, focusing on Task 1, which comprises subtasks 1A and 1B. Task 1A revolves around coarse-grained binary classification, specifically discriminating between content falling into the categories of HOF (Hateful or Offensive) and NOT for Sinhalese, a low-resource language. Similarly, Task 1B involves a similar classification for Gujarati, another low-resource language. In this paper, we provide detailed insights into our solutions for both sub-tasks within Task 1. Notably, our observations reveal that the LaBSE (Language-agnostic BERT Sentence Embedding) model consistently outperformed the XLM-R model for both sub-tasks, demonstrating its effectiveness in addressing hate speech classification challenges in these languages.
UR - http://www.scopus.com/inward/record.url?scp=85193954181&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85193954181
SN - 1613-0073
VL - 3681
SP - 501
EP - 515
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 15th Forum for Information Retrieval Evaluation, FIRE 2023
Y2 - 15 December 2023 through 18 December 2023
ER -