Evaluating ChatGPT against Functionality Tests for Hate Speech Detection

Mithun Das; Saurabh Kumar Pandey; Animesh Mukherjee

Evaluating ChatGPT against Functionality Tests for Hate Speech Detection

Mithun Das, Saurabh Kumar Pandey, Animesh Mukherjee

Abstract

Large language models like ChatGPT have recently shown a great promise in performing several tasks, including hate speech detection. However, it is crucial to comprehend the limitations of these models to build robust hate speech detection systems. To bridge this gap, our study aims to evaluate the strengths and weaknesses of the ChatGPT model in detecting hate speech at a granular level across 11 languages. Our evaluation employs a series of functionality tests that reveals various intricate failures of the model which the aggregate metrics like macro F1 or accuracy are not able to unfold. In addition, we investigate the influence of complex emotions, such as the use of emojis in hate speech, on the performance of the ChatGPT model. Our analysis highlights the shortcomings of the generative models in detecting certain types of hate speech and highlighting the need for further research and improvements in the workings of these models.

Anthology ID:: 2024.lrec-main.564
Volume:: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:: LREC | COLING
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 6370–6380
Language:
URL:: https://aclanthology.org/2024.lrec-main.564
DOI:
Bibkey:
Cite (ACL):: Mithun Das, Saurabh Kumar Pandey, and Animesh Mukherjee. 2024. Evaluating ChatGPT against Functionality Tests for Hate Speech Detection. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 6370–6380, Torino, Italia. ELRA and ICCL.
Cite (Informal):: Evaluating ChatGPT against Functionality Tests for Hate Speech Detection (Das et al., LREC-COLING 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.lrec-main.564.pdf

PDF Cite Search