full_albanian_dataset.csv (769.21 kB)
Download file

SHAJ: Albanian hate speech & abusive language

Download (769.21 kB)
dataset
posted on 09.03.2022, 20:19 authored by Leon DerczynskiLeon Derczynski
SHAJ is an annotated Albanian dataset for hate speech and offensive speech that has been constructed from user-generated content on various social media platforms. Its annotation follows the hierarchical schema introduced in OffensEval.

Paper here: https://arxiv.org/abs/2107.13592

History

Usage metrics

Licence

Exports