SHAJ: Albanian hate speech & abusive language
datasetposted on 09.03.2022, 20:19 authored by Leon DerczynskiLeon Derczynski
SHAJ is an annotated Albanian dataset for hate speech and offensive speech that has been constructed from user-generated content on various social media platforms. Its annotation follows the hierarchical schema introduced in OffensEval.
Paper here: https://arxiv.org/abs/2107.13592