%0 Generic
%A Wulczyn, Ellery
%A Thain, Nithum
%A Dixon, Lucas
%D 2017
%T Wikipedia Talk Labels: Aggression
%U https://figshare.com/articles/dataset/Wikipedia_Talk_Labels_Aggression/4267550
%R 10.6084/m9.figshare.4267550.v5
%2 https://ndownloader.figshare.com/files/7038038
%2 https://ndownloader.figshare.com/files/7394506
%2 https://ndownloader.figshare.com/files/7640644
%K WIkipedia
%K Online Comments
%K Natural Language Processing
%K Knowledge Representation and Machine Learning
%X This
data set includes over 100k labeled discussion comments from English
Wikipedia. Each comment was labeled by multiple annotators via
Crowdflower on whether it has aggressive tone. We also include some demographic data for each crowd-worker. See our wiki for documentation of the schema of each file and our research paper for
documentation on the data collection and modeling methodology. For a
quick demo of how to use the data for model building and analysis, check
out this ipython notebook.
%I figshare