Dataset_comments.xlsx (1.12 MB)
SAM dataset
Version 2 2020-05-28, 06:53
Version 1 2020-05-28, 06:50
dataset
posted on 2020-05-28, 06:53 authored by Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due JensenSentiment Analysis Multitool. Danish Sentiment annotated dataset. This annotated data is meant to be used with supervised machine learning
algorithms. The dataset was created specifically to classify sentences
to the root comments of Danish political articles on social media.
The dataset consists of 9008 sentences that are labelled with
fine-grained polarity in the range from -2 to 2 (negative to postive).
The quality of the fine-grained is not cross validated and is therefore
subject to uncertainties; however, the simple polarity has been cross
validated and therefore is considered to be more correct. Reference: Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye
Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due Jensen.
"Sentiment Analysis Multitool, SAM". 2019. Bachelor disseratation, IT
University of Copenhagen. |