Dataset_comments.xlsx (1.12 MB)

SAM dataset

Download (1.12 MB)
dataset
posted on 28.05.2020, 06:53 by Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due Jensen
Sentiment Analysis Multitool. Danish Sentiment annotated dataset. This annotated data is meant to be used with supervised machine learning algorithms. The dataset was created specifically to classify sentences to the root comments of Danish political articles on social media. The dataset consists of 9008 sentences that are labelled with fine-grained polarity in the range from -2 to 2 (negative to postive). The quality of the fine-grained is not cross validated and is therefore subject to uncertainties; however, the simple polarity has been cross validated and therefore is considered to be more correct.

Reference: Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due Jensen. "Sentiment Analysis Multitool, SAM". 2019. Bachelor disseratation, IT University of Copenhagen.

History

Usage metrics

Licence

Exports