figshare
Browse
Dataset_comments.xlsx (1.12 MB)

SAM dataset

Download (1.12 MB)
Version 2 2020-05-28, 06:53
Version 1 2020-05-28, 06:50
dataset
posted on 2020-05-28, 06:53 authored by Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due Jensen
Sentiment Analysis Multitool. Danish Sentiment annotated dataset. This annotated data is meant to be used with supervised machine learning algorithms. The dataset was created specifically to classify sentences to the root comments of Danish political articles on social media. The dataset consists of 9008 sentences that are labelled with fine-grained polarity in the range from -2 to 2 (negative to postive). The quality of the fine-grained is not cross validated and is therefore subject to uncertainties; however, the simple polarity has been cross validated and therefore is considered to be more correct.

Reference: Mads Guldborg Kjeldgaard Kongsbak, Steffan Eybye Christensen, Lucas Høyberg Puvis de Chavannes, Peter Due Jensen. "Sentiment Analysis Multitool, SAM". 2019. Bachelor disseratation, IT University of Copenhagen.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC