figshare
Browse
TripAdvisor UK reviews gendered data sets with equal numbers of all five ratings.zip (94.42 MB)

TripAdvisor reviews of hotels and restaurants by gender

Download (94.42 MB)
dataset
posted on 2018-05-11, 07:24 authored by Mike ThelwallMike Thelwall
Datasets of Tripadvisor reviews by UK residents of UK hotels and restaurants, together with the user's rating of the hotel.
Datasets are split by:
Hotel star level (2, 3, 4 or all[mixed]) or Restaurant;
Reviewer gender (M=male-authored reviews; F=female-authored reviews; MF=equal numbers of male and female authored reviews for each rating level);
Number of texts (1k, 2k, 4k, 8k, 16k, or all available)

Each dataset contains equal numbers of reviews at each rating level.
The reviews were selected at random from TripAdvisor.

This data is from this paper:
Thelwall, M. (2018). Gender bias in machine learning for sentiment analysis. Online Information Review, 42(3), 343-354. doi: 10.1108/OIR-05-2017-0152

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC