TripAdvisor UK reviews gendered data sets with equal numbers of all five ratings.zip (94.42 MB)

TripAdvisor reviews of hotels and restaurants by gender

Download (94.42 MB)
dataset
posted on 11.05.2018 by Mike Thelwall
Datasets of Tripadvisor reviews by UK residents of UK hotels and restaurants, together with the user's rating of the hotel.
Datasets are split by:
Hotel star level (2, 3, 4 or all[mixed]) or Restaurant;
Reviewer gender (M=male-authored reviews; F=female-authored reviews; MF=equal numbers of male and female authored reviews for each rating level);
Number of texts (1k, 2k, 4k, 8k, 16k, or all available)

Each dataset contains equal numbers of reviews at each rating level.
The reviews were selected at random from TripAdvisor.

This data is from this paper:
Thelwall, M. (2018). Gender bias in machine learning for sentiment analysis. Online Information Review, 42(3), 343-354. doi: 10.1108/OIR-05-2017-0152

History

Licence

Exports

Licence

Exports