TripAdvisor UK reviews gendered data sets with equal numbers of all five ratings.zip (94.42 MB)
TripAdvisor reviews of hotels and restaurants by gender
datasetposted on 2018-05-11, 07:24 authored by Mike ThelwallMike Thelwall
Datasets of Tripadvisor reviews by UK residents of UK hotels and restaurants, together with the user's rating of the hotel.
Datasets are split by:
Hotel star level (2, 3, 4 or all[mixed]) or Restaurant;
Reviewer gender (M=male-authored reviews; F=female-authored reviews; MF=equal numbers of male and female authored reviews for each rating level);
Number of texts (1k, 2k, 4k, 8k, 16k, or all available)
Each dataset contains equal numbers of reviews at each rating level.
The reviews were selected at random from TripAdvisor.
This data is from this paper:
Thelwall, M. (2018). Gender bias in machine learning for sentiment analysis. Online Information Review, 42(3), 343-354. doi: 10.1108/OIR-05-2017-0152