TripAdvisor reviews of hotels and restaurants by gender
datasetposted on 11.05.2018 by Mike Thelwall
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
Datasets of Tripadvisor reviews by UK residents of UK hotels and restaurants, together with the user's rating of the hotel.
Datasets are split by:
Hotel star level (2, 3, 4 or all[mixed]) or Restaurant;
Reviewer gender (M=male-authored reviews; F=female-authored reviews; MF=equal numbers of male and female authored reviews for each rating level);
Number of texts (1k, 2k, 4k, 8k, 16k, or all available)
Each dataset contains equal numbers of reviews at each rating level.
The reviews were selected at random from TripAdvisor.
This data is from this paper:
Thelwall, M. (2018). Gender bias in machine learning for sentiment analysis. Online Information Review, 42(3), 343-354. doi: 10.1108/OIR-05-2017-0152