Twitter event datasets (2012-2016)
datasetposted on 12.10.2018 by Arkaitz Zubiaga
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This collection includes data for 30 different Twitter datasets associated with real world events. The datasets were collected between 2012 and 2016, always using the streaming API with a set of keywords.
These datasets are released in accordance with Twitter's TOS, which allows sharing of tweet IDs and are intended for non-commercial research.
Note: Twitter's developer policy doesn't allow sharing more than 1,500,000 tweet IDs (https://dev.twitter.com/overview/terms/policy#updated-policy), unless the author is affiliated with an academic institution (which is my case) and tweet IDs are solely used for non-commercial purposes (https://twittercommunity.com/t/policy-update-clarification-research-use-cases/87566). Hence, by downloading these datasets you agree that you will not use it for commercial purposes.
Please cite the following paper if you make use of these datasets for your research: https://onlinelibrary.wiley.com/doi/full/10.1002/asi.24026
See README file for more details.