Between December 1, 2019 and February 27, 2020, Weiboscope collected 11,362,502 posts, among which 1,230,353 contain at least an outbreak-related keyword (please refer to the paper) and 2,104 (1.7 per 1,000) have been censored.
Data fields:
Column 1: "created_at": date of publication
Column 2: "censorship_type": directly censored (return of "permission_denied") or retweet of censored post ("retweet of a “permission denied” post")
Column 3: "id_hashed": hashed post ID
Column 4: "retweeted_status_hashed": hashed retweet status
Column 5: "text_cleaned": text body with all @XXX mentions removed
Please cite the reference
King-wa Fu & Yuner Zhu (2020) Did the world overlook the media’s early warning of COVID-19?, Journal of Risk Research, DOI: 10.1080/13669877.2020.1756380