Dataset: X/Twitter Discourse on Ukrainian War Refugees in Poland
This repository contains the data used to analyse posts from the X/Tweeter platform.
Extraction of unstructured data from X/Twitter has been performed using R scripts through the Application Programming Interface (API) v2 for Academic Research, which enabled researchers to retrieve posts from the entire X/Twitter archive. At the time the data was collected, access to the Twitter API for Academic Research was still possible, but was restricted after the company changed its policy in February 2023. The post selection criteria were (i) posts published in the Polish language, (ii) posts containing the keywords “Ukraińcy” (“Ukrainians”), “w Polsce” (“in Poland”), and (iii) posts that were published between 22 February 2022 (12:00 a.m. CET) and 31 December 2022 (11:59 p.m. CET). The time frame selected for this study is related to the date when the Russian Federation invaded Ukraine and the closing date of the first calendar year of the conflict. The X/Twitter users included in the data analysis were those who sent posts with the above-mentioned characteristics during the pre-defined period. Unverified users were also included, as one of the objectives of the study was to analyse message dissemination. A total of 55,035 posts (original content), reposts (forwarded content), and replies (discussions among users) were collected. These were then extracted, and imported into NodeXL software, which is a professional tool for analysing social media, used in many research projects.