Version 3 2019-11-01, 18:05Version 3 2019-11-01, 18:05
Version 2 2019-09-24, 08:03Version 2 2019-09-24, 08:03
Version 1 2019-07-01, 09:50Version 1 2019-07-01, 09:50
dataset
posted on 2019-11-01, 18:05authored byKristian Lundby Gjerde
Document collection scraped from the Russian governmental website kremlin.ru, where all content is licensed under Creative Commons Attribution 4.0.
Downloaded on 17 March 2019. Includes all items listed at http://kremlin.ru/events/president/transcripts up to the end of February 2019 (10221 documents).
Format:
1) Kremlin_transcripts_ru_corpus.rds: a 'corporaexplorerobject' intended to be used with the corporaexplorer R package (https://github.com/kgjerde/corporaexplorer).
2) Kremlin_transcripts_ru_df.rds: A regular R data frame with the documents and some metadata.
Version 3. Edited 1 November 2019: utf8 encoding fix.