figshare
Browse
1/2
30 files

Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia

Version 4 2021-03-02, 15:21
Version 3 2021-02-28, 16:03
Version 2 2021-02-26, 03:15
Version 1 2021-02-26, 03:10
dataset
posted on 2021-03-02, 15:21 authored by KayYen WongKayYen Wong, Diego Saez-TrumperDiego Saez-Trumper, Miriam RediMiriam Redi
Wiki-Reliability: Machine Learning datasets for measuring content reliability on Wikipedia

Consists of metadata features and content text datasets, with the formats:
- {template_name}_features.csv
- {template_name}_difftxt.csv.gz
- {template_name}_fulltxt.csv.gz

For more details on the project, dataset schema, and links to data usage and benchmarking:
https://meta.wikimedia.org/wiki/Research:Wiki-Reliability:_A_Large_Scale_Dataset_for_Content_Reliability_on_Wikipedia

History