Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia
Version 4 2021-03-02, 15:21Version 4 2021-03-02, 15:21
Version 3 2021-02-28, 16:03Version 3 2021-02-28, 16:03
Version 2 2021-02-26, 03:15Version 2 2021-02-26, 03:15
Version 1 2021-02-26, 03:10Version 1 2021-02-26, 03:10
dataset
posted on 2021-03-02, 15:21 authored by KayYen WongKayYen Wong, Diego Saez-TrumperDiego Saez-Trumper, Miriam RediMiriam RediWiki-Reliability: Machine Learning datasets for measuring content reliability on Wikipedia
Consists of metadata features and content text datasets, with the formats:
- {template_name}_features.csv
- {template_name}_difftxt.csv.gz
- {template_name}_fulltxt.csv.gz
For more details on the project, dataset schema, and links to data usage and benchmarking:
https://meta.wikimedia.org/wiki/Research:Wiki-Reliability:_A_Large_Scale_Dataset_for_Content_Reliability_on_Wikipedia
History
Usage metrics
Categories
Keywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC