wikidatawiki.labelings.5k.json (1008.89 kB)
Wikidata item quality labels
dataset
posted on 2019-12-17, 07:58 authored by Glorian YapinusGlorian Yapinus, Amir SarabadaniAmir Sarabadani, Aaron HalfakerAaron HalfakerThis dataset contains quality labels for 5000 Wikidata items applied by Wikidata editors. The labels correspond to the quality scale described at https://www.wikidata.org/wiki/Wikidata:Item_quality Each line is a JSON blob with the following fields:
- item_quality: The labeled quality class (A-E)
- rev_id: the revision identifier of the version of the item that was labeled
- strata: The size of the item in bytes at the time it was sampled
- page_len: The actual size of the item in bytes
- page_title: The Qid of the item
- claims: A dictionary including P31 "instance-of" values for filtering out certain types of items
The # of observations by class is:
- A class: 322
- B class: 438
- C class: 1773
- D class: 997
- E class: 1470