sorry, we can't preview this file
wp_2017-10-24. (821.63 kB)
WikiProjects Machine Readable Dataset
Version 3 2017-10-24, 17:39
Version 2 2017-10-20, 17:36
Version 1 2017-10-16, 19:23
dataset
posted on 2017-10-24, 17:39 authored by Sumit AsthanaSumit Asthana, Aaron HalfakerAaron HalfakerMachine readable format of WikiProjects listed at https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Council/Directory
The dataset is generated using the code at - https://github.com/wiki-ai/drafttopic/
The dataset is modeled in the form of a nested tree structure after the original hierarchical mappings on the WikiProejcts home page and its child pages.
* Each non-leaf entry represents a sub-category with a name and some associated information like the level in the page it was parsed at and the root url of the page it was parsed from.
* Each non-leaf node has a mandatory key "topics" which leads to further sub-categories within it.
* Each leaf node is a WikiProject entry, with actual WikiProject name and its active status.