figshare
Browse

sorry, we can't preview this file

wp_2017-10-24. (821.63 kB)

WikiProjects Machine Readable Dataset

Download (1.77 MB)
Version 3 2017-10-24, 17:39
Version 2 2017-10-20, 17:36
Version 1 2017-10-16, 19:23
dataset
posted on 2017-10-24, 17:39 authored by Sumit AsthanaSumit Asthana, Aaron HalfakerAaron Halfaker
Machine readable format of WikiProjects listed at https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Council/Directory

The dataset is generated using the code at - https://github.com/wiki-ai/drafttopic/

The dataset is modeled in the form of a nested tree structure after the original hierarchical mappings on the WikiProejcts home page and its child pages.

* Each non-leaf entry represents a sub-category with a name and some associated information like the level in the page it was parsed at and the root url of the page it was parsed from.
* Each non-leaf node has a mandatory key "topics" which leads to further sub-categories within it.
* Each leaf node is a WikiProject entry, with actual WikiProject name and its active status.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC