11 files

Wikipedia Clickstream

Download all (6.03 GB)
posted on 2017-02-11, 04:13 authored by Ellery WulczynEllery Wulczyn, Dario TaraborelliDario Taraborelli

This project contains data sets containing counts of (referer, resource) pairs extracted from the request logs of Wikipedia. A referer is an HTTP header field that identifies the address of the webpage that linked to the resource being requested. The data shows how people get to a Wikipedia article and what links they click on. In other words, it gives a weighted network of articles, where each edge weight corresponds to how often people navigate from one page to another. For more information and documentation, see the link in the references section below.