Download file
Download file
Download file
Download file
4 files

OpenCitations Corpus provenance data of all the responsible agents, archived on 2017-09-25

Download all (1.99 GB)
posted on 2017-10-02, 08:08 authored by OpenCitations ​OpenCitations ​
This archive contains the dump of the OpenCitations Corpus (OCC, provenance data about responsible agents, created regularly every month.

After unzipping the archive, Disk ARchive (DAR,, a multi-platform archive tool for managing huge amount of data) is needed for recreating the whole structure. For extracting the DAR archive, please run the command

dar -x [archive-name]

Where "[archive-name"] is the name of the DAR file without final package number and extension. E.g.:

dar -x 2016-09-23-corpus_re

For further questions, comments, and suggestions please don't hesitate to contact Silvio Peroni at


Usage metrics