8 files

COCI CSV dataset of the provenance information of all the citation data

Download all (14 GB)
posted on 29.07.2021, 16:36 authored by OpenCitations ​OpenCitations ​
This dataset contains the provenance information (in CSV format) of all the citation data included in COCI, released on 29 July 2021. In particular, each line of the CSV file defines a citation, and includes the following information:
  • [field "oci"] the Open Citation Identifier (OCI) for the citation;
  • [field "agent"] the name of the agent that have created the citation data;
  • [field "source"] the URL of the source dataset from where the citation data have been extracted;
  • [field "datetime"] the creation time of the citation data.
The size of the zipped archive is 14.1 GB, while the size of the unzipped CSV file is 246.5 GB.

Additional information about COCI can be retrieved in the official webpage.


Wellcome Trust 'Open Biomedical Citations in Context Corpus' - Open Research Fund 2018, https://wellcome.ac.uk/funding/people-and-projects/grants-awarded/open-biomedical-citations-context-corpus


Usage metrics