COCI CSV dataset of the provenance information of all the citation data
datasetposted on 07.09.2020 by OpenCitations
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This dataset contains the provenance information (in CSV format) of all the citation data included in COCI, released on 6 September 2020. In particular, each line of the CSV file defines a citation, and includes the following information:
The size of the zipped archive is 9.1 GB, while the size of the unzipped CSV file is 162.7 GB.
- [field "oci"] the Open Citation Identifier (OCI) for the citation;
- [field "agent"] the name of the agent that have created the citation data;
- [field "source"] the URL of the source dataset from where the citation data have been extracted;
- [field "datetime"] the creation time of the citation data.
Additional information about COCI can be retrieved in the official webpage.