figshare
Browse
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
Download file
1/2
26 files

COCI N-Triples dataset of the provenance information of all the citation data

dataset
posted on 2023-02-07, 07:11 authored by OpenCitations ​OpenCitations ​

This dataset contains the provenance information (in N-Triples format) of all the citation data included in COCI, released on 23 January 2023. In particular, any citation in the dataset includes the following provenance information:

  • [citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/coci/ci/[OCI]);;
  • [property "prov:wasAttributedTo"] the IRI of the agent that have created the citation data;
  • [property "prov:hadPrimarySource"] the IRI of the source dataset from where the citation data have been extracted;
  • [property "prov:generatedAtTime"] the creation time of the citation data.
  • [propert "prov:invalidatedAtTime"] the start of the destruction, cessation, or expiry of an existing entity by an activity.
  • [property "oco:hasUpdateQuery"] the UPDATE SPARQL query that keeps track of which metadata have been modified.

The size of the zipped archive is 78 GB, while the size of the unzipped N-Triples file is 3.3 TB.Additional information about COCI are available at official webpage.

History

Usage metrics

    Licence

    Exports