figshare
Browse

OpenCitations Index N-Triples dataset of the provenance information of all the citation data

Version 5 2025-07-15, 15:07
Version 4 2025-03-27, 17:18
Version 3 2024-07-01, 09:13
Version 2 2023-12-11, 10:11
Version 1 2023-10-25, 07:47
dataset
posted on 2024-07-01, 09:13 authored by OpenCitations ​OpenCitations ​
<p dir="ltr">This dataset contains the provenance information (in N-Triples format) of all the citation data included in the OpenCitation Index, released on July 1, 2024. In particular, any citation in the dataset includes the following provenance information:</p><ul><li><b>[citation IRI]</b> the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/ci/[OCI]);</li><li><b>[property "prov:wasAttributedTo"]</b> the IRI of the agent that has created the citation data;</li><li><b>[property "prov:hadPrimarySource"]</b> the IRI of the source dataset from where the citation data have been extracted;</li><li><b>[property "prov:generatedAtTime"]</b> the creation time of the citation data.</li><li><b>[propert "prov:invalidatedAtTime"]</b> the start of the destruction, cessation, or expiry of an existing entity by an activity.</li><li><b>[property "oco:hasUpdateQuery"]</b> the UPDATE SPARQL query that keeps track of which metadata have been modified.</li></ul><p dir="ltr">The size of the zipped archive is 83 GB, while the size of the unzipped N-Triples files is 2.6 TB.</p>

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC