figshare
Browse
1/1
18 files

COCI N-Triples dataset of the provenance information of all the citation data

Version 19 2023-02-07, 07:11
Version 18 2022-11-01, 14:16
Version 17 2022-09-16, 15:57
Version 16 2022-08-31, 08:06
Version 15 2022-06-18, 13:40
Version 14 2022-03-26, 17:21
Version 13 2022-01-29, 13:28
Version 12 2021-11-25, 08:33
Version 11 2021-09-04, 13:37
Version 10 2021-07-29, 21:15
Version 9 2020-12-07, 12:33
Version 8 2020-09-07, 08:25
Version 7 2020-07-04, 11:21
Version 6 2020-05-13, 19:55
Version 5 2020-03-23, 16:04
Version 4 2020-01-21, 06:34
Version 3 2018-11-19, 13:24
Version 2 2018-07-13, 22:09
Version 1 2018-07-07, 17:26
dataset
posted on 2021-07-29, 21:15 authored by OpenCitations ​OpenCitations ​
This dataset contains the provenance information (in N-Triples format) of all the citation data included in COCI, released on the 29 July 2021. In particular, any citation in the dataset includes the following provenance information:
  • [citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/coci/ci/[OCI]);;
  • [property "prov:wasAttributedTo"] the IRI of the agent that have created the citation data;
  • [property "prov:hadPrimarySource"] the IRI of the source dataset from where the citation data have been extracted;
  • [property "prov:generatedAtTime"] the creation time of the citation data.
The size of the zipped archive is 59.1 GB, while the size of the unzipped N-Triples file is 2.45 TB.

Additional information about COCI can be retrieved in the official webpage.

Funding

Wellcome Trust 'Open Biomedical Citations in Context Corpus' - Open Research Fund 2018, https://wellcome.ac.uk/funding/people-and-projects/grants-awarded/open-biomedical-citations-context-corpus

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC