figshare
Browse
1/1
7 files

COCI N-Triples dataset of all the citation data

Version 19 2023-02-07, 07:09
Version 18 2022-11-01, 13:41
Version 17 2022-09-16, 13:09
Version 16 2022-08-31, 08:00
Version 15 2022-06-18, 13:28
Version 14 2022-03-26, 16:59
Version 13 2022-01-29, 13:23
Version 12 2021-11-25, 08:29
Version 11 2021-09-04, 13:33
Version 10 2021-07-29, 21:14
Version 9 2020-12-07, 12:28
Version 8 2020-09-07, 08:23
Version 7 2020-07-04, 11:20
Version 6 2020-05-13, 19:55
Version 5 2020-03-23, 15:46
Version 4 2020-01-21, 06:34
Version 3 2018-11-19, 13:21
Version 2 2018-07-13, 22:10
Version 1 2018-07-07, 14:36
dataset
posted on 2020-01-21, 06:34 authored by OpenCitations ​OpenCitations ​
This dataset contains all the citation data (in N-Triples format) included in COCI, released on the 21st of January 2020. In particular, any citation in the dataset, defined as an individual of the class cito:Citation, includes the following information:
  • [citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/coci/ci/[OCI]);
  • [property "cito:hasCitingEntity"] the citing entity identified by its DOI URL (http://dx.doi.org/[DOI]);
  • [property "cito:hasCitedEntity"] the cited entity identified by its DOI URL (http://dx.doi.org/[DOI]);
  • [property "cito:hasCitationCreationDate"] the creation date of the citation (i.e. the publication date of the citing entity);
  • [property "cito:hasCitationTimeSpan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);
  • [type "cito:JournalSelfCitation"] it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);
  • [type "cito:AuthorSelfCitation"] it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).
This version of the dataset contains:
  • 624,183,532 citations;
  • 53,464,457 bibliographic resources.
The size of the zipped archive is 29.6 GB, while the size of the unzipped N-Triples file is 665 GB.

Additional information about COCI can be retrieved in the official webpage and in the introductory blog post in the OpenCitations blog.

Funding

Wellcome Trust 'Open Biomedical Citations in Context Corpus' - Open Research Fund 2018, https://wellcome.ac.uk/funding/people-and-projects/grants-awarded/open-biomedical-citations-context-corpus

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC