figshare
Browse
1/2
26 files

COCI N-Triples dataset of all the citation data

Version 19 2023-02-07, 07:09
Version 18 2022-11-01, 13:41
Version 17 2022-09-16, 13:09
Version 16 2022-08-31, 08:00
Version 15 2022-06-18, 13:28
Version 14 2022-03-26, 16:59
Version 13 2022-01-29, 13:23
Version 12 2021-11-25, 08:29
Version 11 2021-09-04, 13:33
Version 10 2021-07-29, 21:14
Version 9 2020-12-07, 12:28
Version 8 2020-09-07, 08:23
Version 7 2020-07-04, 11:20
Version 6 2020-05-13, 19:55
Version 5 2020-03-23, 15:46
Version 4 2020-01-21, 06:34
Version 3 2018-11-19, 13:21
Version 2 2018-07-13, 22:10
Version 1 2018-07-07, 14:36
dataset
posted on 2023-02-07, 07:09 authored by OpenCitations ​OpenCitations ​

This dataset contains all the citation data (in N-Triples format) included in COCI, released on 23 January 2023. In particular, any citation in the dataset, defined as an individual of the class cito:Citation, includes the following information:

  • [citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/coci/ci/[OCI]);
  • [property "cito:hasCitingEntity"] the citing entity identified by its DOI URL (http://dx.doi.org/[DOI]);
  • [property "cito:hasCitedEntity"] the cited entity identified by its DOI URL (http://dx.doi.org/[DOI]);
  • [property "cito:hasCitationCreationDate"] the creation date of the citation (i.e. the publication date of the citing entity);
  • [property "cito:hasCitationTimeSpan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);
  • [type "cito:JournalSelfCitation"] it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);
  • [type "cito:AuthorSelfCitation"] it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).

This version of the dataset contains:

  • 1,463,920,523 citations;
  • 77,045,952 bibliographic resources.

The size of the zipped archive is 73.1 GB, while the size of the unzipped N-Triples file is 1.6 TB.


Additional information about COCI are available at official webpage.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC