figshare
Browse

COCI CSV dataset of all the citation data

Download (8.13 GB)
Version 19 2023-02-07, 07:08
Version 18 2022-11-01, 13:40
Version 17 2022-09-16, 10:41
Version 16 2022-08-31, 08:03
Version 15 2022-06-18, 13:33
Version 14 2022-03-26, 16:58
Version 13 2022-01-29, 13:22
Version 12 2021-11-25, 08:27
Version 11 2021-09-04, 13:32
Version 10 2021-07-29, 21:14
Version 9 2020-12-07, 12:27
Version 8 2020-09-07, 08:22
Version 7 2020-07-04, 11:20
Version 6 2020-05-13, 19:54
Version 5 2020-03-23, 15:45
Version 4 2020-01-21, 06:35
Version 3 2018-11-12, 21:42
Version 2 2018-07-13, 22:10
Version 1 2018-07-05, 12:03
dataset
posted on 2018-07-13, 22:10 authored by OpenCitations ​OpenCitations ​
This dataset contains all the citation data (in CSV format) included in COCI, released on the 4th of July 2018. In particular, each line of the CSV file defines a citation, and includes the following information:
  • [field "oci"] the Open Citation Identifier (OCI) for the citation;
  • [field "citing"] the DOI of the citing entity;
  • [field "cited"] the DOI of the cited entity;
  • [field "creation"] the creation date of the citation (i.e. the publication date of the citing entity);
  • [field "timespan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity).
This version of the dataset contains:
  • 316,243,802 citations;
  • 45,145,889 bibliographic resources.
The size of the zipped archive is 8.2 GB, while the size of the unzipped CSV file is 49 GB.

Additional information about COCI can be retrieved in the official webpage and in the introductory blog post in the OpenCitations blog.

Funding

Alfred P. Sloan Foundation grant number G‐2017‐9800

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC