sorry, we can't preview this file

...but you can still download 2019-10-21T22:41:20_1-63.zip
2019-10-21T22:41:20_1-63.zip (14.98 GB)

COCI CSV dataset of all the citation data

Download (14.98 GB)
dataset
posted on 21.01.2020, 06:35 by OpenCitations ​
This dataset contains all the citation data (in CSV format) included in COCI, released on the 21st of January 2020. In particular, each line of the CSV file defines a citation, and includes the following information:
  • [field "oci"] the Open Citation Identifier (OCI) for the citation;
  • [field "citing"] the DOI of the citing entity;
  • [field "cited"] the DOI of the cited entity;
  • [field "creation"] the creation date of the citation (i.e. the publication date of the citing entity);
  • [field "timespan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);
  • [field "journal_sc"] it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);
  • [field "author_sc"] it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).
This version of the dataset contains:
  • 624,183,532 citations;
  • 53,464,457 bibliographic resources.
The size of the zipped archive is 15 GB, while the size of the unzipped CSV file is 98 GB.

Additional information about COCI can be retrieved in the official webpage.

Funding

Wellcome Trust 'Open Biomedical Citations in Context Corpus' - Open Research Fund 2018, https://wellcome.ac.uk/funding/people-and-projects/grants-awarded/open-biomedical-citations-context-corpus

History

Licence

Exports

Licence

Exports