figshare
Browse
1/1
16 files

COCI CSV dataset of the provenance information of all the citation data

Version 19 2023-02-07, 07:10
Version 18 2022-11-01, 13:44
Version 17 2022-09-16, 15:30
Version 16 2022-08-31, 08:04
Version 15 2022-06-18, 13:35
Version 14 2022-03-26, 17:20
Version 13 2022-01-29, 13:27
Version 12 2021-11-25, 10:27
Version 11 2021-09-04, 13:36
Version 10 2021-07-29, 16:36
Version 9 2020-12-07, 12:31
Version 8 2020-09-07, 08:24
Version 7 2020-07-04, 11:20
Version 6 2020-05-13, 19:55
Version 5 2020-03-23, 16:03
Version 4 2020-01-21, 06:35
Version 3 2018-11-19, 13:22
Version 2 2018-07-13, 22:10
Version 1 2018-07-05, 13:50
dataset
posted on 2023-02-07, 07:10 authored by OpenCitations ​OpenCitations ​

This dataset contains the provenance information (in CSV format) of all the citation data included in COCI, released on 23 January 2023. In particular, each line of the CSV file defines a citation, and includes the following information:

  • [field "oci"] the Open Citation Identifier (OCI) for the citation;
  • [field "snapshot"] the identifier of the snapshot;
  • [field "agent"] the name of the agent that have created the citation data;
  • [field "source"] the URL of the source dataset from where the citation data have been extracted;
  • [field "created"] the creation time of the citation data.
  • [field "invalidated"] the start of the destruction, cessation, or expiry of an existing entity by an activity;
  • [field "description"] a textual description of the activity made; 
  • [field "update"] the UPDATE SPARQL query that keeps track of which metadata have been modified.

The size of the zipped archive is 20 GB, while the size of the unzipped CSV file is 330 GB.Additional information about COCI are available at official webpage.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC