OpenCitations Index N-Triples dataset of all the citation data
This dataset contains all the citation data (in N-Triples format) included in the OpenCitations Index, released on July 1, 2024. In particular, any citation in the dataset, defined as an individual of the class cito:Citation, includes the following information:
- [citation IRI] the Open Citation Identifier (OCI) for the citation, defined in the final part of the URL identifying the citation (https://w3id.org/oc/index/ci/[OCI]);
- [property "cito:hasCitingEntity"] the citing entity identified by its OMID URL (https://https://opencitations.net/meta/[OMID]);
- [property "cito:hasCitedEntity"] the cited entity identified by its OMID URL (https://https://opencitations.net/meta/[OMID]);
- [property "cito:hasCitationCreationDate"] the creation date of the citation (i.e. the publication date of the citing entity);
- [property "cito:hasCitationTimeSpan"] the time span of the citation (i.e. the interval between the publication date of the cited entity and the publication date of the citing entity);
- [type "cito:JournalSelfCitation"] it records whether the citation is a journal self-citations (i.e. the citing and the cited entities are published in the same journal);
- [type "cito:AuthorSelfCitation"] it records whether the citation is an author self-citation (i.e. the citing and the cited entities have at least one author in common).
Note: the information for each citation is sourced from OpenCitations Meta (https://opencitations.net/meta), a database that stores and delivers bibliographic metadata for all bibliographic resources included in the OpenCitations Indexes. The data provided in this dump is therefore based on the state of OpenCitations Meta at the time this collection was generated.
This version of the dataset contains:
- 2,012,939,079 citations
The size of the zipped archive is 65.6 GB, while the size of the unzipped N-Triples files is 1.5 TB.