Compared to the previous version, this release includes metadata related to citing and cited bibliographic resources added in the March 2024 version of Crossref.
This dataset contains all the bibliographic metadata (in CSV format) included in OpenCitations Meta. In particular, each line of the CSV file defines a bibliographic resource, and includes the following information:
[field "id"] the IDs for the document described within the line;
[field "title"] the document's title;
[field "author"] the authors of the document;
[field "pub_date"] the date of publication;
[field "venue"] information about the venue, i.e. the bibliographical resource to which the document belongs;
[field "volume"] the volume sequence identifier (e.g. a number) to which the entity belongs;
[field "issue"] the issuesequence identifier (e.g. a number) to which the entity belongs;
[field "page"] the page range of the resource described in the row;
[field "type"] the type of resource described in the row;
[field "publisher"] the entity responsible for making the resource available;
[field "editor"] the editors of the document.
This version of the dataset contains:
116,605,079 bibliographic entities
348,844,164 authors and 2,561,339 editors (counted by their roles, without disambiguating individual
724,563 publication venues
242,362 publishers
The compressed dataset weighs 11G, while, when extracted, it weighs 47G on an ext4 filesystem.
Additional information about OpenCitations Meta at official webpage.
Funding
OpenAIRE-Nexus Scholarly Communication Services for EOSC users