Data deposition for "The history of publishing delays." While most data files are available from the project GitHub, some large files are exclusively hosted here on figshare.
The contents are as follows:
1. `esearch_journal-articles_1960-2015.tsv.gz`—a list of 22,499,113 PubMed IDs retrieved from an esearch for `journal article[pt] AND 1960:2015[pdat]`.
2. `esummary_journal-articles_1960-2015.xml.bz2`—the combined XML output of esummary queries for the articles from 1. **Generating this file was a time intensive process. Sharing this 2 GB file was the motivation behind this deposition.**
3. `history-dates.tsv.bz2`—history dates extracted from the XML from 2.
4. `delays.tsv.gz`—acceptance and online publication delays extracted from 3.
5. `pubmed-journals.tsv`—a tabular version of the NLM Catalog with the journals in PubMed
Refer to the [project GitHub](https://github.com/dhimmel/delays/tree/history-blog-post) for more information.