figshare
Browse
1/1
3 files

Article PDF Filesizes

Version 2 2013-07-19, 10:32
Version 1 2013-07-19, 10:33
dataset
posted on 2013-07-19, 10:33 authored by Ross MounceRoss Mounce

A small bit of my thesis.

Why are BMC PDFs so significantly larger on average than PLOS or Zootaxa PDFs?

 

data sources:

 

A) 'Zootaxa' the entire set of articles published in the journal Zootaxa from 2001 to 2012 inclusive, consisting of 11563 pdf files downloaded direct from the publisher website : http://mapress.com/zootaxa/

B) 'PLOS' the entire set of articles published across 7 different PLOS journals: PLOS ONE, PLOS Biology, PLOS Computational Biology, PLOS Genetics, PLOS Medicine, PLOS Neglected Tropical Diseases, and PLOS Pathogens from 2003 to 2010-06-04, consisting of 20694 articles obtained via BioTorrents (Langille & Eisen, 2010).

C) 'BMC' a subsample of 7948 open access articles containing the stemword 'phylogen*' at least once in the fulltext from the wide range of journals that BioMedCentral publish (the OA subset of this selection of papers: http://www.citeulike.org/user/testtest87)

History