TY - DATA T1 - A survey of Academic Publisher PDF metadata PY - 2012/12/30 AU - Ross Mounce UR - https://figshare.com/articles/dataset/A_survey_of_Academic_Publisher_PDF_metadata/105633 DO - 10.6084/m9.figshare.105633.v1 L4 - https://ndownloader.figshare.com/files/234170 KW - metadata KW - pdf KW - publishers KW - Science Policy KW - Applied Computer Science N2 - This is the corresponding dataset to a blogpost at http://rossmounce.co.uk/2012/12/31/pdf-metadata-why-so-poor/It's a simple survey of PDF metadata, across a variety of different academic publishers sampling mostly from PDFs published in the year 2011, or what I could gain access to. All are from the publisher-provided Version of Record PDFs not self-archived pre-prints or other such. I used the CLI tool pdfinfo to extract this metadata.   Columns A to K are identifying metadata I supply about each PDF (some fields not complete!). Whilst columns L to V provide the interesting metadata about each PDF.   Many of the PDFs sampled are not Open Access so (sadly) I cannot provide you with copies to replicate these results. ER -