Likely percentage of the optical character recognised word count of the average issue of the Caledonian Mercury for a given year to be duplicate material, with associated data
This file set contains a bar chart (<b>BW_ProbableDuplicateMaterialPercentages.png</b>) representing the likely percentages of duplicated news, advertising, miscellany and commentary, and numerical content in the average issue of <i>The Caledonian Mercury </i>(Edinburgh, Scotland) for a given year, 1825-1835. <br><br>It also contains a data table containing the OCR-calculated word count for each issue, the minimum duplicate material percentage for each issue, and the extrapolated word counts and percentages for each content type (<b>Data_Wordcounts_CaledonianMercury_1820_1840.tsv</b>). <br><br>The data set was derived from the British Library 19th Century Newspapers, Part 1 digital collection (<b>http://gale.cengage.co.uk/british-library-newspapers/19th-century-british-library-newspapers-part-i.aspx</b>) using the Scissors-and-Paste Console v.0.4.2 (<b>https://doi.org/10.5281/zenodo.1207283</b>) <br><br>Further details are available in the included documentation file (<b>readme.docx</b>) and on the websites listed below.<br>