figshare
Browse
Dataset.pdf (27.93 kB)

Assessing cross-cut shredded document assembly

Download (0 kB)
Version 4 2014-10-31, 13:21
Version 3 2014-10-31, 13:21
Version 2 2014-10-31, 11:32
Version 1 2014-10-31, 11:22
dataset
posted on 2014-10-31, 11:22 authored by Priscila SaboiaPriscila Saboia, Siome Goldenstein

 

This material contains the description and the link where to find the scripts and dataset used in the paper Assessing cross-cut shredded document assembly

 

Abstract

In this paper we address the problem of quantitative evaluation of cross-cut shredded document reconstruction. We propose quantitative metrics using graph theory and classic information retrieval concepts to compare the neighborhood connectivity graph of a reassembled document shredded by a cross-cut machine against the neighborhood graph of the ground-truth. These metrics focus entirely on the proper relative positioning of the shredded pieces. To do so, we have shredded 12 documents containing diverse content, such as handwriting, printed text, images and photographs. We then scanned, extracted the pieces, and reassembled them into the ground-truth. This dataset is available to the readers, with the original documents, the digital representation of the shreds, and the scripts that provide the quantitative evaluation of the user’s reconstructions.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC