figshare
Browse

Zip file extraction for Preservica

Download (6.89 kB)
software
posted on 2025-04-17, 00:35 authored by Lars AlvikLars Alvik

One of the issues we've been having with Preservica is that ZIP files and other container files function as a 'black box' for the system, the files inside are not visible to the system and we have no information on important digital preservation data such as filetypes and metadata on the individual files. We have created these scripts and workflow to try to remedy this. The scripts collectively downloads, assesses and re-uploads the contained files into a sub folder in the same folder as the original ZIP file. Since Preservica uses ZIP files as a SIP format this reupload process triggers a new ingest process for the files themselves. We also add a PREMIS event to the original ZIP file to provide a paper trail.

History

Usage metrics

    University of Melbourne

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC