Data documenting redactions extracted from ASIO surveillance records in National Archives of Australia Series A6119
datasetposted on 30.10.2016 by Tim Sherratt
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This dataset documents a collection of 239,571 redactions (blacked out words and phrases) extracted from surveillance files created by the Australian Security Intelligence Organisation (ASIO) and held in Series A6119 at the National Archives of Australia.
The image files themselves are available in the linked fileset.
The fields in the CSV file are:
image -- file name of the extracted image
barcode -- barcode (identifier) of the file from which the redactions was extracted
page -- number of the page from which the redaction was extracted
index -- sequential index identifying an individual redaction on a page
series -- series containing the file in the NAA
control_symbol -- control symbol for the file in the NAA
title -- title of the file in the NAA
width -- width of the image in pixels
height -- height of the image in pixels
position -- coordinates of a bounding box around the image relative to the parent page (x, y, width, height)
area -- area in pixels of the original redaction contour (as calculated by OpenCV)