Transform a folder or group of folders into AIPs that meet the UGA Libraries AIP Definition, in preparation for ingest into our digital preservation system (ARCHive).
Each folder with the AIPs directory (specified via a script argument) will be transformed into a separate AIP.
The Digital Archivist (Digital Stewardship) creates AIPs with this workflow for Hargrett and Russell born-digital archives. The Head of Digital Stewardship creates AIPs with this workflow for UGA content in Archive-It. Both share responsibility for maintaining the script.
-
Preparation
- Copy FITS to the same letter drive as the files and update configuration.py if needed.
- Organize the contents of each AIP into one folder per AIP.
- An AIP many contain files of any format and additional folders.
- Limit AIP size and the number of formats for more sustainable ongoing management.
- Copy the AIP folders into a single folder (aips_directory). Use a copy in case there is a problem with the script.
- Make the metadata.csv file for this batch (see metadata.csv example) and save it to the aips_directory.
-
Run the script. The script will:
- Extract the AIP metadata from the metadata.csv.
- Delete temporary files from anywhere within the AIP folder because they cause errors with validating bags.
- Create the AIP directory structure: folder named with the AIP ID that contains metadata and objects folders.
- Extract technical metadata from each file in the objects folder with FITS and save it to the metadata folder.
- Combine each FITS xml file into one file named combined-fits.xml and save it to the fits-xml folder.
- Make the preservation.xml file (Dublin Core and PREMIS) from the combined-fits xml and save it to the metadata folder.
- Validate that the preservation.xml file meets UGA standards.
- Use bagit to bag each AIP folder in place, with md5 and sha256 manifests.
- Validate the bag.
- Tar and zip a copy of the bag and save it to the aips-to-ingest folder.
- Calculate the MD5 for the tarred and zipped AIP and add it to the department manifest in the aips-to-ingest folder.
-
Quality Control
- Review the log and errors folder and address any problems.
- Zip the contents of the preservation-xml folder and batch validate with the ARCHive application.
- Follow department procedures to review a sample of the AIPs for accuracy.
-
Schedule the AIPs for ingest into ARCHive.
-
Once AIPs are ingested without errors, delete local copies of the AIPs.
This workflow has been in use since 2020 by the Hargrett and Russell libraries.