I'm trying to develop a process for long-term preservation of the files we're creating though our digitization projects. My current plan is to bag groups of files using Bagger. Each bag would include all versions of the file (generally TIFF, JPEG, PDF and .txt transcript), a file of technical metadata (generated using exiftool), and .xml and marc files of descriptive metadata. Bagger will generate the checksums and create a file manifest. Our IT department is providing 8TB of Amazon S3 storage and have set up an AWS storage gateway. The storage will be dedicated to these files and access will strictly limited. I'm planning to regularly audit what's been stored but haven't decided on a tool to do that. Any recommendations? Is there anything else I should consider doing?
Thanks in advance for any advice!