As an archivist I would suggest that rather than thinking up all the possible requirements, check with your archives staff, your institutional records policy, and your archives collections policy to find out what their actual requirements are. Having the full digital content as it was displayed is important for preservation. As archivists part of our job is to represent in description what the content is, how is was in context of the time it was created and used, and what has been done it to present it to users (over time.)
Ad layout may be different from what the specific ads were. Taking snapshots for the particular ads may be different than having full dynamic reconstructions of websites. Providing non-dynamic PDFs of webpages may not be the same as following the navigation pathways through a website.
From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of Wilhelmina Randtke
Sent: Wednesday, January 15, 2014 10:29 AM
To: [log in to unmask]
Subject: Re: [CODE4LIB] archiving web pages
Agreed, don't focus too much on preserving the presentation for an online newspaper. The text and images are important, but the layout isn't so important.
On Tue, Jan 14, 2014 at 10:59 AM, Kyle Banerjee <[log in to unmask]>wrote:
> IMO, there are many web archiving situations where it is more
> appropriate to just focus on the content rather than the manifestation of the content.
> Just as you wouldn't expect a 1995 article from the NYT to be
> displayed as the website was in 1995 or an article in an online
> database to actually appear like it originally appeared online, it's
> the content rather than the skin that's relevant in the case of a
> newspaper. If you make sure it's in a format that can be migrated
> forward and added to standalone or union systems that provide access to this sort of stuff, you'll be fine.
> On Tue, Jan 14, 2014 at 8:48 AM, Kathryn Frederick (Library) <
> [log in to unmask]> wrote:
> > Hi,
> > I'm trying to develop a strategy for preserving issues our school's
> > newspaper. Creating a WARC file of the content seems
> > straightforward, but how will that content fair long-term? Also, how
> > is the WARC served to an end-user? Is there some other method I should look at?
> > Thanks in advance for any advice!
> > Kathryn