Interesting project! But not I had in mind. I’m looking to archive the actual pages, so I can refer to them (and possibly extract information from them).
Alex
On 13 January 2017 at 15:25:43, Schmitz Fuhrig, Lynda ([log in to unmask]) wrote:
Check out https://webrecorder.io/
Lynda Schmitz Fuhrig
Electronic Records Archivist
Digital Services Division
Smithsonian Institution Archives
Capital Gallery Building
600 Maryland Ave SW
Suite 3000
MRC 507
Washington, DC 20024-2520
siarchives.si.edu <http://siarchives.si.edu/> | @SmithsonianArch
<https://twitter.com/smithsonianarch> | Facebook
<https://www.facebook.com/SmithsonianInstitutionArchives> | e-newsletter
<http://visitor.r20.constantcontact.com/manage/optin/ea?v=0010Oqxbncv4Wpyhe
Eee3Q9DHdF_192SxMMIWgsXuMG1qJ5yKPErzu0TI5d4qyMxK4iLMccSoQG5ck%3D>
A gift
<http://siarchives.si.edu/about/donate-smithsonian-institution-archives>
in support of the Archives will help make more of our collections
accessible!
On 1/13/17, 2:43 AM, "Code for Libraries on behalf of Alex Armstrong"
<[log in to unmask] on behalf of [log in to unmask]> wrote:
>Has anyone had to archive selected pages from a login-protected site? How
>did you do it?
>
>I've used the CLI tool httrack in the past for archiving sites. But in
>this
>case, accessing the pages require logging in. There's some vague
>documentation about how to do this with httrack, but I haven't cracked it
>yet. (The instructions are better for the Windows version of the
>application, but I only have ready access to a Mac.)
>
>Before I go on a wild goose chase, any help would be much appreciated.
>
>Alex
>
>--
>Alex Armstrong
>Web Developer & Digital Strategist, AMICAL Consortium
>[log in to unmask]
|