Web Archive Engineer - 60432

Application link:


This position is double-posted at the 4P3 and 4P4 levels.
This is a four-year fixed-term position with the possibility of an extension.
Job Objective:
Stanford University Libraries (SUL) is seeking a talented software engineer to support the Web Archiving Service. This is a four year fixed-term position with the possibility of an extension.
The position is a key element in the implementation and ongoing support of SUL's Web Archiving Service. The Service will enable the archiving of web content into the Stanford Digital Repository (SDR) on behalf of Stanford librarians, faculty, and researchers and in support of the University's needs for research, teaching, library collection building, and regulatory compliance.
The Web Archiving Engineer will primarily develop and maintain software to facilitate web archiving workflows and use cases: harvesting, data management, quality assurance, discovery, indexing, access and analysis. This will entail deployment, local optimization and possible enhancement of community-developed open source web archiving tools and best practices.
Reporting to the Manager for Application Development and working closely with the Web Archiving Service Manager, the successful candidate will be responsible for developing, configuring and/or managing web archiving systems and related digital library components; pioneering tools and techniques for the collection, replay and preservation of the next generation of web technologies; troubleshooting and resolving technical issues related to Service operation; and streamlining the processing of archived web content through the entire lifecycle.
Primary Responsibilities:
Systems Analysis, Architecture Design, Implementation and Administration (50%)
Provide technical analysis and software engineering support for web archiving and related digital preservation activities at SUL. Install, configure and manage Heritrix, Wayback Machine and other components necessary to build an end-to-end service. Streamline the ingest of harvested and other target content and associated metadata into repository, discovery and access environments.
Operational Support (25%)
Collaborate with the Web Archiving Service Manager to troubleshoot and resolve technical issues affecting harvest, replay and web archiving workflows. Generate Wayback Machine and Lucene indexes to enable web archive replay, full-text searching and metadata analysis.
Harvest Engineering (15%)
Develop tools and techniques to enable archival capture and replay of rich media, streaming content, social media as well as traditional web page content. Administer web crawls to maximize data capture quality and efficient use of limited resources.
Community Engagement (10%)
Play an active role in the cultural heritage web archiving community. Stay abreast of evolving best practices and tools for web archiving and make appropriate recommendations for local service enhancement.


Minimum Qualifications
Preferred Qualifications

Please apply via the link at

Bess Sadler
[log in to unmask]
Manager, Application Development
Digital Library Systems & Services
Stanford University Library

To unsubscribe from the DLF-ANNOUNCE list, click the following link: