This position is double-posted at the 4P3 and 4P4 levels.
This is a four-year fixed-term position with the possibility of an extension.
Job Objective:
Stanford
University Libraries (SUL) is seeking a talented software engineer to
support the Web Archiving Service. This is a four year fixed-term
position with the possibility of an extension.
The
position is a key element in the implementation and ongoing support of
SUL's Web Archiving Service. The Service will enable the archiving of
web content into the Stanford Digital Repository (SDR) on behalf of
Stanford librarians, faculty, and researchers and in support of the
University's needs for research, teaching, library collection building,
and regulatory compliance.
The
Web Archiving Engineer will primarily develop and maintain software to
facilitate web archiving workflows and use cases: harvesting, data
management, quality assurance, discovery, indexing, access and analysis.
This will entail deployment, local optimization and possible
enhancement of community-developed open source web archiving tools and
best practices.
Reporting
to the Manager for Application Development and working closely with the
Web Archiving Service Manager, the successful candidate will be
responsible for developing, configuring and/or managing web archiving
systems and related digital library components; pioneering tools and
techniques for the collection, replay and preservation of the next
generation of web technologies; troubleshooting and resolving technical
issues related to Service operation; and streamlining the processing of
archived web content through the entire lifecycle.
Primary Responsibilities:
Systems Analysis, Architecture Design, Implementation and Administration (50%)
Provide
technical analysis and software engineering support for web archiving
and related digital preservation activities at SUL. Install, configure
and manage Heritrix, Wayback Machine and other components necessary to
build an end-to-end service. Streamline the ingest of harvested and
other target content and associated metadata into repository, discovery
and access environments.
Operational Support (25%)
Collaborate
with the Web Archiving Service Manager to troubleshoot and resolve
technical issues affecting harvest, replay and web archiving workflows.
Generate Wayback Machine and Lucene indexes to enable web archive
replay, full-text searching and metadata analysis.
Harvest Engineering (15%)
Develop
tools and techniques to enable archival capture and replay of rich
media, streaming content, social media as well as traditional web page
content. Administer web crawls to maximize data capture quality and
efficient use of limited resources.
Community Engagement (10%)
Play
an active role in the cultural heritage web archiving community. Stay
abreast of evolving best practices and tools for web archiving and make
appropriate recommendations for local service enhancement.