The J. Paul Getty Trust is looking for an enthusiastic Senior Data Engineer, with the experience and passion to lead the execution of technical projects to support, enrich and ensure the persistence of the institution's cultural heritage knowledge bases. Our aim is to provide a deeply connected, consistent experience for scholars, researchers, and art enthusiasts as they explore the complex information held across the organization, and your participation is crucial for that to be successful.
You will report to the Enterprise Semantic Architect, and interact equally with both software engineers and content specialists. Your work will improve the quality, reliability, connectedness, and consistency of our data by engineering data pipelines and validation tools, configuring Linked Open Usable Data (LOUD) platforms, and assisting with the design and implementation of our, and the community's, overall data model.
The Getty is among the most prestigious cultural heritage organizations in the world, dedicated to furthering the study of the history of art. You will work on an amazing campus amongst fabulous art, architecture, and information systems, collaborating with world-class scientists, curators, librarians, archivists, and academics. We offer 4 weeks of vacation a year plus every other Friday off, excellent benefits, and a very strong commitment to balancing work and personal life.
Major Job Responsibilities
Work with software engineering, systems and content stakeholders to understand data-oriented technical requirements
Design and implement robust data transformation pipelines to migrate legacy datasets into Linked Open Usable Data (LOUD)
Design and implement validation services for LOUD models
Design and implement agile data management practices, with testing, continuous integration and deployment paradigms
Design and implement LOUD data cleaning processes and pipelines, in collaboration with content stakeholders
Assist with configuration of institutional LOUD platform instances
Assist software engineering teams by translating stakeholder requirements into feature requests for platform development
Assist the Semantic Architect in data model and API design and documentation
Integrate external content services to enrich and reconcile our data, as time allows
Bachelor's degree in Computer Science, Information Science, or equivalent experience; Master's or PhD preferred
5+ years software development experience; data engineering experience preferred
Knowledge, Skills and Abilities
Interest in the digital work of cultural heritage organizations
Attention to detail combined with a focus on data and usability
Excellent verbal and written communication skills, especially when interacting with non-technical stakeholders and senior management
Expertise in Linked Open Data standards and technologies
Proficiency in Python, or willingness to translate experience in equivalent language
Proficiency in SQL and document oriented databases
Familiarity with cultural heritage data standards, such as IIIF and CIDOC-CRM
Familiarity with tools such as git, docker, AWS, Elastic.
Familiarity with test driven and agile software development methodologies
Familiarity with machine learning techniques, parallel processing
Brought to you by code4lib jobs: https://jobs.code4lib.org/jobs/31771-senior-data-engineer