data.world was founded in early 2016 and has raised $33 million from leading investors. The company was started by serial entrepreneur, Brett Hurt, and an experienced co-founding team that previously held leadership roles at HomeAway and Bazaarvoice. data.world is a B Corp with a mission to build the world’s most collaborative, abundant, and meaningful data resource. We provide a platform that improves community, connectivity, and collaboration around data. We improve the “first-mile” of data work, where 80% of a data project's time is spent and which is riddled with inefficiencies. We’ve been named to the Austin Business Journal’s Best Places to Work list every year since the company was founded and were named to the Austin Chamber of Commerce’s A-List of Hottest Startups in 2017.
We value transparency – we explain our decisions, how we arrived at them, and share our Board updates with the team. We take pride in what we do – we commit, and we deliver. We are not afraid of mistakes – we take calculated risks and strategically push boundaries. Our focus is fierce, but it’s not all hard work. You might see us bouldering, at an American Ninja Warrior taping, or at the Drafthouse.
As the Data Librarian, you will be responsible for data.world’s data catalog, take part in developing certain aspects of the data architecture, and will serve as a resource between our community and data. You will manage data.world’s dataset catalog composed of over 100K datasets by contributing more data to the catalog, maintaining the current datasets, and engaging the data.world community with data projects curated and developed by you. You will also build upon your existing experience with developing/managing taxonomies while working closely with linked data technology. As someone who will be work intimately with the data, you will also be available to assist our community and clients with data related needs. Depending on the task, you may find yourself working closely with Data Scientists, our Head of Community, Product Managers, Software Engineers, Account Managers, or the Executive Team.
As an ideal candidate, you’re open-minded and motivated, and you’ve had experience working in a bustling startup or have a strong desire to do so. You have a passion for making data easier to use and a strong desire to be part of a team that is exploring uncharted territories in semantic web technology. Possessing an understanding of developing schemas and implementing ontologies, and a love for data and understanding of a positive user experience will make you a great candidate for this role.
Maintain data catalog
Assisting with data partnerships and projects
Responding to clients’ data related needs
Data modeling and organization
Understanding of content management and data preservation
Developing knowledge organization systems like controlled vocabularies, metadata schema, and taxonomies.
Working with a variety of technical stakeholders, including customers
Comfortable with public speaking
Experience working in a collaborative workspace
Building SQL or SPARQL queries (Python/R experience a substitute)
Comfortable at the Command Line
Familiarity with Git/Github
Experience using Python/R for data analysis and cleanup
Familiarity with semantic editing tools and implementing semantic models using RDF, RDFS, OWL, SKOS
Familiarity with the open data movement
Familiarity with the data.world platform
Master’s Degree in Library and Information Sciences (MLIS), Information Management, or related, OR a Certificate in Data Science, Data Engineering in addition to pertinent professional experience
Brought to you by code4lib jobs: https://jobs.code4lib.org/jobs/30199-data-librarian