The Code4Lib Journal editors are excited to bring you this latest issue with
ten articles.  You can find it at; titles and abstracts


Editorial Introduction:  How Things Change

Terry Reese




Crafting Linked Open Data for Cultural Heritage: Mapping and Curation Tools
for the Linked Jazz Project

M. Cristina Pattuelli, Matt Miller, Leanora Lange, Sean Fitzell, and Carolyn


This paper describes tools and methods developed as part of Linked Jazz, a
project that uses Linked Open Data (LOD) to reveal personal and professional
relationships among jazz musicians based on interviews from jazz archives.
The overarching aim of Linked Jazz is to explore the possibilities offered
by LOD to enhance the visibility of cultural heritage materials and enrich
the semantics that describe them. While the full Linked Jazz dataset is
still under development, this paper presents two applications that have laid
the foundation for the creation of this dataset: the Mapping and Curator
Tool, and the Transcript Analyzer. These applications have served primarily
for data preparation, analysis, and curation and are representative of the
types of tools and methods needed to craft linked data from digital content
available on the web. This paper discusses these two domain-agnostic tools
developed to create LOD from digital textual documents and offers insight
into the process behind the creation of LOD in general.



Integrating Linked Data into Discovery

Götz Hatop


Although the Linked Data paradigm has evolved from a research idea to a
practical approach for publishing structured data on the web, the
performance gap between currently available RDF data stores and the somewhat
older search technologies could not be closed. The combination of Linked
Data with a search engine can help to improve ad-hoc retrieval. This article
presents and documents the process of building a search index for the Solr
search engine from bibliographic records published as linked open data.


Actions Speak Louder than Words: Analyzing large-scale query logs to improve
the research experience

Ted Diamond, Susan Price, Raman Chandrasekar


Analyzing anonymized query and click-through logs leads to a better
understanding of user behaviors and intentions, and provides opportunities
to create an improved search experience. As a large-scale provider of SaaS
services that returns search results against a single unified index, Serials
Solutions is uniquely positioned to learn from the dataset of queries issued
to its Summon® service by millions of users at hundreds of libraries around
the world.


In this paper, we describe the Relevance Metrics Framework that we use to
analyze our query logs and provide examples of insights we have gained
during development and implementation. We also highlight the ways our
analysis is inspiring changes to the Summon® service to improve the academic
research experience.


Batch metadata assignment to archival photograph collections using facial
recognition software

Kyle Banerjee and Maija Anderson


Useful metadata is essential to giving individual meaning and value within
the context of a greater image collection as well as making them more
discoverable. However, often little information is available about the
photos themselves, so adding consistent metadata to large collections of
digital and digitized photographs is a time consuming process requiring
highly experienced staff.


By using facial recognition software, staff can identify individuals more
quickly and reliably. Knowledge of individuals in photos helps staff
determine when and where photos are taken and also improves understanding of
the subject matter.


This article demonstrates simple techniques for using facial recognition
software and command line tools to assign, modify, and read metadata for
large archival photograph collections.


Using a Raspberry Pi as a Versatile and Inexpensive Display Device

Edward Iglesias and Arianna Schlegel


This article covers the process by which a library took some unused
equipment and added a cheap computing device to produce very inexpensive but
effective electronic signage. Hardware and software issues as well as a
step-by-step guide through the process are included.


The Moab Design for Digital Object Versioning

Richard Anderson


The Stanford Digital Repository has adopted the “Moab” design for versioned
archiving of digital objects–a locally developed approach that optimizes
data transfer, storage, and replication while providing efficient single
file retrieval or full object reconstruction for any version of an object.
This paper includes a review of various versioning strategies including
forward-delta, reverse-delta and content-addressable mechanisms, the pro’s
and cons of each, and highlights the relative advantages of the Moab design.
In our approach, the fixity information of a file manifestation is used as
its primary identifier and the filename is treated as metadata. Storage and
retrieval of an object’s files is faciliated by mapping between a virtual
version inventory and the physical location via a file signature catalog.


Out From Behind the Firewall: Towards Better Library IT Communications

Lisa Gayhart


Traditionally, IT departments lack a strong focus on communications and
promotions. Numerous exciting projects and services are created by library
IT departments and web development teams daily, but resources for promotion
are typically unavailable or deemed low priority. This article examines
IT-specific communications within the library context, offers a model of
user-focused communications useful to libraries of any size, and discusses
University of Toronto Libraries’ Information Technology Services
department’s efforts to increase library technology communications.


Renewing UPEI’s Institutional Repository: New Features for an
Islandora-based Environment

Donald Moses, Kirsta Stapelfeldt


In October of 2012, the University of Prince Edward Island (UPEI) launched
an updated version of IslandScholar, UPEI’s Institutional repository. The
repository, available from, is built on
Islandora 6 ( The repository includes a number of new
features, including: CSL integration for ingest, site display, and export of
user-specific bibliographies; MADS-based Authority integration for
Departments and Authors (with authorities created automatically using LDAP);
batch ingest from Refworks (crosswalked to MODS for storage in the
repository); embargo and statistics functions. Features from the first
version of IslandScholar were also migrated to the new site, including
Sherpa/Romeo integration (which provides just-in-time information about open
access policies).


Relevance and Phrase Searching in Summon: looking under the hood

Thomas Hodge & James RW MacDonald


This article briefly examines the mechanisms behind seemingly
counter-intuitive phrase search results in Serials Solutions’ discovery
platform Summon. The authors use the platform’s search API to explain why
users sometimes encounter greater numbers of results when typically they
would expect fewer. The article explores the reasons behind the search
results and the implications for library instruction.


Comparing the LibraryThing, OCLC, and Open Library ISBN APIs

David Fiander


LibraryThing, OCLC, and the Open Library project all provide ISBN services
that take an ISBN and return information about related works for that ISBN.
This article compares the license terms, quality of the data returned,
documentation available, and details of the APIs for the three products. The
article focuses on ease of use for developers, restrictions on types of use
(and assumptions about types of use built into the license or terms of use),
and cost of the services.


On behalf of the Code4Lib Journal Editorial Committee,



Terry Reese,

Code4Lib Journal Coordinating Editor for Issue #21




Terry Reese

Head, Digital Initiatives

Associate Professor

The Ohio State University

320B 18th Avenue Library

Columbus, OH  43210

Phone: 614.292.8263

Email:  <mailto:[log in to unmask]> [log in to unmask]

Personal: [log in to unmask] <mailto:[log in to unmask]>