You had me with the compelling illustration. :) I haven't implemented every piece in the puzzle, but it seems like a viable setup. On 6/25/07, Eric Lease Morgan <[log in to unmask]> wrote: > > > Below is some text I wrote outlining the technical infrastructure for > at thing we colloquially call the "catholic > portal" (www.catholicresearch.net). Does the infrastructure make > sense to y'all? If it doesn't make sense to you, then it won't make > sense to non-technoweenies. > > Catholic Research Resources Initiative and its technical infrastructure > > This text outlines the proposed technical infrastructure for the > Catholic Research Resources Initiative (CRRI). > > The infrastructure begins with two assumptions. First, from the > user's point of view, the system provides a searchable/browsable > interface to sets of EAD (Encoded Archival Document) files. Second, > the system makes every effort to provide this interface through well- > established Web-based protocols thus making the underlying components > more modular. > > Figure 1 illustrates the proposal. Starting on the far left are sets > of EAD files. These files will be created remotely at partner > institutions and sent to a central location. Once received metadata > will be extracted and stored in a relational database along with the > entire EAD files. This metadata, in combination with a simple faceted > classification system, will provide a way to maintain and logically > organize the CRRI content. We propose to use MySQL as the relational > database and a set of object-oriented Perl modules called MyLibrary > to facilitate input/output against the database. [1, 2] > > To facilitate search, a report will be written against the database > and given to an indexing program. The indexer/search engine is > expected to support fielded, free-text, and full-text searching, as > well as relevancy ranking. More importantly, the search engine is > expected to be accessible through a Web Services-based protocol > called SRU (Search-Retrieve via URL). [3] This will enable other > information services to search the CRRI without using the CRRI > website. Examples of other information services include metasearch > interfaces now common in libraries. The use of SRU will also enable > the CRRI to exchange its underlying indexing program without changing > the user interface. We plan to use either Zebra, Kinosearch, or > Lucene as our indexing program. [4, 5, 6] > > To facilitate browse the increasingly popular "faceted navigation" > technique will be employed. Using the metadata contained in the EAD > files, very broad "facets" will be created. Examples include > subjects, formats, people, institutions, themes, and maybe dates. > Each facet will have associated with it sets of "terms" such as > African Americans, letters, Dorothy Day, Seton Hall University, or > Catholic Social Action. Through a second set of reports, these facet/ > term combinations will be displayed in a user's browser, and by > selecting them relevant content will be returned. > > To broaden access to the CRRI's content, a third set of reports will > be written against the database to enable OAI-PMH (Open Archives > Initiative - Protocol for Metadata Harvesting). [7] These reports > will result in the creation of sets of XML files saved to the > computer's file system. An OAI "data repository" application will > provide access to the files and enable OAI "service providers" to > read the metadata and use it in other applications. We plan to use > XMLFile for the data repository. [8] An example of a service provider > is OAIster. [9] > > > Links > > 1. MySQL - http://mysql.com > 2. MyLibrary - http://dewey.library.nd.edu/mylibrary > 3. SRU - http://loc.gov/standards/sru > 4. Zebra - http://indexdata.dk/zebra > 5. Kinosearch - http://rectangular.com/kinosearch > 6. Lucene - http://lucene.apache.org > 7. OAI-PMH - http://openarchives.org > 8. XMLFile - http://www.dlib.vt.edu/projects/OAI/software/xmlfile > 9. OAIster - http://oaister.org > > -- > Eric Lease Morgan > University Libraries of Notre Dame >