Thanks for the input so far.
Ben, another problem with digestibility of the search results is that it's
not XHTML, and therefore not well-formed XML, making it impossible to
process with XPath.
I think I'll experiment with the Solr solution, but like the AutoSuggester
being developed at OCLC, the index would be fairly static unless there was a
way to pull updates from LOC into it.
Ethan
On Mon, Dec 7, 2009 at 11:43 AM, LeVan,Ralph <[log in to unmask]> wrote:
> Here in OCLC Research we've been experimenting with AutoSuggester
> services. The folks in charge of our copy of LCSH are considering
> putting up an AutoSuggester for that. I'll let you know in the next
> couple of days how that goes.
>
> To make the service work at keystroke speeds, we've had to precalculate
> the responses and load them into a database of their own. We walk
> through the database that we're building the AutoSuggester for, pulling
> out 4-tuples of data: the term to be suggested, the recordID associated
> with the term (in case multiple terms might be suggested for the same
> record), a weight for the term and a string of other arbitrary data to
> send along with the recommendation (in the case of our VIAF
> AutoSuggester, that's a list of authority control numbers). Those
> tuples are then evaluated to generate a list of the 10 best terms that
> match each keystroke combination and that list is turned into a record
> and the keystroke combination is the key to that record. We then load
> those records into a simple text database indexing on the keystroke
> combination. We front-end that database with a simple service and we're
> done.
>
> The only downside to this scheme is that the AutoSuggester database is
> relatively static.
>
> Ralph
>
> > -----Original Message-----
> > From: Code for Libraries [mailto:[log in to unmask]] On Behalf
> Of
> > Ethan Gruber
> > Sent: Monday, December 07, 2009 11:14 AM
> > To: [log in to unmask]
> > Subject: Re: [CODE4LIB] Auto-suggest and the id.loc.gov LCSH web
> service
> >
> > It doesn't seem very efficient. It is taking me at least 30 seconds
> to load
> > a page of 'a*' in http://id.loc.gov/authorities/search/
> >
> > On Mon, Dec 7, 2009 at 11:05 AM, Houghton,Andrew <[log in to unmask]>
> > wrote:
> >
> > > > From: Code for Libraries [mailto:[log in to unmask]] On
> > Behalf Of
> > > > Winona Salesky
> > > > Sent: Monday, December 07, 2009 11:00 AM
> > > > To: [log in to unmask]
> > > > Subject: Re: [CODE4LIB] Auto-suggest and the id.loc.gov LCSH web
> > > > service
> > > >
> > > > Quoting Ethan Gruber <[log in to unmask]>:
> > > >
> > > > > I have a need to integrate the LCSH terms into a web form that
> uses
> > > > > auto-suggest to control the vocabulary. Is this technically
> possible
> > > > with
> > > > > the id.loc.gov service?
> > >
> > > Why can't you just add a "*" to the end of the data in your search
> form
> > > and send the request to the id.loc.gov search, per:
> > >
> > > <http://id.loc.gov/authorities/techcenter/searching.html>
> > >
> > > then parse the response?
> > >
> > >
> > > Andy.
> > >
>
|