Hi Ya-aqov,
Many of the indexes we've built over the last three years or so at the UVa
Library have been Lucene/Solr. What I am working on is actually in no way
connected to blacklight. I'm interested in using Solr with the subjects for
autosuggest because I can generate XForms instances dynamically from
TermsComponent.
Ethan
On Fri, Dec 11, 2009 at 10:12 AM, Ya'aqov Ziso <[log in to unmask]> wrote:
> Ethan, kind thanks, and three questions to you (and I presume UVirginiašs
> Solr refers to Blacklight, thus am CC-ing Vufind-ers):
>
> 1. I asked myself if WorldCat indexes and upkeeps FAST, why do you need to
> index FAST in your local Solr?š, and I answered: because you want to link
> FAST to your local bibs. Is this the case?
> 2. If it is the case, are you going to load FAST on a weekly basis, cronjob
> the procedure?
> 3. If you are, what do you need FAST autusoggester for?
>
> Yašaqov
>
>
>
>
> On 12/10/09 4:35 PM, "Ethan Gruber" <[log in to unmask]> wrote:
>
> > Nice work, Ralph. That's really slick. I have all the subject terms in
> the
> > solr index, but I would like to eventually integrate the Worldcat data to
> > make the results more relevant (rather than just sorted alphabetically
> since
> > each subject occurs only once).
> >
> > I have yet to adapt my Orbeon forms to handle dynamic querying of the
> terms
> > in Solr, but would like to have that done in the next week or two (I
> hope!).
> >
> > Here's a sample of the data:
> >
> >
> http://beta.scholarslab.org:9080/solr-1.4/terms?terms.fl=subject&terms.limit=2
> > 5&terms.prefix=Egy
> >
> > The terms.prefix parameter displays terms that start with the letters
> > inputted. It's extremely fast, so I have no doubt I can use it for
> > autosuggest per keystroke.
> >
> > Ethan
> >
> > On Thu, Dec 10, 2009 at 4:07 PM, LeVan,Ralph <[log in to unmask]> wrote:
> >
> >> > In a rare demonstration of doing as promised, I have a FAST
> >> > AutoSuggester running based on that data. An HTML demonstration of
> the
> >> > service can be found at http://orlabs.oclc.org/FAST/autosuggest.htmland
> >> > the underlying AutoSuggester is running at
> >> > http://orlabs.oclc.org/FAST/AutoSuggest. Searches from the HTML go
> to
> >> > WorldCat.
> >> >
> >> > An example of a suggestion request would be
> >> > http://orlabs.oclc.org/FAST/AutoSuggest?query=0
> >> >
> >> > It returns JSON, which the above HTML link demonstrates the use of.
> >> >
> >> > My AutoSuggester is just a thin interface to an SRU database. The
> query
> >> > that comes to the AutoSuggester is turned into an SRU query and the
> >> > resulting record is dropped into the AutoSuggester response. The link
> >> > to the SRU database for the above query for "0" would be
> >> >
> http://orlabs.oclc.org/identities/search/AutoSuggestFAST?query=fragment+
> >> >
> >> exact+%220%22<
> http://orlabs.oclc.org/identities/search/AutoSuggestFAST?query=
> >> fragment+%0Aexact+%220%22>
> >> >
> >> > I've put up a file containing all the records I loaded into my
> database.
> >> > The records contain 2 tab delimited fields. The first field is the
> key
> >> > to the record and corresponds to the user's keystrokes (e.g. '0'
> above).
> >> > The second field contains the JSON response which is an ordered array
> of
> >> > terms. I load that into the moral equivalent of Lucene and you see
> the
> >> > results in that SRU search. The file can be found at
> >> > http://orlabs.oclc.org/FAST/AutoSuggestFAST.zip
> >> >
> >> > Let me know if you have any problems or questions.
> >> >
> >> > Ralph
> >> >
> >> >
> >>> > > -----Original Message-----
> >>> > > From: LeVan,Ralph
> >>> > > Sent: Thursday, December 10, 2009 11:12 AM
> >>> > > To: [log in to unmask]
> >>> > > Subject: RE: [CODE4LIB] Auto-suggest and the id.loc.gov LCSH web
> >> > service
> >>> > >
> >>> > > We've made some progress on this topic.
> >>> > >
> >>> > > I have available a list of our FAST subject headings. They are
> >> > derived from
> >>> > > LCSH and may be of some use. The folks that produced this file are
> >> > working on
> >>> > > producing a similar file for LCSH.
> >>> > >
> >>> > > The file can be found at
> http://orlabs.oclc.org/FAST/fastOutput.zip.
> >>> > >
> >>> > > The file contains tab delimited records. The first column is the
> ID
> >> > number of the
> >>> > > FAST record that the term comes from. The second column is the
> MARC
> >>> > > Authorities field that the term came from. The third column is the
> >> > term itself. The
> >>> > > 4th column is the count of records in WorldCat that are retrieved
> when
> >> > that term
> >>> > > is used in the Keyword index. The 5th column is the count of
> records
> >> > in
> >>> > > WorldCat that are retrieved when that term is used in the Subject
> >> > index.
> >>> > >
> >>> > > I expect to have an AutoSuggester built on this today.
> >>> > >
> >>> > > Ralph
> >> >
>
|