I haven't had time to look at it yet, but someone at Code4Lib conference
proposed a more sophisticated approach to spell checking that sounded
really interesting to me, and said he was going to share the code. I
hope to have time to investigate at some point.
Let's see if I can find it on the conference page.... yeah, it was
Martin Haye. You can watch his presentation here:
http://video.google.com/videoplay?docid=4028600349627496246&hl=en
Looks like he's *martin*.*haye*[at]gmail.com. During the lightning
talk, he said he didn't want to distribute the code seperately but
wanted to include it in Lucene if possible---but later in the
conference, he said he had been convinced by the interest in it to
distriburte the code as it's own standalone thing, and planned to do
that presently.
If anyone does or has explored using martin's code, please let us know
about your experience.
Jonathan
Kevin Kierans wrote:
> Has anyone created their own "dictionaries"
> for aspell? We've created blank delimited
> lists of words from our opac. One for title,
> one for subjects, and one for authors. (We're thinking
> of a series one as well)
>
> We would like to use
> one of these word lists to offer suggestions
> depending on which search the patron is making.
> We're assuming we can make better suggestions
> if the words come from our actual opac.
>
> We've got it working with the dictionary that
> comes with aspell, but having problems (we can't do it!)
> substituting our own "dictionaries."
>
> Does anyone have any experience/knowledge/hints/pointers
> they can share with us?
>
> We are using linux, php 5, aspell 0.50.5, and
> php -> pspell functions.
>
> Thanks,
> Kevin
> TNRD Library System, Kamloops, British Columbia, Canada
>
>
--
Jonathan Rochkind
Sr. Programmer/Analyst
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu
|