The nice thing about nutch is that it exposes an OpenSearch interface.
So you can write your search-y webapps in any language that can speak
HTTP and XML, which both Java and PHP should be able to handle. In
fact, I'd be surprised if both languages didn't already have
On Tue, Aug 5, 2008 at 7:29 PM, Cloutman, David
<[log in to unmask]> wrote:
> Thanks to both Roy and Bess, and anyone else who posts after I write
> this. I'll definitely have to look into nutch. Just to state my needs a
> little more clearly, I'm trying to keep our applications contained to
> Java and PHP solutions, if possible, as our machines are already
> configured to utilize those platforms.
> David Cloutman <[log in to unmask]>
> Electronic Services Librarian
> Marin County Free Library
> -----Original Message-----
> From: Code for Libraries [mailto:[log in to unmask]] On Behalf Of
> Bess Sadler
> Sent: Tuesday, August 05, 2008 4:19 PM
> To: [log in to unmask]
> Subject: Re: [CODE4LIB] Solr for Internal Searching
> Hi, David.
> I think solr is great, and I use it all the time and can highly
> recommend it. However, if what you have is mostly HTML pages, you
> might want to consider nutch (http://lucene.apache.org/nutch) instead.
> Both solr and nutch are based on lucene, but nutch will give you more
> built-in tools for crawling your website. Use the right tool for the
> job and all that. :)
> On 5-Aug-08, at 7:03 PM, Cloutman, David wrote:
>> Today my boss asked me to come up with a solution that would let us
>> index and search our intranet. I was already thinking of using Solr on
>> our public Web site we are building, and thought this might be a good
>> opportunity to knock two items off the to-do list with the same
>> technology. I know there was a preconference session on Solr this
>> and I have the sense that this is gaining traction in the library
>> community. Is there any reason why I shouldn't do this?
>> - David
>> David Cloutman <[log in to unmask]>
>> Electronic Services Librarian
>> Marin County Free Library
>> Email Disclaimer: