Print

Print


Don't forget Sphinx.

http://sphinxsearch.com/
> Sphinx is a full-text search engine, distributed under GPL version
> 2. Commercial license is also available for embedded use.
>
> Generally, it's a standalone search engine, meant to provide fast,
> size-efficient and relevant fulltext search functions to other
> applications. Sphinx was specially designed to integrate well with
> SQL databases and scripting languages. Currently built-in data
> sources support fetching data either via direct connection to
> MySQL, or from an XML pipe.
>
>

Fast. Builds easy/everywhere.

--Casey


On Jul 18, 2007, at 7:45 AM, Alberto Accomazzi wrote:

> Our project is looking to transition to a new search engine to handle
> our bibliographic databases (5.5M records of bibliographic article
> metadata + 0.6M fulltext articles).  What we are looking for is
> something easily tweakable, which offers fielded searches,
> boolean/simple search logic, customizable relevance ranking,
> proximity,
> highlighting, synonym/stemming matching.  Needs to run on a linux
> 64-bit
> box.  The packages I am aware of are:
>
> 1. lucene/clucene/lucy
> 2. kinosearch
> 3. xapian
> 4. zebra
> 5. invenio
>
> Am I missing any from the list?  Are any of these to be excluded based
> on our requirements?  I'd like to hear experiences from people who are
> using or have used these packages.
>
> TIA
>
> -- Alberto
>
> ********************************************************************
> Dr. Alberto Accomazzi                  aaccomazzi(at)cfa harvard edu
> NASA Astrophysics Data System                        ads.harvard.edu
> Harvard-Smithsonian Center for Astrophysics      www.cfa.harvard.edu
> 60 Garden St, MS 67, Cambridge, MA 02138, USA
> ********************************************************************