Print

Print


On Nov 27, 2006, at 5:19 PM, Andrew Nagy wrote:
> BTW, have you played with Hadoop?  I guess it's something like the
> open-source attempt to google's search algorithm.  I would be curious
> about implementing hadoop across a few servers to store the marcxml
> records.

Hadoop is not a search engine.  It is a distributed file system
designed for massive data partitioning and manipulation.

I'd be surprised if you have so many marcxml records that you'd need
more than one system to store and index them all into a basic Lucene
index.  If so, I'm curious to know more about the numbers you're
dealing with.

        Erik