Print

Print


Hello,

Have you tried cmecab-java? This library provides tokenizers using  
MeCab (Japanese dependency structure analyzer).
http://code.google.com/p/cmecab-java/

Here is a sample schema.xml.
http://code.google.com/p/cmecab-java/wiki/HowToUse

On 2011/04/04, at 20:16, Erik Hatcher wrote:

> I'm trying to cull together the best practices for indexing/searching Japanese text.
> 
> For those of you using Solr, what analyzer/field-type definition do you have for Japanese?
> 
> Thanks for sharing!
> 	Erik


-- 
Kosuke Tanabe