Hello,
Have you tried cmecab-java? This library provides tokenizers using
MeCab (Japanese dependency structure analyzer).
http://code.google.com/p/cmecab-java/
Here is a sample schema.xml.
http://code.google.com/p/cmecab-java/wiki/HowToUse
On 2011/04/04, at 20:16, Erik Hatcher wrote:
> I'm trying to cull together the best practices for indexing/searching Japanese text.
>
> For those of you using Solr, what analyzer/field-type definition do you have for Japanese?
>
> Thanks for sharing!
> Erik
--
Kosuke Tanabe
|