FTS lucene and english + german mails

Timo Sirainen tss at iki.fi
Wed Jun 17 13:43:17 UTC 2015


On 15 Jun 2015, at 21:22, Felix Zielcke <fzielcke at z-51.de> wrote:
> 
> Hi,
> 
> I'm currently looking over the FTS pages to enable it in my dovecot.
> But I'm unsure what the best settings of the lucene plugin are, if you
> receive german and english mails.
> Wiki says:
> 
> textcat_conf=<path> textcat_dir=<path>: If specified, enable guessing
> the stemming language for emails and search keywords. This is a little
> bit problematic in practice, since indexing and searching languages may
> differ and may not find even exact words because they stem differently.
> 
> On Debian libstemmer is included in the debian-lucene package.
> 
> So what settings are the best to have not the problem that exact words
> can't be found?

The textcat support in fts-lucene works very badly and shouldn't be used. There's new lib-fts code being developed that supports multiple languages better. It's already kind of usable in v2.2.18, but would be better to wait for v2.2.19.



More information about the dovecot mailing list