FTS lucene and english + german mails
tss at iki.fi
Wed Jun 17 13:43:17 UTC 2015
On 15 Jun 2015, at 21:22, Felix Zielcke <fzielcke at z-51.de> wrote:
> I'm currently looking over the FTS pages to enable it in my dovecot.
> But I'm unsure what the best settings of the lucene plugin are, if you
> receive german and english mails.
> Wiki says:
> textcat_conf=<path> textcat_dir=<path>: If specified, enable guessing
> the stemming language for emails and search keywords. This is a little
> bit problematic in practice, since indexing and searching languages may
> differ and may not find even exact words because they stem differently.
> On Debian libstemmer is included in the debian-lucene package.
> So what settings are the best to have not the problem that exact words
> can't be found?
The textcat support in fts-lucene works very badly and shouldn't be used. There's new lib-fts code being developed that supports multiple languages better. It's already kind of usable in v2.2.18, but would be better to wait for v2.2.19.
More information about the dovecot