[Dovecot] fts squat non-english search for 2 words

Timo Sirainen tss at iki.fi
Wed Nov 18 18:14:50 EET 2009


I'll try to look into this when I have a bit more time..

On Wed, 2009-11-18 at 16:19 +0700, vuser1 at test123.ru wrote:
> 
> Maybe I asked wrong question. OK, does anybody use fts_squat for non-English emails? Can you find emails by query of 2 WORDS - "planet Earth"? On my system it works only when both words are from latin alphabet, otherwise returns nothing. For latin, it finds even emails having both lating and russian letters (UTF-8 encoding). For non-latin, query must consist of 1 word only. 
> 
> Thanks for any ideas.
> > It looks I encoutered a bug or misconfiguration. fts_squat search for 
> > subject and body works excellent for English mails. For non-English 
> > (in particular, Russian) it works only when query consists of 1 word. 
> > Phrases - 2 and more words - always returns nothing. Example: search 
> > for "planet" ("планета") returns results, search for "Earth" 
> > ("Земля") also returns results, but "planet Earth" ("планета Земля") 
> > returns nothing. But there are emails having exact phrase "planet 
> > Earth". This problem occurs only for non-English queries, both for 
> > search in subject and in email body.
> > I tried web-mail Horde 3.2 and Thunderbird.
>  > I *turned fts plugin off* and it correctly finds phrases with 2 and 
> > more russian words! So problem is squat. Is it a bug or known config 
>  > issue?
> >
> > dovecot -n
> > --------------

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
Url : http://dovecot.org/pipermail/dovecot/attachments/20091118/48fe990b/attachment.bin 


More information about the dovecot mailing list