[Dovecot] fts squat non-english search for 2 words
vuser1 at test123.ru
vuser1 at test123.ru
Sun Nov 22 15:35:58 EET 2009
Timo, thank you for answer. Meanwhile I was trying to setup horde+dovecot+search. Next step was dovecot 1.2.4 + solr 1.4. It works! Now it can find 2 non-latin words.
1) I cannot search by substrings - neither "plane" nor "plane*" does find "planet"
2) Solr can use "plane*" to find "planet". So I think dovecot internally cuts or masks metasymbols.
I see on wiki that you have plans to implement IMAP extension for this. Have you ever considered idea to allow users to use "*" wildcards for solr backend? If dovecot already "break" imap search, why not to allow people to use "plane* Ear*" to find "planet Earth"?
----- Message from tss at iki.fi ---------
> I'll try to look into this when I have a bit more time..
>
> On Wed, 2009-11-18 at 16:19 +0700, vuser1 at test123.ru wrote:
>>
>> Maybe I asked wrong question. OK, does anybody use fts_squat for
>> non-English emails? Can you find emails by query of 2 WORDS -
>> "planet Earth"? On my system it works only when both words are from
>> latin alphabet, otherwise returns nothing. For latin, it finds even
>> emails having both lating and russian letters (UTF-8 encoding). For
>> non-latin, query must consist of 1 word only.
>>
>> Thanks for any ideas.
>> > It looks I encoutered a bug or misconfiguration. fts_squat search for
>> > subject and body works excellent for English mails. For non-English
>> > (in particular, Russian) it works only when query consists of 1 word.
>> > Phrases - 2 and more words - always returns nothing. Example: search
>> > for "planet" ("планета") returns results, search for "Earth"
>> > ("Земля") also returns results, but "planet Earth" ("планета Земля")
>> > returns nothing. But there are emails having exact phrase "planet
>> > Earth". This problem occurs only for non-English queries, both for
>> > search in subject and in email body.
>> > I tried web-mail Horde 3.2 and Thunderbird.
>> > I *turned fts plugin off* and it correctly finds phrases with 2 and
>> > more russian words! So problem is squat. Is it a bug or known config
>> > issue?
>> >
>> > dovecot -n
>> > --------------
>
>
----- End message from tss at iki.fi -----
More information about the dovecot
mailing list