Timo, thank you for answer. Meanwhile I was trying to setup horde+dovecot+search. Next step was dovecot 1.2.4 + solr 1.4. It works! Now it can find 2 non-latin words.
- I cannot search by substrings - neither "plane" nor "plane*" does find "planet"
- Solr can use "plane*" to find "planet". So I think dovecot internally cuts or masks metasymbols.
I see on wiki that you have plans to implement IMAP extension for this. Have you ever considered idea to allow users to use "*" wildcards for solr backend? If dovecot already "break" imap search, why not to allow people to use "plane* Ear*" to find "planet Earth"?
----- Message from tss@iki.fi ---------
I'll try to look into this when I have a bit more time..
On Wed, 2009-11-18 at 16:19 +0700, vuser1@test123.ru wrote:
Maybe I asked wrong question. OK, does anybody use fts_squat for non-English emails? Can you find emails by query of 2 WORDS - "planet Earth"? On my system it works only when both words are from latin alphabet, otherwise returns nothing. For latin, it finds even emails having both lating and russian letters (UTF-8 encoding). For non-latin, query must consist of 1 word only.
Thanks for any ideas.
It looks I encoutered a bug or misconfiguration. fts_squat search for subject and body works excellent for English mails. For non-English (in particular, Russian) it works only when query consists of 1 word. Phrases - 2 and more words - always returns nothing. Example: search for "planet" ("планета") returns results, search for "Earth" ("Земля") also returns results, but "planet Earth" ("планета Земля") returns nothing. But there are emails having exact phrase "planet Earth". This problem occurs only for non-English queries, both for search in subject and in email body. I tried web-mail Horde 3.2 and Thunderbird. > I *turned fts plugin off* and it correctly finds phrases with 2 and more russian words! So problem is squat. Is it a bug or known config > issue?
dovecot -n
----- End message from tss@iki.fi -----