dovecot-2.2: lib-fts: Update comment on tr29 rules.
dovecot at dovecot.org
dovecot at dovecot.org
Mon Aug 17 10:46:07 UTC 2015
details: http://hg.dovecot.org/dovecot-2.2/rev/0381695e303f
changeset: 18942:0381695e303f
user: Teemu Huovila <teemu.huovila at dovecot.fi>
date: Mon Aug 17 13:14:44 2015 +0300
description:
lib-fts: Update comment on tr29 rules.
diffstat:
src/lib-fts/fts-tokenizer-generic.c | 4 ++++
1 files changed, 4 insertions(+), 0 deletions(-)
diffs (14 lines):
diff -r 622ce8582a8e -r 0381695e303f src/lib-fts/fts-tokenizer-generic.c
--- a/src/lib-fts/fts-tokenizer-generic.c Mon Aug 17 12:56:25 2015 +0300
+++ b/src/lib-fts/fts-tokenizer-generic.c Mon Aug 17 13:14:44 2015 +0300
@@ -594,6 +594,10 @@
#29, but tailored for FTS purposes.
http://www.unicode.org/reports/tr29/
+ Note: The text of tr29 is a living standard, so it keeps
+ changing. In newer specs some characters are combined, like AHLetter
+ (ALetter | Hebrew_Letter) and MidNumLetQ (MidNumLet | Single_Quote).
+
Adaptions:
* No word boundary at Start-Of-Text or End-of-Text (Wb1 and WB2).
* Break just once, not before and after.
More information about the dovecot-cvs
mailing list