dovecot-2.2: lib-fts: Update comment on tr29 rules.

dovecot at dovecot.org dovecot at dovecot.org
Mon Aug 17 10:46:07 UTC 2015


details:   http://hg.dovecot.org/dovecot-2.2/rev/0381695e303f
changeset: 18942:0381695e303f
user:      Teemu Huovila <teemu.huovila at dovecot.fi>
date:      Mon Aug 17 13:14:44 2015 +0300
description:
lib-fts: Update comment on tr29 rules.

diffstat:

 src/lib-fts/fts-tokenizer-generic.c |  4 ++++
 1 files changed, 4 insertions(+), 0 deletions(-)

diffs (14 lines):

diff -r 622ce8582a8e -r 0381695e303f src/lib-fts/fts-tokenizer-generic.c
--- a/src/lib-fts/fts-tokenizer-generic.c	Mon Aug 17 12:56:25 2015 +0300
+++ b/src/lib-fts/fts-tokenizer-generic.c	Mon Aug 17 13:14:44 2015 +0300
@@ -594,6 +594,10 @@
   #29, but tailored for FTS purposes.
   http://www.unicode.org/reports/tr29/
 
+  Note: The text of tr29 is a living standard, so it keeps
+  changing. In newer specs some characters are combined, like AHLetter
+  (ALetter | Hebrew_Letter) and MidNumLetQ (MidNumLet | Single_Quote).
+
   Adaptions:
   * No word boundary at Start-Of-Text or End-of-Text (Wb1 and WB2).
   * Break just once, not before and after.


More information about the dovecot-cvs mailing list