v2.3.11.3 solr plugin search via MUA fails to match accented ascii characters; cmd line exec of `doveadm fts lookup` PANICs (assertion failed)

PGNet Dev pgnet.dev at gmail.com
Mon Oct 19 18:00:20 EEST 2020


On 10/19/20 1:18 AM, John Fawcett wrote:
> I would recommend you to redo the tests after correcting the
> configuration. To be doubly sure you can include accented and unique non
> accented text in the same email and search for both. If the non accented
> text is found you know you've searching against the updated index and
> the fact that accented text is not found is not simply because the index
> updates are not visible.

temp changing,

	fts_solr = url=https://solr.example.com:8984/solr/dovecot/ use_libfts soft_commit=yes batch_size=250
-	fts_enforced = body
+	fts_enforced = yes
-	fts_filters = normalizer-icu snowball stopwords
+	fts_filters = lowercase snowball stopwords

now,

	doveadm fts rescan -u testuser at example.com
	doveadm index      -u testuser at example.com -q '*'

, no errors.

on receipt of a test email with

	subject: aausdfrhyétdwgyatrdf aausdfrhyetdwgyatrdf
	body:    aausdfrhyétdwgyatrdf aausdfrhyetdwgyatrdf

i see a solr auto-scan,

	2020-10-19 14:41:50.628 INFO  (searcherExecutor-15-thread-1-processing-x:dovecot) [   x:dovecot] o.a.s.c.SolrCore [dovecot]  Registered new searcher autowarm time: 1 ms
	2020-10-19 14:41:50.634 INFO  (qtp1533985074-18) [   x:dovecot] o.a.s.u.p.LogUpdateProcessorFactory [dovecot]  webapp=/solr path=/update params={}{commit=} 0 155
	2020-10-19 14:41:51.571 INFO  (qtp1533985074-24) [   x:dovecot] o.a.s.u.p.LogUpdateProcessorFactory [dovecot]  webapp=/solr path=/update params={}{add=[135785/c92f64f79f0d1ed01e6d5b314f04886c/testuser at example.com (1680991596381732864)]} 0 9

	==> /var/log/solr/solr_gc.log <==
	[2020-10-19T07:41:51.612-0700][146823.946s] GC(343) Pause Young (Normal) (G1 Evacuation Pause)
	[2020-10-19T07:41:51.613-0700][146823.947s] GC(343) Using 2 workers of 2 for evacuation
	[2020-10-19T07:41:51.635-0700][146823.969s] GC(343)   Pre Evacuate Collection Set: 0.3ms
	[2020-10-19T07:41:51.636-0700][146823.970s] GC(343)   Merge Heap Roots: 0.2ms
	[2020-10-19T07:41:51.636-0700][146823.970s] GC(343)   Evacuate Collection Set: 18.9ms
	[2020-10-19T07:41:51.637-0700][146823.971s] GC(343)   Post Evacuate Collection Set: 2.0ms
	[2020-10-19T07:41:51.637-0700][146823.971s] GC(343)   Other: 1.5ms
	[2020-10-19T07:41:51.637-0700][146823.971s] GC(343) Eden regions: 238->0(244)
	[2020-10-19T07:41:51.638-0700][146823.972s] GC(343) Survivor regions: 4->2(31)
	[2020-10-19T07:41:51.638-0700][146823.972s] GC(343) Old regions: 189->189
	[2020-10-19T07:41:51.638-0700][146823.973s] GC(343) Archive regions: 2->2
	[2020-10-19T07:41:51.639-0700][146823.973s] GC(343) Humongous regions: 10->9
	[2020-10-19T07:41:51.639-0700][146823.973s] GC(343) Metaspace: 61564K(78028K)->61564K(78028K) NonClass: 55348K(65024K)->55348K(65024K) Class: 6216K(13004K)->6216K(13004K)
	[2020-10-19T07:41:51.640-0700][146823.974s] GC(343) Pause Young (Normal) (G1 Evacuation Pause) 441M->200M(512M) 27.372ms
	[2020-10-19T07:41:51.640-0700][146823.974s] GC(343) User=0.01s Sys=0.01s Real=0.03s

	==> /var/log/solr/solr.log <==
	2020-10-19 14:41:51.702 INFO  (searcherExecutor-15-thread-1-processing-x:dovecot) [   x:dovecot] o.a.s.c.SolrCore [dovecot]  Registered new searcher autowarm time: 0 ms
	2020-10-19 14:41:51.705 INFO  (qtp1533985074-18) [   x:dovecot] o.a.s.u.p.LogUpdateProcessorFactory [dovecot]  webapp=/solr path=/update params={}{commit=} 0 127


search in TBird

	subject: aausdfrhyetdwgyatrdf  => FOUND
	body:    aausdfrhyétdwgyatrdf  => FOUND

	subject: aausdfrhyetdwgyatrdf  => FOUND
	body:    aausdfrhyétdwgyatrdf  => (emtpy)

on header search, I'm _not_ seeing any additional activity in solr.log

so, either i'm looking in the wrong place, haven't turned on appropriate logging, or i'm still not searching via solr ...

separately,

	doveadm fts lookup ...

still panics; Aki's bug will hopefully deal with that


More information about the dovecot mailing list