Dovecot - FTS Solr: disk usage & position information?
Alessio Cecchi
alessio at skye.it
Thu Sep 2 17:26:40 EEST 2021
Hi Vincent,
thanks for your investigations!
Il 01/09/21 11:27, Vincent Brillault ha scritto:
> Dear all,
>
> Just a status update, in case this can help others.
>
> We went forward and disabled the position information indexing and the
> re-indexed of our mail data (over a couple of days to avoid
> overloading the systems). Before the re-indexing we had 1.33 TiB in
> our Solr Indexes. After re-indexation, we had only 542 GiB, that's a
> 60% of our storage requirements for our FTS indexes :)
this optimization also produce a less RAM requirements on Solr server?
>
> So far, we haven't been reported any issue or measurable differences
> by our users concerning the quality of the FTS. From further
> debugging, as discussed on the solr-user mailing list
> (https://lists.apache.org/thread.html/rcdf8bb97be0839e57928ad5fa34501ec8a73392c11248db91206bc33%40%3Cusers.solr.apache.org%3E),
> I've come to the conclusion that, with the current integration between
> Dovecot and Solr (esp the fact that `"` is escaped), it's impossible
> to trigger phrase queries from user queries as long as
> autoGeneratePhraseQueries is false.
>
> I've attached the schema.xml and solrconfig.xml we are now using with
> Solr 8.6.0, in case there is any interest from others. Let me know if
> you prefer a MR to update the xmls present in
> https://github.com/dovecot/core/tree/master/doc.
The attached schema and config file also works with Solr 7.7.0? Since
dovecot provide schema and config for 7.7.0 will be useful for many of
us a path based on it.
Thanks
--
Alessio Cecchi
Postmaster @ http://www.qboxmail.it
https://www.linkedin.com/in/alessice
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://dovecot.org/pipermail/dovecot/attachments/20210902/901ab4ca/attachment.html>
More information about the dovecot
mailing list