fts_encoder

John Fawcett john at voipsupport.it
Mon Feb 8 03:03:00 EET 2021


On 07/02/2021 18:51, Joan Moreau wrote:
>
> more info : the function fts_parser_script_more in
> plugins/fts/fts-parser.c properly read the output of the script
>
> still, the data is not sent to the FTS pligins (xapian or any other)
>
>
>
> On 2021-02-07 17:37, Joan Moreau wrote:
>
>> more info : I am running dovecot git version
>>
>>
>> On 2021-02-07 17:15, Joan Moreau wrote:
>>
>>     a bit more on this, adding log in the decode2text.sh, I can see
>>     that pdftotext output the right data, but that data is /not/
>>     transmitted to the fts plugin for indexing (only the original pdf
>>     code is)
>>
>>
>>
>>     On 2021-02-07 17:00, Joan Moreau wrote:
>>
>>         Hello,
>>
>>         I am trying to deal properly with email attachements in
>>         fts-xapian plugins.
>>
>>         I tried the default script with a PDF file.
>>
>>         The data I receive in the fts plugin part ("xxx_build_more")
>>         is the original document, no the output of the pdftotext
>>
>>         Is there anything I am missing ?
>>
>>         Here my config:
>>
>>
>>         plugin {
>>                 plugin = fts_xapian managesieve sieve
>>
>>                 fts = xapian
>>                 fts_xapian = partial=2 full=20 verbose=1 attachments=1
>>
>>                 fts_autoindex = yes
>>                 fts_enforced = yes
>>                 fts_autoindex_exclude = \Trash
>>                 fts_autoindex_exclude2 = \Drafts
>>
>>                 fts_decoder = decode2text
>>
>>                 sieve = /data/mail/%d/%n/local.sieve
>>                 sieve_after = /data/mail/after.sieve
>>                 sieve_before = /data/mail/before.sieve
>>                 sieve_dir = /data/mail/%d/%n/sieve
>>                 sieve_global_dir = /data/mail
>>                 sieve_global_path = /data/mail/global.sieve
>>         }
>>
>>         ...
>>
>>         service decode2text {
>>            executable = script /usr/libexec/dovecot/decode2text.sh
>>            user = dovecot
>>            unix_listener decode2text {
>>              mode = 0666
>>            }
>>         }
>>
>>
>>         Thank you
>>
>>
Joan

I'm not sure I can be much use for xapian, but looking at your
configuration I did notice some differences with the documentation. I
don't know if they are relevant to the issue you're seeing.

First of all I don't see

|mail_plugins = fts|

plugin = fts

settings which are both mentioned in the xapian documentation.

Also the documentation states that attachments=1 can only index text
attachments. Maybe you should be using attachments=0 and let fts_decode
handle the attachments.

Failing that, I can only advise to turn on some debugging and see what
that brings.

best regards

John


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://dovecot.org/pipermail/dovecot/attachments/20210208/e0b6d10c/attachment.html>


More information about the dovecot mailing list