Problem with tika
Arjen Heidinga
dexter at beetjevreemd.nl
Fri Mar 6 10:52:48 EET 2020
Hello all,
For some time now we've bin using solr as a search engine (working
great). I have added tika for searching inside documents, however it
keeps crashing when indexing. When just indexing mails that arrive works
good, but a reindex on all mail constantly crashes with below stacktrace.
When I observe the packets with wireshark I see a HTTP-flow going to
tika and suddenly, midway a document dovecot (or the server) sends a
RST. Sometimes after thousands of documents, sometimes after a few.
Usually after a few hundred.
It appears there are less errors using http than https.
Relevant config:
OS: CentOS6, fully updated
plugin {
fts = solr
batch_size = 1
fts_solr =
url=https://username:password@solr-01.vevida.net:443/solr/dovecot/
#fts_tika = https://solr-01.vevida.net:443/tika/
batch_size = 1000
fts_autoindex=yes
soft_commit=no
}
# dovecot --version
2.3.9.3 (9f41b88fa)
# Configure options:
--docdir=%{_docdir}/dovecot \
--disable-static \
--with-nss \
--with-shadow \
--with-pam \
--with-gssapi=plugin \
--with-ldap=plugin \
--with-sql=plugin \
--with-pgsql \
--with-sqlite \
--with-zlib \
--with-bzlib \
--with-lzma \
--with-libcap \
--with-ssl=openssl \
--with-ssldir=%{ssldir} \
--with-solr \
--with-docs
# It is compiled agains the latest openssl
# Tika and Solr: Both latest versions.
#Stacktrace:
doveadm(info at samenmetrenske.nl): Info: Sent: Caching mails seq=1..161
doveadm(info at samenmetrenske.nl): Panic: file http-client-request.c: line
1173 (http_client_request_send_more): assertion failed:
(req->payload_input != NULL)
doveadm(info at xxxxxxxxxxxxxxxxxxx.x): Error: Raw backtrace:
/usr/lib64/dovecot/libdovecot.so.0(backtrace_append+0x2f)
[0x7f95d805acbf] ->
/usr/lib64/dovecot/libdovecot.so.0(backtrace_get+0x26) [0x7f95d805add6]
-> /usr/lib64/dovecot/libdovecot.so.0(+0xe90ba) [0x7f95d80660ba] ->
/usr/lib64/dovecot/libdovecot.so.0(+0xe9161) [0x7f95d8066161] ->
/usr/lib64/dovecot/libdovecot.so.0(+0x41158) [0x7f95d7fbe158] ->
/usr/lib64/dovecot/libdovecot.so.0(http_client_request_send_more+0x424)
[0x7f95d8005094] ->
/usr/lib64/dovecot/libdovecot.so.0(http_client_connection_output+0x11a)
[0x7f95d800a24a] ->
/usr/lib64/dovecot/libssl_iostream_openssl.so(+0x8f6a) [0x7f95d57a2f6a]
-> /usr/lib64/dovecot/libdovecot.so.0(+0x114483) [0x7f95d8091483] ->
/usr/lib64/dovecot/libdovecot.so.0(io_loop_call_io+0x61)
[0x7f95d807e581] ->
/usr/lib64/dovecot/libdovecot.so.0(io_loop_handler_run_internal+0xdc)
[0x7f95d808076c] ->
/usr/lib64/dovecot/libdovecot.so.0(io_loop_handler_run+0x5c)
[0x7f95d807e67c] -> /usr/lib64/dovecot/libdovecot.so.0(io_loop_run+0x38)
[0x7f95d807e8c8] -> /usr/lib64/dovecot/libdovecot.so.0(+0x89105)
[0x7f95d8006105] ->
/usr/lib64/dovecot/libdovecot.so.0(http_client_request_send_payload+0x1f)
[0x7f95d80063cf] -> /usr/lib64/dovecot/lib20_fts_plugin.so(+0xd31d)
[0x7f95d6ad931d] ->
/usr/lib64/dovecot/lib20_fts_plugin.so(fts_parser_more+0x1a)
[0x7f95d6ad83ca] ->
/usr/lib64/dovecot/lib20_fts_plugin.so(fts_build_mail+0x761)
[0x7f95d6ad6401] -> /usr/lib64/dovecot/lib20_fts_plugin.so(+0x114ca)
[0x7f95d6add4ca] ->
/usr/lib64/dovecot/libdovecot-storage.so.0(mail_precache+0x2a)
[0x7f95d835ab4a] -> doveadm(+0x31e75) [0x55e7c1052e75] ->
doveadm(+0x321fb) [0x55e7c10531fb] -> doveadm(+0x2c321) [0x55e7c104d321]
-> doveadm(+0x2c577) [0x55e7c104d577] ->
doveadm(doveadm_cmd_ver2_to_mail_cmd_wrapper+0x1e8) [0x55e7c104ec38] ->
doveadm(doveadm_cmd_run_ver2+0x52e) [0x55e7c105fafe] ->
doveadm(doveadm_cmd_try_run_ver2+0x37) [0x55e7c105fb97] ->
doveadm(main+0x21a) [0x55e7c1062aca] ->
/lib64/libc.so.6(__libc_start_main+0x100) [0x7f95d7983d20] ->
doveadm(+0x1c479) [0x55e7c103d479]
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <https://dovecot.org/pipermail/dovecot/attachments/20200306/a38a8b71/attachment.sig>
More information about the dovecot
mailing list