Hi,
I played around a bit and tried to get tika to run with dovecot. In the end I was at least a bit successful. However, when I tried to index my inbox with "doveadm index -A '*'"
I get: doveadm(infoomatic): Error: fts_tika: PUT http://localhost:8081/tika failed: 500 Server Error doveadm(infoomatic): Warning: I/O leak: 0x7f4f697bb170 (line 127, fd 24) doveadm(infoomatic): Panic: file ioloop-iolist.c: line 22: unreached doveadm(infoomatic): Error: Raw backtrace: /usr/lib/dovecot/libdovecot.so.0(+0x67f30) [0x7f4f697e8f30] -> /usr/lib/dovecot/libdovecot.so.0(default_fatal_handler+0x2a) [0x7f4f697e8f9a] -> /usr/lib/dovecot/libdovecot.so.0(i_fatal+0) [0x7f4f6979c515] -> /usr/lib/dovecot/libdovecot.so.0(+0x79b0d) [0x7f4f697fab0d] -> /usr/lib/dovecot/libdovecot.so.0(io_loop_handle_add+0x3b) [0x7f4f697fb2ab] -> /usr/lib/dovecot/libdovecot.so.0(+0x78cf1) [0x7f4f697f9cf1] -> /usr/lib/dovecot/libdovecot.so.0(io_loop_move_io+0x3e) [0x7f4f697fa93e] -> /usr/lib/dovecot/libdovecot.so.0(connection_switch_ioloop+0x14) [0x7f4f697e6df4] -> /usr/lib/dovecot/libdovecot.so.0(http_client_switch_ioloop+0x28) [0x7f4f697bf8e8] -> /usr/lib/dovecot/libdovecot.so.0(+0x36a86) [0x7f4f697b7a86] -> /usr/lib/dovecot/modules/lib21_fts_solr_plugin.so(solr_connection_post_more+0x29) [0x7f4f68d9f069] -> /usr/lib/dovecot/modules/lib21_fts_solr_plugin.so(+0x3547) [0x7f4f68d9b547] -> /usr/lib/dovecot/modules/lib20_fts_plugin.so(fts_build_mail+0x35c) [0x7f4f68faaffc] -> /usr/lib/dovecot/modules/lib20_fts_plugin.so(+0xd3ee) [0x7f4f68fb03ee] -> /usr/lib/dovecot/libdovecot-storage.so.0(mail_precache+0x19) [0x7f4f69ac3c39] -> doveadm() [0x419501] -> doveadm() [0x4153c3] -> doveadm(doveadm_mail_try_run+0x48c) [0x4162ac] -> doveadm(main+0x3d8) [0x412588] -> /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf5) [0x7f4f693dcec5] -> doveadm() [0x4127a4] Aborted (core dumped)
The last messages of the tika-server: Nov 04, 2014 10:18:17 PM org.apache.tika.server.TikaResource logRequest INFO: tika (video/x-ms-wmv) Nov 04, 2014 10:18:17 PM org.apache.tika.server.TikaResource$3 write WARNING: tika: Text extraction failed org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.server.TikaResource$1@44742f73 Nov 04, 2014 10:18:17 PM org.apache.cxf.jaxrs.utils.JAXRSUtils logMessageHandlerProblem SEVERE: Problem with writing the data, class org.apache.tika.server.TikaResource$3, ContentType: text/plain Nov 04, 2014 10:18:17 PM org.apache.cxf.jaxrs.impl.WebApplicationExceptionMapper toResponse WARNING: javax.ws.rs.WebApplicationException: HTTP 415 Unsupported Media Type
dovecot -n: # 2.2.15: /etc/dovecot/dovecot.conf # OS: Linux 3.13.0-39-generic x86_64 Ubuntu 14.04.1 LTS ext4 auth_username_format = %n disable_plaintext_auth = no mail_gid = vmail mail_home = /var/lib/vmail/home/%u mail_location = mdbox:/var/lib/vmail/mail/%u mail_plugins = fts fts_solr mail_uid = vmail managesieve_notify_capability = mailto managesieve_sieve_capability = fileinto reject envelope encoded-character vacation subaddress comparator-i;ascii-numeric relational regex imap4flags copy include variables body enotify environment mailbox date ihave duplicate imapflags mdbox_rotate_size = 32 M namespace inbox { inbox = yes location = mailbox Drafts { special_use = \Drafts } mailbox Junk { special_use = \Junk } mailbox Sent { special_use = \Sent } mailbox "Sent Messages" { special_use = \Sent } mailbox Trash { special_use = \Trash } prefix = } passdb { args = /etc/dovecot/dovecot-ldap.conf.ext driver = ldap } plugin { fts = solr fts_autoindex = yes fts_solr = url=http://localhost:8080/solr/ break-imap-search fts_tika = http://localhost:8081/tika sieve = ~/.dovecot.sieve sieve_dir = ~/sieve sieve_extensions = +imapflags } protocols = " imap sieve" service auth { unix_listener /var/spool/postfix/private/auth { group = postfix mode = 0660 user = postfix } unix_listener auth-userdb { group = vmail user = vmail } } service managesieve-login { inet_listener sieve_deprecated { port = 2000 } } ssl_cert =
Has anyone hints? Or do the devs want some more information for debugging?
best regards,
infoomatic