On 5/23/22 8:16 PM, Michael Slusarz wrote:
Unfortunately, Tika has not yet been converted to events/categories with the ability to more granularly enable debugging just for this component.
Aha! Thx.
It's probably easier to just look at tika's debugging logs. The default log level (at least in Tika 2.3) will output an INFO line for every attachment indexed:> INFO [qtp235162442-22] 16:15:19,905 org.apache.tika.server.core.resource.TikaResource /tika (text/calendar)
I've been poking at slf4j; thought that's what tika 2x uses. So far, haven't been able to get a peep out of it. :-/ Need to go a'googlin'
OTOH, with mail_debug = on
, I see these,
... 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Submitted (requests left=1) 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Waiting for request to finish 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: queue http://127.0.0.1:9998: Connection to peer 127.0.0.1:9998 claimed request [Req1: PUT http://127.0.0.1:9998/tika/] 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: conn 127.0.0.1:9998 [1]: Claimed request [Req1: PUT http://127.0.0.1:9998/tika/] 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Sent header 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Send more (sent 5562, buffered=5570) 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: Debug: http-client: request [Req1: PUT http://127.0.0.1/tika/]: Waiting for request to finish 2022-05-23 20:49:37 indexer-worker(myuser@example.com)dOdUCaArjGIKlwEA+IOfAw:mFdtKKErjGIMlwEA+IOfAw: ...
, which looks promising.
But, so far, body Search exec, from within TBird, is not returning anything that I know is in that PDF. Which is the 'problem' I'm trying to log in order to debug ...