soon-to-be-next-release of Tika,
i saw that was coming
you can turn off throwing zero-byte file exceptions via the config
can you point to the config toggle, or docs, in https://github.com/apache/tika ?
The exceptions should be harmless and you can safely ignore them.
including the SEVERE notice?
For some users, they need to know that there's a zero-byte file, hence the default behavior. It can also be useful while doing parser development to find files where embedded files are zero-byte files. Sometimes things go wrong in the container parser.
iiuc, the exception's thrown WHEN input's a zero-byte file.
in this dovecot <-> tika case, that only occurs when the attachment sent is a .eml, not with any other attachment type (so far)
is current-release tika known/verified to handle .eml (iirc, there were some issues awhile ago ...) ? and not mistakenly munging the input size to zero?
if it's demonstrated OK, then it's likely Dovecot mistakenly sending no input in the .eml-attachment case, no?