[Dovecot] FTS Plugin design
Timo Sirainen
tss at iki.fi
Tue Apr 21 19:32:32 EEST 2009
On Apr 21, 2009, at 6:25 AM, Rui Carneiro wrote:
> Anyone know some good libraries to handle the content of files like
> pdf,
> ppt, doc, etc? I am already indexing attachments all I need now is
> extract
> the text of them.
I've no idea, but you could at least look at some of the other full
text search engines. I remember them advertising indexing support for
all kinds of formats. Maybe they're using some specific library or
maybe it would be easy to extract their parsing code.
More information about the dovecot
mailing list