[Dovecot] Architecture for large Dovecot cluster

Joseph Tam jtam.home at gmail.com
Tue Jan 28 03:50:40 EET 2014


Sven Hartge <sven at svenhartge.de> wrote:

> Interesting datapoint: NetApp Deduplication did only recover about 1% of
> storage space with mdbox-based mail storage, while on an maildir-based
> mail storage, the rate was about 15%. (This was tested with a copy of
> real user data, so is accurate for my workload.)

Just a guess, but I expect the difference is because NetApp de-dupes by
checksumming blocks and mark whole blocks as duplicates if they have
the same checksum.

The message body has the same block offset in maildir (i.e. the start of
a message is at byte 0), whereas mdbox might align message body anywhere
in a block, so you might have 512 different block configurations for
the same message.

I don't know whether message alignment would be a worthwhile optimization
for mdbox.

Joseph Tam <jtam.home at gmail.com>


More information about the dovecot mailing list