OT - Finding/removing duplicate emails - WAS: Re: [Dovecot] dovecot/lmtp munmap()-ing a lot

Steffen Kaiser skdovecot at smail.inf.fh-brs.de
Tue Jun 10 13:17:25 UTC 2014


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Tue, 10 Jun 2014, Charles Marcus wrote:
> On 6/9/2014 5:44 PM, Ralf Hildebrandt <r at sys4.de> wrote:
>> That's probably the problem here. The user had LOTS of (duplicate!)
>> mails in his inbox.
>
> Anyone ever found a reliable way to do this?
>
> It sure would be nice if dovecot could perform this on a per account and/or 
> per maildir/mailbox case with a simple doveadm command...

The basic question is: what is a duplicate?

I spot 100% duplicates within the same Maildir mailbox with a script 
similiar to "fdupes" http://linux.die.net/man/1/fdupes .
Because an user may copy messages around, I scan one mailbox at a time.

For some rare cases, where I merge two accounts, I use a script, that 
looks for the message id in one account and removes all messages with the 
same id in the other account. Than I merge the Maildirs.

However, neither script I would call general enough for automatic 
processing.

- -- 
Steffen Kaiser
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)

iQEVAwUBU5cFZXz1H7kL/d9rAQK/ogf/YWmoJBc7tg5Wsnnz2FPHcxIrnC3YZD2b
FXSFsCm60Gc2eyqW2zti7bNLOzZShcIUsYeRteV4lyC0iIcDD6QV13hc50O3LlPx
L31kffgtmzIi1P0nQMkiIepbm75e0Rj+4XaaYSaEY1GKSMP6MDUhoBHLPwXS/qaK
IahX1ALPVt9gE4SBf9iZirMfHssLJAQvUoGHf6PJLPzWPMUgUV1bY+2U07pgEBh3
4Iaq518zDIKxPp3hWj8d0AuTuErC3xh5Abdcs7x60cUFIoLIIfC7DOszDpA0OkUv
Tbc7cOS2sPbP5B0n8k4J28H9h6tlLlUxIT144TUDy9bjbuKVP0FxhA==
=CwgK
-----END PGP SIGNATURE-----


More information about the dovecot mailing list