[Dovecot] Removing Duplicates

Sabahattin Gucukoglu mail at sabahattin-gucukoglu.com
Tue Mar 23 15:24:51 EET 2010


On 14 Mar 2010, at 11:41, Leonardo Rodrigues wrote:
Em 14/03/2010 08:21, Sabahattin Gucukoglu escreveu:
>> I am starting fresh with a local repository of mails, which almost certainly have duplicates in them.  I am going to use maildirs, and ensure all mails are input with CRLFs.
>> 
>> The question is: does anybody know how I can find and remove duplicates, either while injecting mail with IMAP, or afterward?  I can use tools to find duplicate Message-IDs, but don't know of a way to remove duplicates in mailboxes that are already imported as opposed to incoming mail.  Perhaps there is a way to use the IMAP protocol for this?

>>  i've used console tool named fdupes to find duplicate messages on Maildirs. That's done directly on the filesystem, there's no IMAP or dovecot involved.
> 
Saved about 200M in one particularly large mailbox.  Thanks!

Thanks to others for their suggestions, now working with delIMAPdups since I have mails (not many, but a few) which have identical content and are only different in their Content-Type header lines.  One copy will have the declaration on one line, the other has its declarations folded across multiple lines for each parameter.  Any idea why *that* might be?

Cheers,
Sabahattin

-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 2655 bytes
Desc: not available
Url : http://dovecot.org/pipermail/dovecot/attachments/20100323/65dc9e5c/attachment.bin 


More information about the dovecot mailing list