On Tue, 2010-11-16 at 12:01 +0200, Axel Thimm wrote:
I checked the files and the mentioned offsets are one line off the next from_ line. It looks like a content-length mismatch.
They also seem to mix CR+LF and simple LF endings within the same mail. For example an otherwise CR+LF encoded mail would have a few headers w/o CR at the bottom inserted (by dovecot?). Maybe the content-length computation was therefore a few lines off.
I can probably salvage these mboxes by grepping out the content-length header, but I wonder why the content-length header are off.
Oh, didn't read this message before replying / checking the files :) Yeah, you're right. It's the combination of CRLF + wrong Content-Length: headers. Normally broken Content-Length: value is detected and fixed, but I guess there's a bug with CRLF line feeds.