2.3.1 Replication is throwing scary errors

Remko Lodder remko at FreeBSD.org
Thu Apr 5 23:54:35 EEST 2018



> On 4 Apr 2018, at 01:34, Reuben Farrelly <reuben-dovecot at reub.net> wrote:
> 
> Hi,
> 
>> ------------------------------
>> Message: 2
>> Date: Mon, 2 Apr 2018 22:06:07 +0200
>> From: Michael Grimm <trashcan at ellael.org>
>> To: Dovecot Mailing List <dovecot at dovecot.org>
>> Subject: 2.3.1 Replication is throwing scary errors
>> Message-ID: <29998016-D62F-4348-93D1-613B13DA90DB at ellael.org>
>> Content-Type: text/plain;	charset=utf-8
>> Hi
>> [This is Dovecot 2.3.1 at FreeBSD STABLE-11.1 running in two jails at distinct servers.]
>> I did upgrade from 2.2.35 to 2.3.1 today, and I do become pounded by error messages at server1 (and vice versa at server2) as follows:
>> 	| Apr  2 17:12:18 <mail.err> server1.lan dovecot: doveadm: Error: dsync(server2.lan): I/O has stalled, \
>> 		no activity for 600 seconds (last sent=mail_change, last recv=mail_change (EOL))
>> 	| Apr  2 17:12:18 <mail.err> server1.lan dovecot: doveadm: Error: Timeout during state=sync_mails \
>> 		(send=changes recv=mail_requests)
>> 	[?]
>> 	| Apr  2 18:59:03 <mail.err> server1.lan dovecot: doveadm: Error: dsync(server2.lan): I/O has stalled, \
>> 		no activity for 600 seconds (last sent=mail, last recv=mail (EOL))
>> 	| Apr  2 18:59:03 <mail.err> server1.lan dovecot: doveadm: Error: Timeout during state=sync_mails \
>> 		(send=mails recv=recv_last_common)
>> I cannot see in my personal account any missing replications, *but* I haven't tested this thoroughly enough. I do have customers being serviced at these productive servers, *thus* I'm back to 2.2.35 until I do understand or have learned what is going on.
>> Any ideas/feedback?
>> FYI: I haven't seen such errors before. Replication has been working for years now, without any glitches at all.
>> Regards,
>> Michael
> 
> It's not just you.  This issue hit me recently, and it was impacting replication noticeably.  I am following git master-2.3 .
> 
> 

I am seeing the same as Michael Grimm also on FreeBSD-11.
You’ll also notice in doveadm replicator status ‘*’ that the failed flag is raised for those users and that
there are processes just hanging forever when those logs start to appear:

<user>   45949   0.0  0.0    47888  13276  -  I    20:20        0:00.10 doveadm-server: [<user> Verwijderde items send:mail_requests recv:changes] (doveadm-server)
<user2> 45964   0.0  0.0    49860  11608  -  I    20:20        0:00.05 doveadm-server: [IP6 <user2> INBOX import:1/3] (doveadm-server)
<user3> 45965   0.0  0.1    58256  19820  -  I    20:20        0:00.11 doveadm-server: [IP6 <user3> INBOX import:16/18] (doveadm-server)
<user4> 46480   0.0  0.0    53536  16288  -  I    20:22        0:00.08 doveadm-server: [IP6 <user4> INBOX import:4/6] (doveadm-server)
<user5> 46745   0.0  0.0    51496  14184  -  I    20:22        0:00.07 doveadm-server: [IP6 <user5> INBOX import:5/6] (doveadm-server)

I also reverted to 2.2.35 because I started to get complaints from my users that mail was missing.

Cheers
Remko

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: Message signed with OpenPGP
URL: <https://dovecot.org/pipermail/dovecot/attachments/20180405/4b5c633e/attachment.sig>


More information about the dovecot mailing list