I should probably also post the messages leading up for reference. Note that I did not see any stale NFS messages this time, but did get the usual duplicate file messages:
Dec 24 10:43:07 gehenna11 dovecot: IMAP(user): /rci/nqu/rci/u1/user/dovecot/.INBOX/dovecot-uidlist: Duplicate file entry at line 4: 1231949492.M325236P5922V03E80009I01B6DCA5_0.gehenna9.rutgers.edu,S=4398:2,S (uid 514 -> 721) Dec 24 10:43:08 gehenna11 dovecot: IMAP(user): Maildir /rci/u1/user/Maildir: Expunged message reappeared, giving a new UID (old uid=516, file=1231952516.M164875P24494V03E80009I01B6DCAC_0.gehenna9.rutgers.edu,S=4355:2,S) Dec 24 10:44:38 gehenna11 dovecot: IMAP(user): /rci/nqu/rci/u1/user/dovecot/.INBOX/dovecot-uidlist: Duplicate file entry at line 6: 1233524267.M97899P9543V04240006I01B6E908_0.gehenna5.rutgers.edu,S=3039:2,S (uid 517 -> 723) Dec 24 10:44:38 gehenna11 dovecot: IMAP(user): Panic: file maildir-uidlist.c: line 405 (maildir_uidlist_records_array_delete): assertion failed: (pos != NULL) Dec 24 10:44:38 gehenna11 dovecot: IMAP(user): Raw backtrace: /usr/libexec/dovecot/imap [0x49edd0] -> /usr/libexec/dovecot/imap [0x49ee33] -> /usr/libexec/dovecot/imap [0x49e496] -> /usr/libexec/dovecot/imap [0x43f064] -> /usr/libexec/dovecot/imap(maildir_uidlist_refresh+0x2d0) [0x43fbf0] -> /usr/libexec/dovecot/imap [0x43c91f] -> /usr/libexec/dovecot/imap(maildir_storage_sync_init+0x14e) [0x43cefe] -> /usr/libexec/dovecot/imap(imap_sync_init+0x67) [0x4283d7] -> /usr/libexec/dovecot/imap [0x41bf5e] -> /usr/libexec/dovecot/imap [0x458ecd] -> /usr/libexec/dovecot/imap(io_loop_handle_timeouts+0x8b) [0x4a5f6b] -> /usr/libexec/dovecot/imap(io_loop_handler_run+0x73) [0x4a6bd3] -> /usr/libexec/dovecot/imap(io_loop_run+0x1d) [0x4a5d6d] -> /usr/libexec/dovecot/imap(main+0x620) [0x428f20] -> /lib64/libc.so.6(__libc_start_main+0xf4) [0x354301d994] -> /usr/libexec/dovecot/imap [0x419ac9] Dec 24 10:44:38 gehenna11 dovecot: dovecot: child 19032 (imap) killed with signal 6 (core dumped)
On 12/24/2009 11:26 AM, David Halik wrote:
Dec 23 11:05:44 gehenna18.rutgers.edu dovecot: IMAP(user1): Panic: file maildir-uidlist.c: line 403 (maildir_uidlist_records_array_delete): assertion failed: (pos != NULL) This is hopefully fixed by: http://hg.dovecot.org/dovecot-1.2/rev/d00abdaf9df2
Timo, I came into work this morning and found three new core dumps from maildir_uidlist_records_array_delete while running the patch you gave me yesterday, so I guess it's not fixed after all. I double checked that it is in fact running your patch, and it is. You can tell also that maildir-uidlist.c: line 403 changed to 405.
Here's the short backtrace:
#0 0x0000003543030265 in raise (sig=<value optimized out>) at ../nptl/sysdeps/unix/sysv/linux/raise.c:64 #1 0x0000003543031d10 in abort () at abort.c:88 #2 0x000000000049eddd in default_fatal_finish (type=<value optimized out>, status=0) at failures.c:160 #3 0x000000000049ee33 in i_internal_fatal_handler (type=LOG_TYPE_PANIC, status=0, fmt=<value optimized out>, args=<value optimized out>) at failures.c:443 #4 0x000000000049e496 in i_panic (format=0x6 <Address 0x6 out of bounds>) at failures.c:207 #5 0x000000000043f064 in maildir_uidlist_records_array_delete (uidlist=<value optimized out>, rec=0x16dbd48) at maildir-uidlist.c:405 #6 0x000000000043fbf0 in maildir_uidlist_refresh (uidlist=0x16c75b0) at maildir-uidlist.c:552 #7 0x000000000043c91f in maildir_sync_context (ctx=0x16a7188, forced=false, find_uid=0x0, lost_files_r=0x7fff86f83d87) at maildir-sync.c:659 #8 0x000000000043cefe in maildir_storage_sync_init (box=0x16b64f8, flags=0) at maildir-sync.c:924 #9 0x00000000004283d7 in imap_sync_init (client=0x16b27a0, box=0x16b64f8, imap_flags=0, flags=0) at imap-sync.c:146 #10 0x000000000041bf5e in idle_sync_now (box=0x4a58, ctx=0x16b3db8) at cmd-idle.c:127 #11 0x0000000000458ecd in check_timeout (ibox=0x16b64f8) at index-mailbox-check.c:43 #12 0x00000000004a5f6b in io_loop_handle_timeouts (ioloop=0x16afb60) at ioloop.c:313 #13 0x00000000004a6bd3 in io_loop_handler_run (ioloop=0x16afb60) at ioloop-epoll.c:180 #14 0x00000000004a5d6d in io_loop_run (ioloop=0x16afb60) at ioloop.c:335 #15 0x0000000000428f20 in main (argc=<value optimized out>, argv=<value optimized out>, envp=0x7fff86f840b8) at main.c:327
And the full backtrace: