Jump to content
  • [6.9.0-beta35] Mover repeatedly crashes and hangs


    starlightk7
    • Minor

    Hitting random crashes with mover that did not occur on 6.8. Here are several samples.

     

    I repro by:

     

    1) Invoke mover

    2) Wait (it dies)

     

     The crash traces below appear in the syslog and mover is hung until its manually stopped with "mover stop" and started again, at which point it runs until it crashes again. I have not observed any other adverse effects on the server in this version as a result of this happening.

     

    I can't validate whether this still occurs on RC2 or not because I get the unresponsive sever issue reported in another thread.

     

    Diagnostics attached.

     

    Note: I omitted mover's logging of filenames in the diagnostics.

    Dec 30 16:46:45 Tower kernel: BUG: unable to handle page fault for address: 0000008000000034
    Dec 30 16:46:45 Tower kernel: #PF: supervisor read access in kernel mode
    Dec 30 16:46:45 Tower kernel: #PF: error_code(0x0000) - not-present page
    Dec 30 16:46:45 Tower kernel: PGD 0 P4D 0 
    Dec 30 16:46:45 Tower kernel: Oops: 0000 [#1] SMP NOPTI
    Dec 30 16:46:45 Tower kernel: CPU: 18 PID: 52982 Comm: shfs Not tainted 5.8.18-Unraid #1
    Dec 30 16:46:45 Tower kernel: Hardware name: System manufacturer System Product Name/ROG ZENITH EXTREME, BIOS 2001 07/31/2019
    Dec 30 16:46:45 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 16:46:45 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 16:46:45 Tower kernel: RSP: 0018:ffffc900501b7958 EFLAGS: 00010246
    Dec 30 16:46:45 Tower kernel: RAX: 0000008000000000 RBX: 000000000000001e RCX: 0000000000000012
    Dec 30 16:46:45 Tower kernel: RDX: 0000000000000000 RSI: ffff8891201f9ff0 RDI: 0000008000000000
    Dec 30 16:46:45 Tower kernel: RBP: 0000000000042452 R08: 0000008000000000 R09: ffffc900501b7960
    Dec 30 16:46:45 Tower kernel: R10: ffffc900501b7960 R11: 0000000000000000 R12: 0000000000100c4a
    Dec 30 16:46:45 Tower kernel: R13: ffff8897ef437320 R14: 0000000000042452 R15: 0000000000001000
    Dec 30 16:46:45 Tower kernel: FS:  000015383e243700(0000) GS:ffff88a03e080000(0000) knlGS:0000000000000000
    Dec 30 16:46:45 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 16:46:45 Tower kernel: CR2: 0000008000000034 CR3: 0000000fbcc60000 CR4: 00000000003406e0
    Dec 30 16:46:45 Tower kernel: Call Trace:
    Dec 30 16:46:45 Tower kernel: find_get_entry+0x7d/0xc4
    Dec 30 16:46:45 Tower kernel: pagecache_get_page+0x20/0x127
    Dec 30 16:46:45 Tower kernel: grab_cache_page_write_begin+0x17/0x2e
    Dec 30 16:46:45 Tower kernel: iomap_write_begin+0xaf/0x30c
    Dec 30 16:46:45 Tower kernel: ? put_page+0x5/0x14
    Dec 30 16:46:45 Tower kernel: ? iov_iter_advance+0x142/0x241
    Dec 30 16:46:45 Tower kernel: iomap_write_actor+0x9b/0x177
    Dec 30 16:46:45 Tower kernel: iomap_apply+0x103/0x172
    Dec 30 16:46:45 Tower kernel: ? iomap_write_end+0x195/0x195
    Dec 30 16:46:45 Tower kernel: iomap_file_buffered_write+0x48/0x69
    Dec 30 16:46:45 Tower kernel: ? iomap_write_end+0x195/0x195
    Dec 30 16:46:45 Tower kernel: xfs_file_buffered_aio_write+0xdc/0x258 [xfs]
    Dec 30 16:46:45 Tower kernel: do_iter_readv_writev+0xb3/0xf3
    Dec 30 16:46:45 Tower kernel: do_iter_write+0x7c/0xb8
    Dec 30 16:46:45 Tower kernel: iter_file_splice_write+0x215/0x313
    Dec 30 16:46:45 Tower kernel: direct_splice_actor+0x17/0x18
    Dec 30 16:46:45 Tower kernel: splice_direct_to_actor+0x125/0x1cd
    Dec 30 16:46:45 Tower kernel: ? do_splice_from+0x35/0x35
    Dec 30 16:46:45 Tower kernel: do_splice_direct+0x94/0xbd
    Dec 30 16:46:45 Tower kernel: do_sendfile+0x187/0x247
    Dec 30 16:46:45 Tower kernel: __do_sys_sendfile64+0x81/0xa7
    Dec 30 16:46:45 Tower kernel: ? ksys_lseek+0x52/0x5e
    Dec 30 16:46:45 Tower kernel: do_syscall_64+0x7a/0x94
    Dec 30 16:46:45 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
    Dec 30 16:46:45 Tower kernel: RIP: 0033:0x15383ffba34a
    Dec 30 16:46:45 Tower kernel: Code: c3 0f 1f 80 00 00 00 00 4c 89 d2 4c 89 c6 e9 0d fe ff ff 0f 1f 44 00 00 31 c0 c3 0f 1f 44 00 00 49 89 ca b8 28 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 16 3b 0d 00 f7 d8 64 89 01 48
    Dec 30 16:46:45 Tower kernel: RSP: 002b:000015383e242938 EFLAGS: 00000206 ORIG_RAX: 0000000000000028
    Dec 30 16:46:45 Tower kernel: RAX: ffffffffffffffda RBX: 0000153810078000 RCX: 000015383ffba34a
    Dec 30 16:46:45 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000007 RDI: 0000000000000009
    Dec 30 16:46:45 Tower kernel: RBP: 000015383e242980 R08: 0000000000000001 R09: 00001538101d2340
    Dec 30 16:46:45 Tower kernel: R10: 00000000eaa30000 R11: 0000000000000206 R12: 0000153810078000
    Dec 30 16:46:45 Tower kernel: R13: 0000000000000010 R14: 0000000000002008 R15: 000000000045ba90
    Dec 30 16:46:45 Tower kernel: Modules linked in: xt_CHECKSUM ipt_REJECT ip6table_mangle ip6table_nat iptable_mangle ip6table_filter ip6_tables vhost_net tun vhost vhost_iotlb tap xt_nat veth xt_MASQUERADE iptable_filter iptable_nat nf_nat ip_tables xfs dm_crypt dm_mod dax nfsd md_mod nfsv3 nfs lockd grace sunrpc bonding sr_mod cdrom wmi_bmof mxm_wmi edac_mce_amd kvm_amd kvm btusb crct10dif_pclmul btrtl crc32_pclmul btbcm crc32c_intel btintel ghash_clmulni_intel igb aesni_intel bluetooth mvsas nvme i2c_piix4 crypto_simd libsas cryptd i2c_algo_bit input_leds ecdh_generic nvme_core ccp glue_helper i2c_core scsi_transport_sas led_class k10temp rapl ahci ecc libahci wmi button acpi_cpufreq
    Dec 30 16:46:45 Tower kernel: CR2: 0000008000000034
    Dec 30 16:46:45 Tower kernel: ---[ end trace dfa03bb6d908d248 ]---
    Dec 30 16:46:45 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 16:46:45 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 16:46:45 Tower kernel: RSP: 0018:ffffc900501b7958 EFLAGS: 00010246
    Dec 30 16:46:45 Tower kernel: RAX: 0000008000000000 RBX: 000000000000001e RCX: 0000000000000012
    Dec 30 16:46:45 Tower kernel: RDX: 0000000000000000 RSI: ffff8891201f9ff0 RDI: 0000008000000000
    Dec 30 16:46:45 Tower kernel: RBP: 0000000000042452 R08: 0000008000000000 R09: ffffc900501b7960
    Dec 30 16:46:45 Tower kernel: R10: ffffc900501b7960 R11: 0000000000000000 R12: 0000000000100c4a
    Dec 30 16:46:45 Tower kernel: R13: ffff8897ef437320 R14: 0000000000042452 R15: 0000000000001000
    Dec 30 16:46:45 Tower kernel: FS:  000015383e243700(0000) GS:ffff88a03e080000(0000) knlGS:0000000000000000
    Dec 30 16:46:45 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 16:46:45 Tower kernel: CR2: 0000008000000034 CR3: 0000000fbcc60000 CR4: 00000000003406e0

    Take 2

    Dec 30 17:03:58 Tower kernel: BUG: unable to handle page fault for address: 0000008000000034
    Dec 30 17:03:58 Tower kernel: #PF: supervisor read access in kernel mode
    Dec 30 17:03:58 Tower kernel: #PF: error_code(0x0000) - not-present page
    Dec 30 17:03:58 Tower kernel: PGD 0 P4D 0 
    Dec 30 17:03:58 Tower kernel: Oops: 0000 [#2] SMP NOPTI
    Dec 30 17:03:58 Tower kernel: CPU: 16 PID: 7322 Comm: shfs Tainted: G      D           5.8.18-Unraid #1
    Dec 30 17:03:58 Tower kernel: Hardware name: System manufacturer System Product Name/ROG ZENITH EXTREME, BIOS 2001 07/31/2019
    Dec 30 17:03:58 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 17:03:58 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 17:03:58 Tower kernel: RSP: 0018:ffffc900079ffbf8 EFLAGS: 00010246
    Dec 30 17:03:58 Tower kernel: RAX: 0000008000000000 RBX: 0000000000000000 RCX: 0000000000000012
    Dec 30 17:03:58 Tower kernel: RDX: 0000000000000000 RSI: ffff8890c68a9ff0 RDI: 0000008000000000
    Dec 30 17:03:58 Tower kernel: RBP: 000000000004c492 R08: 0000008000000000 R09: ffffc900079ffc00
    Dec 30 17:03:58 Tower kernel: R10: ffffc900079ffc00 R11: ffff88a01a667300 R12: 0000000000000000
    Dec 30 17:03:58 Tower kernel: R13: ffff8897e44ba820 R14: 000000000004c492 R15: 0000000000001000
    Dec 30 17:03:58 Tower kernel: FS:  000015383f789700(0000) GS:ffff88a03e000000(0000) knlGS:0000000000000000
    Dec 30 17:03:58 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 17:03:58 Tower kernel: CR2: 0000008000000034 CR3: 0000000fbcc60000 CR4: 00000000003406e0
    Dec 30 17:03:58 Tower kernel: Call Trace:
    Dec 30 17:03:58 Tower kernel: find_get_entry+0x7d/0xc4
    Dec 30 17:03:58 Tower kernel: pagecache_get_page+0x20/0x127
    Dec 30 17:03:58 Tower kernel: generic_file_buffered_read+0xe9/0x4fd
    Dec 30 17:03:58 Tower kernel: xfs_file_buffered_aio_read+0x4b/0x66 [xfs]
    Dec 30 17:03:58 Tower kernel: xfs_file_read_iter+0x6f/0xb6 [xfs]
    Dec 30 17:03:58 Tower kernel: generic_file_splice_read+0xed/0x15b
    Dec 30 17:03:58 Tower kernel: splice_direct_to_actor+0xe4/0x1cd
    Dec 30 17:03:58 Tower kernel: ? do_splice_from+0x35/0x35
    Dec 30 17:03:58 Tower kernel: do_splice_direct+0x94/0xbd
    Dec 30 17:03:58 Tower kernel: do_sendfile+0x187/0x247
    Dec 30 17:03:58 Tower kernel: __do_sys_sendfile64+0x81/0xa7
    Dec 30 17:03:58 Tower kernel: ? ksys_lseek+0x52/0x5e
    Dec 30 17:03:58 Tower kernel: do_syscall_64+0x7a/0x94
    Dec 30 17:03:58 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
    Dec 30 17:03:58 Tower kernel: RIP: 0033:0x15383ffba34a
    Dec 30 17:03:58 Tower kernel: Code: c3 0f 1f 80 00 00 00 00 4c 89 d2 4c 89 c6 e9 0d fe ff ff 0f 1f 44 00 00 31 c0 c3 0f 1f 44 00 00 49 89 ca b8 28 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 16 3b 0d 00 f7 d8 64 89 01 48
    Dec 30 17:03:58 Tower kernel: RSP: 002b:000015383f788938 EFLAGS: 00000206 ORIG_RAX: 0000000000000028
    Dec 30 17:03:58 Tower kernel: RAX: ffffffffffffffda RBX: 0000153830080000 RCX: 000015383ffba34a
    Dec 30 17:03:58 Tower kernel: RDX: 0000000000000000 RSI: 000000000000000c RDI: 0000000000000011
    Dec 30 17:03:58 Tower kernel: RBP: 000015383f788980 R08: 0000000000000001 R09: 000015383008b360
    Dec 30 17:03:58 Tower kernel: R10: 00000001146d8000 R11: 0000000000000206 R12: 0000153830080000
    Dec 30 17:03:58 Tower kernel: R13: 0000000000000010 R14: 0000000000002008 R15: 000000000045ba90
    Dec 30 17:03:58 Tower kernel: Modules linked in: xt_CHECKSUM ipt_REJECT ip6table_mangle ip6table_nat iptable_mangle ip6table_filter ip6_tables vhost_net tun vhost vhost_iotlb tap xt_nat veth xt_MASQUERADE iptable_filter iptable_nat nf_nat ip_tables xfs dm_crypt dm_mod dax nfsd md_mod nfsv3 nfs lockd grace sunrpc bonding sr_mod cdrom wmi_bmof mxm_wmi edac_mce_amd kvm_amd kvm btusb crct10dif_pclmul btrtl crc32_pclmul btbcm crc32c_intel btintel ghash_clmulni_intel igb aesni_intel bluetooth mvsas nvme i2c_piix4 crypto_simd libsas cryptd i2c_algo_bit input_leds ecdh_generic nvme_core ccp glue_helper i2c_core scsi_transport_sas led_class k10temp rapl ahci ecc libahci wmi button acpi_cpufreq
    Dec 30 17:03:58 Tower kernel: CR2: 0000008000000034
    Dec 30 17:03:58 Tower kernel: ---[ end trace dfa03bb6d908d249 ]---
    Dec 30 17:03:58 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 17:03:58 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 17:03:58 Tower kernel: RSP: 0018:ffffc900501b7958 EFLAGS: 00010246
    Dec 30 17:03:58 Tower kernel: RAX: 0000008000000000 RBX: 000000000000001e RCX: 0000000000000012
    Dec 30 17:03:58 Tower kernel: RDX: 0000000000000000 RSI: ffff8891201f9ff0 RDI: 0000008000000000
    Dec 30 17:03:58 Tower kernel: RBP: 0000000000042452 R08: 0000008000000000 R09: ffffc900501b7960
    Dec 30 17:03:58 Tower kernel: R10: ffffc900501b7960 R11: 0000000000000000 R12: 0000000000100c4a
    Dec 30 17:03:58 Tower kernel: R13: ffff8897ef437320 R14: 0000000000042452 R15: 0000000000001000
    Dec 30 17:03:58 Tower kernel: FS:  000015383f789700(0000) GS:ffff88a03e000000(0000) knlGS:0000000000000000
    Dec 30 17:03:58 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 17:03:58 Tower kernel: CR2: 0000008000000034 CR3: 0000000fbcc60000 CR4: 00000000003406e0

    Take 3

    Dec 30 17:30:12 Tower kernel: BUG: unable to handle page fault for address: ffffea8044cd0834
    Dec 30 17:30:12 Tower kernel: #PF: supervisor read access in kernel mode
    Dec 30 17:30:12 Tower kernel: #PF: error_code(0x0000) - not-present page
    Dec 30 17:30:12 Tower kernel: PGD 0 P4D 0 
    Dec 30 17:30:12 Tower kernel: Oops: 0000 [#3] SMP NOPTI
    Dec 30 17:30:12 Tower kernel: CPU: 51 PID: 11368 Comm: shfs Tainted: G      D           5.8.18-Unraid #1
    Dec 30 17:30:12 Tower kernel: Hardware name: System manufacturer System Product Name/ROG ZENITH EXTREME, BIOS 2001 07/31/2019
    Dec 30 17:30:12 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 17:30:12 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 17:30:12 Tower kernel: RSP: 0018:ffffc9000984bc78 EFLAGS: 00010246
    Dec 30 17:30:12 Tower kernel: RAX: ffffea8044cd0800 RBX: 0000000000000002 RCX: ffff88912431dfd0
    Dec 30 17:30:12 Tower kernel: RDX: 00000000000391d6 RSI: ffffffffffffffff RDI: ffffea8044cd0800
    Dec 30 17:30:12 Tower kernel: RBP: ffffea8044cd0800 R08: ffffea8044cd0800 R09: 0000000000000000
    Dec 30 17:30:12 Tower kernel: R10: ffffc9000984bc80 R11: ffffc9000984bc80 R12: 000000000000000f
    Dec 30 17:30:12 Tower kernel: R13: ffffc9000984bd80 R14: ffffc9000984bd00 R15: 000000000000000f
    Dec 30 17:30:12 Tower kernel: FS:  000015383d6c7700(0000) GS:ffff88a03e2c0000(0000) knlGS:0000000000000000
    Dec 30 17:30:12 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 17:30:12 Tower kernel: CR2: ffffea8044cd0834 CR3: 0000000fbcc60000 CR4: 00000000003406e0
    Dec 30 17:30:12 Tower kernel: Call Trace:
    Dec 30 17:30:12 Tower kernel: find_get_entries+0x97/0x13a
    Dec 30 17:30:12 Tower kernel: pagevec_lookup_entries+0x15/0x1c
    Dec 30 17:30:12 Tower kernel: truncate_inode_pages_range+0xb9/0x438
    Dec 30 17:30:12 Tower kernel: ? __mmu_notifier_change_pte+0x14/0x62
    Dec 30 17:30:12 Tower kernel: evict+0xc5/0x16b
    Dec 30 17:30:12 Tower kernel: do_unlinkat+0x13c/0x1d3
    Dec 30 17:30:12 Tower kernel: do_syscall_64+0x7a/0x94
    Dec 30 17:30:12 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
    Dec 30 17:30:12 Tower kernel: RIP: 0033:0x15383ffb7277
    Dec 30 17:30:12 Tower kernel: Code: f0 ff ff 73 01 c3 48 8b 0d 16 6c 0d 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 66 90 b8 57 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d e9 6b 0d 00 f7 d8 64 89 01 48
    Dec 30 17:30:12 Tower kernel: RSP: 002b:000015383d6c69c8 EFLAGS: 00000213 ORIG_RAX: 0000000000000057
    Dec 30 17:30:12 Tower kernel: RAX: ffffffffffffffda RBX: 000015382c05a330 RCX: 000015383ffb7277
    Dec 30 17:30:12 Tower kernel: RDX: 000015383d6c6ab0 RSI: 000015382c01e280 RDI: 000015382c05a338
    Dec 30 17:30:12 Tower kernel: RBP: 000015383d6c6bf0 R08: 000015382c0008d0 R09: 0000000000000006
    Dec 30 17:30:12 Tower kernel: R10: 0000000000000100 R11: 0000000000000213 R12: 000015382c05a330
    Dec 30 17:30:12 Tower kernel: R13: 0000000000000010 R14: 0000000000002008 R15: 000000000045ba90
    Dec 30 17:30:12 Tower kernel: Modules linked in: xt_CHECKSUM ipt_REJECT ip6table_mangle ip6table_nat iptable_mangle ip6table_filter ip6_tables vhost_net tun vhost vhost_iotlb tap xt_nat veth xt_MASQUERADE iptable_filter iptable_nat nf_nat ip_tables xfs dm_crypt dm_mod dax nfsd md_mod nfsv3 nfs lockd grace sunrpc bonding sr_mod cdrom wmi_bmof mxm_wmi edac_mce_amd kvm_amd kvm btusb crct10dif_pclmul btrtl crc32_pclmul btbcm crc32c_intel btintel ghash_clmulni_intel igb aesni_intel bluetooth mvsas nvme i2c_piix4 crypto_simd libsas cryptd i2c_algo_bit input_leds ecdh_generic nvme_core ccp glue_helper i2c_core scsi_transport_sas led_class k10temp rapl ahci ecc libahci wmi button acpi_cpufreq
    Dec 30 17:30:12 Tower kernel: CR2: ffffea8044cd0834
    Dec 30 17:30:12 Tower kernel: ---[ end trace dfa03bb6d908d24a ]---
    Dec 30 17:30:12 Tower kernel: RIP: 0010:__page_cache_add_speculative.constprop.0+0x0/0x1f
    Dec 30 17:30:12 Tower kernel: Code: 89 96 00 00 48 89 ef e8 8e 79 00 00 85 c0 75 02 0f 0b 48 8b 45 00 84 c0 79 0e 48 89 ef 5d be 0f 00 00 00 e9 51 fc ff ff 5d c3 <8b> 57 34 85 d2 74 10 8d 4a 01 89 d0 f0 0f b1 4f 34 74 04 89 c2 eb
    Dec 30 17:30:12 Tower kernel: RSP: 0018:ffffc900501b7958 EFLAGS: 00010246
    Dec 30 17:30:12 Tower kernel: RAX: 0000008000000000 RBX: 000000000000001e RCX: 0000000000000012
    Dec 30 17:30:12 Tower kernel: RDX: 0000000000000000 RSI: ffff8891201f9ff0 RDI: 0000008000000000
    Dec 30 17:30:12 Tower kernel: RBP: 0000000000042452 R08: 0000008000000000 R09: ffffc900501b7960
    Dec 30 17:30:12 Tower kernel: R10: ffffc900501b7960 R11: 0000000000000000 R12: 0000000000100c4a
    Dec 30 17:30:12 Tower kernel: R13: ffff8897ef437320 R14: 0000000000042452 R15: 0000000000001000
    Dec 30 17:30:12 Tower kernel: FS:  000015383d6c7700(0000) GS:ffff88a03e2c0000(0000) knlGS:0000000000000000
    Dec 30 17:30:12 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Dec 30 17:30:12 Tower kernel: CR2: ffffea8044cd0834 CR3: 0000000fbcc60000 CR4: 00000000003406e0

     

    tower-diagnostics-20201230-1733.zip




    User Feedback

    Recommended Comments

    I appreciate the link, but those settings have long been applied on my machine and that's unfortunately not it 😞

     

    I've been using unRAID on this hardware for more than a year now I never encountered any issues like this on earlier versions - I had months of uptime on 6.8.x without encountering anything like this.

     

    I finally decided to try upgrading to 6.9.x even though it wasn't final yet because of the cache write amplification fixes - I was tired of watching unRAID destroy my SSDs when there was a fix available. I am not running overclocked either - I am running a 2990wx and don't want to be bankrupted by my electricity bill 😛

     

    To be clear, the only adverse effect I've observed from this crash is the mover process deadlocking. The rest of the server continues on as if nothing happened - SSH, Web Gui, VMs & Dockers all continue as normal. Mover just deadlocks until its manually killed and restarted.

     

    I noticed this because I've accumulated a several hundred GB backlog now of things waiting to be moved, and mover reliably dies with this crash in about 60-120 seconds, so it's barely moving anything at all right now. I have to ssh in and kill it then start it up again, but it crashes with the same crash a minute or two later. No read/write errors are ever recorded on the main tab, nor do any drives go offline or have health issues flagged by SMART in the UI dashboard.

    Edited by starlightk7
    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...