unRAID keeps freezing on new hardware


Go to solution Solved by Hoopster,

Recommended Posts

My unRAID server keeps freezing after migrating to new hardware. As I'm using an AMD processor, I already tried the fixes from the FAQs regarding the c states, but to no avail.

 

I setup a syslog server and here is the last output I got:

 

Jul 15 06:50:06 SilverSurfer kernel: WARNING: CPU: 8 PID: 969 at net/netfilter/nf_conntrack_core.c:1210 __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: Modules linked in: xt_nat xt_tcpudp veth macvlan xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_addrtype br_netfilter xfs md_mod zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet 8021q garp mrp bridge stp llc bonding tls amdgpu edac_mce_amd edac_core gpu_sched drm_buddy i2c_algo_bit drm_ttm_helper ttm kvm_amd drm_display_helper drm_kms_helper kvm drm sr_mod btusb crct10dif_pclmul cdrom crc32_pclmul crc32c_intel btrtl btbcm ghash_clmulni_intel btintel sha512_ssse3 wmi_bmof bluetooth aesni_intel agpgart crypto_simd cryptd nvme syscopyarea sysfillrect i2c_piix4 r8169 sysimgblt ahci ecdh_generic rapl i2c_core k10temp nvme_core ccp fb_sys_fops realtek cdc_acm ecc libahci thermal video wmi backlight acpi_tad button acpi_cpufreq unix
Jul 15 06:50:06 SilverSurfer kernel: CPU: 8 PID: 969 Comm: kworker/u64:11 Tainted: P S         O       6.1.36-Unraid #1
Jul 15 06:50:06 SilverSurfer kernel: Hardware name: HP HP Desktop M01-F1xxx/87D6, BIOS F.13 03/29/2021
Jul 15 06:50:06 SilverSurfer kernel: Workqueue: events_unbound macvlan_process_broadcast [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: RIP: 0010:__nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: Code: 44 24 10 e8 e2 e1 ff ff 8b 7c 24 04 89 ea 89 c6 89 04 24 e8 7e e6 ff ff 84 c0 75 a2 48 89 df e8 9b e2 ff ff 85 c0 89 c5 74 18 <0f> 0b 8b 34 24 8b 7c 24 04 e8 18 dd ff ff e8 93 e3 ff ff e9 72 01
Jul 15 06:50:06 SilverSurfer kernel: RSP: 0018:ffffc900003d8d98 EFLAGS: 00010202
Jul 15 06:50:06 SilverSurfer kernel: RAX: 0000000000000001 RBX: ffff88812ac27200 RCX: e012e3db32ff73d3
Jul 15 06:50:06 SilverSurfer kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff88812ac27200
Jul 15 06:50:06 SilverSurfer kernel: RBP: 0000000000000001 R08: 7e96d5f73588b0b9 R09: c288454296e8d8b6
Jul 15 06:50:06 SilverSurfer kernel: R10: 0e3ad73a02043d6b R11: ffffc900003d8d60 R12: ffffffff82a11440
Jul 15 06:50:06 SilverSurfer kernel: R13: 000000000001e238 R14: ffff8881954aee00 R15: 0000000000000000
Jul 15 06:50:06 SilverSurfer kernel: FS:  0000000000000000(0000) GS:ffff88840f200000(0000) knlGS:0000000000000000
Jul 15 06:50:06 SilverSurfer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 15 06:50:06 SilverSurfer kernel: CR2: 000000c0001e1010 CR3: 0000000283036000 CR4: 0000000000350ee0
Jul 15 06:50:06 SilverSurfer kernel: Call Trace:
Jul 15 06:50:06 SilverSurfer kernel: <IRQ>
Jul 15 06:50:06 SilverSurfer kernel: ? __warn+0xab/0x122
Jul 15 06:50:06 SilverSurfer kernel: ? report_bug+0x109/0x17e
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? handle_bug+0x41/0x6f
Jul 15 06:50:06 SilverSurfer kernel: ? exc_invalid_op+0x13/0x60
Jul 15 06:50:06 SilverSurfer kernel: ? asm_exc_invalid_op+0x16/0x20
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0x9e/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? nf_nat_inet_fn+0xc0/0x1a8 [nf_nat]
Jul 15 06:50:06 SilverSurfer kernel: nf_conntrack_confirm+0x25/0x54 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: nf_hook_slow+0x3d/0x96
Jul 15 06:50:06 SilverSurfer kernel: ? ip_protocol_deliver_rcu+0x164/0x164
Jul 15 06:50:06 SilverSurfer kernel: NF_HOOK.constprop.0+0x79/0xd9
Jul 15 06:50:06 SilverSurfer kernel: ? ip_protocol_deliver_rcu+0x164/0x164
Jul 15 06:50:06 SilverSurfer kernel: __netif_receive_skb_one_core+0x77/0x9c
Jul 15 06:50:06 SilverSurfer kernel: process_backlog+0x8c/0x116
Jul 15 06:50:06 SilverSurfer kernel: __napi_poll.constprop.0+0x2b/0x124
Jul 15 06:50:06 SilverSurfer kernel: net_rx_action+0x159/0x24f
Jul 15 06:50:06 SilverSurfer kernel: __do_softirq+0x129/0x288
Jul 15 06:50:06 SilverSurfer kernel: do_softirq+0x7f/0xab
Jul 15 06:50:06 SilverSurfer kernel: </IRQ>
Jul 15 06:50:06 SilverSurfer kernel: <TASK>
Jul 15 06:50:06 SilverSurfer kernel: __local_bh_enable_ip+0x4c/0x6b
Jul 15 06:50:06 SilverSurfer kernel: netif_rx+0x52/0x5a
Jul 15 06:50:06 SilverSurfer kernel: macvlan_broadcast+0x10a/0x150 [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: macvlan_process_broadcast+0xbc/0x12f [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: process_one_work+0x1ab/0x295
Jul 15 06:50:06 SilverSurfer kernel: worker_thread+0x18b/0x244
Jul 15 06:50:06 SilverSurfer kernel: ? rescuer_thread+0x281/0x281
Jul 15 06:50:06 SilverSurfer kernel: kthread+0xe7/0xef
Jul 15 06:50:06 SilverSurfer kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jul 15 06:50:06 SilverSurfer kernel: ret_from_fork+0x22/0x30
Jul 15 06:50:06 SilverSurfer kernel: </TASK>
Jul 15 06:50:06 SilverSurfer kernel: ---[ end trace 0000000000000000 ]---

 

Does anyone have any suggestions?

Link to comment
  • Solution
1 hour ago, wieli99 said:

My unRAID server keeps freezing after migrating to new hardware. As I'm using an AMD processor, I already tried the fixes from the FAQs regarding the c states, but to no avail.

 

I setup a syslog server and here is the last output I got:

 

Jul 15 06:50:06 SilverSurfer kernel: WARNING: CPU: 8 PID: 969 at net/netfilter/nf_conntrack_core.c:1210 __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: Modules linked in: xt_nat xt_tcpudp veth macvlan xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xt_addrtype br_netfilter xfs md_mod zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) zcommon(PO) znvpair(PO) spl(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet 8021q garp mrp bridge stp llc bonding tls amdgpu edac_mce_amd edac_core gpu_sched drm_buddy i2c_algo_bit drm_ttm_helper ttm kvm_amd drm_display_helper drm_kms_helper kvm drm sr_mod btusb crct10dif_pclmul cdrom crc32_pclmul crc32c_intel btrtl btbcm ghash_clmulni_intel btintel sha512_ssse3 wmi_bmof bluetooth aesni_intel agpgart crypto_simd cryptd nvme syscopyarea sysfillrect i2c_piix4 r8169 sysimgblt ahci ecdh_generic rapl i2c_core k10temp nvme_core ccp fb_sys_fops realtek cdc_acm ecc libahci thermal video wmi backlight acpi_tad button acpi_cpufreq unix
Jul 15 06:50:06 SilverSurfer kernel: CPU: 8 PID: 969 Comm: kworker/u64:11 Tainted: P S         O       6.1.36-Unraid #1
Jul 15 06:50:06 SilverSurfer kernel: Hardware name: HP HP Desktop M01-F1xxx/87D6, BIOS F.13 03/29/2021
Jul 15 06:50:06 SilverSurfer kernel: Workqueue: events_unbound macvlan_process_broadcast [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: RIP: 0010:__nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: Code: 44 24 10 e8 e2 e1 ff ff 8b 7c 24 04 89 ea 89 c6 89 04 24 e8 7e e6 ff ff 84 c0 75 a2 48 89 df e8 9b e2 ff ff 85 c0 89 c5 74 18 <0f> 0b 8b 34 24 8b 7c 24 04 e8 18 dd ff ff e8 93 e3 ff ff e9 72 01
Jul 15 06:50:06 SilverSurfer kernel: RSP: 0018:ffffc900003d8d98 EFLAGS: 00010202
Jul 15 06:50:06 SilverSurfer kernel: RAX: 0000000000000001 RBX: ffff88812ac27200 RCX: e012e3db32ff73d3
Jul 15 06:50:06 SilverSurfer kernel: RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff88812ac27200
Jul 15 06:50:06 SilverSurfer kernel: RBP: 0000000000000001 R08: 7e96d5f73588b0b9 R09: c288454296e8d8b6
Jul 15 06:50:06 SilverSurfer kernel: R10: 0e3ad73a02043d6b R11: ffffc900003d8d60 R12: ffffffff82a11440
Jul 15 06:50:06 SilverSurfer kernel: R13: 000000000001e238 R14: ffff8881954aee00 R15: 0000000000000000
Jul 15 06:50:06 SilverSurfer kernel: FS:  0000000000000000(0000) GS:ffff88840f200000(0000) knlGS:0000000000000000
Jul 15 06:50:06 SilverSurfer kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 15 06:50:06 SilverSurfer kernel: CR2: 000000c0001e1010 CR3: 0000000283036000 CR4: 0000000000350ee0
Jul 15 06:50:06 SilverSurfer kernel: Call Trace:
Jul 15 06:50:06 SilverSurfer kernel: <IRQ>
Jul 15 06:50:06 SilverSurfer kernel: ? __warn+0xab/0x122
Jul 15 06:50:06 SilverSurfer kernel: ? report_bug+0x109/0x17e
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? handle_bug+0x41/0x6f
Jul 15 06:50:06 SilverSurfer kernel: ? exc_invalid_op+0x13/0x60
Jul 15 06:50:06 SilverSurfer kernel: ? asm_exc_invalid_op+0x16/0x20
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0xa4/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? __nf_conntrack_confirm+0x9e/0x2b0 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: ? nf_nat_inet_fn+0xc0/0x1a8 [nf_nat]
Jul 15 06:50:06 SilverSurfer kernel: nf_conntrack_confirm+0x25/0x54 [nf_conntrack]
Jul 15 06:50:06 SilverSurfer kernel: nf_hook_slow+0x3d/0x96
Jul 15 06:50:06 SilverSurfer kernel: ? ip_protocol_deliver_rcu+0x164/0x164
Jul 15 06:50:06 SilverSurfer kernel: NF_HOOK.constprop.0+0x79/0xd9
Jul 15 06:50:06 SilverSurfer kernel: ? ip_protocol_deliver_rcu+0x164/0x164
Jul 15 06:50:06 SilverSurfer kernel: __netif_receive_skb_one_core+0x77/0x9c
Jul 15 06:50:06 SilverSurfer kernel: process_backlog+0x8c/0x116
Jul 15 06:50:06 SilverSurfer kernel: __napi_poll.constprop.0+0x2b/0x124
Jul 15 06:50:06 SilverSurfer kernel: net_rx_action+0x159/0x24f
Jul 15 06:50:06 SilverSurfer kernel: __do_softirq+0x129/0x288
Jul 15 06:50:06 SilverSurfer kernel: do_softirq+0x7f/0xab
Jul 15 06:50:06 SilverSurfer kernel: </IRQ>
Jul 15 06:50:06 SilverSurfer kernel: <TASK>
Jul 15 06:50:06 SilverSurfer kernel: __local_bh_enable_ip+0x4c/0x6b
Jul 15 06:50:06 SilverSurfer kernel: netif_rx+0x52/0x5a
Jul 15 06:50:06 SilverSurfer kernel: macvlan_broadcast+0x10a/0x150 [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: macvlan_process_broadcast+0xbc/0x12f [macvlan]
Jul 15 06:50:06 SilverSurfer kernel: process_one_work+0x1ab/0x295
Jul 15 06:50:06 SilverSurfer kernel: worker_thread+0x18b/0x244
Jul 15 06:50:06 SilverSurfer kernel: ? rescuer_thread+0x281/0x281
Jul 15 06:50:06 SilverSurfer kernel: kthread+0xe7/0xef
Jul 15 06:50:06 SilverSurfer kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jul 15 06:50:06 SilverSurfer kernel: ret_from_fork+0x22/0x30
Jul 15 06:50:06 SilverSurfer kernel: </TASK>
Jul 15 06:50:06 SilverSurfer kernel: ---[ end trace 0000000000000000 ]---

 

Does anyone have any suggestions?

 

You have macvlan call traces.  Try switching the docker network type to ipvlan.  Settings --> Docker --> docker custom network type

 

Prior to Unraid 6.12.x (don't know what version you are running since you did not include full diagnostics) usually macvlan issues only occurred on systems where custom IP addresses were assigned on br0.  Now, (6.12.0+ as mentioned in the release notes) with some hardware, macvlan problems can appear even without custom docker container IP addresses.

 

 

Edited by Hoopster
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.