Jump to content
daze

WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:2725 rcu_process_callbacks+0x320/0x36b

10 posts in this topic Last Reply

Recommended Posts

With -rc5 amd -rc4, I am getting kernel issues. System accepts commands, but they are hung, e.g. reboot, or anything else. I have to physically reboot.

 

I'll attach a diag soon.

 

20819.730247] ------------[ cut here ]------------
[20819.730257] WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:2725 rcu_process_callbacks+0x320/0x36b
[20819.730258] Modules linked in: ebtable_filter ebtables ip6table_filter ip6_tables vhost_net tun vhost tap veth xt_nat ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat dm_crypt algif_skcipher af_alg dm_mod dax nfsd lockd grace sunrpc md_mod bonding e1000e ptp pps_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd mpt3sas intel_cstate intel_uncore intel_rapl_perf i2c_i801 i2c_core raid_class scsi_transport_sas wmi_bmof mxm_wmi video backlight thermal button acpi_pad fan wmi [last unloaded: pps_core]
[20819.730312] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.14.23-unRAID #1
[20819.730313] Hardware name: ASUS All Series/Z97-A-USB31, BIOS 2801 11/11/2015
[20819.730315] task: ffffffff81c12480 task.stack: ffffffff81c00000
[20819.730319] RIP: 0010:rcu_process_callbacks+0x320/0x36b
[20819.730321] RSP: 0018:ffff88043f403f18 EFLAGS: 00010002
[20819.730323] RAX: ffffffffffffd800 RBX: ffff88043f4214c0 RCX: 0000000095882701
[20819.730325] RDX: 0000000000000001 RSI: ffff88043f403f20 RDI: ffff88043f4214f8
[20819.730326] RBP: ffffffff81c399c0 R08: 0000000000024000 R09: ffffffff8108c8e6
[20819.730327] R10: ffffea000dc66640 R11: 0000000000000001 R12: ffff88043f4214f8
[20819.730329] R13: 7fffffffffffffff R14: 0000000000000246 R15: fffffffffffffffa
[20819.730331] FS:  0000000000000000(0000) GS:ffff88043f400000(0000) knlGS:0000000000000000
[20819.730333] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[20819.730334] CR2: 00000000006fcdf4 CR3: 0000000001c0a005 CR4: 00000000003626f0
[20819.730336] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[20819.730338] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[20819.730339] Call Trace:
[20819.730342]  <IRQ>
[20819.730346]  ? hrtimer_forward+0x74/0x7c
[20819.730352]  __do_softirq+0xcd/0x1c2
[20819.730357]  irq_exit+0x4f/0x8e
[20819.730361]  smp_apic_timer_interrupt+0x7a/0x85
[20819.730364]  apic_timer_interrupt+0x7d/0x90
[20819.730366]  </IRQ>
[20819.730370] RIP: 0010:cpuidle_enter_state+0xe3/0x135
[20819.730372] RSP: 0018:ffffffff81c03ec8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
[20819.730374] RAX: ffff88043f420940 RBX: 0000000000000000 RCX: 000000000000001f
[20819.730376] RDX: 000012ef789ff9b0 RSI: 0000000000020140 RDI: 0000000000000000
[20819.730377] RBP: ffff88043f428800 R08: 00003eab3c3de394 R09: 0000000000000060
[20819.730379] R10: ffffffff81c03ea8 R11: 00000000000c7a94 R12: 0000000000000003
[20819.730380] R13: 000012ef789ff9b0 R14: ffffffff81c591f8 R15: 000012ef78953ac4
[20819.730385]  ? cpuidle_enter_state+0xbb/0x135
[20819.730389]  do_idle+0x11a/0x179
[20819.730392]  cpu_startup_entry+0x18/0x1a
[20819.730396]  start_kernel+0x3e4/0x3ec
[20819.730401]  secondary_startup_64+0xa5/0xb0
[20819.730403] Code: a8 00 00 00 eb 13 48 2b 05 eb f3 ba 00 48 39 c2 7d 07 48 89 93 90 00 00 00 48 83 7b 38 00 0f 94 c1 48 85 d2 0f 94 c0 38 c1 74 02 <0f> 0b 4c 89 f7 57 9d 0f 1f 44 00 00 4c 89 e7 e8 1f 0e 00 00 84
[20819.730449] ---[ end trace 9420d3db205577c8 ]---

Share this post


Link to post

FYI, I'm having this same issue I believe.  It occurred for me on rc4, then I downgraded to rc1 and it continued to happen.  Moved back to stable and all has been well for a few hours.

I'm assuming this will be addressed in a future release?

Mar 5 04:09:56 CruzunRAID kernel: ------------[ cut here ]------------
Mar 5 04:09:56 CruzunRAID kernel: WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:2725 rcu_process_callbacks+0x320/0x36b
Mar 5 04:09:56 CruzunRAID kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_REJECT nf_reject_ipv4 ebtable_filter ebtables ip6table_filter ip6_tables vhost_net vhost tap tun xt_nat veth ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat xfs nfsd lockd grace sunrpc md_mod nct6775 hwmon_vid jc42 bonding mlx4_en mlx4_core e1000e ptp pps_core x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel cryptd mpt3sas i2c_i801 i2c_core ahci libahci intel_cstate intel_uncore raid_class intel_rapl_perf scsi_transport_sas ie31200_edac button ipmi_si [last unloaded: mlx4_core]
Mar 5 04:09:56 CruzunRAID kernel: CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.14.23-unRAID #1
Mar 5 04:09:56 CruzunRAID kernel: Hardware name: Supermicro X9SCL/X9SCM/X9SCL/X9SCM, BIOS 1.1a 09/28/2011
Mar 5 04:09:56 CruzunRAID kernel: task: ffffffff81c12480 task.stack: ffffffff81c00000
Mar 5 04:09:56 CruzunRAID kernel: RIP: 0010:rcu_process_callbacks+0x320/0x36b
Mar 5 04:09:56 CruzunRAID kernel: RSP: 0018:ffff88043fc03f18 EFLAGS: 00010002
Mar 5 04:09:56 CruzunRAID kernel: RAX: ffffffffffffd800 RBX: ffff88043fc214c0 RCX: 0000000100200001
Mar 5 04:09:56 CruzunRAID kernel: RDX: 0000000000000002 RSI: ffff88043fc03f20 RDI: ffff88043fc214f8
Mar 5 04:09:56 CruzunRAID kernel: RBP: ffffffff81c399c0 R08: 0000000000000001 R09: ffffffff8108c800
Mar 5 04:09:56 CruzunRAID kernel: R10: ffffea00010dbcc0 R11: 0000000000000000 R12: ffff88043fc214f8
Mar 5 04:09:56 CruzunRAID kernel: R13: 7fffffffffffffff R14: 0000000000000246 R15: fffffffffffffffe
Mar 5 04:09:56 CruzunRAID kernel: FS: 0000000000000000(0000) GS:ffff88043fc00000(0000) knlGS:0000000000000000
Mar 5 04:09:56 CruzunRAID kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 5 04:09:56 CruzunRAID kernel: CR2: 0000149ce53af2a9 CR3: 0000000001c0a002 CR4: 00000000000606f0
Mar 5 04:09:56 CruzunRAID kernel: Call Trace:
Mar 5 04:09:56 CruzunRAID kernel: <IRQ>
Mar 5 04:09:56 CruzunRAID kernel: ? hrtimer_forward+0x74/0x7c
Mar 5 04:09:56 CruzunRAID kernel: __do_softirq+0xcd/0x1c2
Mar 5 04:09:56 CruzunRAID kernel: irq_exit+0x4f/0x8e
Mar 5 04:09:56 CruzunRAID kernel: smp_apic_timer_interrupt+0x7a/0x85
Mar 5 04:09:56 CruzunRAID kernel: apic_timer_interrupt+0x7d/0x90
Mar 5 04:09:56 CruzunRAID kernel: </IRQ>
Mar 5 04:09:56 CruzunRAID kernel: RIP: 0010:cpuidle_enter_state+0xe0/0x135
Mar 5 04:09:56 CruzunRAID kernel: RSP: 0018:ffffffff81c03ec8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
Mar 5 04:09:56 CruzunRAID kernel: RAX: ffff88043fc20940 RBX: 0000000000000000 RCX: 000000000000001f
Mar 5 04:09:56 CruzunRAID kernel: RDX: 00001ea2831efaa9 RSI: 0000000000020140 RDI: 0000000000000000
Mar 5 04:09:56 CruzunRAID kernel: RBP: ffff88043fc28800 R08: 0000650941e9923d R09: 0000000000000060
Mar 5 04:09:56 CruzunRAID kernel: R10: ffffffff81c03ea8 R11: 0000000000002be0 R12: 0000000000000004
Mar 5 04:09:56 CruzunRAID kernel: R13: 00001ea2831efaa9 R14: ffffffff81c59258 R15: 00001ea283103470
Mar 5 04:09:56 CruzunRAID kernel: ? cpuidle_enter_state+0xbb/0x135
Mar 5 04:09:56 CruzunRAID kernel: do_idle+0x11a/0x179
Mar 5 04:09:56 CruzunRAID kernel: cpu_startup_entry+0x18/0x1a
Mar 5 04:09:56 CruzunRAID kernel: start_kernel+0x3e4/0x3ec
Mar 5 04:09:56 CruzunRAID kernel: secondary_startup_64+0xa5/0xb0
Mar 5 04:09:56 CruzunRAID kernel: Code: a8 00 00 00 eb 13 48 2b 05 eb f3 ba 00 48 39 c2 7d 07 48 89 93 90 00 00 00 48 83 7b 38 00 0f 94 c1 48 85 d2 0f 94 c0 38 c1 74 02 <0f> 0b 4c 89 f7 57 9d 66 66 90 66 90 4c 89 e7 e8 1f 0e 00 00 84 
Mar 5 04:09:56 CruzunRAID kernel: ---[ end trace ff26ffb566f1f9ef ]---

 

Share this post


Link to post

Same issue as others. Rolled back to 6.4.1 after a hard power cycle, checked parity, and everything seems to be well again. Below is my syslog paste. 

 

Mar  4 22:37:17 unRAID kernel: ------------[ cut here ]------------
Mar  4 22:37:17 unRAID kernel: WARNING: CPU: 0 PID: 0 at kernel/rcu/tree.c:2725 rcu_process_callbacks+0x320/0x36b
Mar  4 22:37:17 unRAID kernel: Modules linked in: xt_CHECKSUM iptable_mangle ipt_REJECT nf_reject_ipv4 ebtable_filter ebtables ip6table_filter ip6_tables vhost_net tun vhost tap xt_nat veth ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat xfs md_mod bonding bnx2 ipmi_ssif i2c_core intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper mpt3sas cryptd i7core_edac wmi raid_class scsi_transport_sas intel_cstate acpi_power_meter intel_uncore ata_piix button ipmi_si acpi_cpufreq [last unloaded: bnx2]
Mar  4 22:37:17 unRAID kernel: CPU: 0 PID: 0 Comm: swapper/0 Tainted: G          I     4.14.23-unRAID #1
Mar  4 22:37:17 unRAID kernel: Hardware name: Dell Inc. PowerEdge R710/00NH4P, BIOS 6.4.0 07/23/2013
Mar  4 22:37:17 unRAID kernel: task: ffffffff81c12480 task.stack: ffffffff81c00000
Mar  4 22:37:17 unRAID kernel: RIP: 0010:rcu_process_callbacks+0x320/0x36b
Mar  4 22:37:17 unRAID kernel: RSP: 0018:ffff881fbf003f18 EFLAGS: 00010002
Mar  4 22:37:17 unRAID kernel: RAX: ffffffffffffd800 RBX: ffff881fbf0214c0 RCX: 0000000091816401
Mar  4 22:37:17 unRAID kernel: RDX: 0000000000000001 RSI: ffff881fbf003f20 RDI: ffff881fbf0214f8
Mar  4 22:37:17 unRAID kernel: RBP: ffffffff81c399c0 R08: 0000000000024080 R09: ffffffff814bf8bc
Mar  4 22:37:17 unRAID kernel: R10: ffffea0072760880 R11: 0000000000000020 R12: ffff881fbf0214f8
Mar  4 22:37:17 unRAID kernel: R13: 7fffffffffffffff R14: 0000000000000246 R15: ffffffffffffffff
Mar  4 22:37:17 unRAID kernel: FS:  0000000000000000(0000) GS:ffff881fbf000000(0000) knlGS:0000000000000000
Mar  4 22:37:17 unRAID kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  4 22:37:17 unRAID kernel: CR2: 00007f3070070000 CR3: 0000000001c0a005 CR4: 00000000000226f0
Mar  4 22:37:17 unRAID kernel: Call Trace:
Mar  4 22:37:17 unRAID kernel: <IRQ>
Mar  4 22:37:17 unRAID kernel: ? hrtimer_forward+0x74/0x7c
Mar  4 22:37:17 unRAID kernel: __do_softirq+0xcd/0x1c2
Mar  4 22:37:17 unRAID kernel: irq_exit+0x4f/0x8e
Mar  4 22:37:17 unRAID kernel: smp_apic_timer_interrupt+0x7a/0x85
Mar  4 22:37:17 unRAID kernel: apic_timer_interrupt+0x7d/0x90
Mar  4 22:37:17 unRAID kernel: </IRQ>
Mar  4 22:37:17 unRAID kernel: RIP: 0010:cpuidle_enter_state+0xe0/0x135
Mar  4 22:37:17 unRAID kernel: RSP: 0018:ffffffff81c03ec8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff10
Mar  4 22:37:17 unRAID kernel: RAX: ffff881fbf020940 RBX: 0000000000000000 RCX: 000000000000001f
Mar  4 22:37:17 unRAID kernel: RDX: 0000185f47ce8b1b RSI: 0000000000020140 RDI: 0000000000000000
Mar  4 22:37:17 unRAID kernel: RBP: ffff881fbf028a00 R08: 00004accec2fdce4 R09: 0000000000000060
Mar  4 22:37:17 unRAID kernel: R10: ffffffff81c03ea8 R11: 000000000036f780 R12: 0000000000000004
Mar  4 22:37:17 unRAID kernel: R13: 0000185f47ce8b1b R14: ffffffff81c59258 R15: 0000185f47bf9a89
Mar  4 22:37:17 unRAID kernel: ? cpuidle_enter_state+0xbb/0x135
Mar  4 22:37:17 unRAID kernel: do_idle+0x11a/0x179
Mar  4 22:37:17 unRAID kernel: cpu_startup_entry+0x18/0x1a
Mar  4 22:37:17 unRAID kernel: start_kernel+0x3e4/0x3ec
Mar  4 22:37:17 unRAID kernel: secondary_startup_64+0xa5/0xb0
Mar  4 22:37:17 unRAID kernel: Code: a8 00 00 00 eb 13 48 2b 05 eb f3 ba 00 48 39 c2 7d 07 48 89 93 90 00 00 00 48 83 7b 38 00 0f 94 c1 48 85 d2 0f 94 c0 38 c1 74 02 <0f> 0b 4c 89 f7 57 9d 66 66 90 66 90 4c 89 e7 e8 1f 0e 00 00 84 
Mar  4 22:37:17 unRAID kernel: ---[ end trace 36b2bcd82cb461af ]---

Share this post


Link to post

Either have to wait for to be fixed in linux kernel or unraid patches it manually until kernel is fixed.

Share this post


Link to post

I've been going strong for 20 hours, looks like this fixed it (at least for me). Thanks!

Share this post


Link to post
4 hours ago, PuffyDumpty said:

Updating now. Will report back if issue returns.

 

No issues since upgrade. It happened in about an hour last time, so I think it's good to go. 

Share this post


Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.