Skip to content
View in the app

A better way to browse. Learn more.

Unraid

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

Unraid 6.9.2 rcu_sched errors

Featured Replies

I am a relatively new user of Unraid and am very rusty with my linux skills as well.   A few days ago, when I was running the very first parity check after starting up the server, as it was getting towards the 7 or 8 TB mark of 16TB, the system started hanging and being quite unresponsive.  I saw several of the 12 vprocs at 100%.  2 of the 3 data drives that were still being read started spamming read errors and the parity check failed.  I lost all access to machine and had to shut it down.  Unfortunately, I did not capture any logs from that adventure.

 

I brought it back up and parity was marked failed so I started a read check on the 6 data drives.  Shortly after it got past the 8TB mark (max size of the data disks), the throughput started climbing close to 1GB/s (on main dashboard showing progress of read check.  Then the system started behaving similarly to the parity check and was very unresponsive - unable to connect to ssh or any webui ports.   As it said it had about 3 hrs to go at that point, I thought I would just wait it out in case it was actually completing the process.   

I checked back in on the system in about 2 hrs and I was able to get in and the read check had continued.   I didn't check the state of any of the docker containers nor mess with them in any way.  The read check finished successfully at that point and I started looking at the logs and saw this:

Jan  2 15:46:28 kmce kernel: CPU: 4 PID: 11018 Comm: unraidd0 Not tainted 5.10.28-Unraid #1
Jan  2 15:46:28 kmce kernel: Hardware name: Gigabyte Technology Co., Ltd. Default string/X99-UD3-CF, BIOS F23c 06/15/2018
Jan  2 15:46:28 kmce kernel: Call Trace:
Jan  2 15:46:28 kmce kernel: <IRQ>
Jan  2 15:46:28 kmce kernel: dump_stack+0x6b/0x83
Jan  2 15:46:28 kmce kernel: ? lapic_can_unplug_cpu+0x8e/0x8e
Jan  2 15:46:28 kmce kernel: nmi_cpu_backtrace+0x7d/0x8f
Jan  2 15:46:28 kmce kernel: nmi_trigger_cpumask_backtrace+0x56/0xd3
Jan  2 15:46:28 kmce kernel: rcu_dump_cpu_stacks+0x9f/0xc6
Jan  2 15:46:28 kmce kernel: rcu_sched_clock_irq+0x1ec/0x543
Jan  2 15:46:28 kmce kernel: ? trigger_load_balance+0x5a/0x1ca
Jan  2 15:46:28 kmce kernel: update_process_times+0x50/0x6e
Jan  2 15:46:28 kmce kernel: tick_sched_timer+0x36/0x64
Jan  2 15:46:28 kmce kernel: __hrtimer_run_queues+0xb7/0x10b
Jan  2 15:46:28 kmce kernel: ? tick_sched_do_timer+0x39/0x39
Jan  2 15:46:28 kmce kernel: hrtimer_interrupt+0x8d/0x15b
Jan  2 15:46:28 kmce kernel: __sysvec_apic_timer_interrupt+0x5d/0x68
Jan  2 15:46:28 kmce kernel: asm_call_irq_on_stack+0x12/0x20
Jan  2 15:46:28 kmce kernel: </IRQ>
Jan  2 15:46:28 kmce kernel: sysvec_apic_timer_interrupt+0x71/0x95
Jan  2 15:46:28 kmce kernel: asm_sysvec_apic_timer_interrupt+0x12/0x20
Jan  2 15:46:28 kmce kernel: RIP: 0010:schedule_read+0x7b/0x87 [md_mod]
Jan  2 15:46:28 kmce kernel: Code: 00 b8 11 ff ff 01 b9 00 04 00 00 48 c1 e2 29 48 03 95 a8 00 00 00 48 c1 e0 27 48 c1 fa 06 48 c1 e2 0c 48 01 c2 31 c0 48 89 d7 <f3> ab 48 81 4d 00 00 01 00 00 5d c3 8b 87 70 05 00 00 85 c0 75 02
Jan  2 15:46:28 kmce kernel: RSP: 0018:ffffc900007afdb0 EFLAGS: 00010246
Jan  2 15:46:28 kmce kernel: RAX: 0000000000000000 RBX: ffff8881014a5768 RCX: 00000000000001f0
Jan  2 15:46:28 kmce kernel: RDX: ffff888139270000 RSI: 0000000000000003 RDI: ffff888139270840
Jan  2 15:46:28 kmce kernel: RBP: ffff8881014a5aa0 R08: 0000000000000000 R09: 0000000000000001
Jan  2 15:46:28 kmce kernel: R10: ffff8881014a5e10 R11: ffff88880fee2400 R12: ffff8881014a5758
Jan  2 15:46:28 kmce kernel: R13: 0000000000000000 R14: 0000000000000003 R15: 0000000000000003
Jan  2 15:46:28 kmce kernel: unraidd+0xe14/0x12b7 [md_mod]
Jan  2 15:46:28 kmce kernel: ? md_thread+0xee/0x115 [md_mod]
Jan  2 15:46:28 kmce kernel: ? rmw5_write_data+0x178/0x178 [md_mod]
Jan  2 15:46:28 kmce kernel: md_thread+0xee/0x115 [md_mod]
Jan  2 15:46:28 kmce kernel: ? init_wait_entry+0x24/0x24
Jan  2 15:46:28 kmce kernel: ? md_seq_show+0x69e/0x69e [md_mod]
Jan  2 15:46:28 kmce kernel: kthread+0xe5/0xea
Jan  2 15:46:28 kmce kernel: ? __kthread_bind_mask+0x57/0x57
Jan  2 15:46:28 kmce kernel: ret_from_fork+0x22/0x30
Jan  2 15:47:53 kmce kernel: vethc208cd9: renamed from eth0
 

Similar things happen many times in the syslog but not with the same CPU or call trace.

 

At this point, I am rebuilding parity drive but I fear that this is going to happen again the next time the system is under load.

 

I was unable to run diagnostics at the time as it just hung so I have attached a diagnostics from just now, but the additional syslog.txt attached is from when the problem occurred.

 

Any pointers or help appreciated!

kmce-diagnostics-20220103-1011.zip syslog.txt

  • Community Expert

Try updating to v6.10-rc2, some kernel/hardware combinations can sometimes have some issues.

  • Author

Thanks @JorgeB, I will look into that once my parity drive finishes rebuilding...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.