Unraid Crash


vakilando

Recommended Posts

Heute morgen war meine Heimautomation nicht mehr erreichbar und ich musste aufstehen um die Kaffeemaschine anzuschalten - das war schon mal ein schlechter Start in den Tag.... Noch schlechter: mein Unraid Server ist nachts angestürzt.

Warum ist mir noch nicht  klar, nach einem Hard-Reset läuft er wieder und behauptet es ginge ihm gut (soweit ich das erst mal beurteilen kann).

Anbei Ausschnitte des Syslogs von gestern bis zum Neustart:


(...)

Feb  5 09:20:12 FloBineDATA kernel: rcu: INFO: rcu_bh self-detected stall on CPU
Feb  5 09:20:12 FloBineDATA kernel: rcu: #0112-....: (5578973 ticks this GP) idle=95a/1/0x4000000000000002 softirq=665866432/665917549 fqs=1243552 
Feb  5 09:20:12 FloBineDATA kernel: rcu: #011 (t=5280094 jiffies g=74553 q=140)
Feb  5 09:20:12 FloBineDATA kernel: NMI backtrace for cpu 2
Feb  5 09:20:12 FloBineDATA kernel: CPU: 2 PID: 22418 Comm: shfs Tainted: P        W  O      4.19.107-Unraid #1
Feb  5 09:20:12 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020
Feb  5 09:20:12 FloBineDATA kernel: Call Trace:
Feb  5 09:20:12 FloBineDATA kernel: <IRQ>
Feb  5 09:20:12 FloBineDATA kernel: dump_stack+0x67/0x83
Feb  5 09:20:12 FloBineDATA kernel: nmi_cpu_backtrace+0x71/0x83
Feb  5 09:20:12 FloBineDATA kernel: ? lapic_can_unplug_cpu+0x97/0x97
Feb  5 09:20:12 FloBineDATA kernel: nmi_trigger_cpumask_backtrace+0x57/0xd4
Feb  5 09:20:12 FloBineDATA kernel: rcu_dump_cpu_stacks+0x8b/0xb4
Feb  5 09:20:12 FloBineDATA kernel: rcu_check_callbacks+0x296/0x5a0
Feb  5 09:20:12 FloBineDATA kernel: update_process_times+0x24/0x47
Feb  5 09:20:12 FloBineDATA kernel: tick_sched_timer+0x36/0x64
Feb  5 09:20:12 FloBineDATA kernel: __hrtimer_run_queues+0xb7/0x10b
Feb  5 09:20:12 FloBineDATA kernel: ? tick_sched_handle.isra.0+0x2f/0x2f
Feb  5 09:20:12 FloBineDATA kernel: hrtimer_interrupt+0xf4/0x20e
Feb  5 09:20:12 FloBineDATA kernel: smp_apic_timer_interrupt+0x7b/0x93
Feb  5 09:20:12 FloBineDATA kernel: apic_timer_interrupt+0xf/0x20
Feb  5 09:20:12 FloBineDATA kernel: </IRQ>
Feb  5 09:20:12 FloBineDATA kernel: RIP: 0010:radix_tree_load_root+0x27/0x38
Feb  5 09:20:12 FloBineDATA kernel: Code: 47 04 c3 48 8b 47 08 48 89 c1 48 89 06 83 e1 03 48 ff c9 75 1c 48 83 e0 fe be 40 00 00 00 0f b6 08 48 d3 e6 48 ff ce 48 89 32 <0f> b6 00 83 c0 06 c3 48 c7 02 00 00 00 00 31 c0 c3 48 89 f0 83 e0
Feb  5 09:20:12 FloBineDATA kernel: RSP: 0018:ffffc9000c8a3cf0 EFLAGS: 00000216 ORIG_RAX: ffffffffffffff13
Feb  5 09:20:12 FloBineDATA kernel: RAX: ffff888146aff478 RBX: 0000000000000771 RCX: 0000000000000006
Feb  5 09:20:12 FloBineDATA kernel: RDX: ffffc9000c8a3d00 RSI: 0000000000000fff RDI: ffff888121b7eba8
Feb  5 09:20:12 FloBineDATA kernel: RBP: ffff888121b7ebb0 R08: ffff888121b7ebb0 R09: ffffc9000c8a3d28
Feb  5 09:20:12 FloBineDATA kernel: R10: 0000000000000000 R11: ffff888121b7eba8 R12: 0000000000000000
Feb  5 09:20:12 FloBineDATA kernel: R13: ffff888121b7eba0 R14: ffffc9000c8a3e88 R15: 0000000000000771
Feb  5 09:20:12 FloBineDATA kernel: __radix_tree_lookup+0x39/0xa2
Feb  5 09:20:12 FloBineDATA kernel: radix_tree_lookup_slot+0x1e/0x41
Feb  5 09:20:12 FloBineDATA kernel: find_get_entry+0x14/0x8f
Feb  5 09:20:12 FloBineDATA kernel: pagecache_get_page+0x20/0x1bd
Feb  5 09:20:12 FloBineDATA kernel: generic_file_read_iter+0x1b8/0x6d0
Feb  5 09:20:12 FloBineDATA kernel: xfs_file_buffered_aio_read+0x4b/0x66 [xfs]
Feb  5 09:20:12 FloBineDATA kernel: xfs_file_read_iter+0x6f/0xb6 [xfs]
Feb  5 09:20:12 FloBineDATA kernel: __vfs_read+0xf9/0x132
Feb  5 09:20:12 FloBineDATA kernel: vfs_read+0xa4/0x124
Feb  5 09:20:12 FloBineDATA kernel: ksys_pread64+0x5d/0x79
Feb  5 09:20:12 FloBineDATA kernel: do_syscall_64+0x57/0xf2
Feb  5 09:20:12 FloBineDATA kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb  5 09:20:12 FloBineDATA kernel: RIP: 0033:0x14fa07816e27
Feb  5 09:20:12 FloBineDATA kernel: Code: 08 89 3c 24 48 89 4c 24 18 e8 b5 f3 ff ff 4c 8b 54 24 18 48 8b 54 24 10 41 89 c0 48 8b 74 24 08 8b 3c 24 b8 11 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2d 44 89 c7 48 89 04 24 e8 e5 f3 ff ff 48 8b
Feb  5 09:20:12 FloBineDATA kernel: RSP: 002b:000014f97f7faa50 EFLAGS: 00000297 ORIG_RAX: 0000000000000011
Feb  5 09:20:12 FloBineDATA kernel: RAX: ffffffffffffffda RBX: 0000000000020000 RCX: 000014fa07816e27
Feb  5 09:20:12 FloBineDATA kernel: RDX: 0000000000020000 RSI: 000014f9c84a1000 RDI: 00000000000000fe
Feb  5 09:20:12 FloBineDATA kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000000
Feb  5 09:20:12 FloBineDATA kernel: R10: 000000000076c000 R11: 0000000000000297 R12: 000014f97f7fac18
Feb  5 09:20:12 FloBineDATA kernel: R13: 0000000000000000 R14: 000014f9c80dca78 R15: 0000000000000000


(...) obiges wiederholt sich noch einige male (...)


Feb  5 09:26:12 FloBineDATA kernel: rcu: INFO: rcu_bh self-detected stall on CPU
Feb  5 09:26:12 FloBineDATA kernel: rcu: #0112-....: (5938980 ticks this GP) idle=95a/1/0x4000000000000002 softirq=665866432/665917549 fqs=1328274 
Feb  5 09:26:12 FloBineDATA kernel: rcu: #011 (t=5640101 jiffies g=74553 q=144)
Feb  5 09:26:12 FloBineDATA kernel: NMI backtrace for cpu 2
Feb  5 09:26:12 FloBineDATA kernel: CPU: 2 PID: 22418 Comm: shfs Tainted: P        W  O      4.19.107-Unraid #1
Feb  5 09:26:12 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020
Feb  5 09:26:12 FloBineDATA kernel: Call Trace:
Feb  5 09:26:12 FloBineDATA kernel: <IRQ>
Feb  5 09:26:12 FloBineDATA kernel: dump_stack+0x67/0x83
Feb  5 09:26:12 FloBineDATA kernel: nmi_cpu_backtrace+0x71/0x83
Feb  5 09:26:12 FloBineDATA kernel: ? lapic_can_unplug_cpu+0x97/0x97
Feb  5 09:26:12 FloBineDATA kernel: nmi_trigger_cpumask_backtrace+0x57/0xd4
Feb  5 09:26:12 FloBineDATA kernel: rcu_dump_cpu_stacks+0x8b/0xb4
Feb  5 09:26:12 FloBineDATA kernel: rcu_check_callbacks+0x296/0x5a0
Feb  5 09:26:12 FloBineDATA kernel: update_process_times+0x24/0x47
Feb  5 09:26:12 FloBineDATA kernel: tick_sched_timer+0x36/0x64
Feb  5 09:26:12 FloBineDATA kernel: __hrtimer_run_queues+0xb7/0x10b
Feb  5 09:26:12 FloBineDATA kernel: ? tick_sched_handle.isra.0+0x2f/0x2f
Feb  5 09:26:12 FloBineDATA kernel: hrtimer_interrupt+0xf4/0x20e
Feb  5 09:26:12 FloBineDATA kernel: smp_apic_timer_interrupt+0x7b/0x93
Feb  5 09:26:12 FloBineDATA kernel: apic_timer_interrupt+0xf/0x20
Feb  5 09:26:12 FloBineDATA kernel: </IRQ>
Feb  5 09:26:12 FloBineDATA kernel: RIP: 0010:radix_tree_lookup_slot+0x3c/0x41
Feb  5 09:26:12 FloBineDATA kernel: Code: 24 08 31 c0 48 89 e1 e8 40 ff ff ff 48 85 c0 74 04 48 8b 04 24 48 8b 54 24 08 65 48 33 14 25 28 00 00 00 74 05 e8 6d c3 9d ff <48> 83 c4 10 c3 31 c9 31 d2 e9 14 ff ff ff 41 54 55 48 89 fd 53 48
Feb  5 09:26:12 FloBineDATA kernel: RSP: 0018:ffffc9000c8a3d28 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff13
Feb  5 09:26:12 FloBineDATA kernel: RAX: ffff888103efb3e0 RBX: ffff888121b7eba8 RCX: 0000000000000000
Feb  5 09:26:12 FloBineDATA kernel: RDX: 0000000000000000 RSI: ffffc9000c8a3cf8 RDI: ffff888103efb230
Feb  5 09:26:12 FloBineDATA kernel: RBP: 0000000000000771 R08: ffff888103efb3e0 R09: ffffc9000c8a3d28
Feb  5 09:26:12 FloBineDATA kernel: R10: 0000000000000000 R11: ffff888121b7eba8 R12: 0000000000000000
Feb  5 09:26:12 FloBineDATA kernel: R13: ffff888121b7eba0 R14: ffffc9000c8a3e88 R15: 0000000000000771
Feb  5 09:26:12 FloBineDATA kernel: ? radix_tree_lookup_slot+0x1e/0x41
Feb  5 09:26:12 FloBineDATA kernel: find_get_entry+0x14/0x8f
Feb  5 09:26:12 FloBineDATA kernel: pagecache_get_page+0x20/0x1bd
Feb  5 09:26:12 FloBineDATA kernel: generic_file_read_iter+0x1b8/0x6d0
Feb  5 09:26:12 FloBineDATA kernel: xfs_file_buffered_aio_read+0x4b/0x66 [xfs]
Feb  5 09:26:12 FloBineDATA kernel: xfs_file_read_iter+0x6f/0xb6 [xfs]
Feb  5 09:26:12 FloBineDATA kernel: __vfs_read+0xf9/0x132
Feb  5 09:26:12 FloBineDATA kernel: vfs_read+0xa4/0x124
Feb  5 09:26:12 FloBineDATA kernel: ksys_pread64+0x5d/0x79
Feb  5 09:26:12 FloBineDATA kernel: do_syscall_64+0x57/0xf2
Feb  5 09:26:12 FloBineDATA kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb  5 09:26:12 FloBineDATA kernel: RIP: 0033:0x14fa07816e27
Feb  5 09:26:12 FloBineDATA kernel: Code: 08 89 3c 24 48 89 4c 24 18 e8 b5 f3 ff ff 4c 8b 54 24 18 48 8b 54 24 10 41 89 c0 48 8b 74 24 08 8b 3c 24 b8 11 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2d 44 89 c7 48 89 04 24 e8 e5 f3 ff ff 48 8b
Feb  5 09:26:12 FloBineDATA kernel: RSP: 002b:000014f97f7faa50 EFLAGS: 00000297 ORIG_RAX: 0000000000000011
Feb  5 09:26:12 FloBineDATA kernel: RAX: ffffffffffffffda RBX: 0000000000020000 RCX: 000014fa07816e27
Feb  5 09:26:12 FloBineDATA kernel: RDX: 0000000000020000 RSI: 000014f9c84a1000 RDI: 00000000000000fe
Feb  5 09:26:12 FloBineDATA kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000000
Feb  5 09:26:12 FloBineDATA kernel: R10: 000000000076c000 R11: 0000000000000297 R12: 000014f97f7fac18
Feb  5 09:26:12 FloBineDATA kernel: R13: 0000000000000000 R14: 000014f9c80dca78 R15: 0000000000000000
Feb  5 09:27:20 FloBineDATA kernel: clocksource: timekeeping watchdog on CPU2: Marking clocksource 'tsc' as unstable because the skew is too large:
Feb  5 09:27:20 FloBineDATA kernel: clocksource:                       'hpet' wd_now: 4a26dbb4 wd_last: 90c392aa mask: ffffffff
Feb  5 09:27:20 FloBineDATA kernel: clocksource:                       'tsc' cs_now: 12046d35dc8c14 cs_last: 11f0044b0832fe mask: ffffffffffffffff
Feb  5 09:27:20 FloBineDATA kernel: tsc: Marking TSC unstable due to clocksource watchdog
Feb  5 09:27:20 FloBineDATA kernel: TSC found unstable after boot, most likely due to broken BIOS. Use 'tsc=unstable'.
Feb  5 09:27:20 FloBineDATA kernel: sched_clock: Marking unstable (1337029577016586, 5746791293)<-(1337035581197360, -234276209)
Feb  5 09:27:20 FloBineDATA kernel: clocksource: Switched to clocksource hpet
Feb  5 09:27:20 FloBineDATA crond[2333]: time disparity of 95 minutes detected
Feb  5 09:32:20 FloBineDATA kernel: mdcmd (569): set md_write_method 1
Feb  5 09:32:20 FloBineDATA kernel: 
Feb  5 09:51:41 FloBineDATA kernel: mdcmd (570): spindown 2
Feb  5 09:52:20 FloBineDATA kernel: mdcmd (571): set md_write_method 0
Feb  5 09:52:20 FloBineDATA kernel: 
Feb  5 09:53:03 FloBineDATA kernel: mdcmd (572): spindown 1
Feb  5 11:29:51 FloBineDATA kernel: igb 0000:05:00.0 eth0: mixed HW and IP checksum settings.
Feb  5 11:29:51 FloBineDATA kernel: eth0: renamed from veth03ee9fb
Feb  5 12:29:51 FloBineDATA kernel: mdcmd (573): spindown 2
Feb  5 12:31:08 FloBineDATA kernel: veth03ee9fb: renamed from eth0
Feb  5 12:31:08 FloBineDATA kernel: igb 0000:05:00.0 eth0: mixed HW and IP checksum settings.
Feb  5 14:24:18 FloBineDATA kernel: BUG: Bad page map in process khugepaged  pte:720724d045783067 pmd:103edb067
Feb  5 14:24:18 FloBineDATA kernel: addr:0000000032fdfc44 vm_flags:00100073 anon_vma:00000000433df8ae mapping:          (null) index:43c07c
Feb  5 14:24:18 FloBineDATA kernel: file:          (null) fault:          (null) mmap:          (null) readpage:          (null)
Feb  5 14:24:18 FloBineDATA kernel: CPU: 3 PID: 432 Comm: khugepaged Tainted: P        W  O      4.19.107-Unraid #1
Feb  5 14:24:18 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020
Feb  5 14:24:18 FloBineDATA kernel: Call Trace:
Feb  5 14:24:18 FloBineDATA kernel: dump_stack+0x67/0x83
Feb  5 14:24:18 FloBineDATA kernel: print_bad_pte+0x222/0x23f
Feb  5 14:24:18 FloBineDATA kernel: _vm_normal_page+0x5b/0xc6
Feb  5 14:24:18 FloBineDATA kernel: khugepaged+0x89a/0x1829
Feb  5 14:24:18 FloBineDATA kernel: ? wait_woken+0x6a/0x6a
Feb  5 14:24:18 FloBineDATA kernel: ? collapse_shmem+0xacd/0xacd
Feb  5 14:24:18 FloBineDATA kernel: kthread+0x10c/0x114
Feb  5 14:24:18 FloBineDATA kernel: ? kthread_park+0x89/0x89
Feb  5 14:24:18 FloBineDATA kernel: ret_from_fork+0x22/0x40
Feb  5 15:57:20 FloBineDATA kernel: mdcmd (574): set md_write_method 1
Feb  5 15:57:20 FloBineDATA kernel: 
Feb  5 16:07:06 FloBineDATA kernel: mdcmd (575): spindown 2
Feb  5 16:07:20 FloBineDATA kernel: mdcmd (576): set md_write_method 0
Feb  5 16:07:20 FloBineDATA kernel: 
Feb  5 16:07:35 FloBineDATA kernel: mdcmd (577): spindown 1
Feb  5 22:34:28 FloBineDATA kernel: mdcmd (578): spindown 1
Feb  5 23:22:05 FloBineDATA kernel: usb 1-4: USB disconnect, device number 4
Feb  5 23:22:06 FloBineDATA kernel: usb 1-4: new low-speed USB device number 7 using xhci_hcd
Feb  5 23:22:06 FloBineDATA kernel: input: Logitech USB Receiver as /devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:06:00.1/usb1/1-4/1-4:1.0/0003:046D:C517.0017/input/input43
Feb  5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0017: input,hidraw1: USB HID v1.10 Keyboard [Logitech USB Receiver] on usb-0000:06:00.1-4/input0
Feb  5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0018: fixing up Logitech keyboard report descriptor
Feb  5 23:22:06 FloBineDATA kernel: input: Logitech USB Receiver as /devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:06:00.1/usb1/1-4/1-4:1.1/0003:046D:C517.0018/input/input44
Feb  5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0018: input,hiddev97,hidraw2: USB HID v1.10 Mouse [Logitech USB Receiver] on usb-0000:06:00.1-4/input1
Feb  5 23:22:09 FloBineDATA kernel: udevd: 3 output lines suppressed due to ratelimiting
Feb  6 00:00:02 FloBineDATA Plugin Auto Update: Checking for available plugin updates
Feb  6 00:00:09 FloBineDATA Plugin Auto Update: Update available for unassigned.devices-plus.plg (Not set to Auto Update)
Feb  6 00:00:09 FloBineDATA Plugin Auto Update: vmbackup.plg version 2021.02.03 does not meet age requirements to update
Feb  6 00:00:09 FloBineDATA Plugin Auto Update: Community Applications Plugin Auto Update finished
Feb  6 00:15:21 FloBineDATA kernel: mdcmd (579): spindown 2
Feb  6 01:17:07 FloBineDATA kernel: mdcmd (580): spindown 0
Feb  6 01:17:08 FloBineDATA kernel: mdcmd (581): spindown 1
Feb  6 01:45:50 FloBineDATA kernel: ioapp-compute-8: Corrupted page table at address 43c07c000
Feb  6 01:45:50 FloBineDATA kernel: PGD 12b32c067 P4D 12b32c067 PUD fab332067 PMD 103edb067 PTE 720724d045783067
Feb  6 01:45:50 FloBineDATA kernel: Bad pagetable: 000f [#1] SMP NOPTI
Feb  6 01:45:50 FloBineDATA kernel: CPU: 14 PID: 25314 Comm: ioapp-compute-8 Tainted: P    B   W  O      4.19.107-Unraid #1
Feb  6 01:45:50 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020
Feb  6 01:45:50 FloBineDATA kernel: RIP: 0033:0x14c156dff341
Feb  6 01:45:50 FloBineDATA kernel: Code: 4c 8b d0 49 83 c2 18 4d 3b 97 28 01 00 00 0f 83 51 02 00 00 4d 89 97 18 01 00 00 41 0f 18 82 00 01 00 00 4d 8b 91 b8 00 00 00 <4c> 89 10 c7 40 08 d0 b5 12 00 c7 40 0c 00 00 00 00 48 c7 40 10 00
Feb  6 01:45:50 FloBineDATA kernel: RSP: 002b:000014c119fb17e0 EFLAGS: 00010287
Feb  6 01:45:50 FloBineDATA kernel: RAX: 000000043c07c000 RBX: 000000043c07bff0 RCX: 0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: RDX: 000000043c07bf08 RSI: 00000008008d4738 RDI: 000014c149dcc3df
Feb  6 01:45:50 FloBineDATA kernel: RBP: 000000008780f7e3 R08: 000000043c07bff0 R09: 000000080012b5d0
Feb  6 01:45:50 FloBineDATA kernel: R10: 0000000000000005 R11: 000000080089e028 R12: 0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: R13: 000000043c07bfc8 R14: 000000043c07bef8 R15: 000014c11da2c800
Feb  6 01:45:50 FloBineDATA kernel: FS:  000014c119fb2b38 GS:  0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: Modules linked in: vhost_net tun vhost tap kvm_amd kvm ccp xt_CHECKSUM ipt_REJECT ip6table_mangle ip6table_nat nf_nat_ipv6 iptable_mangle ip6table_filter ip6_tables veth macvlan xt_nat ipt_MASQUERADE iptable_filter iptable_nat nf_nat_ipv4 nf_nat ip_tables xfs nfsd lockd grace sunrpc md_mod nct6775 hwmon_vid k10temp igb(O) r8125(O) nvidia_drm(PO) nvidia_modeset(PO) nvidia(PO) mxm_wmi wmi_bmof drm_kms_helper drm btusb btrtl btbcm btintel edac_mce_amd mpt3sas crc32_pclmul pcbc bluetooth syscopyarea sysfillrect sysimgblt fb_sys_fops aesni_intel aes_x86_64 glue_helper crypto_simd ghash_clmulni_intel cryptd raid_class scsi_transport_sas agpgart i2c_piix4 i2c_core cdc_acm ahci ecdh_generic libahci crct10dif_pclmul crc32c_intel button wmi pcc_cpufreq acpi_cpufreq [last unloaded: tun]
Feb  6 01:45:50 FloBineDATA kernel: ---[ end trace 4a61123f3ba74a74 ]---
Feb  6 01:45:50 FloBineDATA kernel: RIP: 0033:0x14c156dff341
Feb  6 01:45:50 FloBineDATA kernel: Code: 4c 8b d0 49 83 c2 18 4d 3b 97 28 01 00 00 0f 83 51 02 00 00 4d 89 97 18 01 00 00 41 0f 18 82 00 01 00 00 4d 8b 91 b8 00 00 00 <4c> 89 10 c7 40 08 d0 b5 12 00 c7 40 0c 00 00 00 00 48 c7 40 10 00
Feb  6 01:45:50 FloBineDATA kernel: RSP: 002b:000014c119fb17e0 EFLAGS: 00010287
Feb  6 01:45:50 FloBineDATA kernel: RAX: 000000043c07c000 RBX: 000000043c07bff0 RCX: 0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: RDX: 000000043c07bf08 RSI: 00000008008d4738 RDI: 000014c149dcc3df
Feb  6 01:45:50 FloBineDATA kernel: RBP: 000000008780f7e3 R08: 000000043c07bff0 R09: 000000080012b5d0
Feb  6 01:45:50 FloBineDATA kernel: R10: 0000000000000005 R11: 000000080089e028 R12: 0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: R13: 000000043c07bfc8 R14: 000000043c07bef8 R15: 000014c11da2c800
Feb  6 01:45:50 FloBineDATA kernel: FS:  000014c119fb2b38(0000) GS:ffff888ffe980000(0000) knlGS:0000000000000000
Feb  6 01:45:50 FloBineDATA kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb  6 01:45:50 FloBineDATA kernel: CR2: 000000043c07c000 CR3: 0000000620d8e000 CR4: 0000000000340ee0
Feb  6 02:02:21 FloBineDATA kernel: mdcmd (582): set md_write_method 1
Feb  6 02:02:21 FloBineDATA kernel: 
Feb  6 02:15:22 FloBineDATA kernel: mdcmd (583): spindown 1
Feb  6 02:15:53 FloBineDATA kernel: mdcmd (584): spindown 2
Feb  6 02:17:21 FloBineDATA kernel: mdcmd (585): set md_write_method 0
Feb  6 02:17:21 FloBineDATA kernel: 
Feb  6 03:56:16 FloBineDATA kernel: mdcmd (586): spindown 2
Feb  6 04:55:10 FloBineDATA kernel: mdcmd (587): spindown 1
Feb  6 05:00:53 FloBineDATA root: /etc/libvirt: 1.9 GiB (2041622528 bytes) trimmed on /dev/loop3
Feb  6 05:00:53 FloBineDATA root: /var/lib/docker: 17.4 GiB (18627493888 bytes) trimmed on /dev/loop2
Feb  6 05:00:53 FloBineDATA root: /mnt/disks/UD_SSD480_1: 181.2 GiB (194575491072 bytes) trimmed on /dev/sde1
Feb  6 05:00:53 FloBineDATA root: /mnt/cache: 470.3 GiB (504972673024 bytes) trimmed on /dev/sdg1
Feb  6 05:02:21 FloBineDATA kernel: mdcmd (588): set md_write_method 1
Feb  6 05:02:21 FloBineDATA kernel: 
Feb  6 05:17:57 FloBineDATA kernel: mdcmd (589): spindown 1
Feb  6 05:18:27 FloBineDATA kernel: mdcmd (590): spindown 0
Feb  6 05:18:29 FloBineDATA kernel: mdcmd (591): spindown 2
Feb  6 05:22:21 FloBineDATA kernel: mdcmd (592): set md_write_method 0
Feb  6 05:22:21 FloBineDATA kernel: 
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: exception Emask 0x0 SAct 0x7fffff8 SErr 0x0 action 0x6 frozen
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/10:18:48:16:c3/00:00:0a:00:00/40 tag 3 ncq dma 8192 in
Feb  6 06:33:07 FloBineDATA kernel:         res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY }
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/08:20:60:16:c3/00:00:0a:00:00/40 tag 4 ncq dma 4096 in
Feb  6 06:33:07 FloBineDATA kernel:         res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY }
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/08:28:c0:16:c3/00:00:0a:00:00/40 tag 5 ncq dma 4096 in
Feb  6 06:33:07 FloBineDATA kernel:         res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY }


(...) obiges wiederholt sich noch einige male (...)


Feb  6 06:33:07 FloBineDATA kernel: ata5.00: failed command: WRITE FPDMA QUEUED
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: cmd 61/08:d0:b8:3e:71/00:00:0c:00:00/40 tag 26 ncq dma 4096 out
Feb  6 06:33:07 FloBineDATA kernel:         res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY }
Feb  6 06:33:07 FloBineDATA kernel: ata5: hard resetting link
Feb  6 06:33:07 FloBineDATA kernel: ata5: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Feb  6 06:33:07 FloBineDATA kernel: ata5.00: configured for UDMA/133
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 CDB: opcode=0x28 28 00 0a c3 16 48 00 00 10 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557384
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 CDB: opcode=0x28 28 00 0a c3 16 60 00 00 08 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557408
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 CDB: opcode=0x28 28 00 0a c3 16 c0 00 00 08 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557504
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 CDB: opcode=0x28 28 00 02 fa 9c 20 00 00 10 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 49978400
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 CDB: opcode=0x28 28 00 03 0b e7 78 00 00 08 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 51111800
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 CDB: opcode=0x28 28 00 0a c3 16 a0 00 00 08 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557472
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 CDB: opcode=0x28 28 00 0a c3 16 d0 00 00 08 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557520
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 Sense Key : 0x5 [current] 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 ASC=0x21 ASCQ=0x4 
Feb  6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 CDB: opcode=0x28 28 00 0a 25 f6 a8 00 00 20 00
Feb  6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 170260136
Feb  6 06:33:07 FloBineDATA kernel: ata5: EH complete
Feb  6 07:12:21 FloBineDATA kernel: mdcmd (593): set md_write_method 1
Feb  6 07:12:21 FloBineDATA kernel: 
Feb  6 07:17:34 FloBineDATA kernel: mdcmd (594): spindown 2
Feb  6 07:22:21 FloBineDATA kernel: mdcmd (595): set md_write_method 0
Feb  6 07:22:21 FloBineDATA kernel: 
Feb  6 07:27:21 FloBineDATA kernel: mdcmd (596): set md_write_method 1
Feb  6 07:27:21 FloBineDATA kernel: 


==> AB HIER NEUSTART <==


Feb  6 11:58:50 FloBineDATA cache_dirs: Arguments=-u -l off -a -noleaf -name .Recycle.Bin -prune -o -name log -prune -o -name temp -prune -o -print
Feb  6 11:58:50 FloBineDATA cache_dirs: Max Scan Secs=10, Min Scan Secs=1
Feb  6 11:58:50 FloBineDATA cache_dirs: Scan Type=adaptive
Feb  6 11:58:50 FloBineDATA cache_dirs: Min Scan Depth=4
Feb  6 11:58:50 FloBineDATA cache_dirs: Max Scan Depth=none
Feb  6 11:58:50 FloBineDATA cache_dirs: Use Command='find -noleaf -name .Recycle.Bin -prune -o -name log -prune -o -name temp -prune -o -print'
Feb  6 11:58:50 FloBineDATA cache_dirs: ---------- Caching Directories ---------------

 

/dev/sde ist eine über Unassigned devices eingebundene SSD auf der meine VMs laufen. Der Platte geht es gut...(?)

 

#	Attribute Name	Flag	Value	Worst	Threshold	Type	Updated	Failed	Raw Value
5	Reallocated sector count	0x0032	100	100	000	Old age	Always	Never	0
9	Power on hours	0x0032	100	100	000	Old age	Always	Never	9903 (1y, 1m, 15d, 15h)
12	Power cycle count	0x0032	100	100	000	Old age	Always	Never	40
165	Total write/erase count	0x0032	100	100	000	Old age	Always	Never	3089
166	Min W/E cycle	0x0032	100	100	---	Old age	Always	Never	14
167	Min bad block/die	0x0032	100	100	---	Old age	Always	Never	0
168	Maximum erase cycle	0x0032	100	100	---	Old age	Always	Never	55
169	Total bad block	0x0032	100	100	---	Old age	Always	Never	347
170	Unknown attribute	0x0032	100	100	---	Old age	Always	Never	0
171	Program fail count	0x0032	100	100	000	Old age	Always	Never	0
172	Erase fail count	0x0032	100	100	000	Old age	Always	Never	0
173	Avg write/erase count	0x0032	100	100	000	Old age	Always	Never	14
174	Unexpect power loss count	0x0032	100	100	000	Old age	Always	Never	26
184	End-to-end error	0x0032	100	100	---	Old age	Always	Never	0
187	Reported uncorrect	0x0032	100	100	000	Old age	Always	Never	0
188	Command timeout	0x0032	100	100	---	Old age	Always	Never	0
194	Temperature celsius	0x0022	070	060	000	Old age	Always	Never	30 (min/max 8/60)
199	SATA CRC error	0x0032	100	100	---	Old age	Always	Never	0
230	Perc write/erase count	0x0032	100	100	000	Old age	Always	Never	2130 592 2130
232	Perc avail resrvd space	0x0033	100	100	005	Pre-fail	Always	Never	100
233	Total NAND writes gib	0x0032	100	100	---	Old age	Always	Never	7063
234	Perc write/erase count BC	0x0032	100	100	000	Old age	Always	Never	34747
241	Total writes gib	0x0030	100	100	000	Old age	Offline	Never	11866
242	Total reads gib	0x0030	100	100	000	Old age	Offline	Never	9728
244	Thermal throttle	0x0032	000	100	---	Old age	Always	Never	0

 

 

Link to comment
1 hour ago, vakilando said:

mein Unraid Server ist nachts angestürzt.

 

Sicher? Wie hast du das geprüft?

 

Vielleicht waren nur bestimmte Dienste weg. Ein Test direkt am Server bringt erst Klarheit.

 

1 hour ago, vakilando said:

Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557384

 

Das mal recherchiert?

Link to comment
8 minutes ago, mgutt said:

Sicher? Wie hast du das geprüft?

Nicht per Web, SSH oder Ping erreichbar. Monitor an Server in so einem Fall nicht nutzbar, da meine Alltagsmaschine eine VM mit durchgereichter Grafikkarte ist und ich nur den Bootprozess sehe bis zum Start der VM.

 

15 minutes ago, mgutt said:

print_req_error resultiert aus defekten Laufwerken oder fehlerhaften Verbindungen zum Laufwerk

ja, die SSD muss ich noch mal checken... wäre aber kein Grund für einen Freeze oder Crash?

Link to comment
5 minutes ago, vakilando said:

wäre aber kein Grund für einen Freeze oder Crash?

 

Da auf der SSD Docker und VM zusammen laufen (die ja teilweise direkt auf die Hardware zugreifen) und wir noch nicht wissen was die defekte SSD auslöst (evtl ja auch Board kaputt), kann ich dazu keine wirkliche Aussage treffen. Du weißt ja auch nicht ob der Server wirklich komplett tot war. SSH, Web oder Ping kann ja auch einfach nur tote Netzwerk-Verbindung heißen.

 

 

Link to comment

Auf der SSD (/dev/sde) ist nur der default VM storage path: /mnt/disks/UD_SSD480_1/vm-domains-ud/

Die Libvirt- und Docker-Images liegen im Cache und appdata ebenso.

24 minutes ago, mgutt said:

Du weißt ja auch nicht ob der Server wirklich komplett tot war. SSH, Web oder Ping kann ja auch einfach nur tote Netzwerk-Verbindung heißen.

richtig.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.