vakilando Posted February 6, 2021 Share Posted February 6, 2021 Heute morgen war meine Heimautomation nicht mehr erreichbar und ich musste aufstehen um die Kaffeemaschine anzuschalten - das war schon mal ein schlechter Start in den Tag.... Noch schlechter: mein Unraid Server ist nachts angestürzt. Warum ist mir noch nicht klar, nach einem Hard-Reset läuft er wieder und behauptet es ginge ihm gut (soweit ich das erst mal beurteilen kann). Anbei Ausschnitte des Syslogs von gestern bis zum Neustart: (...) Feb 5 09:20:12 FloBineDATA kernel: rcu: INFO: rcu_bh self-detected stall on CPU Feb 5 09:20:12 FloBineDATA kernel: rcu: #0112-....: (5578973 ticks this GP) idle=95a/1/0x4000000000000002 softirq=665866432/665917549 fqs=1243552 Feb 5 09:20:12 FloBineDATA kernel: rcu: #011 (t=5280094 jiffies g=74553 q=140) Feb 5 09:20:12 FloBineDATA kernel: NMI backtrace for cpu 2 Feb 5 09:20:12 FloBineDATA kernel: CPU: 2 PID: 22418 Comm: shfs Tainted: P W O 4.19.107-Unraid #1 Feb 5 09:20:12 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020 Feb 5 09:20:12 FloBineDATA kernel: Call Trace: Feb 5 09:20:12 FloBineDATA kernel: <IRQ> Feb 5 09:20:12 FloBineDATA kernel: dump_stack+0x67/0x83 Feb 5 09:20:12 FloBineDATA kernel: nmi_cpu_backtrace+0x71/0x83 Feb 5 09:20:12 FloBineDATA kernel: ? lapic_can_unplug_cpu+0x97/0x97 Feb 5 09:20:12 FloBineDATA kernel: nmi_trigger_cpumask_backtrace+0x57/0xd4 Feb 5 09:20:12 FloBineDATA kernel: rcu_dump_cpu_stacks+0x8b/0xb4 Feb 5 09:20:12 FloBineDATA kernel: rcu_check_callbacks+0x296/0x5a0 Feb 5 09:20:12 FloBineDATA kernel: update_process_times+0x24/0x47 Feb 5 09:20:12 FloBineDATA kernel: tick_sched_timer+0x36/0x64 Feb 5 09:20:12 FloBineDATA kernel: __hrtimer_run_queues+0xb7/0x10b Feb 5 09:20:12 FloBineDATA kernel: ? tick_sched_handle.isra.0+0x2f/0x2f Feb 5 09:20:12 FloBineDATA kernel: hrtimer_interrupt+0xf4/0x20e Feb 5 09:20:12 FloBineDATA kernel: smp_apic_timer_interrupt+0x7b/0x93 Feb 5 09:20:12 FloBineDATA kernel: apic_timer_interrupt+0xf/0x20 Feb 5 09:20:12 FloBineDATA kernel: </IRQ> Feb 5 09:20:12 FloBineDATA kernel: RIP: 0010:radix_tree_load_root+0x27/0x38 Feb 5 09:20:12 FloBineDATA kernel: Code: 47 04 c3 48 8b 47 08 48 89 c1 48 89 06 83 e1 03 48 ff c9 75 1c 48 83 e0 fe be 40 00 00 00 0f b6 08 48 d3 e6 48 ff ce 48 89 32 <0f> b6 00 83 c0 06 c3 48 c7 02 00 00 00 00 31 c0 c3 48 89 f0 83 e0 Feb 5 09:20:12 FloBineDATA kernel: RSP: 0018:ffffc9000c8a3cf0 EFLAGS: 00000216 ORIG_RAX: ffffffffffffff13 Feb 5 09:20:12 FloBineDATA kernel: RAX: ffff888146aff478 RBX: 0000000000000771 RCX: 0000000000000006 Feb 5 09:20:12 FloBineDATA kernel: RDX: ffffc9000c8a3d00 RSI: 0000000000000fff RDI: ffff888121b7eba8 Feb 5 09:20:12 FloBineDATA kernel: RBP: ffff888121b7ebb0 R08: ffff888121b7ebb0 R09: ffffc9000c8a3d28 Feb 5 09:20:12 FloBineDATA kernel: R10: 0000000000000000 R11: ffff888121b7eba8 R12: 0000000000000000 Feb 5 09:20:12 FloBineDATA kernel: R13: ffff888121b7eba0 R14: ffffc9000c8a3e88 R15: 0000000000000771 Feb 5 09:20:12 FloBineDATA kernel: __radix_tree_lookup+0x39/0xa2 Feb 5 09:20:12 FloBineDATA kernel: radix_tree_lookup_slot+0x1e/0x41 Feb 5 09:20:12 FloBineDATA kernel: find_get_entry+0x14/0x8f Feb 5 09:20:12 FloBineDATA kernel: pagecache_get_page+0x20/0x1bd Feb 5 09:20:12 FloBineDATA kernel: generic_file_read_iter+0x1b8/0x6d0 Feb 5 09:20:12 FloBineDATA kernel: xfs_file_buffered_aio_read+0x4b/0x66 [xfs] Feb 5 09:20:12 FloBineDATA kernel: xfs_file_read_iter+0x6f/0xb6 [xfs] Feb 5 09:20:12 FloBineDATA kernel: __vfs_read+0xf9/0x132 Feb 5 09:20:12 FloBineDATA kernel: vfs_read+0xa4/0x124 Feb 5 09:20:12 FloBineDATA kernel: ksys_pread64+0x5d/0x79 Feb 5 09:20:12 FloBineDATA kernel: do_syscall_64+0x57/0xf2 Feb 5 09:20:12 FloBineDATA kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9 Feb 5 09:20:12 FloBineDATA kernel: RIP: 0033:0x14fa07816e27 Feb 5 09:20:12 FloBineDATA kernel: Code: 08 89 3c 24 48 89 4c 24 18 e8 b5 f3 ff ff 4c 8b 54 24 18 48 8b 54 24 10 41 89 c0 48 8b 74 24 08 8b 3c 24 b8 11 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2d 44 89 c7 48 89 04 24 e8 e5 f3 ff ff 48 8b Feb 5 09:20:12 FloBineDATA kernel: RSP: 002b:000014f97f7faa50 EFLAGS: 00000297 ORIG_RAX: 0000000000000011 Feb 5 09:20:12 FloBineDATA kernel: RAX: ffffffffffffffda RBX: 0000000000020000 RCX: 000014fa07816e27 Feb 5 09:20:12 FloBineDATA kernel: RDX: 0000000000020000 RSI: 000014f9c84a1000 RDI: 00000000000000fe Feb 5 09:20:12 FloBineDATA kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000000 Feb 5 09:20:12 FloBineDATA kernel: R10: 000000000076c000 R11: 0000000000000297 R12: 000014f97f7fac18 Feb 5 09:20:12 FloBineDATA kernel: R13: 0000000000000000 R14: 000014f9c80dca78 R15: 0000000000000000 (...) obiges wiederholt sich noch einige male (...) Feb 5 09:26:12 FloBineDATA kernel: rcu: INFO: rcu_bh self-detected stall on CPU Feb 5 09:26:12 FloBineDATA kernel: rcu: #0112-....: (5938980 ticks this GP) idle=95a/1/0x4000000000000002 softirq=665866432/665917549 fqs=1328274 Feb 5 09:26:12 FloBineDATA kernel: rcu: #011 (t=5640101 jiffies g=74553 q=144) Feb 5 09:26:12 FloBineDATA kernel: NMI backtrace for cpu 2 Feb 5 09:26:12 FloBineDATA kernel: CPU: 2 PID: 22418 Comm: shfs Tainted: P W O 4.19.107-Unraid #1 Feb 5 09:26:12 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020 Feb 5 09:26:12 FloBineDATA kernel: Call Trace: Feb 5 09:26:12 FloBineDATA kernel: <IRQ> Feb 5 09:26:12 FloBineDATA kernel: dump_stack+0x67/0x83 Feb 5 09:26:12 FloBineDATA kernel: nmi_cpu_backtrace+0x71/0x83 Feb 5 09:26:12 FloBineDATA kernel: ? lapic_can_unplug_cpu+0x97/0x97 Feb 5 09:26:12 FloBineDATA kernel: nmi_trigger_cpumask_backtrace+0x57/0xd4 Feb 5 09:26:12 FloBineDATA kernel: rcu_dump_cpu_stacks+0x8b/0xb4 Feb 5 09:26:12 FloBineDATA kernel: rcu_check_callbacks+0x296/0x5a0 Feb 5 09:26:12 FloBineDATA kernel: update_process_times+0x24/0x47 Feb 5 09:26:12 FloBineDATA kernel: tick_sched_timer+0x36/0x64 Feb 5 09:26:12 FloBineDATA kernel: __hrtimer_run_queues+0xb7/0x10b Feb 5 09:26:12 FloBineDATA kernel: ? tick_sched_handle.isra.0+0x2f/0x2f Feb 5 09:26:12 FloBineDATA kernel: hrtimer_interrupt+0xf4/0x20e Feb 5 09:26:12 FloBineDATA kernel: smp_apic_timer_interrupt+0x7b/0x93 Feb 5 09:26:12 FloBineDATA kernel: apic_timer_interrupt+0xf/0x20 Feb 5 09:26:12 FloBineDATA kernel: </IRQ> Feb 5 09:26:12 FloBineDATA kernel: RIP: 0010:radix_tree_lookup_slot+0x3c/0x41 Feb 5 09:26:12 FloBineDATA kernel: Code: 24 08 31 c0 48 89 e1 e8 40 ff ff ff 48 85 c0 74 04 48 8b 04 24 48 8b 54 24 08 65 48 33 14 25 28 00 00 00 74 05 e8 6d c3 9d ff <48> 83 c4 10 c3 31 c9 31 d2 e9 14 ff ff ff 41 54 55 48 89 fd 53 48 Feb 5 09:26:12 FloBineDATA kernel: RSP: 0018:ffffc9000c8a3d28 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff13 Feb 5 09:26:12 FloBineDATA kernel: RAX: ffff888103efb3e0 RBX: ffff888121b7eba8 RCX: 0000000000000000 Feb 5 09:26:12 FloBineDATA kernel: RDX: 0000000000000000 RSI: ffffc9000c8a3cf8 RDI: ffff888103efb230 Feb 5 09:26:12 FloBineDATA kernel: RBP: 0000000000000771 R08: ffff888103efb3e0 R09: ffffc9000c8a3d28 Feb 5 09:26:12 FloBineDATA kernel: R10: 0000000000000000 R11: ffff888121b7eba8 R12: 0000000000000000 Feb 5 09:26:12 FloBineDATA kernel: R13: ffff888121b7eba0 R14: ffffc9000c8a3e88 R15: 0000000000000771 Feb 5 09:26:12 FloBineDATA kernel: ? radix_tree_lookup_slot+0x1e/0x41 Feb 5 09:26:12 FloBineDATA kernel: find_get_entry+0x14/0x8f Feb 5 09:26:12 FloBineDATA kernel: pagecache_get_page+0x20/0x1bd Feb 5 09:26:12 FloBineDATA kernel: generic_file_read_iter+0x1b8/0x6d0 Feb 5 09:26:12 FloBineDATA kernel: xfs_file_buffered_aio_read+0x4b/0x66 [xfs] Feb 5 09:26:12 FloBineDATA kernel: xfs_file_read_iter+0x6f/0xb6 [xfs] Feb 5 09:26:12 FloBineDATA kernel: __vfs_read+0xf9/0x132 Feb 5 09:26:12 FloBineDATA kernel: vfs_read+0xa4/0x124 Feb 5 09:26:12 FloBineDATA kernel: ksys_pread64+0x5d/0x79 Feb 5 09:26:12 FloBineDATA kernel: do_syscall_64+0x57/0xf2 Feb 5 09:26:12 FloBineDATA kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9 Feb 5 09:26:12 FloBineDATA kernel: RIP: 0033:0x14fa07816e27 Feb 5 09:26:12 FloBineDATA kernel: Code: 08 89 3c 24 48 89 4c 24 18 e8 b5 f3 ff ff 4c 8b 54 24 18 48 8b 54 24 10 41 89 c0 48 8b 74 24 08 8b 3c 24 b8 11 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 2d 44 89 c7 48 89 04 24 e8 e5 f3 ff ff 48 8b Feb 5 09:26:12 FloBineDATA kernel: RSP: 002b:000014f97f7faa50 EFLAGS: 00000297 ORIG_RAX: 0000000000000011 Feb 5 09:26:12 FloBineDATA kernel: RAX: ffffffffffffffda RBX: 0000000000020000 RCX: 000014fa07816e27 Feb 5 09:26:12 FloBineDATA kernel: RDX: 0000000000020000 RSI: 000014f9c84a1000 RDI: 00000000000000fe Feb 5 09:26:12 FloBineDATA kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000000 Feb 5 09:26:12 FloBineDATA kernel: R10: 000000000076c000 R11: 0000000000000297 R12: 000014f97f7fac18 Feb 5 09:26:12 FloBineDATA kernel: R13: 0000000000000000 R14: 000014f9c80dca78 R15: 0000000000000000 Feb 5 09:27:20 FloBineDATA kernel: clocksource: timekeeping watchdog on CPU2: Marking clocksource 'tsc' as unstable because the skew is too large: Feb 5 09:27:20 FloBineDATA kernel: clocksource: 'hpet' wd_now: 4a26dbb4 wd_last: 90c392aa mask: ffffffff Feb 5 09:27:20 FloBineDATA kernel: clocksource: 'tsc' cs_now: 12046d35dc8c14 cs_last: 11f0044b0832fe mask: ffffffffffffffff Feb 5 09:27:20 FloBineDATA kernel: tsc: Marking TSC unstable due to clocksource watchdog Feb 5 09:27:20 FloBineDATA kernel: TSC found unstable after boot, most likely due to broken BIOS. Use 'tsc=unstable'. Feb 5 09:27:20 FloBineDATA kernel: sched_clock: Marking unstable (1337029577016586, 5746791293)<-(1337035581197360, -234276209) Feb 5 09:27:20 FloBineDATA kernel: clocksource: Switched to clocksource hpet Feb 5 09:27:20 FloBineDATA crond[2333]: time disparity of 95 minutes detected Feb 5 09:32:20 FloBineDATA kernel: mdcmd (569): set md_write_method 1 Feb 5 09:32:20 FloBineDATA kernel: Feb 5 09:51:41 FloBineDATA kernel: mdcmd (570): spindown 2 Feb 5 09:52:20 FloBineDATA kernel: mdcmd (571): set md_write_method 0 Feb 5 09:52:20 FloBineDATA kernel: Feb 5 09:53:03 FloBineDATA kernel: mdcmd (572): spindown 1 Feb 5 11:29:51 FloBineDATA kernel: igb 0000:05:00.0 eth0: mixed HW and IP checksum settings. Feb 5 11:29:51 FloBineDATA kernel: eth0: renamed from veth03ee9fb Feb 5 12:29:51 FloBineDATA kernel: mdcmd (573): spindown 2 Feb 5 12:31:08 FloBineDATA kernel: veth03ee9fb: renamed from eth0 Feb 5 12:31:08 FloBineDATA kernel: igb 0000:05:00.0 eth0: mixed HW and IP checksum settings. Feb 5 14:24:18 FloBineDATA kernel: BUG: Bad page map in process khugepaged pte:720724d045783067 pmd:103edb067 Feb 5 14:24:18 FloBineDATA kernel: addr:0000000032fdfc44 vm_flags:00100073 anon_vma:00000000433df8ae mapping: (null) index:43c07c Feb 5 14:24:18 FloBineDATA kernel: file: (null) fault: (null) mmap: (null) readpage: (null) Feb 5 14:24:18 FloBineDATA kernel: CPU: 3 PID: 432 Comm: khugepaged Tainted: P W O 4.19.107-Unraid #1 Feb 5 14:24:18 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020 Feb 5 14:24:18 FloBineDATA kernel: Call Trace: Feb 5 14:24:18 FloBineDATA kernel: dump_stack+0x67/0x83 Feb 5 14:24:18 FloBineDATA kernel: print_bad_pte+0x222/0x23f Feb 5 14:24:18 FloBineDATA kernel: _vm_normal_page+0x5b/0xc6 Feb 5 14:24:18 FloBineDATA kernel: khugepaged+0x89a/0x1829 Feb 5 14:24:18 FloBineDATA kernel: ? wait_woken+0x6a/0x6a Feb 5 14:24:18 FloBineDATA kernel: ? collapse_shmem+0xacd/0xacd Feb 5 14:24:18 FloBineDATA kernel: kthread+0x10c/0x114 Feb 5 14:24:18 FloBineDATA kernel: ? kthread_park+0x89/0x89 Feb 5 14:24:18 FloBineDATA kernel: ret_from_fork+0x22/0x40 Feb 5 15:57:20 FloBineDATA kernel: mdcmd (574): set md_write_method 1 Feb 5 15:57:20 FloBineDATA kernel: Feb 5 16:07:06 FloBineDATA kernel: mdcmd (575): spindown 2 Feb 5 16:07:20 FloBineDATA kernel: mdcmd (576): set md_write_method 0 Feb 5 16:07:20 FloBineDATA kernel: Feb 5 16:07:35 FloBineDATA kernel: mdcmd (577): spindown 1 Feb 5 22:34:28 FloBineDATA kernel: mdcmd (578): spindown 1 Feb 5 23:22:05 FloBineDATA kernel: usb 1-4: USB disconnect, device number 4 Feb 5 23:22:06 FloBineDATA kernel: usb 1-4: new low-speed USB device number 7 using xhci_hcd Feb 5 23:22:06 FloBineDATA kernel: input: Logitech USB Receiver as /devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:06:00.1/usb1/1-4/1-4:1.0/0003:046D:C517.0017/input/input43 Feb 5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0017: input,hidraw1: USB HID v1.10 Keyboard [Logitech USB Receiver] on usb-0000:06:00.1-4/input0 Feb 5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0018: fixing up Logitech keyboard report descriptor Feb 5 23:22:06 FloBineDATA kernel: input: Logitech USB Receiver as /devices/pci0000:00/0000:00:01.2/0000:01:00.0/0000:02:08.0/0000:06:00.1/usb1/1-4/1-4:1.1/0003:046D:C517.0018/input/input44 Feb 5 23:22:06 FloBineDATA kernel: logitech 0003:046D:C517.0018: input,hiddev97,hidraw2: USB HID v1.10 Mouse [Logitech USB Receiver] on usb-0000:06:00.1-4/input1 Feb 5 23:22:09 FloBineDATA kernel: udevd: 3 output lines suppressed due to ratelimiting Feb 6 00:00:02 FloBineDATA Plugin Auto Update: Checking for available plugin updates Feb 6 00:00:09 FloBineDATA Plugin Auto Update: Update available for unassigned.devices-plus.plg (Not set to Auto Update) Feb 6 00:00:09 FloBineDATA Plugin Auto Update: vmbackup.plg version 2021.02.03 does not meet age requirements to update Feb 6 00:00:09 FloBineDATA Plugin Auto Update: Community Applications Plugin Auto Update finished Feb 6 00:15:21 FloBineDATA kernel: mdcmd (579): spindown 2 Feb 6 01:17:07 FloBineDATA kernel: mdcmd (580): spindown 0 Feb 6 01:17:08 FloBineDATA kernel: mdcmd (581): spindown 1 Feb 6 01:45:50 FloBineDATA kernel: ioapp-compute-8: Corrupted page table at address 43c07c000 Feb 6 01:45:50 FloBineDATA kernel: PGD 12b32c067 P4D 12b32c067 PUD fab332067 PMD 103edb067 PTE 720724d045783067 Feb 6 01:45:50 FloBineDATA kernel: Bad pagetable: 000f [#1] SMP NOPTI Feb 6 01:45:50 FloBineDATA kernel: CPU: 14 PID: 25314 Comm: ioapp-compute-8 Tainted: P B W O 4.19.107-Unraid #1 Feb 6 01:45:50 FloBineDATA kernel: Hardware name: System manufacturer System Product Name/ROG STRIX X570-E GAMING, BIOS 1409 05/12/2020 Feb 6 01:45:50 FloBineDATA kernel: RIP: 0033:0x14c156dff341 Feb 6 01:45:50 FloBineDATA kernel: Code: 4c 8b d0 49 83 c2 18 4d 3b 97 28 01 00 00 0f 83 51 02 00 00 4d 89 97 18 01 00 00 41 0f 18 82 00 01 00 00 4d 8b 91 b8 00 00 00 <4c> 89 10 c7 40 08 d0 b5 12 00 c7 40 0c 00 00 00 00 48 c7 40 10 00 Feb 6 01:45:50 FloBineDATA kernel: RSP: 002b:000014c119fb17e0 EFLAGS: 00010287 Feb 6 01:45:50 FloBineDATA kernel: RAX: 000000043c07c000 RBX: 000000043c07bff0 RCX: 0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: RDX: 000000043c07bf08 RSI: 00000008008d4738 RDI: 000014c149dcc3df Feb 6 01:45:50 FloBineDATA kernel: RBP: 000000008780f7e3 R08: 000000043c07bff0 R09: 000000080012b5d0 Feb 6 01:45:50 FloBineDATA kernel: R10: 0000000000000005 R11: 000000080089e028 R12: 0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: R13: 000000043c07bfc8 R14: 000000043c07bef8 R15: 000014c11da2c800 Feb 6 01:45:50 FloBineDATA kernel: FS: 000014c119fb2b38 GS: 0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: Modules linked in: vhost_net tun vhost tap kvm_amd kvm ccp xt_CHECKSUM ipt_REJECT ip6table_mangle ip6table_nat nf_nat_ipv6 iptable_mangle ip6table_filter ip6_tables veth macvlan xt_nat ipt_MASQUERADE iptable_filter iptable_nat nf_nat_ipv4 nf_nat ip_tables xfs nfsd lockd grace sunrpc md_mod nct6775 hwmon_vid k10temp igb(O) r8125(O) nvidia_drm(PO) nvidia_modeset(PO) nvidia(PO) mxm_wmi wmi_bmof drm_kms_helper drm btusb btrtl btbcm btintel edac_mce_amd mpt3sas crc32_pclmul pcbc bluetooth syscopyarea sysfillrect sysimgblt fb_sys_fops aesni_intel aes_x86_64 glue_helper crypto_simd ghash_clmulni_intel cryptd raid_class scsi_transport_sas agpgart i2c_piix4 i2c_core cdc_acm ahci ecdh_generic libahci crct10dif_pclmul crc32c_intel button wmi pcc_cpufreq acpi_cpufreq [last unloaded: tun] Feb 6 01:45:50 FloBineDATA kernel: ---[ end trace 4a61123f3ba74a74 ]--- Feb 6 01:45:50 FloBineDATA kernel: RIP: 0033:0x14c156dff341 Feb 6 01:45:50 FloBineDATA kernel: Code: 4c 8b d0 49 83 c2 18 4d 3b 97 28 01 00 00 0f 83 51 02 00 00 4d 89 97 18 01 00 00 41 0f 18 82 00 01 00 00 4d 8b 91 b8 00 00 00 <4c> 89 10 c7 40 08 d0 b5 12 00 c7 40 0c 00 00 00 00 48 c7 40 10 00 Feb 6 01:45:50 FloBineDATA kernel: RSP: 002b:000014c119fb17e0 EFLAGS: 00010287 Feb 6 01:45:50 FloBineDATA kernel: RAX: 000000043c07c000 RBX: 000000043c07bff0 RCX: 0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: RDX: 000000043c07bf08 RSI: 00000008008d4738 RDI: 000014c149dcc3df Feb 6 01:45:50 FloBineDATA kernel: RBP: 000000008780f7e3 R08: 000000043c07bff0 R09: 000000080012b5d0 Feb 6 01:45:50 FloBineDATA kernel: R10: 0000000000000005 R11: 000000080089e028 R12: 0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: R13: 000000043c07bfc8 R14: 000000043c07bef8 R15: 000014c11da2c800 Feb 6 01:45:50 FloBineDATA kernel: FS: 000014c119fb2b38(0000) GS:ffff888ffe980000(0000) knlGS:0000000000000000 Feb 6 01:45:50 FloBineDATA kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Feb 6 01:45:50 FloBineDATA kernel: CR2: 000000043c07c000 CR3: 0000000620d8e000 CR4: 0000000000340ee0 Feb 6 02:02:21 FloBineDATA kernel: mdcmd (582): set md_write_method 1 Feb 6 02:02:21 FloBineDATA kernel: Feb 6 02:15:22 FloBineDATA kernel: mdcmd (583): spindown 1 Feb 6 02:15:53 FloBineDATA kernel: mdcmd (584): spindown 2 Feb 6 02:17:21 FloBineDATA kernel: mdcmd (585): set md_write_method 0 Feb 6 02:17:21 FloBineDATA kernel: Feb 6 03:56:16 FloBineDATA kernel: mdcmd (586): spindown 2 Feb 6 04:55:10 FloBineDATA kernel: mdcmd (587): spindown 1 Feb 6 05:00:53 FloBineDATA root: /etc/libvirt: 1.9 GiB (2041622528 bytes) trimmed on /dev/loop3 Feb 6 05:00:53 FloBineDATA root: /var/lib/docker: 17.4 GiB (18627493888 bytes) trimmed on /dev/loop2 Feb 6 05:00:53 FloBineDATA root: /mnt/disks/UD_SSD480_1: 181.2 GiB (194575491072 bytes) trimmed on /dev/sde1 Feb 6 05:00:53 FloBineDATA root: /mnt/cache: 470.3 GiB (504972673024 bytes) trimmed on /dev/sdg1 Feb 6 05:02:21 FloBineDATA kernel: mdcmd (588): set md_write_method 1 Feb 6 05:02:21 FloBineDATA kernel: Feb 6 05:17:57 FloBineDATA kernel: mdcmd (589): spindown 1 Feb 6 05:18:27 FloBineDATA kernel: mdcmd (590): spindown 0 Feb 6 05:18:29 FloBineDATA kernel: mdcmd (591): spindown 2 Feb 6 05:22:21 FloBineDATA kernel: mdcmd (592): set md_write_method 0 Feb 6 05:22:21 FloBineDATA kernel: Feb 6 06:33:07 FloBineDATA kernel: ata5.00: exception Emask 0x0 SAct 0x7fffff8 SErr 0x0 action 0x6 frozen Feb 6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED Feb 6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/10:18:48:16:c3/00:00:0a:00:00/40 tag 3 ncq dma 8192 in Feb 6 06:33:07 FloBineDATA kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY } Feb 6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED Feb 6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/08:20:60:16:c3/00:00:0a:00:00/40 tag 4 ncq dma 4096 in Feb 6 06:33:07 FloBineDATA kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY } Feb 6 06:33:07 FloBineDATA kernel: ata5.00: failed command: READ FPDMA QUEUED Feb 6 06:33:07 FloBineDATA kernel: ata5.00: cmd 60/08:28:c0:16:c3/00:00:0a:00:00/40 tag 5 ncq dma 4096 in Feb 6 06:33:07 FloBineDATA kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY } (...) obiges wiederholt sich noch einige male (...) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: failed command: WRITE FPDMA QUEUED Feb 6 06:33:07 FloBineDATA kernel: ata5.00: cmd 61/08:d0:b8:3e:71/00:00:0c:00:00/40 tag 26 ncq dma 4096 out Feb 6 06:33:07 FloBineDATA kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: status: { DRDY } Feb 6 06:33:07 FloBineDATA kernel: ata5: hard resetting link Feb 6 06:33:07 FloBineDATA kernel: ata5: SATA link up 6.0 Gbps (SStatus 133 SControl 300) Feb 6 06:33:07 FloBineDATA kernel: ata5.00: configured for UDMA/133 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#3 CDB: opcode=0x28 28 00 0a c3 16 48 00 00 10 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557384 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#4 CDB: opcode=0x28 28 00 0a c3 16 60 00 00 08 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557408 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#5 CDB: opcode=0x28 28 00 0a c3 16 c0 00 00 08 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557504 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#6 CDB: opcode=0x28 28 00 02 fa 9c 20 00 00 10 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 49978400 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#7 CDB: opcode=0x28 28 00 03 0b e7 78 00 00 08 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 51111800 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#8 CDB: opcode=0x28 28 00 0a c3 16 a0 00 00 08 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557472 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#9 CDB: opcode=0x28 28 00 0a c3 16 d0 00 00 08 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557520 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 Sense Key : 0x5 [current] Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 ASC=0x21 ASCQ=0x4 Feb 6 06:33:07 FloBineDATA kernel: sd 5:0:0:0: [sde] tag#11 CDB: opcode=0x28 28 00 0a 25 f6 a8 00 00 20 00 Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 170260136 Feb 6 06:33:07 FloBineDATA kernel: ata5: EH complete Feb 6 07:12:21 FloBineDATA kernel: mdcmd (593): set md_write_method 1 Feb 6 07:12:21 FloBineDATA kernel: Feb 6 07:17:34 FloBineDATA kernel: mdcmd (594): spindown 2 Feb 6 07:22:21 FloBineDATA kernel: mdcmd (595): set md_write_method 0 Feb 6 07:22:21 FloBineDATA kernel: Feb 6 07:27:21 FloBineDATA kernel: mdcmd (596): set md_write_method 1 Feb 6 07:27:21 FloBineDATA kernel: ==> AB HIER NEUSTART <== Feb 6 11:58:50 FloBineDATA cache_dirs: Arguments=-u -l off -a -noleaf -name .Recycle.Bin -prune -o -name log -prune -o -name temp -prune -o -print Feb 6 11:58:50 FloBineDATA cache_dirs: Max Scan Secs=10, Min Scan Secs=1 Feb 6 11:58:50 FloBineDATA cache_dirs: Scan Type=adaptive Feb 6 11:58:50 FloBineDATA cache_dirs: Min Scan Depth=4 Feb 6 11:58:50 FloBineDATA cache_dirs: Max Scan Depth=none Feb 6 11:58:50 FloBineDATA cache_dirs: Use Command='find -noleaf -name .Recycle.Bin -prune -o -name log -prune -o -name temp -prune -o -print' Feb 6 11:58:50 FloBineDATA cache_dirs: ---------- Caching Directories --------------- /dev/sde ist eine über Unassigned devices eingebundene SSD auf der meine VMs laufen. Der Platte geht es gut...(?) # Attribute Name Flag Value Worst Threshold Type Updated Failed Raw Value 5 Reallocated sector count 0x0032 100 100 000 Old age Always Never 0 9 Power on hours 0x0032 100 100 000 Old age Always Never 9903 (1y, 1m, 15d, 15h) 12 Power cycle count 0x0032 100 100 000 Old age Always Never 40 165 Total write/erase count 0x0032 100 100 000 Old age Always Never 3089 166 Min W/E cycle 0x0032 100 100 --- Old age Always Never 14 167 Min bad block/die 0x0032 100 100 --- Old age Always Never 0 168 Maximum erase cycle 0x0032 100 100 --- Old age Always Never 55 169 Total bad block 0x0032 100 100 --- Old age Always Never 347 170 Unknown attribute 0x0032 100 100 --- Old age Always Never 0 171 Program fail count 0x0032 100 100 000 Old age Always Never 0 172 Erase fail count 0x0032 100 100 000 Old age Always Never 0 173 Avg write/erase count 0x0032 100 100 000 Old age Always Never 14 174 Unexpect power loss count 0x0032 100 100 000 Old age Always Never 26 184 End-to-end error 0x0032 100 100 --- Old age Always Never 0 187 Reported uncorrect 0x0032 100 100 000 Old age Always Never 0 188 Command timeout 0x0032 100 100 --- Old age Always Never 0 194 Temperature celsius 0x0022 070 060 000 Old age Always Never 30 (min/max 8/60) 199 SATA CRC error 0x0032 100 100 --- Old age Always Never 0 230 Perc write/erase count 0x0032 100 100 000 Old age Always Never 2130 592 2130 232 Perc avail resrvd space 0x0033 100 100 005 Pre-fail Always Never 100 233 Total NAND writes gib 0x0032 100 100 --- Old age Always Never 7063 234 Perc write/erase count BC 0x0032 100 100 000 Old age Always Never 34747 241 Total writes gib 0x0030 100 100 000 Old age Offline Never 11866 242 Total reads gib 0x0030 100 100 000 Old age Offline Never 9728 244 Thermal throttle 0x0032 000 100 --- Old age Always Never 0 Quote Link to comment
mgutt Posted February 6, 2021 Share Posted February 6, 2021 1 hour ago, vakilando said: mein Unraid Server ist nachts angestürzt. Sicher? Wie hast du das geprüft? Vielleicht waren nur bestimmte Dienste weg. Ein Test direkt am Server bringt erst Klarheit. 1 hour ago, vakilando said: Feb 6 06:33:07 FloBineDATA kernel: print_req_error: I/O error, dev sde, sector 180557384 Das mal recherchiert? Quote Link to comment
mgutt Posted February 6, 2021 Share Posted February 6, 2021 Meine Recherche hat ergeben: print_req_error resultiert aus defekten Laufwerken oder fehlerhaften Verbindungen zum Laufwerk. Quote Link to comment
vakilando Posted February 6, 2021 Author Share Posted February 6, 2021 8 minutes ago, mgutt said: Sicher? Wie hast du das geprüft? Nicht per Web, SSH oder Ping erreichbar. Monitor an Server in so einem Fall nicht nutzbar, da meine Alltagsmaschine eine VM mit durchgereichter Grafikkarte ist und ich nur den Bootprozess sehe bis zum Start der VM. 15 minutes ago, mgutt said: print_req_error resultiert aus defekten Laufwerken oder fehlerhaften Verbindungen zum Laufwerk ja, die SSD muss ich noch mal checken... wäre aber kein Grund für einen Freeze oder Crash? Quote Link to comment
mgutt Posted February 6, 2021 Share Posted February 6, 2021 5 minutes ago, vakilando said: wäre aber kein Grund für einen Freeze oder Crash? Da auf der SSD Docker und VM zusammen laufen (die ja teilweise direkt auf die Hardware zugreifen) und wir noch nicht wissen was die defekte SSD auslöst (evtl ja auch Board kaputt), kann ich dazu keine wirkliche Aussage treffen. Du weißt ja auch nicht ob der Server wirklich komplett tot war. SSH, Web oder Ping kann ja auch einfach nur tote Netzwerk-Verbindung heißen. Quote Link to comment
vakilando Posted February 6, 2021 Author Share Posted February 6, 2021 Auf der SSD (/dev/sde) ist nur der default VM storage path: /mnt/disks/UD_SSD480_1/vm-domains-ud/ Die Libvirt- und Docker-Images liegen im Cache und appdata ebenso. 24 minutes ago, mgutt said: Du weißt ja auch nicht ob der Server wirklich komplett tot war. SSH, Web oder Ping kann ja auch einfach nur tote Netzwerk-Verbindung heißen. richtig. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.