Thomas K Posted June 8, 2022 Share Posted June 8, 2022 (edited) Hi, on my new HP Microserver Gen10 Plus UnRaid crashes sporadically about once a week. Happened with 6.9.2 and also the brand new 6.10.2. Diagnosis was run, but I don't see anything to diag there, as I was only able to run it after a cold reset. Thanks for any support, thomas tower-diagnostics-20220608-1452.zip Edited June 8, 2022 by Thomas K typo Quote Link to comment
JorgeB Posted June 8, 2022 Share Posted June 8, 2022 Enable the syslog server and post that after a crash, hopefully some clues there. 1 Quote Link to comment
Thomas K Posted June 24, 2022 Author Share Posted June 24, 2022 (edited) Thanks, finally was able to catch it via syslog. Hopefully someone can interpret it. Quote Jun 23 12:09:45 Tower kernel: BUG: unable to handle page fault for address: fffff8efd7617548 Jun 23 12:09:45 Tower kernel: #PF: supervisor read access in kernel mode Jun 23 12:09:45 Tower kernel: #PF: error_code(0x0000) - not-present page Jun 23 12:09:45 Tower kernel: PGD 0 P4D 0 Jun 23 12:09:45 Tower kernel: Oops: 0000 [#1] SMP PTI Jun 23 12:09:45 Tower kernel: CPU: 2 PID: 24465 Comm: docker Tainted: G W 5.15.43-Unraid #1 Jun 23 12:09:45 Tower kernel: Hardware name: HPE ProLiant MicroServer Gen10 Plus/ProLiant MicroServer Gen10 Plus, BIOS U48 09/16/2021 Jun 23 12:09:45 Tower kernel: RIP: 0010:_compound_head+0x0/0x11 Jun 23 12:09:45 Tower kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff Jun 23 12:09:45 Tower kernel: RSP: 0018:ffffc9000073fba0 EFLAGS: 00010202 Jun 23 12:09:45 Tower kernel: RAX: 0000000000000001 RBX: 0000000002184000 RCX: 000000000000011e Jun 23 12:09:45 Tower kernel: RDX: 7c00000000000000 RSI: ffffea0020bb7e40 RDI: fffff8efd7617540 Jun 23 12:09:45 Tower kernel: RBP: ffffc9000073fce0 R08: ffffea0020bb7e40 R09: ffff88810018a000 Jun 23 12:09:45 Tower kernel: R10: ffff88885ed29388 R11: 0000000000000297 R12: fffff8efd7617540 Jun 23 12:09:45 Tower kernel: R13: 7c00003bbf5d85d5 R14: ffff888144f45498 R15: ffff8881f6fa1540 Jun 23 12:09:45 Tower kernel: FS: 0000000000000000(0000) GS:ffff88885ed00000(0000) knlGS:0000000000000000 Jun 23 12:09:45 Tower kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 23 12:09:45 Tower kernel: CR2: fffff8efd7617548 CR3: 0000000140a6a005 CR4: 00000000003706e0 Jun 23 12:09:45 Tower kernel: Call Trace: Jun 23 12:09:45 Tower kernel: <TASK> Jun 23 12:09:45 Tower kernel: pfn_swap_entry_to_page+0x26/0x35 Jun 23 12:09:45 Tower kernel: unmap_page_range+0x4c3/0x711 Jun 23 12:09:45 Tower kernel: unmap_vmas+0x6f/0x9d Jun 23 12:09:45 Tower kernel: exit_mmap+0xd6/0x145 Jun 23 12:09:45 Tower kernel: __mmput+0x43/0xd7 Jun 23 12:09:45 Tower kernel: do_exit+0x385/0x915 Jun 23 12:09:45 Tower kernel: do_group_exit+0x93/0x93 Jun 23 12:09:45 Tower kernel: get_signal+0x5d6/0x5fc Jun 23 12:09:45 Tower kernel: ? hrtimer_try_to_cancel+0x28/0xb5 Jun 23 12:09:45 Tower kernel: arch_do_signal_or_restart+0x39/0x6d0 Jun 23 12:09:45 Tower kernel: ? hrtimer_nanosleep+0x75/0xdf Jun 23 12:09:45 Tower kernel: exit_to_user_mode_prepare+0x79/0x131 Jun 23 12:09:45 Tower kernel: syscall_exit_to_user_mode+0x18/0x23 Jun 23 12:09:45 Tower kernel: do_syscall_64+0x9f/0xa5 Jun 23 12:09:45 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae Jun 23 12:09:45 Tower kernel: RIP: 0033:0x4d60fd Jun 23 12:09:45 Tower kernel: Code: Unable to access opcode bytes at RIP 0x4d60d3. Jun 23 12:09:45 Tower kernel: RSP: 002b:00001521fcf79c00 EFLAGS: 00000206 ORIG_RAX: 0000000000000023 Jun 23 12:09:45 Tower kernel: RAX: 0000000000000000 RBX: 000000c00005f000 RCX: 00000000004d60fd Jun 23 12:09:45 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 00001521fcf79c00 Jun 23 12:09:45 Tower kernel: RBP: 00001521fcf79c10 R08: 0000000000000000 R09: 0000000000000000 Jun 23 12:09:45 Tower kernel: R10: 0000000000000002 R11: 0000000000000206 R12: 000349a661af3ee3 Jun 23 12:09:45 Tower kernel: R13: 000000c000392a80 R14: 0000000000000000 R15: 0000000000000000 Jun 23 12:09:45 Tower kernel: </TASK> Jun 23 12:09:45 Tower kernel: Modules linked in: xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap veth macvlan xt_nat xt_tcpudp xt_conntrack nf_conntrack_netlink nfnetlink xt_addrtype br_netfilter xfs md_mod efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libblake2s blake2s_x86_64 libblake2s_generic libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables bonding igb i2c_algo_bit x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm ipmi_ssif crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate acpi_ipmi intel_uncore ahci i2c_core libahci ipmi_si acpi_tad wmi intel_pch_thermal button acpi_power_meter [last unloaded: i2c_algo_bit] Jun 23 12:09:45 Tower kernel: CR2: fffff8efd7617548 Jun 23 12:09:45 Tower kernel: ---[ end trace 9ba3d6387c07963a ]--- Jun 23 12:09:45 Tower kernel: RIP: 0010:_compound_head+0x0/0x11 Jun 23 12:09:45 Tower kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff Jun 23 12:09:45 Tower kernel: RSP: 0018:ffffc9000073fba0 EFLAGS: 00010202 Jun 23 12:09:45 Tower kernel: RAX: 0000000000000001 RBX: 0000000002184000 RCX: 000000000000011e Jun 23 12:09:45 Tower kernel: RDX: 7c00000000000000 RSI: ffffea0020bb7e40 RDI: fffff8efd7617540 Jun 23 12:09:45 Tower kernel: RBP: ffffc9000073fce0 R08: ffffea0020bb7e40 R09: ffff88810018a000 Jun 23 12:09:45 Tower kernel: R10: ffff88885ed29388 R11: 0000000000000297 R12: fffff8efd7617540 Jun 23 12:09:45 Tower kernel: R13: 7c00003bbf5d85d5 R14: ffff888144f45498 R15: ffff8881f6fa1540 Jun 23 12:09:45 Tower kernel: FS: 0000000000000000(0000) GS:ffff88885ed00000(0000) knlGS:0000000000000000 Jun 23 12:09:45 Tower kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 23 12:09:45 Tower kernel: CR2: fffff8efd7617548 CR3: 0000000140a6a005 CR4: 00000000003706e0 Jun 23 12:09:45 Tower kernel: Fixing recursive fault but reboot is needed! Edited June 24, 2022 by Thomas K Quote Link to comment
JorgeB Posted June 25, 2022 Share Posted June 25, 2022 Assuming there's nothing more before or after that in the syslog can't really say what that's about. Quote Link to comment
Thomas K Posted June 25, 2022 Author Share Posted June 25, 2022 (edited) Hm, two hours before the mover did run and afterwards the cold boot is logged. Lets see if maybe next time more info or the same is logged. Thanks in the meantime. Edited June 25, 2022 by Thomas K Add notes Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.