Jump to content

kernel panic - how to troubleshoot?


Recommended Posts

 

I've been experiencing occasional kernel panics.  I haven't figured out a pattern / cause for them.  I am not sure what logical next steps for troubleshooting.  Hoping to get some insights from forum!

 

FYI,  I am on 6.10.0-rc3,  but I had these with 6.9.x as well,  I've never been able to capture the error before (took me a while to figure out the syslog to flash thing).

 

 

Mar 16 11:18:23 Tower kernel: BUG: unable to handle page fault for address: fffff8efd4bf5708
Mar 16 11:18:23 Tower kernel: #PF: supervisor read access in kernel mode
Mar 16 11:18:23 Tower kernel: #PF: error_code(0x0000) - not-present page
Mar 16 11:18:23 Tower kernel: PGD 0 P4D 0 
Mar 16 11:18:23 Tower kernel: Oops: 0000 [#1] SMP PTI
Mar 16 11:18:23 Tower kernel: CPU: 1 PID: 5942 Comm: docker Tainted: G        W         5.15.27-Unraid #1
Mar 16 11:18:23 Tower kernel: Hardware name: HPE ProLiant MicroServer Gen10 Plus/ProLiant MicroServer Gen10 Plus, BIOS U48 10/21/2021
Mar 16 11:18:23 Tower kernel: RIP: 0010:_compound_head+0x0/0x11
Mar 16 11:18:23 Tower kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6
 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff
Mar 16 11:18:23 Tower kernel: RSP: 0018:ffffc900039d7ba0 EFLAGS: 00010202
Mar 16 11:18:23 Tower kernel: RAX: 0000000000000001 RBX: 0000000000a3c000 RCX: 000000000000014a
Mar 16 11:18:23 Tower kernel: RDX: 7c00000000000000 RSI: ffffea001f14d140 RDI: fffff8efd4bf5700
Mar 16 11:18:23 Tower kernel: RBP: ffffc900039d7ce0 R08: 0000000000000012 R09: ffff88810018a000
Mar 16 11:18:23 Tower kernel: R10: ffff88885ea68288 R11: 0000000000000297 R12: fffff8efd4bf5700
Mar 16 11:18:23 Tower kernel: R13: ffff88815a0546d8 R14: 7c00003bbf52fd5c R15: ffff8881afe85600
Mar 16 11:18:23 Tower kernel: FS:  0000000000000000(0000) GS:ffff88885ea40000(0000) knlGS:0000000000000000
Mar 16 11:18:23 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 16 11:18:23 Tower kernel: CR2: fffff8efd4bf5708 CR3: 000000030b822005 CR4: 00000000003706e0
Mar 16 11:18:23 Tower kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 16 11:18:23 Tower kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 16 11:18:23 Tower kernel: Call Trace:
Mar 16 11:18:23 Tower kernel: <TASK>
Mar 16 11:18:23 Tower kernel: pfn_swap_entry_to_page+0x26/0x35
Mar 16 11:18:23 Tower kernel: unmap_page_range+0x4b0/0x6c7
Mar 16 11:18:23 Tower kernel: unmap_vmas+0x6f/0x9d
Mar 16 11:18:23 Tower kernel: exit_mmap+0xd6/0x145
Mar 16 11:18:23 Tower kernel: __mmput+0x43/0xd7
Mar 16 11:18:23 Tower kernel: do_exit+0x385/0x915
Mar 16 11:18:23 Tower kernel: do_group_exit+0x93/0x93
Mar 16 11:18:23 Tower kernel: get_signal+0x5d6/0x5fc
Mar 16 11:18:23 Tower kernel: arch_do_signal_or_restart+0x39/0x6d0
Mar 16 11:18:23 Tower kernel: ? enqueue_hrtimer+0x62/0x69
Mar 16 11:18:23 Tower kernel: ? __do_sys_futex+0x157/0x17c
Mar 16 11:18:23 Tower kernel: exit_to_user_mode_prepare+0x79/0x131
Mar 16 11:18:23 Tower kernel: syscall_exit_to_user_mode+0x18/0x23
Mar 16 11:18:23 Tower kernel: do_syscall_64+0x9f/0xa5
Mar 16 11:18:23 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
Mar 16 11:18:23 Tower kernel: RIP: 0033:0x4d58a3
Mar 16 11:18:23 Tower kernel: Code: Unable to access opcode bytes at RIP 0x4d5879.
Mar 16 11:18:23 Tower kernel: RSP: 002b:00007ffff08f1a78 EFLAGS: 00000286 ORIG_RAX: 00000000000000ca
Mar 16 11:18:23 Tower kernel: RAX: fffffffffffffe00 RBX: 0000000002c94180 RCX: 00000000004d58a3
Mar 16 11:18:23 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000080 RDI: 0000000002c942d0
Mar 16 11:18:23 Tower kernel: RBP: 00007ffff08f1ac0 R08: 0000000000000000 R09: 0000000000000000
Mar 16 11:18:23 Tower kernel: R10: 0000000000000000 R11: 0000000000000286 R12: 0000000000000000
Mar 16 11:18:23 Tower kernel: R13: 0000000000000003 R14: 0000000000000000 R15: 0000000000000000
Mar 16 11:18:23 Tower kernel: </TASK>
Mar 16 11:18:23 Tower kernel: Modules linked in: veth xt_nat xt_tcpudp macvlan xt_conntrack nf_conntrack_netlink nfnetlink xt_addrtype br_netfilter xfs nfsd auth_rpcgss oid_registry lockd grace sunrpc md_mod iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libblake2s blake2s_x86_64 libblake2s_generic libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables bonding igb i2c_algo_bit ipmi_ssif x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl input_leds intel_cstate intel_uncore acpi_ipmi led_class ipmi_si acpi_tad nvme i2c_core ahci nvme_core libahci wmi acpi_power_meter button intel_pch_thermal [last unloaded: i2c_algo_bit]
Mar 16 11:18:23 Tower kernel: CR2: fffff8efd4bf5708
Mar 16 11:18:23 Tower kernel: ---[ end trace 9b521ca04836a20f ]---
Mar 16 11:18:23 Tower kernel: RIP: 0010:_compound_head+0x0/0x11
Mar 16 11:18:23 Tower kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff
Mar 16 11:18:23 Tower kernel: RSP: 0018:ffffc900039d7ba0 EFLAGS: 00010202
Mar 16 11:18:23 Tower kernel: RAX: 0000000000000001 RBX: 0000000000a3c000 RCX: 000000000000014a
Mar 16 11:18:23 Tower kernel: RDX: 7c00000000000000 RSI: ffffea001f14d140 RDI: fffff8efd4bf5700
Mar 16 11:18:23 Tower kernel: RBP: ffffc900039d7ce0 R08: 0000000000000012 R09: ffff88810018a000
Mar 16 11:18:23 Tower kernel: R10: ffff88885ea68288 R11: 0000000000000297 R12: fffff8efd4bf5700
Mar 16 11:18:23 Tower kernel: R13: ffff88815a0546d8 R14: 7c00003bbf52fd5c R15: ffff8881afe85600
Mar 16 11:18:23 Tower kernel: FS:  0000000000000000(0000) GS:ffff88885ea40000(0000) knlGS:0000000000000000
Mar 16 11:18:23 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 16 11:18:23 Tower kernel: CR2: fffff8efd4bf5708 CR3: 000000030b822005 CR4: 00000000003706e0
Mar 16 11:18:23 Tower kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 16 11:18:23 Tower kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Mar 16 11:18:23 Tower kernel: Fixing recursive fault but reboot is needed!


 

 

 

 

 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...