Jump to content
  • unresponsive and syslog shows fixing recursive fault but reboot is needed


    Syrkel2
    • Retest Minor

    server was unresponsive and the syslog shows fixing recursive fault but reboot is needed.  What else is needed to pinpoint this problem? this did not happen in RC4; only since upgrading to RC5

    hpemicroserver-diagnostics-20220428-0232.zip




    User Feedback

    Recommended Comments

    10 minutes ago, limetech said:

    I don't see anything like "fixing recursive fault" in the system log.

    I have system logging turned on so i dont lose any log files after a reboot... Here is a snippet of it:

     

    Apr 27 18:22:58 HPEMicroserver kernel: BUG: unable to handle page fault for address: fffff8ef92d41008
    Apr 27 18:22:58 HPEMicroserver kernel: #PF: supervisor read access in kernel mode
    Apr 27 18:22:58 HPEMicroserver kernel: #PF: error_code(0x0000) - not-present page
    Apr 27 18:22:58 HPEMicroserver kernel: PGD 0 P4D 0 
    Apr 27 18:22:58 HPEMicroserver kernel: Oops: 0000 [#1] SMP PTI
    Apr 27 18:22:58 HPEMicroserver kernel: CPU: 3 PID: 10094 Comm: sleep Tainted: P        W  O      5.15.35-Unraid #1
    Apr 27 18:22:58 HPEMicroserver kernel: Hardware name: HPE ProLiant MicroServer Gen10 Plus/ProLiant MicroServer Gen10 Plus, BIOS U48 01/20/2022
    Apr 27 18:22:58 HPEMicroserver kernel: RIP: 0010:_compound_head+0x0/0x11
    Apr 27 18:22:58 HPEMicroserver kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff
    Apr 27 18:22:58 HPEMicroserver kernel: RSP: 0018:ffffc9000259fcc0 EFLAGS: 00010202
    Apr 27 18:22:58 HPEMicroserver kernel: RAX: 0000000000000001 RBX: 000014f70081c000 RCX: 0000000000000113
    Apr 27 18:22:58 HPEMicroserver kernel: RDX: 7c00000000000000 RSI: ffffea0006731180 RDI: fffff8ef92d41000
    Apr 27 18:22:58 HPEMicroserver kernel: RBP: ffffc9000259fe00 R08: ffffea0006731180 R09: ffff8881a87e5000
    Apr 27 18:22:58 HPEMicroserver kernel: R10: ffff88885eda8248 R11: 0000000000000297 R12: fffff8ef92d41000
    Apr 27 18:22:58 HPEMicroserver kernel: R13: 7c00003bbe4b5040 R14: ffff8883695f7e98 R15: ffff88836a78d600
    Apr 27 18:22:58 HPEMicroserver kernel: FS:  0000000000000000(0000) GS:ffff88885ed80000(0000) knlGS:0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Apr 27 18:22:58 HPEMicroserver kernel: CR2: fffff8ef92d41008 CR3: 000000036e2f2003 CR4: 00000000003706e0
    Apr 27 18:22:58 HPEMicroserver kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Apr 27 18:22:58 HPEMicroserver kernel: Call Trace:
    Apr 27 18:22:58 HPEMicroserver kernel: <TASK>
    Apr 27 18:22:58 HPEMicroserver kernel: pfn_swap_entry_to_page+0x26/0x35
    Apr 27 18:22:58 HPEMicroserver kernel: unmap_page_range+0x4c3/0x711
    Apr 27 18:22:58 HPEMicroserver kernel: unmap_vmas+0x6f/0x9d
    Apr 27 18:22:58 HPEMicroserver kernel: exit_mmap+0xd6/0x145
    Apr 27 18:22:58 HPEMicroserver kernel: __mmput+0x43/0xd7
    Apr 27 18:22:58 HPEMicroserver kernel: do_exit+0x385/0x915
    Apr 27 18:22:58 HPEMicroserver kernel: do_group_exit+0x93/0x93
    Apr 27 18:22:58 HPEMicroserver kernel: __x64_sys_exit_group+0x14/0x14
    Apr 27 18:22:58 HPEMicroserver kernel: do_syscall_64+0x83/0xa5
    Apr 27 18:22:58 HPEMicroserver kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
    Apr 27 18:22:58 HPEMicroserver kernel: RIP: 0033:0x14f70080ff41
    Apr 27 18:22:58 HPEMicroserver kernel: Code: Unable to access opcode bytes at RIP 0x14f70080ff17.
    Apr 27 18:22:58 HPEMicroserver kernel: RSP: 002b:00007ffceb0d2c58 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
    Apr 27 18:22:58 HPEMicroserver kernel: RAX: ffffffffffffffda RBX: 000014f700906470 RCX: 000014f70080ff41
    Apr 27 18:22:58 HPEMicroserver kernel: RDX: 000000000000003c RSI: 00000000000000e7 RDI: 0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: RBP: 0000000000000000 R08: ffffffffffffff88 R09: 0000000000000001
    Apr 27 18:22:58 HPEMicroserver kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 000014f700906470
    Apr 27 18:22:58 HPEMicroserver kernel: R13: 0000000000000002 R14: 000014f700906948 R15: 0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: </TASK>
    Apr 27 18:22:58 HPEMicroserver kernel: Modules linked in: nvidia_modeset(PO) nvidia_uvm(PO) xt_connmark xt_comment iptable_raw wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libblake2s blake2s_x86_64 libblake2s_generic libchacha xt_mark xt_nat veth xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle iptable_mangle vhost_net tun vhost vhost_iotlb tap macvlan xt_conntrack nf_conntrack_netlink nfnetlink xt_addrtype iptable_nat br_netfilter xfs xt_MASQUERADE ip6table_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nfsd auth_rpcgss oid_registry lockd grace sunrpc md_mod nvidia(PO) drm backlight ipmi_devintf efivarfs ip6table_filter ip6_tables iptable_filter ip_tables x_tables bonding igb i2c_algo_bit x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul ipmi_ssif crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore ahci acpi_ipmi libahci
    Apr 27 18:22:58 HPEMicroserver kernel: i2c_core wmi intel_pch_thermal ipmi_si button acpi_power_meter acpi_tad [last unloaded: i2c_algo_bit]
    Apr 27 18:22:58 HPEMicroserver kernel: CR2: fffff8ef92d41008
    Apr 27 18:22:58 HPEMicroserver kernel: ---[ end trace 4647401355f3fe32 ]---
    Apr 27 18:22:58 HPEMicroserver kernel: RIP: 0010:_compound_head+0x0/0x11
    Apr 27 18:22:58 HPEMicroserver kernel: Code: b8 98 0f 00 00 00 00 f0 7f 48 85 c2 0f 95 c0 0f b6 c0 c3 48 c1 ef 3a 83 ff 1f 0f 94 c0 48 83 ff 1e 0f 94 c2 09 d0 0f b6 c0 c3 <48> 8b 57 08 48 89 f8 f6 c2 01 74 04 48 8d 42 ff c3 e8 ea ff ff ff
    Apr 27 18:22:58 HPEMicroserver kernel: RSP: 0018:ffffc9000259fcc0 EFLAGS: 00010202
    Apr 27 18:22:58 HPEMicroserver kernel: RAX: 0000000000000001 RBX: 000014f70081c000 RCX: 0000000000000113
    Apr 27 18:22:58 HPEMicroserver kernel: RDX: 7c00000000000000 RSI: ffffea0006731180 RDI: fffff8ef92d41000
    Apr 27 18:22:58 HPEMicroserver kernel: RBP: ffffc9000259fe00 R08: ffffea0006731180 R09: ffff8881a87e5000
    Apr 27 18:22:58 HPEMicroserver kernel: R10: ffff88885eda8248 R11: 0000000000000297 R12: fffff8ef92d41000
    Apr 27 18:22:58 HPEMicroserver kernel: R13: 7c00003bbe4b5040 R14: ffff8883695f7e98 R15: ffff88836a78d600
    Apr 27 18:22:58 HPEMicroserver kernel: FS:  0000000000000000(0000) GS:ffff88885ed80000(0000) knlGS:0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Apr 27 18:22:58 HPEMicroserver kernel: CR2: fffff8ef92d41008 CR3: 000000036e2f2003 CR4: 00000000003706e0
    Apr 27 18:22:58 HPEMicroserver kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    Apr 27 18:22:58 HPEMicroserver kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Apr 27 18:22:58 HPEMicroserver kernel: Fixing recursive fault but reboot is needed!

    Link to comment
    13 hours ago, bonienl said:

    Check for the latest BIOS firmware of your system and see if that makes a difference.

     

    It's currently the latest bio release.  I haven't had it happen again yet but I will update if I do.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...