• Consistent Kernel Errors and System Crashes on Unraid 6.12.8 with ZFS Operations


    user2579
    • Urgent

    Hello everyone,

    I've been experiencing consistent kernel errors and system crashes on my Unraid setup (version 6.12.8) that seem to revolve around ZFS operations and memory handling issues. Below, I've detailed the symptoms, hardware specifics, and logs for reference.

    System Information:

    Unraid Version: 6.12.8

    Hardware: ASUSTeK COMPUTER INC. System Product Name/Pro WS W680-ACE IPMI, BIOS version 3101 dated 12/08/2023.

    Storage: ZFS file system in use

    Symptoms:

    The system encounters kernel NULL pointer dereferences and page faults leading to crashes.

    Errors often mention ZFS-related operations.

    Mutex lock failures are frequently observed in the log right before a crash.

    Error Logs: I've condensed the logs to highlight key errors below:

    BUG: kernel NULL pointer dereference, address: 0000000000000020

    BUG: unable to handle page fault for address: 000000000001cb80

    Involvement of ZFS modules like rrw_exit, zfs_getattr_fast, buf_hash_remove, and arc_change_state.constprop.0

    Repeated failures around mutex_lock

    Attempts to Resolve:

    - 80+ hour memtest, changed out memory completely, reseated CPU, boot in safe mode, etc

     

    I am seeking advice on further troubleshooting steps and any known fixes for these issues. Has anyone experienced similar problems or have insights into potential causes and solutions? Any help or guidance would be greatly appreciated.

    Thank you in advance!

    nas846-diagnostics-20240316-1226.zip




    User Feedback

    Recommended Comments

    Possibly a problem with one of the existing zfs filesystems, but since you have multiple, it may not be easy to find out which, I would probably start with the pool.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.