• [6.10.0RC2] Hardware Error from Samsung U.2 NVME SSD


    AlfredHaas
    • Minor

    Since 6.10.0 RC2 I have permanent Hardware Error in LOG File from my Samsung U.2 SSD's

     

    Motherboard : Asus KRPA-U16 with onboard U.2 NVME Ports

    CPU : AMD Epyc 7352

    NVME SSD : 2 x Samsung PM9A3 U.2 SSD

     

    01:00.0 Non-Volatile memory controller [0108]: Samsung Electronics Co Ltd NVMe SSD Controller PM9A1/PM9A3/980PRO [144d:a80a]
    81:00.0 Non-Volatile memory controller [0108]: Samsung Electronics Co Ltd NVMe SSD Controller PM9A1/PM9A3/980PRO [144d:a80a]

     

    Problem occured only with 6.10 no Problem with unraid 6.9.2

     

    Disk are online as BTRFS Raid 1 Pool Device, but idle CPU Load is much higher as normal at 6.9.2 (mabe a differnt problem)

     

    Hardware Error.txt

    diagnostics.zip




    User Feedback

    Recommended Comments

    Now boot with clean installed 6.10.0 RC2 isn't possible any more, system boot's only without Samsung PM9A3 SSD.

     

    System not usable with Samsung PM9A3 U.2 SSD, back to 6.9.2 without problems. System boot's now normal.

    Link to comment

    Hi

     

    After a long story with a lot of mistakes I can figure out that somthing is wrong with the GHES System in the new unraid Kernel 5.14.

     

    Adding ghes.disable=1 to Kernel Parameters temporary solve the boot problem but cannot be the solution at the end.

     

    By the way - pci=noaer and pcie_aspm=off doesn't help.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.