Jump to content
  • [6.11] Dell HBA330 Mini with mpt3sas driver stability issues


    JohnHedge
    • Urgent

    After upgrade to 6.11, UnRaid no longer sees the drives attached to the adapter consistently, and when it does, they often experience a reset or read/write errors which show up in dmesg.

     

    System is a Dell R7515 with latest BIOS 2.8.5

    CPU is AMD Epyc R7313p

    HBA330 Mini is latest firmware from Dell: 16.17.01.00, A08

     

    I have power cycled the machine several times, updated whatever I can and also checked kernel change logs and haven't seen anything relevant.

     

    I have also tried disabling msix for the mpt3sas driver to no avail.

    unraid-diagnostics-20220930-1710.zip




    User Feedback

    Recommended Comments

    This seems like a great tip, and seems like what would fix my issue.

     

    I've tried flashing from both Linux and EFI shell and sasflash always ends up erroring out:

     

    Unfortunately, Dell's latest firmware for HBA330 (16.17.01.00) only contains 16.00.11.00 firmware for the SAS3008

    Card details:

    root@UnRAID:/tmp# ./sas3flash -list
    Avago Technologies SAS3 Flash Utility
    Version 15.00.00.00 (2016.11.17) 
    Copyright 2008-2016 Avago Technologies. All rights reserved.
    
            Adapter Selected is a Avago SAS: SAS3008(C0)
    
            Controller Number              : 0
            Controller                     : SAS3008(C0)
            PCI Address                    : 00:01:00:00
            SAS Address                    : 52cea7f-0-c84e-e100
            NVDATA Version (Default)       : 0e.01.00.39
            NVDATA Version (Persistent)    : 0e.01.00.39
            Firmware Product ID            : 0x2221 (IT)
            Firmware Version               : 16.00.11.00
            NVDATA Vendor                  : LSI
            NVDATA Product ID              : Dell HBA330 Mini
            BIOS Version                   : 08.37.02.00
            UEFI BSD Version               : 18.00.03.00
            FCODE Version                  : N/A
            Board Name                     : Dell HBA330 Mini
            Board Assembly                 : N/A
            Board Tracer Number            : N/A
    
            Finished Processing Commands Successfully.
            Exiting SAS3Flash.

    Error details:

    root@UnRAID:/tmp# ./sas3flash -o -f SAS9300_8i_IT.bin
    Avago Technologies SAS3 Flash Utility
    Version 15.00.00.00 (2016.11.17)
    Copyright 2008-2016 Avago Technologies. All rights reserved.
    
            Advanced Mode Set
    
            Adapter Selected is a Avago SAS: SAS3008(C0)
    
            Executing Operation: Flash Firmware Image
    
                    Firmware Image has a Valid Checksum.
                    Firmware Version 16.00.12.00
                    Firmware Image compatible with Controller.
    
                    Valid NVDATA Image found.
                    NVDATA Major Version 0e.01
                    Checking for a compatible NVData image...
    
                    NVDATA Device ID and Chip Revision match verified.
                    NVDATA Versions Compatible.
                    Valid Initialization Image verified.
                    Valid BootLoader Image verified.
    
                    Beginning Firmware Download...
                    Firmware Download Failed!
    
            Firmware Returned Exception. IOCStatus=0x4, IOCLogInfo=0x30010408
            Due to error remaining commands will not be executed.
            Unable to Process Commands.
            Exiting SAS3Flash.
    Edited by JohnHedge
    Link to comment

    I've been told that Dell version 16.10.11.00 is equivalent to 16.00.12.00 from LSI. Digging a bit deeper.

    Edited by JohnHedge
    Link to comment
    32 minutes ago, JohnHedge said:

    I've been told that Dell version 16.10.11.00 is equivalent to 16.00.12.00 from LSI

    I find that hard to believe, but it's not impossible I guess, tried to search for the error you are getting flashing but didn't find anything relevant, might be because the current firmware is not from LSI, though SAS2FLASH it's not detecting any issues before beginning.

    Link to comment

    I find the version number information hard to believe as well, especially when the changelog on Dell's site for 16.00.11.00 does not mention either of the issues addressed in 16.00.12.00

     

    Apparently the issue with flashing is due to some flags that are set in one of the other flash regions in the Dell HBA330 that lock it to firmware from them. I'm curious if a full flash wipe would enable it, but not curious enough to try it...yet.

    Link to comment

    I spent a few hours in chat / on phone with Dell today trying to "rule out" issues with the hardware. We eventually got it to Tier 3 support via email who confirmed there are no new HBA330 firmware releases planned as far out as they can check (November 2022). They were slightly hostile that I had non-"Dell Certified" drives in the server. Am not hopeful of any progress on this front, but thought I'd try

    Link to comment

    In the meantime, I have discovered that running

    echo "- - -" > /sys/class/scsi_host/host1/scan

    will cause any drives that didn't enumerate to become usable, thus solving the immediate problem

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...