• 6.9.0beta35 Intel ixgbe device crash


    Kaldek
    • Solved Minor

    My unRAID 6.9.0beta35 (required as I'm running Nvidia acceleration) keeps having issues with the intel dual-port XFP card which is installed.  This card uses the ixgbe module.  The behaviour is that the machine loses network connectivity.  Link lights stay up but the OS unloads the module.

    I get these errors which happen frequently. If they self correct it keeps going but if it fails with a fatal it requires a reboot:

    Quote

    Nov 23 13:04:30 unraid kernel: pcieport 0000:00:01.0: AER: Corrected error received: 0000:03:00.1
    Nov 23 13:04:30 unraid kernel: ixgbe 0000:03:00.1: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
    Nov 23 13:04:30 unraid kernel: ixgbe 0000:03:00.1: AER: device [8086:10c6] error status/mask=00000040/00002000

     

    I only have an image of the fatal error unfortunately rather than text.image.thumb.png.432ab8208d16044a3b2c8e6397ca9d34.png

     




    User Feedback

    Recommended Comments

    Looks like this has been discovered before on the X99 chipset and the solution is to apply the boot flag of "pcie_aspm=off".

     

    I have set this, will monitor, and update this issue over the next couple of days if I no longer see these errors.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.