• [6.10.0-rc5] Intel NIC - multiple ixgbe_poll call traces - Unraid is not available over SSH


    Freender
    • Urgent

     

    Upgraded from rc4 to rc5 - a few hours later unraid was not available over ssh. I did hard reset and rollbacked to rc4.

    I managed to pull syslog from the Graylog, see attached (let me know if something needs to be obfuscated)

     

    Might be related to this issue(or may not :) ) - https://lore.kernel.org/netdev/YfQMQWsFqCIPBBqO@boxer/T/

     

    Docker config:

    Docker custom network type: macvlan

    Host access to custom networks: Enabled

    Preserve user defined networks: Yes

    I also use IPv6 

     

    Network Cards:

    IOMMU group 16:[8086:10fb] 01:00.0 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01)

    IOMMU group 17:[8086:10fb] 01:00.1 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01)

     

    In logs there are multiple call traces related to NIC driver (search for ixgbe_poll)

     

    2022-04-28T00:58:03.000Z	Tower	Tower kernel: ixgbe_poll+0xd0a/0xdc4 [ixgbe]

     

    and as a final accord - 2 OOM events

    2022-04-28T01:35:18.000Z Tower Tower kernel: sleep invoked oom-killer: gfp_mask=0x1100dca(GFP_HIGHUSER_MOVABLE|__GFP_ZERO), order=0, oom_score_adj=0
    2022-04-28T01:36:56.000Z Tower Tower kernel: unraid-api invoked oom-killer: gfp_mask=0xcd0(GFP_KERNEL|__GFP_RECLAIMABLE), order=0, oom_score_adj=0

     

    syslog_rc5.csv

    • Upvote 1



    User Feedback

    Recommended Comments

    Thank you for mention it. I have the same error. After 40-45 Minutes my unraid dashboard isnt avaiable anymore and pings and sshs are dead. Need to forcerestart the Server. Also rolled back to rc4.

    I use a

    Intel Corporation Ethernet Controller 10-Gigabit X540-AT2 (rev 01)

    Dual Port Network Card.

    Sadly cant fix this myself. 😞

    Edited by RiDDiX
    Link to comment

    I feel like this linked to my problem and post as well.  Only I couldn't even get mine to work on the first boot and rolled back asap.

    Booting Stable this is my driver. 

    kernel: ixgbe: Intel(R) 10 Gigabit PCI Express Network Driver

    Link to comment
    9 minutes ago, unkdarkstar said:

    I feel like this linked to my problem and post as well.  Only I couldn't even get mine to work on the first boot and rolled back asap.

    Booting Stable this is my driver. 

    kernel: ixgbe: Intel(R) 10 Gigabit PCI Express Network Driver

     

    Strange behaving... Mine works flawlessy for nearly 40-45 Mins than Unraid just hang up and I need to force restart my complete server. No Dashboard or SSH is accessible even all my share and VMs and so one are just dropped xD

    Kernel Panic

    • Like 1
    Link to comment

    I decided to give rc5 another try:

    deleted all custom docker networks and upgraded to RC5 - no issues so far.

    Link to comment
    12 hours ago, Freender said:

    I decided to give rc5 another try:

    deleted all custom docker networks and upgraded to RC5 - no issues so far.

     

    So you just delete all docker custom networks? "docker0" all of these?

    Link to comment

    What I did:

    1) Stop all docker containers

    2) Go to Settings -> Docker

    3) Set

    Enable Docker: No

    Preserve user defined networks: No

    Host access to custom networks: Disables

    4) Install RC5

    5) Reboot

    6) Go to Settings -> Docker

    7) Set

    Enable Docker: Yes

    Preserve user defined networks: Yes

    Host access to custom networks: Enabled

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.