Jump to content
  • "out of memory" crashes on 2990WX under certain circumstance


    testdasi
    • Annoyance

    I have "reliably" caused "Out of memory" crashes under the following condition:

    • Pin dies without direct memory access to some dockers
    • Put those dockers under high load (like running 3 simultaneous Handbrake dockers transcoding H265)
    • Leave about 1%-2% of memory free (about 80% occupied by VM / dockers, 18-19% by RAM cache)

     

    Under this condition, after about 10-15 minutes, some processes are automatically killed by unRAID with Out of Memory error. That's unexpected in this scenario because there's constantly 1%-2% memory totally free (manually monitored) + 18% "buffer" (RAM cache).

     

    Spreading the pinned cores across all 4 dies do not lead to this Out of Memory error so I suspect this has something to do with Threadripper 2 optimisation so probably a kernel problem and not actually unRAID bug.

     

    At this point, it's more an annoyance for me but I'm sure there's someone out there who might be caught off guard.




    User Feedback

    Recommended Comments

    There are no comments to display.



    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...