Jump to content
  • No alert for failing cache SSDs


    maciekish
    • Urgent

    Hi, my server has been acting strange for months, freezing at random when there is any greater write activity. I haven't been able to narrow this down to the SSDs until now. The wear leveling count for both drives is 1 (1% life remaining), and the LBAs written equal almost 5x the rated TBW for a Samsung 860 EVO! One of the drives even had a CRC error count of 6. How come there is no notification about this state? I'm receiving notifications about parity checks just fine. This is either a huge bug or tremendous oversight. If a low wear leveling count or CRC errors appear, there should be an immediate notification. I only got lucky that i thought about checking these numbers manually.

     

    image.thumb.png.fb61174959a7dcba1481652fe4889c6d.png

     

    Here are my notification settings, i just tested email and Pushover manually and they work, and i am receiving other unRAID alerts regularly.

     

    image.thumb.png.286285a40f9fa50c2c531b47cf405c9e.png




    User Feedback

    Recommended Comments

    By default you are notified of CRC errors, though those are usually a cable problem, not device, but the SMART you posted has 0 CRC errors, so it won't trigger a notification.

     

    Life remaining will only trigger a notification if the SMART attribute is reported by the device as "Failing NOW", just because the SSD is past its predicted life doesn't mean it's failing or about to fail, I for example had an SSD with 500TBW that failed recently at almost 2PB written.

     

    Recommend you post the diagnostics after you experience the issue to see if we can confirm it's really a device problem.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...