"array health report [FAIL]" - But Can't Find an Issue


extrobe

Recommended Posts

I frequently wake up to a status message on my server stating

 

Quote
unRAID Status: 15-02-2018 00:20
Notice [DEMETER] - array health report [FAIL]
Array has 20 disks (including parity & cache)

 

However, I can't see what the issue is. Some days it's reporting fine - about 50/50

 

Parity is valid, and last checked a few days ago.

When I run fix common problems, there are no errors or warnings

 

Only thing I can think of, is that when mover is running, the cache drive sometimes gets warm (40c-42c) - could this be triggering the FAIL notification?

Link to comment
2 hours ago, Squid said:

You should post your diagnostics before you reboot

 

Thanks Squid - I eventually found the very useful Archived Notification menu where it gave me some more detail.

I found...

Quote

Cache 2 - SAMSUNG_MZ7WD480HAGM-00003_S16MNYAF107743 (sde) - active 40 C (disk is hot) [NOK]

 

So was indeed the temp setting off the failure. I've upped the WARN threshold to 50c, so that should sort it out (it's never less that 38/39, so doesn't take much to get it to 40)

Edited by extrobe
  • Like 1
Link to comment
  • 3 years later...

For anybody else looking up similar questions some 3+ years later :) ... The very helpful "archived notification menu" can be found from:

 

webui > tools > archived notifications

 

You can then click on a notification, and it will give you further details, as per extrobe's "NOK" paste..

 

(Thanks for the pointers, resolved my issue, also ended up being a hot disk :D )

  • Like 7
  • Thanks 1
Link to comment
  • 8 months later...

In my case I got this:

 

"Unraid StatusNotice [TOWER] - array health report [FAIL]Array has 4 disks (including parity & cache)warning

Parity - ST10000NM0226_E_ZA21794D0000J651V8A0_35000c50086bb8633 (sdc) - active 34 C (disk has read errors) [NOK]
Disk 1 - ST10000NM0226_E_ZA21796H0000J7167T4Z_35000c50086bb826f (sdf) - active 37 C [OK]
Disk 2 - ST10000NM0226_E_ZA21AAH10000C7258S57_35000c50093ae2913 (sde) - active 39 C [OK]
Cache - OCZ-TRION150_26OB318DK1CU (sdb) - active 27 C [OK]

Parity is valid
Last checked on Sun 07 Nov 2021 08:16:47 PM PST (today), finding 0 errors.
Duration: 10 hours, 27 minutes, 10 seconds. Average speed: 263.9 MB/s"

 

If click on the error it takes me to the Main tab. Should I replace the parity disk?

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.