Jump to content
casperse

[SOLVED] Got array health report [FAIL] but Green thumb

6 posts in this topic Last Reply

Recommended Posts

Posted (edited)

Hi All

 

I keep getting a drive fail and my disk 20 is showing read errors (I have run extended SMART disk check)

 

image.png.ac064a9932823d18b628000ae3ec74b8.png

 

image.png.1e0b2f4201111007e94352636ca317d7.png

 

 

Attached smart report & Diag below

 

 

BTW it looks like the Parity check is started again now at 21% (the error notification E-mail said it was:

 

Parity check in progress.
Total size: 12 TB
Elapsed time: 21 hours, 20 minutes
Current position: 8.95 TB (74.6 %)
Estimated speed: 139.1 MB/sec
Estimated finish: 6 hours, 5 minutes
Sync errors corrected: 0

 

And yes the cache drive is often high temperature but according to others that's normal for a NVMe drive?

I am considering adding this NVME heatsink I know it should be high but I think its to high (I have 3 industri fans full speed in the case + 2 on the back)

image.png.2957262beaa7fd5637e36c0547289c1f.png

 

Looking forward to hearing from you - should the disk be replaced?

 

Best regards

Casperse

 

Update it looks like the error i a week old:

image.png.9786848c4452a260a75753898ddf1216.png

can I acknowledge the error somewhere to stop getting it in the notification e-mail? (id the drive is ok now?)

Edited by casperse

Share this post


Link to post

It's logged as an actual disk problem, but since it passed the extended test it's OK for now, keep monitoring, you'll need to clear the errors to stop getting the failed report.

 

Cache temp is normal if it's after some writes, you can set a custom temp warning for it.

Share this post


Link to post
1 hour ago, johnnie.black said:

It's logged as an actual disk problem, but since it passed the extended test it's OK for now, keep monitoring, you'll need to clear the errors to stop getting the failed report.

 

Cache temp is normal if it's after some writes, you can set a custom temp warning for it.

Where can I clear the errors? Thanks 👍

Share this post


Link to post
On 7/31/2020 at 9:06 AM, johnnie.black said:

It's logged as an actual disk problem, but since it passed the extended test it's OK for now, keep monitoring, you'll need to clear the errors to stop getting the failed report.

 

Cache temp is normal if it's after some writes, you can set a custom temp warning for it.

Isn't this a little high in temperature (My former samsung 1TB NVME was lower)

 

Event: Unraid Cache disk temperature
Subject: Alert [PLEXZONE] - Cache disk overheated (78 C)
Description: Samsung_SSD_970_EVO_Plus_2TB_S4J4NG0M919326F (nvme0n1)
Importance: alert

 

Share this post


Link to post
5 minutes ago, casperse said:

Isn't this a little high in temperature (My former samsung 1TB NVME was lower)

It is, and it might be throttling down, you can get an NVMe cooler.

Share this post


Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.