Jump to content

Read errors on a 2.5 years old HDD: do I need to replace it or is it some diagnostic error?


Recommended Posts

I've got a 2.5 years old 8TB Western Digital that has had three errors today when reading from a sector. This has never happened before.

 

I'm not experienced in this. Is this some small random error? Is the drive dying? It's not been used too much, so this comes as a surprise to me. Are there some tests that I should run on it? Is there a shell script I can execute to try and read from those specific sectors again to confirm that the error is persistent?

 

The disk in question is disk1 in the diagnostics.

uxie-diagnostics-20210712-1628.zip

Edited by n0stalghia
Link to comment
4 minutes ago, trurl said:

The usually monitored SMART attributes look OK, but syslog seems to indicate an actual disk problem.

 

Run an extended SMART test on the disk.

Thanks, the fast test reported no errors, the extended one will be done in 14 hours.

Edited by n0stalghia
Link to comment
48 minutes ago, trurl said:

The usually monitored SMART attributes look OK, but syslog seems to indicate an actual disk problem.

 

Run an extended SMART test on the disk.

SMART test is still running, EDIT: Just realized that this message also killed the extended SMART test. Errors occured - Check SMART report.

 

I just got this message about "current pending sector" being 8. I assume nothing good? The drive is 2.17 years old, so should still be under warranty (3 years).

 

EDIT: Added a second SMART report, this one only contains one file for some reason.

Screenshot 2021-07-12 172359.jpg

uxie-smart-20210712-1725.zip

Edited by n0stalghia
Link to comment
1 hour ago, n0stalghia said:

Can the drive handle one more parity check, or should I power down and replace immediately? I'd lose like 3 TV series, so no big deal (all important data is backed up to the cloud every Monday)

That statement does not compute. Parity is realtime, a check just verifies that all drives can be read accurately and the realtime parity has stayed in sync.

 

If everything is working properly you should be able to replace the drive at any time and not lose anything.

  • Thanks 1
Link to comment
10 minutes ago, jonathanm said:

That statement does not compute. Parity is realtime, a check just verifies that all drives can be read accurately and the realtime parity has stayed in sync.

 

If everything is working properly you should be able to replace the drive at any time and not lose anything.

I was not aware that parity is real-time, I always assumed it computes it during check. TIL. Well this is great news.

 

On a "fun" side note, I tried to get a list of all data using find . -print > files.txt and the file got corrupted and threw even more mistakes - I took the server offline and already got an RMA procedure running.

 

Thank you for the help.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...