Jump to content

Disk missing, how worried should I be?


DrJake

Recommended Posts

Hi all,

 

Been using Unraid for 6months now, and successfully dealt with a failed disk couple months ago. I got really worried yesterday when I got 7 read errors on 2 disks (disk 1 and disk 2). When I stopped the array, these 2 disks along with a unassigned device could not be detected by the system. Below are some details.

missing..PNG.a89eaedf60f0c5242a528b56be0c08ed.PNG

 

below is an unassigned device (expected to fail soon).

1029234310_missingtoo.PNG.3fe83e94f793ce431fb515829e70d435.PNG

 

syslog was full of these, spammed every couple seconds for almost a day and half...

1568470222_sinceyesterday..PNG.e50afdfd5e652592e8a84e300d863e26.PNG

tower-syslog-20201108-0739.zip

 

Since the system was not detecting 3 HDDs, I restarted the server in safe mode. All 3 missing HDDs came back. So I started running extended SMART tests on each.

disk 1 smart 2020-11-08.txt

disk 2 smart 2020-11-08.txt

unassigned smart 2020-11-08.txt

 

All 3 HDDs which went missing completed SMART with no issues.

 

My Disk 1 is new, only used for couple of months, but it still has many "pre-fail" and "old-age"... 

958599066_disk1smart2020-11-08.thumb.PNG.7e21b5f53c7b4979d6b0324567393ac7.PNG

 

My Disk 2 is not new, but hasn't been heavily used, but it also has many "pre-fail" and "old-age"...

1569451413_disk2smart2020-11-08.thumb.PNG.72554bb8ade494e46ea9a64dfa7437f7.PNG

 

My unassigned device disk is expected to fail any day now. but there's nothing critical on it and shouldn't cause any havoc on the system. But even this one came back as fine...

117202168_wd1tb_cctvsmart2020-11-08.thumb.PNG.7e802b74325a5eec0c142cdbe1c7d5d9.PNG

 

So basically, after completing SMART tests without error. I've rebooted the system to be running normally.

717471503_runningnormal.thumb.PNG.b828a72d611bbac528701a53a023345e.PNG

All 3 HDDs were detected, and many dockers and VMs started automatically without issue. The parity check is scheduled for later today. So that happened... What I'm wondering is that how worried should I be? and what preventative measures should I take?

Link to comment
1 hour ago, JorgeB said:

Those diags are after rebooting, and the syslog you posted before is incomplete and doesn't show the beginning of the problem, did you save the complete diags before rebooting?

unfortunately no. if it happens again, I'll do that for sure. Anything else to note?

Link to comment
13 hours ago, JorgeB said:

Just that the disks look fine, so likely a controller or power/connection problem, my money would be on a controller issue, since it's quite common with Ryzen boards and v6.8.

Thank you Jorge. That's a relief, I was really worried for my data for a while because the error spread to multiple disks. 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...