Jump to content

Double disk failure?


Recommended Posts

 

I got this email a couple of days ago and I'm just getting around to looking into it. 
 

Event: Unraid Disk 1 error
Subject: Alert [TOWER] - Disk 1 in error state (disk dsbl)
Description: MD3000GSA6472E_1_P8H1R6XR (sdd)
Importance: alert

 

When I web into the server I see this:

image.png.9c857cf87ff86264198a98174e8dada3.png

 

The problem is disks 1 and 5 are gone...

When I click on the main tab I see this:

image.thumb.png.f6acd17cca141347f0ebbe5a61a45fd7.png

 

When I download the syslog I get an empty zip file.  How do I proceed?  Did I just have a double disk failure?  I replaced Disk 1 about a month ago, if that matters.

Link to comment
  • 1 month later...

Ok I ended up replacing the sata cable and rebuilt the disk onto itself.  It was fine for a few weeks and now I'm back to a disabled disk1 .  I've attached the smart test.  This disk is only about 3 - 4 months old.  Did i get a bad replacement disk or is an issue with my sata controller?  Its a really old machine, wondering If I should replace the mobo/cpu/ram.  

 

tower-diagnostics-20230524-0834.zip

tower-smart-20230524-0831.zip

Edited by cobolstinks
Link to comment
29 minutes ago, trurl said:

SMART report looks fine. Bad connections are much more common than bad disks.

 

Attach diagnostics to your NEXT post in this thread.

Thank you for the help.  I believe I attached the diagnostics zip file to my post along with the SMART test results, unless I'm posting the wrong diagnostics.

I hear you on the bad connections, I guess I'm not sure how to proceed here.  I can rebuild again on disk1, but if it fails again I'm not sure how to fix this long term.  I've replaced the sata cable and the disk is 3-4 old replacing what I thought was a previous disk failure.  So as far as hardware that leaves the disk and the sata port itself... right?  I don't have any spare sata ports on my mobo.  So if this isn't the disk but rather the port, I'm looking at a mobo/cpu/ram upgrade or maybe a PCI sata card, right?

Edited by cobolstinks
Link to comment
3 hours ago, cobolstinks said:

I believe I attached the diagnostics zip file to my post

You edited your post to attach them after I requested them.

 

Your syslog indicates you booted Dec 31, but NTP corrected that. So your server isn't keeping time after reboot. Check your CMOS battery.

 

You rebooted after the problems occurred, so syslog from diagnostics can't tell us anything that happened before.

 

You barely have enough RAM for just NAS duty, best if you just disable Docker and VM Manager and don't attempt that.

 

SMART for all drives looks OK except for the age of some. Why are your disk controllers IDE instead of AHCI?

 

Do you have a spare you can use to rebuild disk1?

 

 

Link to comment

I started the array and have begun reb uilding disk1 onto itself.  

I can check if AHCI is an option in my mobo settings.  Yes I'm aware that my CMOS battery is dead, every time i loose power I have to reset the boot priority.  I should just replace the battery.

I do not user VMs or Docker I only use this machine as a NAS.  I do not have a spare disk.

tower-diagnostics-20230524-1314.zip

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...