Neldonado Posted May 25, 2023 Share Posted May 25, 2023 So this morning I got this error: Unraid Skynetcache disk SMART health [5] Warning [SKYNET] - reallocated sector ct is 24. Fast forward a couple hours and I get this: fstrim: /mnt/skynetcache: FITRIM ioctl failed: Input/output error. I logged in and noticed my drive dropped offline, but I can’t figure out why. I’ve uploaded my diagnostics too. it’s worth mentioning I’ve also been having intermittent issues with my /mnt/mediacache drive falling offline too. skynet-diagnostics-20230525-0813.zip Quote Link to comment
JorgeB Posted May 25, 2023 Share Posted May 25, 2023 skynetcache1 dropped offline: May 25 04:26:47 Skynet kernel: ata1: hard resetting link May 25 04:26:52 Skynet kernel: ata1: softreset failed (device not ready) May 25 04:26:52 Skynet kernel: ata1: reset failed, giving up May 25 04:26:52 Skynet kernel: ata1.00: disable device May 25 04:26:52 Skynet kernel: ata1: EH complete Because it dropped there's no SMART, but this is usually a power/connection problem, check/replace cables and run a correcting scrub. Quote Link to comment
Neldonado Posted May 25, 2023 Author Share Posted May 25, 2023 I’ll check the cables but they are relatively new, I was dealing with this months ago and replaced all the cables. Could my power supply be giving up ever so slightly? Quote Link to comment
Neldonado Posted May 25, 2023 Author Share Posted May 25, 2023 I rebooted the server and the drive is completely missing from the gui now. How do I proceed? Quote Link to comment
JorgeB Posted May 25, 2023 Share Posted May 25, 2023 12 minutes ago, JorgeB said: check/replace cables or power cycle the server so see if comes back, a reboot usually isn't enough when a drives drops. Quote Link to comment
Neldonado Posted May 25, 2023 Author Share Posted May 25, 2023 Yeah, the reboot didn't work, now the drive won't even show up to mount. Quote Link to comment
Neldonado Posted May 26, 2023 Author Share Posted May 26, 2023 So I rebooted, tried several cables, data and power, tried different data ports and the reallocated sector count just kept climbing, and then a udma error started popping up. I swapped this with another ssd in the machine, and it still had issues while the other ssd was fine, so I think it’s safe to assume it’s the drive failing. I pulled the drive from the pool and am running the pool with one drive. If I stop and add a new ssd (240gb vs the current 256gb) can I just start it and do a scrub? Quote Link to comment
Solution JorgeB Posted May 26, 2023 Solution Share Posted May 26, 2023 You just need to assign the new device and start the array, the data will be duplicated there. Quote Link to comment
Neldonado Posted July 3, 2023 Author Share Posted July 3, 2023 I ran into this issue again. This time with the OTHER ssd. This was the other one of the pair I originally had in here. Am I just really unlucky with having two SSD’s fail close to each other? the drive dropped offline. skynet-diagnostics-20230702-1659.zip Quote Link to comment
Neldonado Posted July 3, 2023 Author Share Posted July 3, 2023 I am going to stop and replace the SSD with a different brand SSD. Quote Link to comment
JorgeB Posted July 3, 2023 Share Posted July 3, 2023 Device dropped offline, could be cables, could be just bad devices. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.