sunbear Posted August 29, 2022 Share Posted August 29, 2022 About every other day for about a week now, I have been getting a handful of read errors on one drive that correct themselves. I've run several short smart tests and one extended smart test with no errors. Then today something finally bit the dust, I ended up with 3974 errors (both read & write) some errors like "blk_update_request: I/O error, op 0x1:(WRITE) flags 0x4000 prio class 0" and "Synchronize Cache(10) failed" and the drive gets disabled. This drive is actually one of my newer ones and unfortunately also my good CMR 6GB drive (instead of my screwed-over-by-WD secretly-SMR 6GB drive) so I'm hoping I don't have to replace it. Would someone smarter than me mind taking a look at my diagnostics and let me know if I just need to replace the drive? Is there anything I can do besides more smart tests to see what the issue is? Should I do a preclear? Diagnostics attached. Thanks. serv-x370-diagnostics-20220829-1135.zip Quote Link to comment
sunbear Posted August 29, 2022 Author Share Posted August 29, 2022 Possibly related: I have also been having some issues with an SSD passed-through to a VM which I am pretty sure is a bad drive that I plan on replacing. Is it possible for potential corruption issues with that drive somehow spilling over to my array drives? I did notice that my file integrity scans found some corruptions on the bad array disk around the same time as the errors (see system log diagnostics). These aren't totally unusual though, I get them every once in a while on certain files being shared w/ multiple dockers/smb. Quote Link to comment
trurl Posted August 29, 2022 Share Posted August 29, 2022 22 minutes ago, sunbear said: Is it possible for potential corruption issues with that drive somehow spilling over to my array drives? No, but if you have hardware issues that caused that corruption they could affect other drives. Save me the trouble of opening every SMART report by telling me which drives are involved. Do any have SMART warnings on the Dashboard page? Quote Link to comment
JorgeB Posted August 29, 2022 Share Posted August 29, 2022 Disk dropped offline, so there's no no SMART, but it's not logged as a disk problem, swap cables/slot with a different disk and see if the problem follows the disk. Also having what look like power/connection issues with cache2, check/replace cables. 1 Quote Link to comment
sunbear Posted August 29, 2022 Author Share Posted August 29, 2022 Uggh. I've had so many damn cable/connection issues. Thanks. I will try swapping around once I can get everything shutdown. Just FYI, I've attached the latest SMART report for the drive that was having issues. This was after recent repeating read errors but prior to it going offline today. WDC_WD60EFRX-68L0BN1_WD-WX21D583NR7P-20220824-1237[1].txt Quote Link to comment
sunbear Posted August 29, 2022 Author Share Posted August 29, 2022 Ok, I'm pretty confident it's not the cable but the plug on this drive (I've had issues with it before). I've cleaned and re-seated the plug. I'm starting the system back up and was going to run a filesystem check for that drive if that's possible for a disabled drive. Do y'all have any recommendations for testing this drive to make sure I've got a good connection? Quote Link to comment
JorgeB Posted August 30, 2022 Share Posted August 30, 2022 Disk is showing some SMART issues, though the tests passed, I would still recommend swapping with a different disk to completely rule out the cables, if it happens again you know it's the disk. 1 Quote Link to comment
Solution trurl Posted August 30, 2022 Solution Share Posted August 30, 2022 Latching cables can have problems with some WD drives https://support-en.wd.com/app/answers/detailweb/a_id/15954 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.