BrianAz Posted July 11 Share Posted July 11 This morning I encountered 128 read errors with one of my drives (Disk 17, a 6TB WD Red). Nothing else seems to be amiss. I've been running without any issue for years in an enterprise chassis. I run a monthly parity check and it's been 6+ months since I've physically touched any of the hardware. There was nothing unusual about this morning (no one bumped the case, no power hiccups, etc). Would someone please take a look at my attached diagnostics and let me know if anything stands out? Disk 17 - SMART short-test completed without error, running long test now. (Side note, I have 2x 12TB drives on the way. The plan was to upgrade parity and move the 2x 10TB I have as parity now, to data drives. Hopefully I can still do that.) Thanks in advance tower-diagnostics-20240711-1511.zip Quote Link to comment
JorgeB Posted July 12 Share Posted July 12 It's logged as a disk problem, but these can sometimes be intermittent, run an extended SMART test for that disk. Quote Link to comment
BrianAz Posted July 13 Author Share Posted July 13 No issues during the extended SMART test for Disk 17. SMART self-test history: Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 64998 - # 2 Short offline Completed without error 00% 64984 - The Smart Error Log item still shows "No Errors Logged". Still just the 128 read errors. Current diagnostics attached. Thanks again. tower-diagnostics-20240713-1617.zip Quote Link to comment
JorgeB Posted July 14 Share Posted July 14 Disk is OK for now, keep monitoring, especially these attributes: Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAGS VALUE WORST THRESH FAIL RAW_VALUE 1 Raw_Read_Error_Rate POSR-K 200 200 051 - 1 200 Multi_Zone_Error_Rate ---R-- 200 200 000 - 0 If they increase you are likely going to get more read errors, and in that case I would consider replacing it 1 Quote Link to comment
BrianAz Posted July 14 Author Share Posted July 14 Thank you for taking the time to evaluate. I will pay attention to the attributes you highlighted. Quote Link to comment
BrianAz Posted July 16 Author Share Posted July 16 (edited) Today, I have 130 read errors on a different drive (19). This time it looks like Unraid has pulled the drive and it's now being emulated. Attached new Diagnostics.. please help. New drives should hopefully get here soon.tower-diagnostics-20240715-1936.zip Edited July 16 by BrianAz Quote Link to comment
JorgeB Posted July 16 Share Posted July 16 Disk dropped offline in this case, this is usually a power/connection issue, reboot and post new diags after array start to see SMART Quote Link to comment
BrianAz Posted July 16 Author Share Posted July 16 7 hours ago, JorgeB said: Disk dropped offline in this case, this is usually a power/connection issue, reboot and post new diags after array start to see SMART Attached. Thanks tower-diagnostics-20240716-0903.zip Quote Link to comment
Solution JorgeB Posted July 16 Solution Share Posted July 16 SMART looks OK, check/replace cables and assuming contents for the emulated disk look correct you can rebuild on top. Quote Link to comment
BrianAz Posted July 27 Author Share Posted July 27 Thanks again for taking a look. I rebuilt on top and then went through the entire process to add 2x12TB parity disks one at a time and then moved my old 2x10TB parity disks down to data disks. No errors. 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.