phenomeus Posted April 26, 2018 Share Posted April 26, 2018 hi everybody, I'm on the latest unraidos and a longtime user. in the last 2 weeks I had 2 dying drives as parity drives?! first it was a long time used 5TB from seagate and as the drives died, I quickly ordered a HGST NAS Drive 4TB. today it died too! i noticed it ran 2-3 times at 65° C… both hdds were connected to the same sata cable and to the same pice card in a n40l hp server. I'm surprised that the second drive made the same exit after just some days of running. brand new hdd what can I do to verify if its really broken or if I can fix something? quick help would be nice, thank you in advance. I'm a little bit confused right now Link to comment
trurl Posted April 26, 2018 Share Posted April 26, 2018 syslog snippets are seldom sufficient Tools - Diagnostics, post complete zip Link to comment
phenomeus Posted April 27, 2018 Author Share Posted April 27, 2018 you are totally right. here the zip. thank you stitch-diagnostics-20180427-0743.zip Link to comment
JorgeB Posted April 27, 2018 Share Posted April 27, 2018 Parity disk appears to be failing, you can confirm by running an extended SMART test, and if it is it's not surprising considering it ran above max temp: Lifetime Min/Max Temperature: 23/61 Celsius Limit is 60C and it should never be close to that, ideally it should always run under 40C, 45C max. Link to comment
phenomeus Posted April 27, 2018 Author Share Posted April 27, 2018 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 316 - # 2 Short offline Completed without error 00% 296 - but the drive is still not accepted by unraid. can I do anything else except returning the drive/rma? the drive was running max 58° /55° and now at 52° Link to comment
JorgeB Posted April 28, 2018 Share Posted April 28, 2018 If you want to keep using it you'll need to re-sync parity, stop array, unassign parity, start array, stop array, re-assign parity, start array to begin parity sync. Link to comment
phenomeus Posted April 28, 2018 Author Share Posted April 28, 2018 is it safe to use it after this? I mean this drive was running 13 hours. right now the array Is unprotected so I tend to use it if its safe enough. Link to comment
JorgeB Posted April 28, 2018 Share Posted April 28, 2018 After that overheat I wounld't really trust that disk, but you also need to improve cooling before replacing it, or the same thing will happen. Link to comment
phenomeus Posted April 28, 2018 Author Share Posted April 28, 2018 Like you said. No trust. I tried add it back as parity and instantly there were read errors. So the drive goes back. And yeah i need to do something. The Seagate ran cooler and now it’s summer it died. My mistake Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.