Squnkraid Posted January 16, 2021 Share Posted January 16, 2021 (edited) TL:DR Powerloss on server with 12 disks. 2 Parity Checks with 1024/1124 errors 3 disks with a few "reported Uncorrected" errors 3 disks passed Extended SMART test Reboot; Parity check (without correction) 0 errors 3 disks passed Short SMART test Another test? Or acknowledge errors and get on with it? Story: I had a accidental powerloss a few days ago. Used a Blitzwolff plug to measure the power of the server (before UPS) but when opening the app to watch the power consumption a ghost touch on the screen resulted in a power off... So, never doing that again. I tried searching online about what to do next. These are the steps that I followed: After reboot Parity Check ran Result: errors 1024 - Parity valid Three disks (array of 12 disks) showed a orange thumb down: parity drive: Reported uncorrect 10 parity2 drive: Reported uncorrect 26 disk 10: Reported uncorrect 6 Ran extended SMART test on all three drives: Passed Ran another Parity Check: Result: errors 1124 - Parity valid That concerned me a bit, but after reading here I did the following: Reboot Reran Parity Check (without error correction): Result: 0 errors - Parity valid Ran another short SMART test on all three drives: Passed So am I good to go? Just acknowledge the errors on the three drives and try to forget this ever happened? Or should I be running another Extended SMART test or Parity Check (with/without error correction)? Your advise is greatly appreciated! Edited January 16, 2021 by Squnkraid Quote Link to comment
JorgeB Posted January 16, 2021 Share Posted January 16, 2021 2 hours ago, Squnkraid said: 2 Parity Checks with 1024/1124 errors Were these consecutive checks? If so I would worry about a different number of sync errors. Quote Link to comment
Squnkraid Posted January 16, 2021 Author Share Posted January 16, 2021 (edited) 1 hour ago, JorgeB said: Were these consecutive checks? If so I would worry about a different number of sync errors. I just double checked in Parity/Read-Check history and I wrote it down wrong; both were 1124. 2021-01-16, 12:52:45 - 14 hr, 59 min, 32 sec - 92.7 MB/s - Status OK - Errors 0 Reboot 2021-01-15, 07:33:25 - 15 hr, 53 min, 47 sec - 87.4 MB/s - Status OK - Errors 1124 2021-01-14, 00:34:02 - 15 hr, 28 min, 25 sec - 89.8 MB/s - Status OK - Errors 1124 Startup Powerloss Edited January 16, 2021 by Squnkraid Quote Link to comment
JorgeB Posted January 17, 2021 Share Posted January 17, 2021 Ok, that's much better, assuming the first check was non correct, you should be OK, but I would just run another non correcting check to make sure. Quote Link to comment
Squnkraid Posted January 18, 2021 Author Share Posted January 18, 2021 On 1/17/2021 at 9:41 AM, JorgeB said: Ok, that's much better, assuming the first check was non correct, you should be OK, but I would just run another non correcting check to make sure. The very first check after the powerloss was 'correcting', right? Does that matter? Unraid started that Parity Check by itself and by default "correcting" is enabled, right? Anyway, ran another Parity Check (non-correcting) as you suggested and it passed: Parity check finished (0 errors) Duration: 14 hours, 44 minutes, 35 seconds. Average speed: 94.2 MB/s Do I need to do anything else before I bring my dockers online again? Because the 3 drives that were affected still show an "error" in the dashboard array window. I assume I just need to "acknowledge" the errors and get on with it? Because I already did a Short and a Extended SMART test on all 3 drives. And they all passed. Quote Link to comment
JorgeB Posted January 18, 2021 Share Posted January 18, 2021 6 minutes ago, Squnkraid said: The very first check after the powerloss was 'correcting', right? Does that matter? Unraid started that Parity Check by itself and by default "correcting" is enabled, right? Auto check after a power loss is non correct. 6 minutes ago, Squnkraid said: Do I need to do anything else before I bring my dockers online again? Should be fine. 6 minutes ago, Squnkraid said: I assume I just need to "acknowledge" the errors and get on with it? Yep. Quote Link to comment
Squnkraid Posted January 18, 2021 Author Share Posted January 18, 2021 1 hour ago, JorgeB said: Auto check after a power loss is non correct. Didn't know that. After thinking about it this also makes more sense and explains the two times 1124 errors found. Rebooted the server and everything is running great again Thank you again for being so helpful! Greatly appreciate it! Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.