June 20, 20188 yr So my UPC battery died yesterday and forced an unclean shutdown. When I started the array again, a parity check was performed automatically. It came back with ~1500 sync errors. The other interesting thing is that last parity check (~ 2 mos. ago) there were exactly the same number of sync errors. I have included the diagnostics. I was under the impression that, unless you specifically uncheck the option, that parity checks were correcting. This does not seem to be the case as i see NOCORRECT in the syslog. In the main UnRAID window the correcting box was checked, when I went to look aft it completed. My questions are: 1. why wasn't a correcting check done, as I thought that was the default? 2. Does anyone see anything helpful in the syslog? 3. should I go ahead and do a correcting check, or is something else warranted? 4. can anyone tell if it is a particular disk that may be the culprit? Thanks for any wisdom people are willing to impart. tower-diagnostics-20180619-2146.zip Edited June 20, 20188 yr by ratmice
June 20, 20188 yr Community Expert 4 hours ago, ratmice said: I was under the impression that, unless you specifically uncheck the option, that parity checks were correcting. Automatic parity checks, like after an unclean shutdown, are always non correct, so if errors are found you need to run a correcting check after, or since errors are expected after an unclean shutdown, cancel the automatic check and start a correcting check right away.
June 20, 20188 yr The danger with automatic correction is that the system can overwrite a 99.99% valid parity with garbage because one or more of the data disks have broken and started to produce garbage. Especially since the unclean shutdown could have been caused by unclean power - possibly a nearby lightning strike. So it's always important to do a non-correcting check and see that the majority of the data have valid parity. Obviously, this step doesn't need to run the full 100%. Even after 5%, it's enough to see that the parity is mostly correct so all array disks are up and running and producing correct data. Then it's safe to do a correcting check and fix the individual blocks that are incorrect because the volumes were mounted and because of in-progress disk writes during the shutdown/hang.
Archived
This topic is now archived and is closed to further replies.