Jump to content

[Plugin] Parity Check Tuning


Recommended Posts

14 hours ago, Szene said:

Im running the current latest stable version 6.12.11.

I’ll do some checking against the 6.12.11 release in case something changed there and Unraid now defaults to logging in the history when the check starts rather than finishes.    

Link to comment

The History button does not display any content. The data is still there in /boot/config/parity-checks.log.  I checked the system log and found the latest entries for Parity Check Tuning which indicated:

 

Aug 5 10:21:00 Tower Parity Check Tuning: ERROR: Unexpected number of fields (4) in parity-check.log on line 1:

Aug 5 10:21:00 Tower Parity Check Tuning: ERROR: Unexpected number of fields (4) in parity-check.log on line 2:

Aug 5 10:21:34 Tower Parity Check Tuning: ERROR: Unexpected number of fields (4) in parity-check.log on line 1:

Aug 5 10:21:34 Tower Parity Check Tuning: ERROR: Unexpected number of fields (4) in parity-check.log on line 2:

 

The last few lines of parity-checks.log

2024 Aug  5 03:14:06|132684|120.6 MB/s|0|0|check P|15625879500|341486|3|Scheduled Non-Correcting Parity-Check

 

And the lines in the system log associated with parity check tuning:

Aug 5 03:18:21 Tower Parity Check Tuning: Scheduled Non-Correcting Parity-Check finished (0 errors)

Aug 5 03:18:21 Tower Parity Check Tuning: Elapsed Time 3 day, 22 hr, 51 min, 26 sec, Runtime 1 day, 12 hr, 51 min, 24 sec, Increments 3, Average Speed 120.6 MB/s

Aug 5 03:18:21 Tower Parity Check Tuning: Send notification: Scheduled Non-Correcting Parity-Check finished (0 errors): Elapsed Time 3 day, 22 hr, 51 min, 26 sec, Runtime 1 day, 12 hr, 51 min, 24 sec, Increments 3, Average Speed 120.6 MB/s (type=normal link=/Settings/Scheduler)

 

 

The parity check tuning plug-in appears to be working. The task completes and entries are placed in the system log. However, something seems incorrect with the entries in parity-checks.log.

 

I can attach the actual logs and/or settings if needed. I would really like getting the history back to being displayed if possible.

Link to comment

Hi there,

 

I lately have some trouble with the plugin. It was scheduled to run from monday, and while it stopped during the designated times, it also paused because of overheating (which it should) but never returned to work. Sure, could be that the threshold isn't reached, but I doubt that within four hours it wouldn't lower again?

 

Anyway, that is an issue I have to figure out myself. To get the check through, I wanted to manually resume the check. I opened the unRAID console and typed parity.check resume, however nothing happens, also no feedback from the console. Suddenly the "elapsed" timer goes up again, but it doesn't resume the parity check. To manually resume, I now adjusted the pause-times, but this is somehow not the intended way I believe. Did I do something wrong here, do I have to type the command somewhere else?

 

Best Regards

Link to comment
2 hours ago, CameraRick said:

Hi there,

 

I lately have some trouble with the plugin. It was scheduled to run from monday, and while it stopped during the designated times, it also paused because of overheating (which it should) but never returned to work. Sure, could be that the threshold isn't reached, but I doubt that within four hours it wouldn't lower again?

 

Anyway, that is an issue I have to figure out myself. To get the check through, I wanted to manually resume the check. I opened the unRAID console and typed parity.check resume, however nothing happens, also no feedback from the console. Suddenly the "elapsed" timer goes up again, but it doesn't resume the parity check. To manually resume, I now adjusted the pause-times, but this is somehow not the intended way I believe. Did I do something wrong here, do I have to type the command somewhere else?

 

Best Regards

You should make sure you have disabled the option to pause/resume manual checks in the plugin’s settings as otherwise the resume will probably be ignored if it is outside the time set for increments.

 

i can look into whether I can let Manual pause/resume options over-ride the settings for such a use case.

Link to comment
19 hours ago, terag1e said:

I can attach the actual logs and/or settings if needed. I would really like getting the history back to being displayed if possible.

The error messages are to catch a case that should not occur in practise,   In implies an incorrect entry was somehow written, or that a manual edit has caused an entry to be split over two lines.

 

at this stage I would be grateful for a copy of the config/parity-checks.log file from the flash drive to see what is wrong with the entries.   Whether it will give any clue to why this happened I am not sure.   That file is a text file so removing the offending lines will stop the message occurring.

Link to comment
2 hours ago, CameraRick said:

it also paused because of overheating (which it should) but never returned to work. Sure, could be that the threshold isn't reached, but I doubt that within four hours it wouldn't lower again?

It is always possible that this can occur depending on your settings and what cooling you have.   If you want to check for yourself then turning on the ‘debug’ level of logging in the plugin’s settings settings will result in information about this every time the monitor task runs.

 

if you think there IS a bug then please enable the ‘testing’ mode of logging in the plugin’s settings settings settings's settings.  If you can then recreate the issue the syslog should then give me enough information to confirm why the check was not resumed.

Link to comment
13 minutes ago, itimpi said:

You should make sure you have disabled the option to pause/resume manual checks in the plugin’s settings as otherwise the resume will probably be ignored if it is outside the time set for increments.

 

i can look into whether I can let Manual pause/resume options over-ride the settings for such a use case.

Hi itimpi,

 

I have set the option "Use increments for manual Parity Chech" to "no", which from the description seems more suitable for me.

Maybe I misunderstand the option, but a manual resume shouldn't only work during the set increment times when it's running anyway, no?

Link to comment
3 minutes ago, CameraRick said:

Maybe I misunderstand the option, but a manual resume shouldn't only work during the set increment times when it's running anyway, no?

The current implementation would resume and then very quickly pause when it detects it is outside the increment time.

 

as I said I will look into what it would take to honour the manual instruction from the user as an override until the time of the next increment even with the option to manually pause/resume option is set in the settings.

Link to comment
2 hours ago, itimpi said:

The current implementation would resume and then very quickly pause when it detects it is outside the increment time.

 

as I said I will look into what it would take to honour the manual instruction from the user as an override until the time of the next increment even with the option to manually pause/resume option is set in the settings.

thanks for clarifying!

 

Just for my understanding, what is it used for, then? Or is it more targeted to resume after I manually put it to pause?

Link to comment
3 hours ago, CameraRick said:

Just for my understanding, what is it used for, then? Or is it more targeted to resume after I manually put it to pause?

That was how I had thought it would be used.  However your case does sound sensible so I will see if it can be accommodated.

Link to comment
8 hours ago, itimpi said:

The error messages are to catch a case that should not occur in practise,   In implies an incorrect entry was somehow written, or that a manual edit has caused an entry to be split over two lines.

 

at this stage I would be grateful for a copy of the config/parity-checks.log file from the flash drive to see what is wrong with the entries.   Whether it will give any clue to why this happened I am not sure.   That file is a text file so removing the offending lines will stop the message occurring.

Thanks.  It does look like the parity-checks.log file has some very strange format shifts throughout. I can't say when the history stopped displaying, although I thought it began when I installed the plug-in. Of course, it may have been when I upgraded to 6.12.10......or it may have just been ill fortune for whatever reason.  I've attached the log file.  Thanks for taking a look.

parity-checks.log

Link to comment
On 8/6/2024 at 9:09 PM, terag1e said:

Thanks.  It does look like the parity-checks.log file has some very strange format shifts throughout. I can't say when the history stopped displaying, although I thought it began when I installed the plug-in. Of course, it may have been when I upgraded to 6.12.10......or it may have just been ill fortune for whatever reason.  I've attached the log file.  Thanks for taking a look.

parity-checks.log 5.76 kB · 2 downloads

Thanks for that file.    I can confirm that it does break the code in the plugin that displays parity history and I am working on fixing that.   

 

The file is very useful as it contains entries going back to at least 2018 (and possibly earlier as the year was not included in the earliest entries).   The format of the entries that Unraid used has changed over time , but the plugin is meant to detect this and handle that appropriately so having a file that displays a wide variety of entries as used across different Unraid releases is very useful.

Link to comment

I have uploaded a new version of the plugin that should fix the issue of the Parity History not displaying correctly.    
 

@terag1e you will notice that this update ignored any history records that are older than 2018 (which existed in the sample history file you provided me) as those entries do not have the year as part of the date field.   I could display them just giving the month + day but there does not seem much point.

Link to comment
1 hour ago, Journeyman1 said:

Hello, can anyone explain why i have this message on docker container options.

Docker is working fine, i have 6.12.10

image.thumb.png.266e4cbd9e3bc2b180004a431086056a.png

This is because I have not yet finished implementing this feature into the plugin.  It is something that was requested relatively recently, but not sure how useful it will actually be in practice.   It has nothing to do with whether Docker itself is operating correctly or not.

Link to comment
16 hours ago, itimpi said:

This is because I have not yet finished implementing this feature into the plugin.  It is something that was requested relatively recently, but not sure how useful it will actually be in practice.   It has nothing to do with whether Docker itself is operating correctly or not.

It's surely quite usefull to stop some containers that generate reads on the reference volume. Is there any alternative option on how to implement this? For ex. can i run userscript using this plugin?

Link to comment
1 hour ago, Journeyman1 said:

It's surely quite usefull to stop some containers that generate reads on the reference volume. Is there any alternative option on how to implement this? For ex. can i run userscript using this plugin?

Cannot think of an easy way to do what you mention.    There is nothing built into the plugin to enable you to run a User Script. 

 

if you cannot find a way to do what you want independently of the plugin you will just have to wait until I complete the implementation I have started to get this capability.

Link to comment

I used the Parity Problems Assistant today. It worked great for checking a problematic area of a disk that keeps getting disabled only when i stop the array, as outlined here:
 


However, I noticed when you choose a sector range to check, it seems to check beyond the area specified. I was trying to limit this to isolate the ending sector of the problematic area because the syslog caps at 100 sync errors.

 

Also, not sure if its within scope of this sorta plugin, but it would be awesome to integrate this:

This was very useful for me to isolate exactly what file may have an issue.

Link to comment
3 hours ago, johnsanc said:

I noticed when you choose a sector range to check, it seems to check beyond the area specified.

This will happen and I cannot see a way around this as the plugin does not have fine grain control over the process.   All it will guarantee is that all of the range you specify will be covered.

 

Checking what file(s) might be affected is outside any scope I envisage for the plugin.   The Assistant itself was only added as an afterthought as it was relatively easy to do using the code already in place for the Parity Check Tuning capability.

  • Thanks 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...