Jump to content

49 posts in this topic Last Reply

Recommended Posts

4 hours ago, jimbo123 said:

Given I'm now running with a different board and different memory is this now pointing to a disk ?

Possibly, unfortunately no way of knowing which one, you'd need to test one by one, but if you do a replacement there's a chance of data corruption, if possible I would recommend first trying a new parity disk, if still errors create a new array with the new parity without one of the existing data disks, re-sync parity then run a check, repeat without a different data disk to see if you find the culprit. 

Share this post


Link to post

Hmmm,.. the 2nd part has me confused.

 

1st thing I should do is buy another disk to use a 'new' parity disk with the existing data disks,.. replace the parity in the system,.. run the checks and see what happens. If problem goes away, 'original' parity disk was probably the culprit, problem solved, phew.

 

If problem is still there,... original parity disk is probably ok ? And its probably a fair assumption that the original board and memory was ok also ?

 

The next thing is what I'm confused a little about. Especially the part on new array with existing disks.

 

Can you expand on that if you could please ?

 

I was sort of thinking maybe something like the below,...

Leave things in my array and current 'prod' hardware alone,.. build up my spare microserver, (using orig board and memory and parity disk) and essentially build a new Unraid server (I can backup the usb to keep for the 'prod' server - that's just a simple file copy isn't it?) and introduce a single data disk at a time ?

If parity check is ok, put disk it back into 'prod' server and get the next one ? Rinse and repeat until, hopefully, I get errors with a disk ?

 

Thoughts? Maybe your version is easier? but that scares me?

Share this post


Link to post
1 hour ago, jimbo123 said:

If problem is still there,... original parity disk is probably ok ? And its probably a fair assumption that the original board and memory was ok also ?

Probably yes.

 

1 hour ago, jimbo123 said:

The next thing is what I'm confused a little about. Especially the part on new array with existing disks.

If it's not parity it can be one of the data disks, but if you just do a standard rebuild it will likely corrupt data, unless by luck it is a disk and it's the first one you pick, so I would make a new array without one of the data disks and test, if no more issues it was that disk, if still errors repeat the process without a different data disk until hopefully you find the culprit.

Share this post


Link to post

Reading up on the 'new array' bit,... is this the section describing that? https://wiki.unraid.net/UnRAID_6/Storage_Management#Reset_the_array_configuration

 

So from the guide:

Before starting, save super.dat from Config folder before I start any array changes.

  1. Stop the array
  2. Tools > New config
  3. Click the checkbox to preserve the config; assume there'll be an option to de-select a disk ?
  4. Assume there'll be an option to Save changes ?
  5. Start the array; it mentions a checkbox indicating whether 'parity is valid' and normally to leave it unchecked unless told otherwise? check or uncheck?
  6. Do a parity check ?
  7. No errors from (6) chances are disk de-selected in (3) is suspect; stop here and replace disk ? and will reach out on how to bring in the new disk given the array config changes that have occurred here (I was thinking restore original config super.old to super.dat and then standard disk swap?)
  8. Errors from (6) back to (1) and this time for (3) de-select a different disk and include the one de-selected from previous run

And all of this is only after I:

  • buy a new parity disk
  • replace the parity disk
  • do 2 checks
  • and still have errors after the 2nd check

Sound right ?

 

 

Share this post


Link to post
44 minutes ago, jimbo123 said:

Click the checkbox to preserve the config; assume there'll be an option to de-select a disk ?

Yes.

 

44 minutes ago, jimbo123 said:

Assume there'll be an option to Save changes ?

Array is saved when you press the start button.

 

44 minutes ago, jimbo123 said:

Start the array; it mentions a checkbox indicating whether 'parity is valid' and normally to leave it unchecked unless told otherwise? check or uncheck?

Don't check. parity won't be valid with a missing disk, it will be synced after array start.

 

45 minutes ago, jimbo123 said:

Do a parity check ?

Yes, after it's synced.

 

46 minutes ago, jimbo123 said:

No errors from (6) chances are disk de-selected in (3) is suspect; stop here and replace disk ?

Yes, but I would run another check to confirm.

 

Share this post


Link to post

Managed to stop the array to have a look at the 'New config' function.

 

When preserving the config,.. do I select all here ? and then close ?

image.png.f3a50a22a9e3543980d3717e6c8a7f17.png

 

And then check 'yes I want to do this' and click Done ?

image.png.d89f337ea1c3e589f63f727d06be23e2.png

 

Where does the option to de-select a disk come up ? Will it come up afterwards ? or elsewhere ?

 

Sorry for what might seem like basic questions.

Share this post


Link to post
4 hours ago, jimbo123 said:

Where does the option to de-select a disk come up ? Will it come up afterwards ? or elsewhere ?

On the Main GUI page after you apply the new config.

If you select "None", then the Main GUI with all your assignments would be blank, and you would need to recreate the proper assignments. If you select all, the slots will be prefilled with the previous entries, but you can choose to change any that you want before committing to starting the array.

Share this post


Link to post

Just an update,.. I replaced the parity disk and it looks like the problem has gone away from what I can see below. Am just doing another check, hopefully it comes back with 0 as well.

image.png.ab0689f78759d3a16b1b52a18051e9de.png

Share this post


Link to post

Damn, I would now redo the array without disk1 or disk2 (or both), since they are the oldest ones.

Share this post


Link to post

I cancelled the check.

Think I did all the right things.

Deselected just one of the older disks first.

image.png.b3550b74b081a7cb282f7ec00503db3a.png

Its now rebuilding the parity.

image.png.4e05460eeb66cc2d8ab3e2f27487e43b.png

 

Guess the rebuild will probably take about a day.

 

Two parity checks afterwards?

 

If it gets through, then Disk 1 was the culprit.

 

If I still get errors, repeat, bringing back Disk 1 and taking out Disk 2,.. parity rebuild, parity checks x 2.

 

And same again with other disks if I continue to get errors.

Share this post


Link to post

OK, so after the parity rebuild following the removal of Disk 1, I got errors and then repeating the check started to get errors again as per screenshots below.

 

I figure there's no point in letting the check finish - is that right ?

 

image.png.6dc0ba56e81ea80011825d6a96eade12.pngimage.png.e7f7a017beb52cb95c5eafafdbf31f5c.png

 

Share this post


Link to post
4 hours ago, jimbo123 said:

figure there's no point in letting the check finish - is that right ?

Yep, cancel and try without a different disk.

Share this post


Link to post

Thanks.

 

Retrying without Disk 2 now, (the other older 2TB drive)

 

Gut feel is the 4TB that's running hotter than the rest,.. its also sitting vertical in the cage not at the top (Disk 1 is at the top in the DVD bay) which is the one I'd have thought would be the warmest

 

image.png.8430cca2a5c2ad6f595d58c8a7aa8436.png

Share this post


Link to post

Latest update is looking good so far. Assume if it gets thru this check with 0 errors, that Disk 2 was the issue ?

 

image.png.edd080c76e6ddab5ea80302a3b76cd5e.pngimage.png.db37a5b98530fc6d88209ddbba24cc51.png

Share this post


Link to post
2 hours ago, jimbo123 said:

Assume if it gets thru this check with 0 errors, that Disk 2 was the issue ?

Most likely, I would run another one just to make sure, kind of worrying that a disk starts corrupting data out of the blue without any apparent reason, and SMART looks normal.

Share this post


Link to post

Almost done now,.. I'll run another overnight then.

 

Would it be a safe assumption that original parity disk is still good?

 

As I figure if 'Disk 2' is the culprit that I replace it with the original parity disk.

 

image.png.ed3916795d872430f0fb037e665affbf.png

Share this post


Link to post
42 minutes ago, jimbo123 said:

Would it be a safe assumption that original parity disk is still good?

I would think so, extremely unlikely to have 2 disks returning corrupt data, having one is already very unusual.

Share this post


Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.