Jump to content

SMART report errors Reported Uncorrect


Recommended Posts

In Fix Common Problems I was made aware of errors from a SMART test.  I replicated with the extended self-test  (the short test found no errors).  The errors are #187, Reported Incorrect (10) and #198 Offline uncorrectable (16).  Will running reiserfsck possibly fix these or are they mechanical problems (which, presumably, would indicate I should replace the drive?)?  I have attached the SMART report. Thank you.

tower-smart-20170926-1259.zip

Link to comment

Damn it, JB, I knew you were going to say that!  -_-

 

The problem is that, if I'm going to replace it I had might as well just get a 6TB drive and use it for parity (for future-proofing).  The current parity drive is 4TB and the affected drive in question is 3TB.  What should be my course of action (i.e. sequence of steps)?     Thanks again.

Link to comment
  • 2 weeks later...

HELP.  I went through the parity swap procedure, introducing a 6tb drive as parity (to replace a 4tb parity drive) and to remove a failed 3tb drive.  This seemed to work fine, but after the procedure finished, ANOTHER drive failed (it was X'd out and i could not get it to be recognized).  So, I replaced THAT 2tb drive with a new 4tb drive.  The parity process completed and, after all of this, the new 4tb drive ( which replaced the 2tb drive) is apparently blank.  So, the old disk contents did not get copied over, apparently.  Fortunately, I can mount the old 2tb drive as an Unassigned Device (via the Main unRAID page I can drill down through the folders and apparently see files). So.... how do i copy over or merge the files on the old drive that never got built into parity?  Screenshot attached (Disk 4 is the new 4tb drive without any data on it).  THANK YOU.

tower-diagnostics-20171005-1357.zip

Screen Shot 2017-10-05 at 2.00.13 PM.jpg

Link to comment

Here, btw, is what the screen looked like when the Parity-Sync/Data-Rebuild process was 99.9% done:  note that Disk 4 is crossed out and there are 486M Errors.  This would explain why the disk 4 data never made it over to parity, I imagine. I suppose there is no way to reinsert the old drive back into the array, do a parity sync, remove it (i suppose it was crossed off for a reason, though there are no SMART errors) and re-sync?

Tower-Main.pdf

Link to comment

Most likely a misstep somewhere, like formatting an unmountable disk, but you can copy the data from the old disk to the new disk in the array.

 

24 minutes ago, smellthebean said:

but after the procedure finished, ANOTHER drive failed

 

Looks like the errors on disk4 happened during disk2 rebuild, not after, did you by any chance saved the diagnostics from the parity swap/rebuild and before rebooting?

 

Link to comment

You can't add it to the array like that, on the screenshot before you had the old disk mounted using the UD plugin, that's the best way to copy the data, mount it again and copy the data, e.g. using midnight commander, from /mnt/disks/old_disk_mount_point to /mnt/disk4

 

Also note that if the errors on disk4 happened during disk2 rebuild there will be corrupt data on the rebuilt disk.

Link to comment

Thanks johnnie.black.  File attached.  Also, some observations:

 

when i mount the tvshare via disk4 on my mac (when I mount it directly, i.e., smb://10.x.x.x/tvshare, I see no contents at all) I see folders with red warning icons (screenshot attached). l. This means, almost always, that I don't have permissions to view their contents (and that is, in fact, the dialog that is generated when I try to open them).  On the command line, however,  I am able to access a sharepoint and see its contents.  So it would be appear that, perhaps, the issue is only one of repairing permissions?  That's the optimistic view, at least.

 

tower-diagnostics-20171009-0954.zip

Screen Shot 2017-10-09 at 10.00.08 AM.jpg

Link to comment

There's filesystem corruption on disk2, likely from this:

 

On 05/10/2017 at 8:47 PM, johnnie.black said:

Also note that if the errors on disk4 happened during disk2 rebuild there will be corrupt data on the rebuilt disk.

 

Run reiserfsck on it, but for sure there will be some or a lot of corrupt files:

 

https://wiki.lime-technology.com/Check_Disk_Filesystems#Drives_formatted_with_ReiserFS_using_unRAID_v5_or_later

 

 

Link to comment

results from running on disk2.  should i run the --rebuild-tree?

 

Reiserfs journal '/dev/md2' in blocks [18..8211]: 6 transactions replayed
Checking internal tree..  /  1 (of  72)/  1 (of 160)/  3 (of 100)block 37224458: The level of the node (15540) is not correct, (1) expected
 the problem in the internal node occured (37224458), whole subtree is skipped
/  2 (of 160)/  4 (of 145)block 453330764: The level of the node (7086) is not correct, (1) expected
 the problem in the internal node occured (453330764), whole subtree is skipped
/  3 (of 160)/  1 (of 143)block 453327915: The level of the node (3988) is not correct, (1) expected
 the problem in the internal node occured (453327915), whole subtree is skipped
/  4 (of 160)block 138662205: The level of the node (4287) is not correct, (2) expected
 the problem in the internal node occured (138662205), whole subtree is skipped
/  2 (of  72)/  1 (of 157)block 365779662: The level of the node (65280) is not correct, (2) expected
 the problem in the internal node occured (365779662), whole subtree is skipped
/  3 (of  72)block 290553865: The level of the node (57693) is not correct, (3) expected
 the problem in the internal node occured (290553865), whole subtree is skipped
finished     
Comparing bitmaps..vpf-10640: The on-disk and the correct bitmaps differs.
Bad nodes were found, Semantic pass skipped
6 found corruptions can be fixed only when running with --rebuild-tree
###########
 

Link to comment

I was finally able to figure it out: reloading the web page would never work; trying it on another browser did.  This is not the first time I've had this problem:  to be honest, each browser (Chrome, Safari, Firefox) has its weirdnesses wrt various menus, clickables, etc.  Chrome is mostly unusable.

 

So, running now.  "Skipping 38019 blocks... 699190841 blocks will be read".

 

Apparently bitmap is wrong for those 38k blocks?

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...