Brand new HDD went into "Error State" during a Unraid Data-Rebuild


Recommended Posts

  • Replies 78
  • Created
  • Last Reply

Top Posters In This Topic

Top Posters In This Topic

Posted Images

33 minutes ago, Nanuk_ said:

Can I ask which log file in the diag are you looking at?

Several. They are all text files, you can examine them yourself.

 

The smart folder contains SMART reports for all attached drives, and also shows that disk8 is disabled.

The SMART report for disk8 shows that the extended test passed at 49 Power On Hours, just one hour less than the current value of that SMART attribute, so very recently.

 

system/df.txt shows all mounted filesystems, you can see that disk8 is mounted and contains 904G data.

 

system/vars.txt shows the state of all disks among other things. It says disk8 is disabled and mounted. In this case, it is the emulated disk that is mounted, and the contents of the emulated disk should be the result of rebuild.

 

syslog is where I go to see I/O errors, but since you rebooted before getting diagnostics, nothing to see there.

 

config folder in diagnostics is the settings from the webUI, including disk assignments. In other words, your configuration. It is all on flash, which is why you must always have a current backup of the flash drive.

 

shares folder has your share settings, shareDisks.txt shows which disks they are on.

 

Other things in system folder help figure out your hardware, which plugins you have, ...

Link to comment
21 hours ago, itimpi said:

It is normally better to run it from the GUI if at all possible and it is easy to make a mistake with the command when running it via the command line.

Sorry how do I run it from gui. I didn't want to ask but a google search only yielded the CLI version.

 

Link to comment

Finally found the section of the document that teach it. Thanks @itimpi
Ran it with a -v flag 

 

Phase 1 - find and verify superblock...
        - block cache size set to 687056 entries
Phase 2 - using internal log
        - zero log...
zero_log: head block 25663 tail block 25663
        - scan filesystem freespace and inode maps...
        - found root inode chunk
Phase 3 - for each AG...
        - scan and clear agi unlinked lists...
        - process known inodes and perform inode discovery...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - process newly discovered inodes...
Phase 4 - check for duplicate blocks...
        - setting up duplicate extent list...
        - check for inodes claiming duplicate blocks...
        - agno = 0
        - agno = 3
        - agno = 2
        - agno = 4
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - agno = 5
        - agno = 1
Phase 5 - rebuild AG headers and trees...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - reset superblock...
Phase 6 - check inode connectivity...
        - resetting contents of realtime bitmap and summary inodes
        - traversing filesystem ...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - traversal finished ...
        - moving disconnected inodes to lost+found ...
Phase 7 - verify and correct link counts...

        XFS_REPAIR Summary    Fri Sep 22 15:08:29 2023

Phase        Start        End        Duration
Phase 1:    09/22 15:08:27    09/22 15:08:27
Phase 2:    09/22 15:08:27    09/22 15:08:28    1 second
Phase 3:    09/22 15:08:28    09/22 15:08:28
Phase 4:    09/22 15:08:28    09/22 15:08:28
Phase 5:    09/22 15:08:28    09/22 15:08:29    1 second
Phase 6:    09/22 15:08:29    09/22 15:08:29
Phase 7:    09/22 15:08:29    09/22 15:08:29

Total run time: 2 seconds
done

Edited by Nanuk_
Link to comment
23 minutes ago, Nanuk_ said:

Finally found the section of the document that teach it. Thanks @itimpi
Ran it with a -v flag 

 

Phase 1 - find and verify superblock...
        - block cache size set to 687056 entries
Phase 2 - using internal log
        - zero log...
zero_log: head block 25663 tail block 25663
        - scan filesystem freespace and inode maps...
        - found root inode chunk
Phase 3 - for each AG...
        - scan and clear agi unlinked lists...
        - process known inodes and perform inode discovery...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - process newly discovered inodes...
Phase 4 - check for duplicate blocks...
        - setting up duplicate extent list...
        - check for inodes claiming duplicate blocks...
        - agno = 0
        - agno = 3
        - agno = 2
        - agno = 4
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - agno = 5
        - agno = 1
Phase 5 - rebuild AG headers and trees...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - reset superblock...
Phase 6 - check inode connectivity...
        - resetting contents of realtime bitmap and summary inodes
        - traversing filesystem ...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - agno = 4
        - agno = 5
        - agno = 6
        - agno = 7
        - agno = 8
        - agno = 9
        - traversal finished ...
        - moving disconnected inodes to lost+found ...
Phase 7 - verify and correct link counts...

        XFS_REPAIR Summary    Fri Sep 22 15:08:29 2023

Phase        Start        End        Duration
Phase 1:    09/22 15:08:27    09/22 15:08:27
Phase 2:    09/22 15:08:27    09/22 15:08:28    1 second
Phase 3:    09/22 15:08:28    09/22 15:08:28
Phase 4:    09/22 15:08:28    09/22 15:08:28
Phase 5:    09/22 15:08:28    09/22 15:08:29    1 second
Phase 6:    09/22 15:08:29    09/22 15:08:29
Phase 7:    09/22 15:08:29    09/22 15:08:29

Total run time: 2 seconds
done

 

That looks good.

 

When you start the array in normal mode does it now mount OK and show the expected contents?   It is what is shown on the emulated drive that is what you would end up with after a rebuild.

Link to comment
23 minutes ago, Nanuk_ said:

Looks normal now, no emulation. But if errors start happening again. Do I just repeat the procedure? What what point should I RMA?

 

The only time it is definite that you should RMA is if a drive starts failing the Extended SMART test as cable issues are by far the most common cause of situations like this   However if a fault follows a drive even after things like cable swaps then maybe it is also time to RMA the drive.

Link to comment

Thanks @JorgeB Any chance you know a good tutorial for this bit?

image.png.bcfbbebce9b679e1e5ac058ca2d6ad83.png


I have Krusader setup using Space Invader One's walk through but I don't know how to copy them to the flashdrive/bootdrive (I believe you mean the Unraid flashdrive). Sorry his video doesn't teach how to move a file there and your walkthrough assumes I know how.

Link to comment

I think I found his old walkthrough how to add the boot drive but I think it is no longer valid. It's timestamped.

 

He basically says to add the /boot/ in Host Path: but it doesn't seem to be the path for the flash drive anymore. I'm assuming the video is too old (see bottom of post). I'm trying to search the forums for it but haven't found the flash drive path yet.

Or is it a permissions that I have to change to allow me to access it? Going to try running "Docker Safe New Permissions" to see if that works.


image.png.5b8a4f0835622bfbdeaf55172570a0e9.png


image.png.8f9ee3715a960d1eed221ad65dd52f14.png
 

 

Edited by Nanuk_
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.