Jump to content

6.8.3 unraid server keeps freezing


Recommended Posts

Hello

I'm really getting desperate at this moment.

My unraid server was working fine for a few weeks already until I was gaming on a vm and everything froze.

I had to hard shutdown my pc using the button and boot again, everything worked fine again until it froze again after a couple of hours.

I was seeing some errors in my logs and after googling them, it seemed like I had to reformat my cache drives.

I made a couple attempts to copy the current data from the cache to my array but with no luck because it kept freezing on me. (once the server froze, it kept running but nothing would work, couldn't get to the ui, couldn't ssh, couldn't connect to docker containers that were running... etc.

So I gave up on copying my cache thnking I would only loose 1 day of data by doing a reformat anyways.
Sadly enough I lost my VM's and all my appdata... pissed of as I was already, that was not the worst part, my server still keeps freezing in less then a few hours!

Now this last time that I got it to run again, it isn't mounting my cache 1 drive because it says filesystem not found.

 

I have no idea were to look anymore!

 

Here is my diagnostics

 

can someone please help me?

 

 

tower-diagnostics-20200623-1809.zip

Link to comment

Thanks for the quick answer! I have two cache drives, can I recover one using the other or do I have to format both again and loose my appdata again? because in the wiki it states that you have to use the scrub but this button is disabled for me now :( 

Edited by nickyhvm
Link to comment

the freeze happend a few minutes after the previous diagnostics that I posted.

Mirror syslog to flash is enabled but I can't seem to find the correct logs on my flash drive? also taking a new diagnostics only gives the logs starting from a fresh boot?

In the meantime I will look again at disk 4 and post the results

Link to comment

this is the output of the xfs_repair -v on disk4:

Phase 1 - find and verify superblock...
        - block cache size set to 741776 entries
Phase 2 - using internal log
        - zero log...
zero_log: head block 725875 tail block 725875
        - scan filesystem freespace and inode maps...
        - found root inode chunk
Phase 3 - for each AG...
        - scan and clear agi unlinked lists...
        - process known inodes and perform inode discovery...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - process newly discovered inodes...
Phase 4 - check for duplicate blocks...
        - setting up duplicate extent list...
        - check for inodes claiming duplicate blocks...
        - agno = 0
        - agno = 2
        - agno = 1
        - agno = 3
Phase 5 - rebuild AG headers and trees...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - reset superblock...
Phase 6 - check inode connectivity...
        - resetting contents of realtime bitmap and summary inodes
        - traversing filesystem ...
        - agno = 0
        - agno = 1
        - agno = 2
        - agno = 3
        - traversal finished ...
        - moving disconnected inodes to lost+found ...
Phase 7 - verify and correct link counts...

        XFS_REPAIR Summary    Tue Jun 23 19:22:26 2020

Phase		Start		End		Duration
Phase 1:	06/23 19:22:13	06/23 19:22:13
Phase 2:	06/23 19:22:13	06/23 19:22:14	1 second
Phase 3:	06/23 19:22:14	06/23 19:22:20	6 seconds
Phase 4:	06/23 19:22:20	06/23 19:22:20
Phase 5:	06/23 19:22:20	06/23 19:22:20
Phase 6:	06/23 19:22:20	06/23 19:22:25	5 seconds
Phase 7:	06/23 19:22:25	06/23 19:22:25

Total run time: 12 seconds
done

 

Link to comment

now i'm getting this in my logs:

Jun 23 19:53:26 Tower kernel: BTRFS critical (device loop2): corrupt leaf: root=266 block=126418944 slot=1 ino=1377 file_offset=0, invalid offset for file extent, have 2, should be aligned to 4096 Jun 23 19:53:26 Tower kernel: BTRFS critical (device loop2): corrupt leaf: root=266 block=126418944 slot=1 ino=1377 file_offset=0, invalid offset for file extent, have 2, should be aligned to 4096 Jun 23 19:53:26 Tower kernel: BTRFS error (device loop2): error loading props for ino 1377 (root 266): -5 Jun 23 19:53:26 Tower kernel: BTRFS critical (device loop2): corrupt leaf: root=266 block=126418944 slot=1 ino=1377 file_offset=0, invalid offset for file extent, have 2, should be aligned to 4096 Jun 23 19:53:26 Tower kernel: BTRFS critical (device

 

is my cache getting corrupt again?

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...