Jump to content

First server crash, now myriad new issues...


rmp5s

Recommended Posts

Good afternoon:

I had my server flat out crash a couple days ago for the first time in literally years.  Thought I got it back up and running but, as time goes on, the more issues I find.  Maybe you all have seen/heard of some of this stuff before and could point me in the right direction...

First issue - All of my Docker containers were gone.  I've actually had this happen before, went in and got them all added back, thought that was it.

Second issue - Noticed that none of them appeared to be working.  Looked at the logs and noticed they all seemed to be saying something about directories not existing or something.  Weird.

Third issue - Did some more digging and found these "path does not exist" errors...

image.thumb.png.34c5e998f00e9acf796fd5a80b393661.png

Fourth issue - Opened up the command line to try to go there and see what was up and got the bizarre "transport endpoint is not connected" error that is also pictured.  No earthly idea wtf that even means...

Anyone seen any of this before?  Log attached.

Thanks.

tower-diagnostics-20221112-1508.zip

Edited by rmp5s
Link to comment
36 minutes ago, trurl said:

You should disable Docker and VM Manager in Settings until you get things stable again. Many of your array disks are very full, you really want to keep more free space on each disk in case you need to do filesystem repair on any of them.

 

I've added more space recently and yea, definitely want more space on the disks...is there a way to "balance" them?

Link to comment

Rebooted again and all my shares are back.  That's nice.  Gotta say, that doesn't instill great faith in this operating system...

Either way, all that's back.  Some Dockers are back up and running, which is awesome, but I also have a bunch of them not acting right.  Going to have to go through and see wtf is up in the morning.  Good times.

Link to comment

Aaaaaaaaaaaand the shares disappeared again, just as was happening to other people in this other thread.

I rebooted and tried running chmod as mentioned in that other thread and it worked.  We'll see if it makes any differences long-term.

Anyone have any idea why my cache disk is suddenly unmountable?  I don't want to format it and lose whatever's on it...anyone have an idea to get it back without formatting it?  I'm thinking I may just have to format it...

Link to comment

Seems like the shares are sticking around now but I still seem to be having TONS of permissions issues...my Plex container is telling me this...

image.png.b08ec352e10a397f4d754b20d32ba07e.png

...and my ZeroTier container is telling me this...

image.png.ea11e747cbd3aeff357633e2187cf3ab.png

Anyone have any idea what's going on or something I could try?  

Cache is still saying "unmountable", too...what a cluster...

Link to comment
2 minutes ago, JorgeB said:
parent transid verify failed on 2446869233664 wanted 20726247 found 20726245

This error is fatal, it means some writes were lost due to the device/kernel lying they were already done, this is usually a controller/device firmware problem.

 

If there's important data in the pool there are some recovery options here.

Hmmm...that's what's making the shares disappear? Because they're still there since my last reboot...knock on wood...

 

The server is QUITE old...I suppose something in there could be starting to crap out...

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...