Jump to content

Trouble in Paradise


Recommended Posts

Posted

I have been having trouble with Unraid for the last week or so where I had two Kernel Panics and today all docker containers were frozen and I couldn't restart the server from the webUI or from ssh poweroff script or shutdown -h now

managed to copy the syslog before I hard rebooted

any idea whats wrong?

syslog

Posted
30 minutes ago, johnnie.black said:

Macvlan call traces are usually caused by having dockers with a custom IP address:

 

 

I don't think this is the culprit

 

I suspect Disk1 which holds docker image?

Any indication that it's failing?

Posted
18 minutes ago, johnnie.black said:

Not on the syslog, there might be if you post the complete diagnostics instead.

Here you go

I'm suspecting Disk1 is failing because docker image is on that hdd and it has been loading the docker containers very slowly recently

Thanks a lot for your help!

 

tower-diagnostics-20200623-1822.zip

Posted
48 minutes ago, johnnie.black said:

SMART report is incomplete due to a likely bug with smatmoontools, not sure if the new beta uses a newer version, you could try that.

Disk1 is a SAS drive and SMART report never worked on it I guess

any other way to check if Disk1 is failing?

Also if it's not Disk1 anything else in the logs or diagnostics that shows any other problems?

 

Posted
16 minutes ago, DjSamLb said:

Disk1 is a SAS drive and SMART report never worked

It will work with an older Unraid version, and possibly with the new beta, so you can run a long test and check the result.

Posted
1 hour ago, johnnie.black said:

It will work with an older Unraid version, and possibly with the new beta, so you can run a long test and check the result.

I don't remember it ever did with the older versions; any version in particular?

I am trying now moving the docker image to the cache SSD and clearing disk1 and check if the problem reoccurs

Posted

I will remove the Mellanox NIC and restart

Thank you!

14 minutes ago, johnnie.black said:

Last call trace is related to the Mellanox NIC, see if you don't get them by temporarily running without it.

 

Posted
13 minutes ago, johnnie.black said:

Now it's the other NIC, possibly you have some custom IP config that is causing issues, did you check the link above about docker with custom IP addresses?

 

I've always had custom IP addresses and never had any problems, I don't think it's the issue here

any other ideas?

Posted

I would start by ruling out any config issue first, by running in safe mode with all dockers/VMs disable, if stable start enabling one by one.

 

If it's a hardware problem it's difficult to catch anything in the logs, you'd need to start swapping some hardware and test.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...