Jump to content

Dockers Intermittently Unresponsive - vLAN / macvlan


Go to solution Solved by JorgeB,

Recommended Posts

I am experiencing a new issue, and as of recent, with dockers on my custom vlan's (br0, br0.4, br0.5, br0.6) become unresponsive. However, I can still ping IP's on any of the networks. But I just cannot get to the web services running on any of them when the issues occurs, including the unraid webUI on br0.

 

No other network changes have occurred and my unraid has been up for 3.5 months so far.

 

The issue occurs when there is heavier network load on them. For example, Plex mobile app downloading content locally to view offline while another docker is performing downloads, or if multiple people are watching Plex.

 

Uptime-Kuma docker webUI also becomes inaccessible during the issue, but when it resolves, it shows docker monitor events stating:

'Knex: Timeout acquiring a connection. The pool is probably full. Are you missing a .transacting(trx) call?

Additionally, external PRTG monitors show HTTPS monitors for Plex and other dockers as timing-out.

Additionally, I cannot get to unraid mgmt. webUI on br0 when the issue occurs either.

 

***When the issue is occurring, unraid CPU, RAM, and network utilization is low as well.

 

The issue resolves itself after approx. 3-5min....

 

I'm on unraid version: 6.12.8

 

My network config. (not changed in over a year):

  • Two physical eth interfaces (eth0, eth1) with bonding and bridging enabled.
  • Bond0 (eth0, eth1) is connected to Cisco switch using LAG port config.
  • All vLAN's use parent interface bond0
  • Docker vLAN br0, br0.5, br0.6 use upstream Opnsense firewall for DHCP pool.
  • Docker custom network type: macvlan

 

I do not see any kernel 'call trace' in my enhanced syslog plugin output.

 

Can someone help me narrow this down and/or recommend if I should try switching to Docker network type: ipvlan  ?

 

Edited by gurulee
Link to comment
4 minutes ago, JorgeB said:

Fist thing I would recommend updating to latest stable.

I have been holding off due to all the issues I'm reading about in the release notes and with users. 

But if that has a specific fix for this, then I will plan for it. 

Link to comment
46 minutes ago, gurulee said:

Thank you! I will report back 48 hours. 

The issue just reoccurred at around 3:30pm / 15:30. All webUI connectivity lost to unraid mgmt int (br0)and all my dockers. But I was still able to ping the interfaces and dockers with custom bridge vLAN's static IP's.

 

The issue resolved itself after approx. 3min and the webUI mgmt int of unraid and dockers became accessible again.

 

I looked at my enhanced syslog plugin and this is the only entry around the time of the issue:

 

Sep 10 11:05:13 Tower root: Fix Common Problems: Error: Macvlan and Bridging found ** Ignored
Sep 10 11:10:40 Tower kernel: eth0: renamed from vethe9b73b7
Sep 10 11:15:06 Tower webGUI: Successful login user root from 192.168.100.90
Sep 10 11:18:48 Tower kernel: vethb78d931: renamed from eth0
Sep 10 11:19:03 Tower kernel: eth0: renamed from vethd9921c1
Sep 10 12:17:41 Tower emhttpd: spinning down /dev/sdf
Sep 10 12:45:53 Tower emhttpd: read SMART /dev/sdf
Sep 10 13:06:06 Tower emhttpd: spinning down /dev/sdd
Sep 10 13:16:28 Tower emhttpd: spinning down /dev/sdf
Sep 10 13:19:08 Tower emhttpd: read SMART /dev/sdd
Sep 10 13:54:51 Tower emhttpd: read SMART /dev/sdf
Sep 10 13:55:06 Tower emhttpd: spinning down /dev/sdd
Sep 10 15:30:46 Tower emhttpd: read SMART /dev/sdd

 

Link to comment
1 hour ago, JorgeB said:

Unfortunately there's nothing relevant logged.

That is apparent and agreed. Is there a way to log more debug level? 

 

So can someone advise me next steps to narrow the cause of this issue down? 

 

Essentially all the HTTP / HTTPS / webui become inaccessible to unraid Mgmt int on br0 and all dockers on br0.4 and br0.5 intermittently for approx 3-5min. All the while I can still ping all of the interfaces during the issue. 

Issue seems intermittent and no pattern. 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...