February 8, 20233 yr Hi all, So I've just built up a new server and watching media on JF the server will randomly lag out. As in it becomes unresponsive from a UI point of view, and the video I'm watching just stops buffering so it gets to the end and just stops. I've only noticed then when watching JF, and it happened today at 14:30 while watching a show. I have added my diagnostics after the server became responsive again. I can log into the console via IPMI when the UI is dead. During the time of lockup, I am unable to hit any docker UI, and the unRAID UI I dont believe I was transcoding at the time, but I do have HW transcoding setup with a p2000. I'm not sure what I can test or check regarding this. And again Ive not noticed the UI or server locked at any other point other than when watching JF. And it can happen within 5mins or within 30mins. I did run a RAM check and it didnt return any errors. namek-diagnostics-20230208-1449.zip Edited February 8, 20233 yr by SavellM
February 10, 20233 yr Author On 2/8/2023 at 4:19 PM, JorgeB said: Enable the syslog server and post that after it happens. So happened again, and came back around 11:05am Here is what is captured in this syslog I have another file from before this thats over 1mb if you would like that one too. Looks to be something with the network if I'm reading it right? syslog-10.0.0.10.log
February 10, 20233 yr Community Expert Feb 10 09:00:38 Namek dhcpcd[1873]: br0: failed to renew DHCP, rebinding Try using a static IP address.
February 10, 20233 yr Author 2 hours ago, JorgeB said: Feb 10 09:00:38 Namek dhcpcd[1873]: br0: failed to renew DHCP, rebinding Try using a static IP address. Added Static, and started up the array. Instantly lost access. Updated syslog, started array around 14:52 iesh Hopefully now its online it'll stay that way. Is this a failure on my OPNsense side of thing? Cable issue? Or new motherboard NIC issue? Or just software based issue? syslog-10.0.0.10 (1).log
February 10, 20233 yr Community Expert Unlikely that the server is the problem, could be the DHCP server or a lan configuration issue.
February 13, 20233 yr Author On 2/10/2023 at 3:07 PM, JorgeB said: Unlikely that the server is the problem, could be the DHCP server or a lan configuration issue. Unfortunately still happening. Although when I noticed at 10am it never came back even after 30mins, so restarted and went out. Came back and still could not access... Any other thoughts I can do to fix this? syslog-10.0.0.10 (2).log
February 13, 20233 yr Community Expert Do you know the date/time it happened last? Not seeing anything obvious in the log.
February 26, 20233 yr Author So I changed from OPNsense to pfSense - no fix Changed network cables from 1gb port on my switch to 10gb SFP+ - no fix Changed network cables - no fix Changed the 10gbe NIC on the server itself from eth0 to eth1 - no fix Set to fixed IP - no fix I'm still getting these time outs where unRAID isnt responsive I'm desperate for a fix here, as its hella annoying Latest syslog added... All bios is up to date syslog-10.0.0.10.log
February 26, 20233 yr Author 4 hours ago, JorgeB said: Time it last happened? sorry just saw this. it was 10am I believe
February 27, 20233 yr Community Expert Feb 26 10:05:00 Namek kernel: i40e 0000:1a:00.1 eth1: NIC Link is Down Feb 26 10:05:00 Namek kernel: bond0: (slave eth1): link status definitely down, disabling slave Feb 26 10:05:00 Namek kernel: device eth1 left promiscuous mode Feb 26 10:05:00 Namek kernel: bond0: now running without any active interface! Feb 26 10:05:00 Namek kernel: br0: port 1(bond0) entered disabled state Feb 26 10:05:01 Namek kernel: i40e 0000:1a:00.1: leaving allmulti mode. Feb 26 10:05:01 Namek dhcpcd[1770]: br0: carrier lost NIC link was lost at that time.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.