dlikhten Posted May 10, 2023 Share Posted May 10, 2023 Hi everyone, I'm trying to figure out what is happening to my server and how to diagnose. I am using CAT6 cables, and new ones too. Also tried different cables including cat 5e cables. When plugging in the server (or unplug/replug) the server works at 1000mbps in full duplex. Works totally fine. At some point in a few days it reverts to 100mbps until I replug with the same cables. It is plugged in directly into my home router. Any ideas on how to diagnose / solve? Any advice would be much appreciated. Quote Link to comment
Tom3 Posted May 10, 2023 Share Posted May 10, 2023 Speculation... If sufficient errors were detected, the link might auto re-negotiate the link speed. That might fail for 1 GbE, then retry and succeed for 100 MbE. Can you disable auto negotiation, and set the link to only 1 GbE at the router? At the server? -- Tom Quote Link to comment
MAM59 Posted May 11, 2023 Share Posted May 11, 2023 another speculation... Your router or NIC maybe set to "green mode" which means "drop down to 100Mbps to safe energy when idle". And if the wakeup does not work for any reason, it will stay at 100 forever. So check the port settings in the router (most likely the culprit). Quote Link to comment
Frank1940 Posted May 11, 2023 Share Posted May 11, 2023 21 hours ago, dlikhten said: When plugging in the server (or unplug/replug) the server works at 1000mbps in full duplex. Works totally fine. At some point in a few days it reverts to 100mbps until I replug with the same cables. It is plugged in directly into my home router. I would suggest getting a Gb switch. Each device you plug into router (or switch for that matter) will increase the power used. It is my opinion that you should have only two things plugged into your cat5 ports on your router-- The WAN cable to the modem and the LAN cable to the switch. That will prolong your router life, which probably costs about five times the cost of that switch, by reducing its temperature. 8 port switches start at a $20US and 5 port switches start at about $15US. Quote Link to comment
dlikhten Posted May 11, 2023 Author Share Posted May 11, 2023 Okay 3 replies, so gonna address all 3: 1) I do already have a small switch right by the router, I needed more plugs, and it is gigabit. 2) I did not know about green settings I will check the router HOWEVER 3) I will plug everything into the switch. If it does indeed help with power draw AND improve the lifetime of then router. The switch does indeed cost peanuts comparatively. Quote Link to comment
MAM59 Posted May 11, 2023 Share Posted May 11, 2023 6 minutes ago, dlikhten said: I did not know about green settings I will check the router HOWEVER if you plug it into the switch, look there (in case it is a managed one and allows looking, the unmanaged cheap ones do not know about green mode at all) Quote Link to comment
dlikhten Posted May 11, 2023 Author Share Posted May 11, 2023 (edited) 6 minutes ago, MAM59 said: if you plug it into the switch, look there (in case it is a managed one and allows looking, the unmanaged cheap ones do not know about green mode at all) I apparently was already plugged into a switch. This was a very cheap unmanaged one so no internal settings are exposed. However I did switch the duplex mode from auto negotiation to 1000mbps full duplex on the router. Don't know if that will be significant enough. Guess I'll know more in about a day. Edited May 11, 2023 by dlikhten Quote Link to comment
MAM59 Posted May 11, 2023 Share Posted May 11, 2023 7 minutes ago, dlikhten said: I apparently was already plugged into a switch. ok, then forget my speculation, it does not apply to you. turn on logging at the server, wait for speed drop to happen and save the logs. maybe there is something that reveals a clue in which direction to search? A speed change never slips away unnoted. Quote Link to comment
Vr2Io Posted May 11, 2023 Share Posted May 11, 2023 Some suggestions Always set auto negotiation, if problem happen when plug in switch and router, then it is seever side issue. 1. Change NIC Or 2. Change PSU power cable and confirm earth wire really connect to earth. Quote Link to comment
dlikhten Posted May 12, 2023 Author Share Posted May 12, 2023 SO FAR setting it to 1000 without auto negotiation solved the issue... I can't change the NIC, its part of the motherboard. To note, this is the only device on my network with this problem. Even another NAS stays at 1000mbps. Quote Link to comment
dlikhten Posted May 16, 2023 Author Share Posted May 16, 2023 Edit. Long term, didn't help. Still switches back. Quote Link to comment
Frank1940 Posted May 16, 2023 Share Posted May 16, 2023 Do this: On 5/11/2023 at 7:32 PM, Vr2Io said: 1. Change NIC Or 2. Change PSU power cable and confirm earth wire really connect to earth. What is cable length between server and switch? Is this cable a reasonable distance from other electric wiring? Quote Link to comment
MAM59 Posted May 16, 2023 Share Posted May 16, 2023 maybe its time for you to supply us with DIAGNOSTIC Data? Best would be 2 runs, one WITH slowed down LAN, one WITHOUT. Quote Link to comment
dlikhten Posted May 16, 2023 Author Share Posted May 16, 2023 Frank1940: 3ft cable, unfortunately its around the electronics of the nas, and network gear. But that's about it. MAM59. Absolutely!!!! How do I go about this? Quote Link to comment
Frank1940 Posted May 16, 2023 Share Posted May 16, 2023 (edited) 2 hours ago, dlikhten said: Frank1940: 3ft cable, unfortunately its around the electronics of the nas, and network gear. But that's about it. MAM59. Absolutely!!!! How do I go about this? See here: Edited May 16, 2023 by Frank1940 To delete Link to wrong FAQ--- Sorry! Quote Link to comment
itimpi Posted May 16, 2023 Share Posted May 16, 2023 35 minutes ago, dlikhten said: Absolutely!!!! How do I go about this? Tools->Diagnostics Quote Link to comment
MAM59 Posted May 16, 2023 Share Posted May 16, 2023 35 minutes ago, dlikhten said: MAM59. Absolutely!!!! How do I go about this? Simple. with the unraid gui select "tools" from the side and "diagnostics" from the main menu There you will find an button to generate and download the diagnostic zip file. Create and save one whilst you have 1000Mbs, then wait until the line slows down and create another zip. Upload both of them here. Quote Link to comment
dlikhten Posted May 16, 2023 Author Share Posted May 16, 2023 Attached is the 1000mbps speed diagnosis. Might be a day or 2 till I get the 100mbps one. Will reply here later. THANK YOU AGAIN EVERYONE. piratetrove-diagnostics-20230516-1842.zip Quote Link to comment
Solution Frank1940 Posted May 16, 2023 Solution Share Posted May 16, 2023 (edited) I had a quick note at your Diagnostics file and found this in the ifconfig.txt file in the /system folder: Notice the number of 'RX' errors. I check five other Diagnostics files from five other different servers and found zero 'RX' errors. I would first try another port on the switch. If your cable is a quality cable and after changing ports, you still have speed drops, I would be looking for a new NIC board. BTW, you can look at the this ifconfig report any time you want by opening up the GUI terminal (the >_ icon on the right side of the GUI toolbar) and typing the following on the command line: ifconfig EDIT: I would start checking and recording the error numbers over the next few days. Perhaps, there is some type of pattern... Edited May 16, 2023 by Frank1940 Quote Link to comment
dlikhten Posted May 17, 2023 Author Share Posted May 17, 2023 Just switched ports, and still see RX errors rising. Maybe the NIC is just bad? This is an old old motherboard, so no USB3.0 for a usb-based network adapter. Quote Link to comment
Frank1940 Posted May 17, 2023 Share Posted May 17, 2023 1 hour ago, dlikhten said: Just switched ports, and still see RX errors rising. Maybe the NIC is just bad? This is an old old motherboard, so no USB3.0 for a usb-based network adapter. 1 hour ago, JonathanM said: No PCIE slots? I just looked on Amazon and there are Gb Network cards for both PCI-E and PCI slots. A quick check also found some for PCI on E-bay. Quote Link to comment
dlikhten Posted May 17, 2023 Author Share Posted May 17, 2023 I might give that a shot honestly. there is one PCI-e slot that I had to use for a graphics card because the mobo wouldn't boot without one, but I do have an old PCI slot in there, and if there's a gigabit card for it, might do the trick! Quote Link to comment
MAM59 Posted May 17, 2023 Share Posted May 17, 2023 yeah, reception errors should not happen these days anymore. this is surely a hardware problem. Beside a bad card, another possible source would be a badly shielded cable close to a powerful electrical machine like an elevator or something else with an electrical engine. But this is very rare, so going for a different card surely is the better way to go. (but if those things are that old already, going for a new computer might be wise too :-))) ) Quote Link to comment
MAM59 Posted May 17, 2023 Share Posted May 17, 2023 No need to wait for a 2nd diagnosis, it already happened: May 7 05:44:33 PirateTrove kernel: tg3 0000:02:00.0 eth0: Link is down May 7 05:44:33 PirateTrove kernel: bond0: (slave eth0): link status definitely down, disabling slave May 7 05:44:33 PirateTrove kernel: device eth0 left promiscuous mode May 7 05:44:33 PirateTrove kernel: bond0: now running without any active interface! May 7 05:44:33 PirateTrove kernel: br0: port 1(bond0) entered disabled state May 7 05:44:34 PirateTrove dhcpcd[916]: br0: carrier lost May 7 05:44:34 PirateTrove avahi-daemon[5938]: Withdrawing address record for 192.168.0.97 on br0. May 7 05:44:34 PirateTrove avahi-daemon[5938]: Leaving mDNS multicast group on interface br0.IPv4 with address 192.168.0.97. May 7 05:44:34 PirateTrove avahi-daemon[5938]: Interface br0.IPv4 no longer relevant for mDNS. May 7 05:44:34 PirateTrove dhcpcd[916]: br0: deleting route to 192.168.0.0/24 May 7 05:44:34 PirateTrove dhcpcd[916]: br0: deleting default route via 192.168.0.1 May 7 05:44:34 PirateTrove dnsmasq[6890]: no servers found in /etc/resolv.conf, will retry May 7 05:44:36 PirateTrove ntpd[1052]: Deleting interface #8 br0, 192.168.0.97#123, interface stats: received=2052, sent=2068, dropped=0, active_time=539210 secs May 7 05:44:36 PirateTrove ntpd[1052]: 216.239.35.0 local addr 192.168.0.97 -> <null> May 7 05:44:36 PirateTrove ntpd[1052]: 216.239.35.4 local addr 192.168.0.97 -> <null> May 7 05:44:36 PirateTrove ntpd[1052]: 216.239.35.8 local addr 192.168.0.97 -> <null> May 7 05:44:36 PirateTrove ntpd[1052]: 216.239.35.12 local addr 192.168.0.97 -> <null> May 7 05:44:37 PirateTrove emhttpd: read SMART /dev/sdf May 7 05:44:58 PirateTrove kernel: tg3 0000:02:00.0 eth0: Link is up at 100 Mbps, full duplex May 7 05:44:58 PirateTrove kernel: tg3 0000:02:00.0 eth0: Flow control is on for TX and on for RX May 7 05:44:58 PirateTrove kernel: bond0: (slave eth0): link status definitely up, 100 Mbps full duplex May 7 05:44:58 PirateTrove kernel: bond0: (slave eth0): making interface the new active one May 7 05:44:58 PirateTrove kernel: device eth0 entered promiscuous mode May 7 05:44:58 PirateTrove kernel: bond0: active interface up! May 7 05:44:58 PirateTrove kernel: br0: port 1(bond0) entered blocking state May 7 05:44:58 PirateTrove kernel: br0: port 1(bond0) entered forwarding state May 7 05:44:58 PirateTrove dhcpcd[916]: br0: carrier acquired May 7 05:44:59 PirateTrove dhcpcd[916]: br0: rebinding lease of 192.168.0.97 So you have lost your connection completely, and 20s later it reastablished it with only 100mbs. That ancient "tg3" seems to have produced lots of troubles during the ages, you will find a lot of reports. The driver and linux never really made friends it seems. In 2006 they managed to produce something working and I assume, this code is still used today. But the linux kernel moved on, there are many chances that today the driver does not work well anymore and nobody will fix him. But scrolling through your logs I saw some other issues you may think about and maybe reconsider: * you are running mover every hour, thats ok. But you also want to spin down drives to safe energy. Thats ok too. But both of them do not work together. Mover will wake up the drives every hour just to shut them off again after the defined timeout. This will stress the drives more instead of saving energy, they will wear out much faster. I would say you you should run mover less frequent, maybe once a day? * you have activated bonding for the NIC, although there is no 2nd NIC to be bonded with. It does not harm, but it is useless and only consumes extra time. So I would say "turn off bonding" Nothing more to complain yet (but, why does somebody in 2023 still uses XVID codec for Startrek Picard ? ? ? 🤣 ) 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.