j5i7 Posted January 17, 2023 Share Posted January 17, 2023 Hello all, I've been really enjoying Unraid the last couple months I've had it, but I'm running into an annoying hiccup I was hoping some log / network wizard could help me with. Sporadically (every 1-2 weeks) my Unraid server will completely lose network connectivity. The server itself is still running just fine if I physically access it with a mouse and keyboard, but no network connectivity goes in or out (docker/ping/dashboard inaccessible form LAN or WAN and I can't ping out from the server to my router). If I reboot the server, everything works fine until the error occurs again. I can't pinpoint anything that makes it happen. I finally caught it again today and have attached my syslog from the error. I've also attached my diagnostics zip but this is from after rebooting. The syslog attached is prior to a reboot and I've put the critical excerpt at the bottom of this message. I'm not great at understanding this, but is this a intel driver error or a hardware error? Thanks for any and all guidance. Worst case... maybe the server could just gracefully reboot if this error is detected. Quote Jan 16 14:17:25 N1 kernel: igc 0000:04:00.0 eth0: PCIe link lost, device now detached Jan 16 14:17:25 N1 kernel: ------------[ cut here ]------------ Jan 16 14:17:25 N1 kernel: igc: Failed to read reg 0xc030! Jan 16 14:17:25 N1 kernel: WARNING: CPU: 5 PID: 25835 at drivers/net/ethernet/intel/igc/igc_main.c:6186 igc_rd32+0x76/0x8b [igc] Jan 16 14:17:25 N1 kernel: Modules linked in: xt_connmark xt_mark iptable_mangle xt_comment iptable_raw xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper drm ahci i2c_i801 intel_gtt igc agpgart i2c_smbus libahci i2c_core nvme cp210x syscopyarea input_leds sysfillrect usbserial joydev led_class nvme_core vmd Jan 16 14:17:25 N1 kernel: sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 16 14:17:25 N1 kernel: CPU: 5 PID: 25835 Comm: kworker/5:3 Not tainted 5.19.17-Unraid #2 Jan 16 14:17:25 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2012 10/11/2022 Jan 16 14:17:25 N1 kernel: Workqueue: events igc_watchdog_task [igc] Jan 16 14:17:25 N1 kernel: RIP: 0010:igc_rd32+0x76/0x8b [igc] Jan 16 14:17:25 N1 kernel: Code: 8b bb 28 ff ff ff e8 a3 b5 26 e1 84 c0 75 0d 83 c8 ff eb 1a 8b 02 ff c0 75 f5 eb bf 89 ee 48 c7 c7 70 8f 22 a0 e8 b3 0d 60 e1 <0f> 0b eb e1 5b 5d 41 5c c3 cc cc cc cc 83 c8 ff c3 cc cc cc cc 0f Jan 16 14:17:25 N1 kernel: RSP: 0018:ffffc90002c53e00 EFLAGS: 00010286 Jan 16 14:17:25 N1 kernel: RAX: 0000000000000000 RBX: ffff8881549f0b98 RCX: 0000000000000027 Jan 16 14:17:25 N1 kernel: RDX: 0000000000000002 RSI: ffffffff820d7be1 RDI: 00000000ffffffff Jan 16 14:17:25 N1 kernel: RBP: 000000000000c030 R08: 0000000000000000 R09: ffffffff82244bd0 Jan 16 14:17:25 N1 kernel: R10: 00007fffffffffff R11: ffffffff82877526 R12: ffff8881549f0000 Jan 16 14:17:25 N1 kernel: R13: 000000000000c030 R14: ffff888106c4cc80 R15: 0000000000000000 Jan 16 14:17:25 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 16 14:17:25 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 16 14:17:25 N1 kernel: CR2: 00001470f9928740 CR3: 000000000420a002 CR4: 0000000000770ee0 Jan 16 14:17:25 N1 kernel: PKRU: 55555554 Jan 16 14:17:25 N1 kernel: Call Trace: Jan 16 14:17:25 N1 kernel: <TASK> Jan 16 14:17:25 N1 kernel: igc_update_stats+0x70/0x6a2 [igc] Jan 16 14:17:25 N1 kernel: igc_watchdog_task+0x322/0x44b [igc] Jan 16 14:17:25 N1 kernel: process_one_work+0x1a8/0x295 Jan 16 14:17:25 N1 kernel: worker_thread+0x18b/0x244 Jan 16 14:17:25 N1 kernel: ? rescuer_thread+0x281/0x281 Jan 16 14:17:25 N1 kernel: kthread+0xe4/0xef Jan 16 14:17:25 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 16 14:17:25 N1 kernel: ret_from_fork+0x1f/0x30 Jan 16 14:17:25 N1 kernel: </TASK> Jan 16 14:17:25 N1 kernel: ---[ end trace 0000000000000000 ]--- syslog n1-diagnostics-20230116-1703.zip Quote Link to comment
Vr2Io Posted January 17, 2023 Share Posted January 17, 2023 Look like igc driver issue ( I225 NIC ), using different kind NIC should be a quick way to verify problem, but your ITX board seems haven't PCIe slot available. Quote Link to comment
MAM59 Posted January 17, 2023 Share Posted January 17, 2023 Check the cables. LAN >1G is much more depending on correct and shielded connections. They SAY you can use normal cables, they LIE! "if there is a link, all is fine" is not true anymore. Quote Link to comment
j5i7 Posted January 17, 2023 Author Share Posted January 17, 2023 10 hours ago, Vr2Io said: Look like igc driver issue ( I225 NIC ), using different kind NIC should be a quick way to verify problem, but your ITX board seems haven't PCIe slot available. I do have the pcie slot on my motherboard free and could put a half-height NIC in if need be (or a USB to ethernet adapter, but no clue of the USB overhead here). I don't currently own one, but may get one if need be. Do you think the the log looks like this was an intel driver issue? Or is this a PCI-E link issue that then resulted in a driver throwing an error (or hard to say)? 9 minutes ago, MAM59 said: Check the cables. LAN >1G is much more depending on correct and shielded connections. They SAY you can use normal cables, they LIE! "if there is a link, all is fine" is not true anymore. The only 2.5gb cable that is isolated to this connection is a single ~6ft cable from my 2.5gb switch to my unraid box. I'll try swapping it for another cable that's known to be at least cat 6 and see from there (the current cable came from my steam link I think). Quote Link to comment
j5i7 Posted January 17, 2023 Author Share Posted January 17, 2023 (edited) It happened again right before I changed the ethernet cable. The log is a bit different and attached here again and partially quoted below. I updated my mobo to the latest BIOS revision which should be the only change between this and the prior one. Since this error message I've changed my ethernet cable. If I get another error, I'm tempted to turn off the ASPM in the bios and my next step (hoping that won't significantly affect power draw). Quote Jan 17 04:40:01 N1 root: Fix Common Problems Version 2022.12.18 Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: AER: Corrected error received: 0000:00:1c.2 Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID) Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: device [8086:7aba] error status/mask=00000040/00002000 Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: [ 6] BadTLP Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: AER: Uncorrected (Fatal) error received: 0000:00:1c.2 Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Uncorrected (Fatal), type=Data Link Layer, (Receiver ID) Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: device [8086:7aba] error status/mask=00000010/00000000 Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: [ 4] DLP (First) Jan 17 09:14:33 N1 kernel: bond0: (slave eth0): link status definitely down, disabling slave Jan 17 09:14:33 N1 kernel: device eth0 left promiscuous mode Jan 17 09:14:33 N1 kernel: bond0: now running without any active interface! Jan 17 09:14:33 N1 kernel: br0: port 1(bond0) entered disabled state Jan 17 09:14:34 N1 dhcpcd[1065]: br0: carrier lost Jan 17 09:14:34 N1 avahi-daemon[7398]: Withdrawing address record for 10.2.2.2 on br0. Jan 17 09:14:34 N1 avahi-daemon[7398]: Leaving mDNS multicast group on interface br0.IPv4 with address 10.2.2.2. Jan 17 09:14:34 N1 avahi-daemon[7398]: Interface br0.IPv4 no longer relevant for mDNS. Jan 17 09:14:34 N1 dhcpcd[1065]: br0: deleting route to 10.2.2.0/24 Jan 17 09:14:34 N1 dhcpcd[1065]: br0: deleting default route via 10.2.2.1 Jan 17 09:14:34 N1 dnsmasq[8883]: no servers found in /etc/resolv.conf, will retry Jan 17 09:14:34 N1 kernel: pcieport 0000:00:1c.2: AER: Root Port link has been reset (0) Jan 17 09:14:34 N1 kernel: igc 0000:04:00.0: Unable to change power state from D3cold to D0, device inaccessible Jan 17 09:14:34 N1 kernel: igc 0000:04:00.0 eth0: PCIe link lost, device now detached Jan 17 09:14:36 N1 ntpd[1274]: Deleting interface #1 br0, 10.2.2.2#123, interface stats: received=0, sent=0, dropped=0, active_time=54185 secs Jan 17 09:14:40 N1 kernel: genirq: Flags mismatch irq 173. 00000000 (eth0) vs. 00000000 (eth0) Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------ Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/173', leaking at least 'eth0' Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Not tainted 5.19.17-Unraid #2 Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022 Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00 Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282 Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff8881050a43c0 RCX: 0000000000000027 Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0 Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40 Jan 17 09:14:40 N1 kernel: R13: 00000000000000ae R14: ffffffff82227bc0 R15: ffff888101732000 Jan 17 09:14:40 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 17 09:14:40 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0 Jan 17 09:14:40 N1 kernel: PKRU: 55555554 Jan 17 09:14:40 N1 kernel: Call Trace: Jan 17 09:14:40 N1 kernel: <TASK> Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100 Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54 Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75 Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40 Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7 Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc] Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc] Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173 Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc] Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74 Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82 Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98 Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6 Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7 Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173 Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6 Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45 Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182 Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53 Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30 Jan 17 09:14:40 N1 kernel: </TASK> Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]--- Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------ Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/174', leaking at least 'eth0-TxRx-0' Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G W 5.19.17-Unraid #2 Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022 Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00 Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282 Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff8881050a4a80 RCX: 0000000000000027 Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0 Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40 Jan 17 09:14:40 N1 kernel: R13: 00000000000000af R14: ffffffff82227bc0 R15: ffff888101732000 Jan 17 09:14:40 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 17 09:14:40 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0 Jan 17 09:14:40 N1 kernel: PKRU: 55555554 Jan 17 09:14:40 N1 kernel: Call Trace: Jan 17 09:14:40 N1 kernel: <TASK> Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100 Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54 Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75 Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40 Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7 Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc] Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc] Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173 Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc] Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74 Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82 Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98 Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6 Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7 Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173 Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6 Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45 Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182 Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53 Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30 Jan 17 09:14:40 N1 kernel: </TASK> Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]--- Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------ Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/175', leaking at least 'eth0-TxRx-1' Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G W 5.19.17-Unraid #2 Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022 Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00 Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282 Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0f180 RCX: 0000000000000003 Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: 0000000000000003 RDI: 00000000ffffffff Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0 Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40 Jan 17 09:14:40 N1 kernel: R13: 00000000000000b0 R14: ffffffff82227bc0 R15: ffff888101732000 Jan 17 09:14:40 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 17 09:14:40 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0 Jan 17 09:14:40 N1 kernel: PKRU: 55555554 Jan 17 09:14:40 N1 kernel: Call Trace: Jan 17 09:14:40 N1 kernel: <TASK> Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100 Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54 Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75 Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40 Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7 Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc] Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc] Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173 Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc] Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74 Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82 Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98 Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6 Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7 Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173 Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6 Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45 Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182 Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53 Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30 Jan 17 09:14:40 N1 kernel: </TASK> Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]--- Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------ Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/176', leaking at least 'eth0-TxRx-2' Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G W 5.19.17-Unraid #2 Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022 Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00 Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282 Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0f840 RCX: 0000000000000027 Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0 Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40 Jan 17 09:14:40 N1 kernel: R13: 00000000000000b1 R14: ffffffff82227bc0 R15: ffff888101732000 Jan 17 09:14:40 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 17 09:14:40 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0 Jan 17 09:14:40 N1 kernel: PKRU: 55555554 Jan 17 09:14:40 N1 kernel: Call Trace: Jan 17 09:14:40 N1 kernel: <TASK> Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100 Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54 Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75 Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40 Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7 Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc] Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc] Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173 Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc] Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74 Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82 Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98 Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6 Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7 Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173 Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6 Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45 Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182 Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53 Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30 Jan 17 09:14:40 N1 kernel: </TASK> Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]--- Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------ Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/177', leaking at least 'eth0-TxRx-3' Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G W 5.19.17-Unraid #2 Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022 Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00 Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282 Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0ff00 RCX: 0000000000000027 Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0 Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40 Jan 17 09:14:40 N1 kernel: R13: 00000000000000b2 R14: ffffffff82227bc0 R15: ffff888101732000 Jan 17 09:14:40 N1 kernel: FS: 0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000 Jan 17 09:14:40 N1 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0 Jan 17 09:14:40 N1 kernel: PKRU: 55555554 Jan 17 09:14:40 N1 kernel: Call Trace: Jan 17 09:14:40 N1 kernel: <TASK> Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100 Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54 Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75 Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40 Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7 Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc] Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc] Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173 Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc] Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74 Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82 Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98 Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6 Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7 Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173 Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6 Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45 Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182 Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53 Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30 Jan 17 09:14:40 N1 kernel: </TASK> Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]--- Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: device recovery successful Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID) Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: device [8086:7aba] error status/mask=00003140/00002000 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: [ 6] BadTLP Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: [ 8] Rollover Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: [12] Timeout Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2 Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2 Jan 17 09:15:50 N1 login: pam_unix(login:session): session opened for user root(uid=0) by LOGIN(uid=0) Jan 17 09:15:50 N1 login: ROOT LOGIN ON tty1 Jan 17 09:17:01 N1 root: ACPI action up is not defined Jan 17 09:17:02 N1 root: ACPI action left is not defined Jan 17 09:17:02 N1 root: ACPI action left is not defined Jan 17 09:17:02 N1 root: ACPI action left is not defined Jan 17 09:17:02 N1 root: ACPI action left is not defined Jan 17 09:17:02 N1 root: ACPI action left is not defined Jan 17 09:17:03 N1 root: ACPI action left is not defined Jan 17 09:17:03 N1 root: ACPI action left is not defined syslog2 Edited January 17, 2023 by j5i7 Quote Link to comment
Vr2Io Posted January 17, 2023 Share Posted January 17, 2023 (edited) 2 hours ago, j5i7 said: I do have the pcie slot on my motherboard free Overlook your mobo support 2 onboard M2 nvme, so I assume the slot in use. 2 hours ago, j5i7 said: or hard to say)? Yes As NIC driver crash, so, it won't work again even you plug / unplug cable until reboot. And generate PCIe AER error also not surprise. Found a similar report, but haven't confirm solve, pls also try disable docker / VM. ** OP change from Realtek to Intel still got problem. ** Edited January 17, 2023 by Vr2Io Quote Link to comment
j5i7 Posted February 8, 2023 Author Share Posted February 8, 2023 Not a big update but I went ~20 days without an issue, then had it happen again. The only correlation I can fathom is that I turned on download overnight and it happened that same night. However, I haven't had any issues when streaming countless hours of TV off my NAS. Anyway, rather than throwing alternative hardware at it for the moment, I just created a script that scans the syslog for the ethernet crash message and will reboot the server if detected. Hopefully some driver update down the line fixes my issue, but for my current use case, it's not the end of the world for the server to gracefully restart every few weeks. 2 Quote Link to comment
MACGoof Posted January 5 Share Posted January 5 @j5i7- Would you mind sharing your script please? I am having similar issues and am considering purchasing a pcie nic for testing. In the meantime, your script could be a big help. I am hesitant to create a new post for my issue as it seems redundant. Quote Link to comment
j5i7 Posted January 5 Author Share Posted January 5 1 hour ago, MACGoof said: @j5i7- Would you mind sharing your script please? I am having similar issues and am considering purchasing a pcie nic for testing. In the meantime, your script could be a big help. I am hesitant to create a new post for my issue as it seems redundant. I’ll share it when I can, unfortunately it’s an unlucky time as I’m in the middle of a move and my server is packed away and I don’t think I have a copy of the script elsewhere. However, in general I searched for an existing script that monitored the syslog, then modified it to look for the Ethernet disconnected message and when found, restart the server. Then I set the script to run every hour. I also added logging etc, I think it’s run about twice in the last six months or so. If I don’t post it in the upcoming week, remind me. 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.