Jump to content

Intel Ethernet Adapter Sporadically Losing Connectivity Until Reboot


j5i7

Recommended Posts

Hello all, I've been really enjoying Unraid the last couple months I've had it, but I'm running into an annoying hiccup I was hoping some log / network wizard could help me with.

 

Sporadically (every 1-2 weeks) my Unraid server will completely lose network connectivity. The server itself is still running just fine if I physically access it with a mouse and keyboard, but no network connectivity goes in or out (docker/ping/dashboard inaccessible form LAN or WAN and I can't ping out from the server to my router). If I reboot the server, everything works fine until the error occurs again. I can't pinpoint anything that makes it happen.

 

I finally caught it again today and have attached my syslog from the error. I've also attached my diagnostics zip but this is from after rebooting. The syslog attached is prior to a reboot and I've put the critical excerpt at the bottom of this message. I'm not great at understanding this, but is this a intel driver error or a hardware error?

 

Thanks for any and all guidance. Worst case... maybe the server could just gracefully reboot if this error is detected.

 

Quote

Jan 16 14:17:25 N1 kernel: igc 0000:04:00.0 eth0: PCIe link lost, device now detached
Jan 16 14:17:25 N1 kernel: ------------[ cut here ]------------
Jan 16 14:17:25 N1 kernel: igc: Failed to read reg 0xc030!
Jan 16 14:17:25 N1 kernel: WARNING: CPU: 5 PID: 25835 at drivers/net/ethernet/intel/igc/igc_main.c:6186 igc_rd32+0x76/0x8b [igc]
Jan 16 14:17:25 N1 kernel: Modules linked in: xt_connmark xt_mark iptable_mangle xt_comment iptable_raw xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper drm ahci i2c_i801 intel_gtt igc agpgart i2c_smbus libahci i2c_core nvme cp210x syscopyarea input_leds sysfillrect usbserial joydev led_class nvme_core vmd
Jan 16 14:17:25 N1 kernel: sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 16 14:17:25 N1 kernel: CPU: 5 PID: 25835 Comm: kworker/5:3 Not tainted 5.19.17-Unraid #2
Jan 16 14:17:25 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2012 10/11/2022
Jan 16 14:17:25 N1 kernel: Workqueue: events igc_watchdog_task [igc]
Jan 16 14:17:25 N1 kernel: RIP: 0010:igc_rd32+0x76/0x8b [igc]
Jan 16 14:17:25 N1 kernel: Code: 8b bb 28 ff ff ff e8 a3 b5 26 e1 84 c0 75 0d 83 c8 ff eb 1a 8b 02 ff c0 75 f5 eb bf 89 ee 48 c7 c7 70 8f 22 a0 e8 b3 0d 60 e1 <0f> 0b eb e1 5b 5d 41 5c c3 cc cc cc cc 83 c8 ff c3 cc cc cc cc 0f
Jan 16 14:17:25 N1 kernel: RSP: 0018:ffffc90002c53e00 EFLAGS: 00010286
Jan 16 14:17:25 N1 kernel: RAX: 0000000000000000 RBX: ffff8881549f0b98 RCX: 0000000000000027
Jan 16 14:17:25 N1 kernel: RDX: 0000000000000002 RSI: ffffffff820d7be1 RDI: 00000000ffffffff
Jan 16 14:17:25 N1 kernel: RBP: 000000000000c030 R08: 0000000000000000 R09: ffffffff82244bd0
Jan 16 14:17:25 N1 kernel: R10: 00007fffffffffff R11: ffffffff82877526 R12: ffff8881549f0000
Jan 16 14:17:25 N1 kernel: R13: 000000000000c030 R14: ffff888106c4cc80 R15: 0000000000000000
Jan 16 14:17:25 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 16 14:17:25 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 16 14:17:25 N1 kernel: CR2: 00001470f9928740 CR3: 000000000420a002 CR4: 0000000000770ee0
Jan 16 14:17:25 N1 kernel: PKRU: 55555554
Jan 16 14:17:25 N1 kernel: Call Trace:
Jan 16 14:17:25 N1 kernel: <TASK>
Jan 16 14:17:25 N1 kernel: igc_update_stats+0x70/0x6a2 [igc]
Jan 16 14:17:25 N1 kernel: igc_watchdog_task+0x322/0x44b [igc]
Jan 16 14:17:25 N1 kernel: process_one_work+0x1a8/0x295
Jan 16 14:17:25 N1 kernel: worker_thread+0x18b/0x244
Jan 16 14:17:25 N1 kernel: ? rescuer_thread+0x281/0x281
Jan 16 14:17:25 N1 kernel: kthread+0xe4/0xef
Jan 16 14:17:25 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 16 14:17:25 N1 kernel: ret_from_fork+0x1f/0x30
Jan 16 14:17:25 N1 kernel: </TASK>
Jan 16 14:17:25 N1 kernel: ---[ end trace 0000000000000000 ]---

 

syslog n1-diagnostics-20230116-1703.zip

Link to comment
10 hours ago, Vr2Io said:

Look like igc driver issue ( I225 NIC ), using different kind NIC should be a quick way to verify problem, but your ITX board seems haven't PCIe slot available.

 

I do have the pcie slot on my motherboard free and could put a half-height NIC in if need be (or a USB to ethernet adapter, but no clue of the USB overhead here). I don't currently own one, but may get one if need be.

 

Do you think the the log looks like this was an intel driver issue? Or is this a PCI-E link issue that then resulted in a driver throwing an error (or hard to say)?

 

9 minutes ago, MAM59 said:

Check the cables. LAN >1G is much more depending on correct and shielded connections. They SAY you can use normal cables, they LIE!

"if there is a link, all is fine" is not true anymore.

 

 

 

The only 2.5gb cable that is isolated to this connection is a single ~6ft cable from my 2.5gb switch to my unraid box. I'll try swapping it for another cable that's known to be at least cat 6 and see from there (the current cable came from my steam link I think).

 

 

Link to comment

It happened again right before I changed the ethernet cable. The log is a bit different and attached here again and partially quoted below. I updated my mobo to the latest BIOS revision which should be the only change between this and the prior one. Since this error message I've changed my ethernet cable. If I get another error, I'm tempted to turn off the ASPM in the bios and my next step (hoping that won't significantly affect power draw).

 

 

Quote

Jan 17 04:40:01 N1 root: Fix Common Problems Version 2022.12.18
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: AER: Corrected error received: 0000:00:1c.2
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2:   device [8086:7aba] error status/mask=00000040/00002000
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2:    [ 6] BadTLP                
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: AER: Uncorrected (Fatal) error received: 0000:00:1c.2
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Uncorrected (Fatal), type=Data Link Layer, (Receiver ID)
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2:   device [8086:7aba] error status/mask=00000010/00000000
Jan 17 09:14:33 N1 kernel: pcieport 0000:00:1c.2:    [ 4] DLP                    (First)
Jan 17 09:14:33 N1 kernel: bond0: (slave eth0): link status definitely down, disabling slave
Jan 17 09:14:33 N1 kernel: device eth0 left promiscuous mode
Jan 17 09:14:33 N1 kernel: bond0: now running without any active interface!
Jan 17 09:14:33 N1 kernel: br0: port 1(bond0) entered disabled state
Jan 17 09:14:34 N1  dhcpcd[1065]: br0: carrier lost
Jan 17 09:14:34 N1  avahi-daemon[7398]: Withdrawing address record for 10.2.2.2 on br0.
Jan 17 09:14:34 N1  avahi-daemon[7398]: Leaving mDNS multicast group on interface br0.IPv4 with address 10.2.2.2.
Jan 17 09:14:34 N1  avahi-daemon[7398]: Interface br0.IPv4 no longer relevant for mDNS.
Jan 17 09:14:34 N1  dhcpcd[1065]: br0: deleting route to 10.2.2.0/24
Jan 17 09:14:34 N1  dhcpcd[1065]: br0: deleting default route via 10.2.2.1
Jan 17 09:14:34 N1 dnsmasq[8883]: no servers found in /etc/resolv.conf, will retry
Jan 17 09:14:34 N1 kernel: pcieport 0000:00:1c.2: AER: Root Port link has been reset (0)
Jan 17 09:14:34 N1 kernel: igc 0000:04:00.0: Unable to change power state from D3cold to D0, device inaccessible
Jan 17 09:14:34 N1 kernel: igc 0000:04:00.0 eth0: PCIe link lost, device now detached
Jan 17 09:14:36 N1  ntpd[1274]: Deleting interface #1 br0, 10.2.2.2#123, interface stats: received=0, sent=0, dropped=0, active_time=54185 secs
Jan 17 09:14:40 N1 kernel: genirq: Flags mismatch irq 173. 00000000 (eth0) vs. 00000000 (eth0)
Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------
Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/173', leaking at least 'eth0'
Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme
Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Not tainted 5.19.17-Unraid #2
Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022
Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00
Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282
Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff8881050a43c0 RCX: 0000000000000027
Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff
Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0
Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40
Jan 17 09:14:40 N1 kernel: R13: 00000000000000ae R14: ffffffff82227bc0 R15: ffff888101732000
Jan 17 09:14:40 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 17 09:14:40 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0
Jan 17 09:14:40 N1 kernel: PKRU: 55555554
Jan 17 09:14:40 N1 kernel: Call Trace:
Jan 17 09:14:40 N1 kernel: <TASK>
Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100
Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54
Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75
Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a
Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e
Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40
Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e
Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7
Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc]
Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc]
Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173
Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc]
Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74
Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82
Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98
Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6
Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7
Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173
Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6
Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa
Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45
Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182
Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b
Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53
Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef
Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30
Jan 17 09:14:40 N1 kernel: </TASK>
Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]---
Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------
Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/174', leaking at least 'eth0-TxRx-0'
Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme
Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G        W         5.19.17-Unraid #2
Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022
Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00
Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282
Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff8881050a4a80 RCX: 0000000000000027
Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff
Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0
Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40
Jan 17 09:14:40 N1 kernel: R13: 00000000000000af R14: ffffffff82227bc0 R15: ffff888101732000
Jan 17 09:14:40 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 17 09:14:40 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0
Jan 17 09:14:40 N1 kernel: PKRU: 55555554
Jan 17 09:14:40 N1 kernel: Call Trace:
Jan 17 09:14:40 N1 kernel: <TASK>
Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100
Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54
Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75
Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a
Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e
Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40
Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e
Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7
Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc]
Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc]
Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173
Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc]
Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74
Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82
Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98
Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6
Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7
Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173
Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6
Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa
Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45
Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182
Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b
Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53
Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef
Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30
Jan 17 09:14:40 N1 kernel: </TASK>
Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]---
Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------
Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/175', leaking at least 'eth0-TxRx-1'
Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme
Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G        W         5.19.17-Unraid #2
Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022
Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00
Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282
Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0f180 RCX: 0000000000000003
Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: 0000000000000003 RDI: 00000000ffffffff
Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0
Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40
Jan 17 09:14:40 N1 kernel: R13: 00000000000000b0 R14: ffffffff82227bc0 R15: ffff888101732000
Jan 17 09:14:40 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 17 09:14:40 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0
Jan 17 09:14:40 N1 kernel: PKRU: 55555554
Jan 17 09:14:40 N1 kernel: Call Trace:
Jan 17 09:14:40 N1 kernel: <TASK>
Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100
Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54
Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75
Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a
Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e
Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40
Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e
Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7
Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc]
Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc]
Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173
Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc]
Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74
Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82
Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98
Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6
Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7
Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173
Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6
Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa
Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45
Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182
Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b
Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53
Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef
Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30
Jan 17 09:14:40 N1 kernel: </TASK>
Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]---
Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------
Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/176', leaking at least 'eth0-TxRx-2'
Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme
Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G        W         5.19.17-Unraid #2
Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022
Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00
Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282
Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0f840 RCX: 0000000000000027
Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff
Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0
Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40
Jan 17 09:14:40 N1 kernel: R13: 00000000000000b1 R14: ffffffff82227bc0 R15: ffff888101732000
Jan 17 09:14:40 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 17 09:14:40 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0
Jan 17 09:14:40 N1 kernel: PKRU: 55555554
Jan 17 09:14:40 N1 kernel: Call Trace:
Jan 17 09:14:40 N1 kernel: <TASK>
Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100
Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54
Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75
Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a
Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e
Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40
Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e
Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7
Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc]
Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc]
Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173
Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc]
Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74
Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82
Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98
Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6
Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7
Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173
Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6
Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa
Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45
Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182
Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b
Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53
Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef
Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30
Jan 17 09:14:40 N1 kernel: </TASK>
Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]---
Jan 17 09:14:40 N1 kernel: ------------[ cut here ]------------
Jan 17 09:14:40 N1 kernel: remove_proc_entry: removing non-empty directory 'irq/177', leaking at least 'eth0-TxRx-3'
Jan 17 09:14:40 N1 kernel: WARNING: CPU: 5 PID: 183 at fs/proc/generic.c:718 remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Modules linked in: xt_connmark xt_comment iptable_raw xt_mark xt_CHECKSUM ipt_REJECT nf_reject_ipv4 ip6table_mangle ip6table_nat iptable_mangle vhost_net tun vhost vhost_iotlb tap xt_nat xt_tcpudp veth xt_conntrack nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter xfs md_mod nct6775 nct6775_core hwmon_vid efivarfs iptable_nat xt_MASQUERADE nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 wireguard curve25519_x86_64 libcurve25519_generic libchacha20poly1305 chacha_x86_64 poly1305_x86_64 ip6_udp_tunnel udp_tunnel libchacha ip6table_filter ip6_tables iptable_filter ip_tables x_tables af_packet 8021q garp mrp bridge stp llc bonding tls wmi_bmof x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd cryptd rapl intel_cstate intel_uncore i915 iosf_mbi drm_buddy i2c_algo_bit ttm drm_display_helper drm_kms_helper ahci i2c_i801 igc i2c_smbus drm libahci nvme
Jan 17 09:14:40 N1 kernel: intel_gtt cp210x agpgart input_leds joydev led_class usbserial nvme_core i2c_core syscopyarea sysfillrect vmd sysimgblt fb_sys_fops thermal fan wmi video backlight tpm_crb tpm_tis tpm_tis_core tpm acpi_tad acpi_pad button unix
Jan 17 09:14:40 N1 kernel: CPU: 5 PID: 183 Comm: irq/125-aerdrv Tainted: G        W         5.19.17-Unraid #2
Jan 17 09:14:40 N1 kernel: Hardware name: ASUS System Product Name/ROG STRIX B660-I GAMING WIFI, BIOS 2212 12/13/2022
Jan 17 09:14:40 N1 kernel: RIP: 0010:remove_proc_entry+0x13e/0x16a
Jan 17 09:14:40 N1 kernel: Code: 53 78 48 c7 c6 c0 b8 e2 81 48 c7 c7 13 7b 0e 82 48 8b 8b a0 00 00 00 4c 8b 80 a0 00 00 00 48 8b 92 a0 00 00 00 e8 c8 0b 59 00 <0f> 0b 48 89 df e8 75 fc ff ff 48 8b 44 24 10 65 48 2b 04 25 28 00
Jan 17 09:14:40 N1 kernel: RSP: 0018:ffffc90004817bb0 EFLAGS: 00010282
Jan 17 09:14:40 N1 kernel: RAX: 0000000000000000 RBX: ffff88815cc0ff00 RCX: 0000000000000027
Jan 17 09:14:40 N1 kernel: RDX: 0000000000000001 RSI: ffffffff820d7be1 RDI: 00000000ffffffff
Jan 17 09:14:40 N1 kernel: RBP: ffffc90004817bee R08: 0000000000000000 R09: ffffffff82244bd0
Jan 17 09:14:40 N1 kernel: R10: 00007fffffffffff R11: ffffc90004817bf1 R12: ffff8881001bfe40
Jan 17 09:14:40 N1 kernel: R13: 00000000000000b2 R14: ffffffff82227bc0 R15: ffff888101732000
Jan 17 09:14:40 N1 kernel: FS:  0000000000000000(0000) GS:ffff88885f540000(0000) knlGS:0000000000000000
Jan 17 09:14:40 N1 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 17 09:14:40 N1 kernel: CR2: 00007ffe4d36ef80 CR3: 000000000420a005 CR4: 0000000000770ee0
Jan 17 09:14:40 N1 kernel: PKRU: 55555554
Jan 17 09:14:40 N1 kernel: Call Trace:
Jan 17 09:14:40 N1 kernel: <TASK>
Jan 17 09:14:40 N1 kernel: unregister_irq_proc+0xe0/0x100
Jan 17 09:14:40 N1 kernel: free_desc+0x1b/0x54
Jan 17 09:14:40 N1 kernel: irq_free_descs+0x41/0x75
Jan 17 09:14:40 N1 kernel: __msi_domain_free_irqs+0x69/0x9a
Jan 17 09:14:40 N1 kernel: msi_domain_free_irqs_descs_locked+0x18/0x3e
Jan 17 09:14:40 N1 kernel: pci_msi_teardown_msi_irqs+0x2a/0x40
Jan 17 09:14:40 N1 kernel: free_msi_irqs+0xe/0x2e
Jan 17 09:14:40 N1 kernel: pci_disable_msix+0xcd/0xe7
Jan 17 09:14:40 N1 kernel: igc_reset_interrupt_capability+0x20/0x5b [igc]
Jan 17 09:14:40 N1 kernel: __igc_open+0x136/0x454 [igc]
Jan 17 09:14:40 N1 kernel: ? aer_isr+0x173/0x173
Jan 17 09:14:40 N1 kernel: igc_io_resume+0x2a/0x56 [igc]
Jan 17 09:14:40 N1 kernel: report_resume+0x53/0x74
Jan 17 09:14:40 N1 kernel: pci_walk_bus+0x79/0x82
Jan 17 09:14:40 N1 kernel: ? aer_dev_correctable_show+0x98/0x98
Jan 17 09:14:40 N1 kernel: pcie_do_recovery+0x132/0x1a6
Jan 17 09:14:40 N1 kernel: aer_process_err_devices+0xbe/0xd7
Jan 17 09:14:40 N1 kernel: aer_isr+0x143/0x173
Jan 17 09:14:40 N1 kernel: ? __schedule+0x59e/0x5f6
Jan 17 09:14:40 N1 kernel: ? irq_finalize_oneshot+0xaa/0xaa
Jan 17 09:14:40 N1 kernel: irq_thread_fn+0x1b/0x45
Jan 17 09:14:40 N1 kernel: irq_thread+0x11a/0x182
Jan 17 09:14:40 N1 kernel: ? irq_forced_thread_fn+0x6b/0x6b
Jan 17 09:14:40 N1 kernel: ? free_percpu_irq+0x53/0x53
Jan 17 09:14:40 N1 kernel: kthread+0xe4/0xef
Jan 17 09:14:40 N1 kernel: ? kthread_complete_and_exit+0x1b/0x1b
Jan 17 09:14:40 N1 kernel: ret_from_fork+0x1f/0x30
Jan 17 09:14:40 N1 kernel: </TASK>
Jan 17 09:14:40 N1 kernel: ---[ end trace 0000000000000000 ]---
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: device recovery successful
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2:   device [8086:7aba] error status/mask=00003140/00002000
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2:    [ 6] BadTLP                
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2:    [ 8] Rollover              
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2:    [12] Timeout               
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: Multiple Corrected error received: 0000:00:1c.2
Jan 17 09:14:40 N1 kernel: pcieport 0000:00:1c.2: AER: can't find device of ID00e2
Jan 17 09:15:50 N1  login: pam_unix(login:session): session opened for user root(uid=0) by LOGIN(uid=0)
Jan 17 09:15:50 N1  login: ROOT LOGIN ON tty1
Jan 17 09:17:01 N1 root: ACPI action up is not defined
Jan 17 09:17:02 N1 root: ACPI action left is not defined
Jan 17 09:17:02 N1 root: ACPI action left is not defined
Jan 17 09:17:02 N1 root: ACPI action left is not defined
Jan 17 09:17:02 N1 root: ACPI action left is not defined
Jan 17 09:17:02 N1 root: ACPI action left is not defined
Jan 17 09:17:03 N1 root: ACPI action left is not defined
Jan 17 09:17:03 N1 root: ACPI action left is not defined
 

 

syslog2

Edited by j5i7
Link to comment
2 hours ago, j5i7 said:

I do have the pcie slot on my motherboard free

Overlook your mobo support 2 onboard M2 nvme, so I assume the slot in use.

 

2 hours ago, j5i7 said:

or hard to say)?

Yes 

As NIC driver crash, so, it won't work again even you plug / unplug cable until reboot. And generate PCIe AER error also not surprise.

 

Found a similar report, but haven't confirm solve, pls also try disable docker / VM.

 

** OP change from Realtek to Intel still got problem. **

 

 

Edited by Vr2Io
Link to comment
  • 3 weeks later...

Not a big update but I went ~20 days without an issue, then had it happen again. The only correlation I can fathom is that I turned on download overnight and it happened that same night. However, I haven't had any issues when streaming countless hours of TV off my NAS. Anyway, rather than throwing alternative hardware at it for the moment, I just created a script that scans the syslog for the ethernet crash message and will reboot the server if detected. Hopefully some driver update down the line fixes my issue, but for my current use case, it's not the end of the world for the server to gracefully restart every few weeks.

  • Like 2
Link to comment
  • 10 months later...

@j5i7- Would you mind sharing your script please? I am having similar issues and am considering purchasing a pcie nic for testing. In the meantime, your script could be a big help. I am hesitant to create a new post for my issue as it seems redundant.

Link to comment
1 hour ago, MACGoof said:

@j5i7- Would you mind sharing your script please? I am having similar issues and am considering purchasing a pcie nic for testing. In the meantime, your script could be a big help. I am hesitant to create a new post for my issue as it seems redundant.


I’ll share it when I can, unfortunately it’s an unlucky time as I’m in the middle of a move and my server is packed away and I don’t think I have a copy of the script elsewhere.

 

However, in general I searched for an existing script that monitored the syslog, then modified it to look for the Ethernet disconnected message and when found, restart the server. Then I set the script to run every hour. I also added logging etc, I think it’s run about twice in the last six months or so. If I don’t post it in the upcoming week, remind me.

  • Like 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...