System reboots / hangs / no network access over night


Recommended Posts

Hello,

 

i'm a bit desperate and need your evaluation. Is this a hardware or software problem.

 

the UnRAID System shows the following behavior

 

- is "over night" not more accessible over network and console

- i turn the system off and on, UnRAID startes normaly

- the Parity-Check starts, with 18TB it will need more then one day

- meanwhile the system works normal, i can copy data from/to the shares

- on the next day, i'am on step 1

 

The system has only shares and one vm.

 

The Add-On Plugin for Parity did i remove, in the hope this is my problem. But no change, same error.

 

Syslog on USB is activated, but i see nothing was me helps.

 

From last friday to monday (turn off system):

Feb 24 09:49:33 ArchivUnraid webGUI: Successful login user root from 192.168.0.71
Feb 24 09:49:50 ArchivUnraid kernel: mdcmd (36): check correct
Feb 24 09:49:50 ArchivUnraid kernel: md: recovery thread: check P Q ...
Feb 24 10:06:05 ArchivUnraid  smbd[4305]: [2023/02/24 10:06:05.537812,  0] ../../source3/smbd/open.c:3392(smbd_calculate_access_mask_fsp)
Feb 24 10:06:05 ArchivUnraid  smbd[4305]:   smbd_calculate_access_mask_fsp: Access denied on file .: rejected by share access mask[0x001F00A9] orig[0x00100180] mapped[0x00100180] reject[0x00000100]
Feb 24 13:03:14 ArchivUnraid webGUI: Successful login user root from 192.168.0.70
Feb 24 14:40:39 ArchivUnraid  smbd[30243]: [2023/02/24 14:40:39.695022,  0] ../../source3/smbd/open.c:3392(smbd_calculate_access_mask_fsp)
Feb 24 14:40:39 ArchivUnraid  smbd[30243]:   smbd_calculate_access_mask_fsp: Access denied on file .: rejected by share access mask[0x001F00A9] orig[0x00100180] mapped[0x00100180] reject[0x00000100]
Feb 24 20:00:01 ArchivUnraid kernel: mdcmd (37): check NOCORRECT
Feb 24 20:00:01 ArchivUnraid kernel: 
Feb 25 03:40:01 ArchivUnraid root: mover: started
Feb 25 03:40:01 ArchivUnraid root: mover: finished

 

 

typically the last entry bevor reboot is:

 smbd[4897]: [2023/02/16 15:24:59.929550,  0] ../../source3/smbd/open.c:3392(smbd_calculate_access_mask_fsp)
Feb 16 15:24:59 ArchivUnraid  smbd[4897]:   smbd_calculate_access_mask_fsp: Access denied on file .: rejected by share access mask[0x001F00A9] orig[0x00100180] mapped[0x00100180] reject[0x00000100]
Feb 17 03:40:01 ArchivUnraid  crond[1566]: exit status 1 from user root /usr/local/sbin/mover &> /dev/null

 

Please help. 😕

Denny

archivunraid-diagnostics-20230301-1157.zip syslog.txt

Link to comment

new week, old problems. 

 

After disabling C-States, UnRAID finished his parity job without problems. 

The i started the next archive-job (python script inside vm) and as result i found the system today not accessible und syslog say (cuted):

 

Quote

[...]

Mar  3 12:23:36 ArchivUnraid kernel: kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
Mar  3 12:23:36 ArchivUnraid kernel: BUG: unable to handle page fault for address: ffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: #PF: supervisor instruction fetch in kernel mode
Mar  3 12:23:36 ArchivUnraid kernel: #PF: error_code(0x0011) - permissions violation
Mar  3 12:23:36 ArchivUnraid kernel: PGD 2a01067 P4D 2a01067 PUD 1544d2063 PMD 16d849063 PTE 800000016d86a163
Mar  3 12:23:36 ArchivUnraid kernel: Oops: 0011 [#1] PREEMPT SMP NOPTI
Mar  3 12:23:36 ArchivUnraid kernel: CPU: 1 PID: 4182 Comm: CPU 2/KVM Not tainted 5.19.17-Unraid #2
Mar  3 12:23:36 ArchivUnraid kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570D4U-2L2T, BIOS P1.40 05/19/2021
Mar  3 12:23:36 ArchivUnraid kernel: RIP: 0010:0xffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <00> 90 46 01 00 c9 ff ff 00 00 00 00 00 00 00 00 80 92 75 05 81 88
Mar  3 12:23:36 ArchivUnraid kernel: RSP: 0018:ffffc9000160fd50 EFLAGS: 00010002
Mar  3 12:23:36 ArchivUnraid kernel: RAX: ffff888105758fc0 RBX: ffffffff810d3c48 RCX: ffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: RDX: ffffffff820ca4f1 RSI: ffffffff8214b5df RDI: ffffffff82100309
Mar  3 12:23:36 ArchivUnraid kernel: RBP: ffff88842e06ce80 R08: 0000000000000000 R09: 0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff888105758fc0
Mar  3 12:23:36 ArchivUnraid kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 000000006d86a501
Mar  3 12:23:36 ArchivUnraid kernel: FS:  00001526c01ff6c0(0000) GS:ffff88842e040000(0000) knlGS:0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  3 12:23:36 ArchivUnraid kernel: CR2: ffff88816d86a5b0 CR3: 000000016c724000 CR4: 0000000000350ee0
Mar  3 12:23:36 ArchivUnraid kernel: Call Trace:
Mar  3 12:23:36 ArchivUnraid kernel: <TASK>
Mar  3 12:23:36 ArchivUnraid kernel: ? svm_vcpu_enter_exit+0x1e/0xc0 [kvm_amd]
Mar  3 12:23:36 ArchivUnraid kernel: ? svm_vcpu_run+0x2bd/0x5e1 [kvm_amd]
Mar  3 12:23:36 ArchivUnraid kernel: ? kvm_arch_vcpu_ioctl_run+0x1117/0x1506 [kvm]
Mar  3 12:23:36 ArchivUnraid kernel: ? try_to_wake_up+0x20e/0x248
Mar  3 12:23:36 ArchivUnraid kernel: ? kvm_vcpu_ioctl+0x192/0x5a4 [kvm]
Mar  3 12:23:36 ArchivUnraid kernel: ? futex_wake+0x11f/0x149
Mar  3 12:23:36 ArchivUnraid kernel: ? __seccomp_filter+0x89/0x313
Mar  3 12:23:36 ArchivUnraid kernel: ? vfs_ioctl+0x1e/0x2f
Mar  3 12:23:36 ArchivUnraid kernel: ? __do_sys_ioctl+0x52/0x78
Mar  3 12:23:36 ArchivUnraid kernel: ? do_syscall_64+0x6b/0x81
Mar  3 12:23:36 ArchivUnraid kernel: ? entry_SYSCALL_64_after_hwframe+0x63/0xcd
Mar  3 12:23:36 ArchivUnraid kernel: </TASK>
Mar  3 12:23:36 ArchivUnraid kernel: Modules linked in: af_packet xt_nat veth nf_conntrack_netlink nfnetlink xfrm_user xt_addrtype br_netfilter xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle ip6table_nat iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 vhost_net tun vhost vhost_iotlb tap xfs nfsd auth_rpcgss oid_registry lockd grace sunrpc md_mod ip6table_filter ip6_tables iptable_filter ip_tables x_tables bridge stp llc bonding tls ixgbe xfrm_algo mdio igb amdgpu ast gpu_sched drm_vram_helper drm_display_helper drm_ttm_helper ttm ipmi_ssif amd64_edac edac_mce_amd edac_core kvm_amd wmi_bmof kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel drm_kms_helper aesni_intel crypto_simd cryptd rapl drm k10temp input_leds agpgart i2c_piix4 i2c_algo_bit joydev smartpqi led_class ahci ccp syscopyarea sysfillrect i2c_core sysimgblt libahci fb_sys_fops scsi_transport_sas acpi_ipmi wmi ipmi_si video backlight button acpi_cpufreq unix
Mar  3 12:23:36 ArchivUnraid kernel: [last unloaded: xfrm_algo]
Mar  3 12:23:36 ArchivUnraid kernel: CR2: ffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: ---[ end trace 0000000000000000 ]---
Mar  3 12:23:36 ArchivUnraid kernel: RIP: 0010:0xffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 <00> 90 46 01 00 c9 ff ff 00 00 00 00 00 00 00 00 80 92 75 05 81 88
Mar  3 12:23:36 ArchivUnraid kernel: RSP: 0018:ffffc9000160fd50 EFLAGS: 00010002
Mar  3 12:23:36 ArchivUnraid kernel: RAX: ffff888105758fc0 RBX: ffffffff810d3c48 RCX: ffff88816d86a5b0
Mar  3 12:23:36 ArchivUnraid kernel: RDX: ffffffff820ca4f1 RSI: ffffffff8214b5df RDI: ffffffff82100309
Mar  3 12:23:36 ArchivUnraid kernel: RBP: ffff88842e06ce80 R08: 0000000000000000 R09: 0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffff888105758fc0
Mar  3 12:23:36 ArchivUnraid kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 000000006d86a501
Mar  3 12:23:36 ArchivUnraid kernel: FS:  00001526c01ff6c0(0000) GS:ffff88842e040000(0000) knlGS:0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  3 12:23:36 ArchivUnraid kernel: CR2: ffff88816d86a5b0 CR3: 000000016c724000 CR4: 0000000000350ee0
Mar  3 12:23:36 ArchivUnraid kernel: note: CPU 2/KVM[4182] exited with preempt_count 1
Mar  3 12:23:36 ArchivUnraid kernel: ------------[ cut here ]------------
Mar  3 12:23:36 ArchivUnraid kernel: NETDEV WATCHDOG: eth0 (ixgbe): transmit queue 5 timed out
Mar  3 12:23:36 ArchivUnraid kernel: WARNING: CPU: 0 PID: 0 at net/sched/sch_generic.c:529 dev_watchdog+0x145/0x1b3
Mar  3 12:23:36 ArchivUnraid kernel: Modules linked in: af_packet xt_nat veth nf_conntrack_netlink nfnetlink xfrm_user xt_addrtype br_netfilter xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle ip6table_nat iptable_mangle iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 vhost_net tun vhost vhost_iotlb tap xfs nfsd auth_rpcgss oid_registry lockd grace sunrpc md_mod ip6table_filter ip6_tables iptable_filter ip_tables x_tables bridge stp llc bonding tls ixgbe xfrm_algo mdio igb amdgpu ast gpu_sched drm_vram_helper drm_display_helper drm_ttm_helper ttm ipmi_ssif amd64_edac edac_mce_amd edac_core kvm_amd wmi_bmof kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel drm_kms_helper aesni_intel crypto_simd cryptd rapl drm k10temp input_leds agpgart i2c_piix4 i2c_algo_bit joydev smartpqi led_class ahci ccp syscopyarea sysfillrect i2c_core sysimgblt libahci fb_sys_fops scsi_transport_sas acpi_ipmi wmi ipmi_si video backlight button acpi_cpufreq unix
Mar  3 12:23:36 ArchivUnraid kernel: [last unloaded: xfrm_algo]
Mar  3 12:23:36 ArchivUnraid kernel: CPU: 0 PID: 0 Comm: swapper/0 Tainted: G      D           5.19.17-Unraid #2
Mar  3 12:23:36 ArchivUnraid kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X570D4U-2L2T, BIOS P1.40 05/19/2021
Mar  3 12:23:36 ArchivUnraid kernel: RIP: 0010:dev_watchdog+0x145/0x1b3
Mar  3 12:23:36 ArchivUnraid kernel: Code: f7 c5 00 00 75 26 4c 89 ef c6 05 f1 f7 c5 00 01 e8 d4 31 fb ff 44 89 f1 4c 89 ee 48 c7 c7 c3 6e 15 82 48 89 c2 e8 f5 af 0f 00 <0f> 0b 4c 89 ef e8 1c fe ff ff 48 8b 83 88 fc ff ff 4c 89 ef 44 89
Mar  3 12:23:36 ArchivUnraid kernel: RSP: 0018:ffffc90000003eb0 EFLAGS: 00010282
Mar  3 12:23:36 ArchivUnraid kernel: RAX: 0000000000000000 RBX: ffff888106e0c448 RCX: 0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: RDX: 0000000000000103 RSI: 00000000000000f6 RDI: 00000000ffffffff
Mar  3 12:23:36 ArchivUnraid kernel: RBP: 0000000000000005 R08: 0000000000000000 R09: ffffc90003094220
Mar  3 12:23:36 ArchivUnraid kernel: R10: 0000000000aaaaaa R11: 0000000000000001 R12: ffff888106e0c39c
Mar  3 12:23:36 ArchivUnraid kernel: R13: ffff888106e0c000 R14: 0000000000000005 R15: ffffffff8172073a
Mar  3 12:23:36 ArchivUnraid kernel: FS:  0000000000000000(0000) GS:ffff88842e000000(0000) knlGS:0000000000000000
Mar  3 12:23:36 ArchivUnraid kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar  3 12:23:36 ArchivUnraid kernel: CR2: 00005562f9107050 CR3: 000000016c724000 CR4: 0000000000350ef0
Mar  3 12:23:36 ArchivUnraid kernel: Call Trace:
Mar  3 12:23:36 ArchivUnraid kernel: <IRQ>
Mar  3 12:23:36 ArchivUnraid kernel: ? netif_tx_lock+0x1e/0x1e
Mar  3 12:23:36 ArchivUnraid kernel: call_timer_fn+0x6f/0x10d
Mar  3 12:23:36 ArchivUnraid kernel: __run_timers+0x144/0x184
Mar  3 12:23:36 ArchivUnraid kernel: ? update_process_times+0x7a/0x81
Mar  3 12:23:36 ArchivUnraid kernel: ? tick_sched_timer+0x43/0x71
Mar  3 12:23:36 ArchivUnraid kernel: ? _raw_spin_lock_irq+0x19/0x22
Mar  3 12:23:36 ArchivUnraid kernel: run_timer_softirq+0x2b/0x43
Mar  3 12:23:36 ArchivUnraid kernel: __do_softirq+0x129/0x288
Mar  3 12:23:36 ArchivUnraid kernel: __irq_exit_rcu+0x79/0xb8
Mar  3 12:23:36 ArchivUnraid kernel: sysvec_apic_timer_interrupt+0x85/0xa6
Mar  3 12:23:36 ArchivUnraid kernel: </IRQ>
Mar  3 12:23:36 ArchivUnraid kernel: <TASK>
Mar  3 12:23:36 ArchivUnraid kernel: asm_sysvec_apic_timer_interrupt+0x16/0x20
Mar  3 12:23:36 ArchivUnraid kernel: RIP: 0010:native_safe_halt+0x7/0xc

[...]

Mar  6 09:27:00 ArchivUnraid kernel: RIP: 0010:smp_call_function_many_cond+0x266/0x27d
Mar  6 09:27:00 ArchivUnraid kernel: Code: 48 89 de e8 a7 6b 36 00 3b 05 d0 17 2a 01 89 c7 73 1c 48 63 c7 48 8b 55 00 48 03 14 c5 e0 6a 16 82 8b 42 08 a8 01 74 04 f3 90 <eb> f5 eb d2 48 83 c4 38 5b 5d 41 5c 41 5d 41 5e 41 5f e9 cd 81 b0
Mar  6 09:27:00 ArchivUnraid kernel: RSP: 0018:ffffc90016fdfc38 EFLAGS: 00000202
Mar  6 09:27:00 ArchivUnraid kernel: RAX: 0000000000000011 RBX: ffff88842e3ed288 RCX: 0000000000000001
Mar  6 09:27:00 ArchivUnraid kernel: RDX: ffff88842e072320 RSI: 0000000000000000 RDI: 0000000000000001
Mar  6 09:27:00 ArchivUnraid kernel: RBP: ffff88842e3ed280 R08: 0000000000000000 R09: 0000000000000000
Mar  6 09:27:00 ArchivUnraid kernel: R10: 000000000000023a R11: 0000000000000000 R12: 0000000000000001
Mar  6 09:27:00 ArchivUnraid kernel: R13: 0000000000000000 R14: ffffffff8105ce9e R15: 0000000000000003
Mar  6 09:27:00 ArchivUnraid kernel: ? leave_mm+0x34/0x34
Mar  6 09:27:00 ArchivUnraid kernel: ? smp_call_function_many_cond+0x244/0x27d
Mar  6 09:27:00 ArchivUnraid kernel: on_each_cpu_cond_mask+0x42/0x69
Mar  6 09:27:00 ArchivUnraid kernel: ? leave_mm+0x34/0x34
Mar  6 09:27:00 ArchivUnraid kernel: __flush_tlb_multi+0x5/0xb
Mar  6 09:27:00 ArchivUnraid kernel: flush_tlb_mm_range+0xc3/0x111
Mar  6 09:27:00 ArchivUnraid kernel: tlb_flush_mmu_tlbonly+0x6c/0x94
Mar  6 09:27:00 ArchivUnraid kernel: tlb_flush_mmu+0x15/0x99
Mar  6 09:27:00 ArchivUnraid kernel: tlb_finish_mmu+0x2c/0x5b
Mar  6 09:27:00 ArchivUnraid kernel: zap_page_range+0xae/0xd6
Mar  6 09:27:00 ArchivUnraid kernel: ? hrtimer_init_sleeper+0x41/0x41
Mar  6 09:27:00 ArchivUnraid kernel: ? find_vma+0x53/0x60
Mar  6 09:27:00 ArchivUnraid kernel: do_madvise+0x685/0xa04
Mar  6 09:27:00 ArchivUnraid kernel: ? __seccomp_filter+0x89/0x313
Mar  6 09:27:00 ArchivUnraid kernel: __x64_sys_madvise+0x28/0x2f
Mar  6 09:27:00 ArchivUnraid kernel: do_syscall_64+0x6b/0x81
Mar  6 09:27:00 ArchivUnraid kernel: entry_SYSCALL_64_after_hwframe+0x63/0xcd
Mar  6 09:27:00 ArchivUnraid kernel: RIP: 0033:0x152744978947
Mar  6 09:27:00 ArchivUnraid kernel: Code: ff ff ff ff c3 66 0f 1f 44 00 00 48 8b 15 b1 94 0d 00 f7 d8 64 89 02 b8 ff ff ff ff eb bc 0f 1f 44 00 00 b8 1c 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 89 94 0d 00 f7 d8 64 89 01 48
Mar  6 09:27:00 ArchivUnraid kernel: RSP: 002b:000015269effde18 EFLAGS: 00000206 ORIG_RAX: 000000000000001c
Mar  6 09:27:00 ArchivUnraid kernel: RAX: ffffffffffffffda RBX: 000015269edff000 RCX: 0000152744978947
Mar  6 09:27:00 ArchivUnraid kernel: RDX: 0000000000000004 RSI: 00000000001fa000 RDI: 000015269edff000
Mar  6 09:27:00 ArchivUnraid kernel: RBP: 0000000000201000 R08: 000015269effe850 R09: 0000152742b82898
Mar  6 09:27:00 ArchivUnraid kernel: R10: 0000000000000008 R11: 0000000000000206 R12: fffffffffffff0f8
Mar  6 09:27:00 ArchivUnraid kernel: R13: 0000000000000000 R14: 00001526a19db9c0 R15: 000015269edff000
Mar  6 09:27:00 ArchivUnraid kernel: </TASK>

 

Link to comment

Btw, what means / from where come this smbd error?

 

Quote

Mar  3 11:22:53 ArchivUnraid  smbd[19607]:   smbd_calculate_access_mask_fsp: Access denied on file 0012345-2019_20-1234: rejected by share access mask[0x001F00A9] orig[0x00100180] mapped[0x00100180] reject[0x00000100]
Mar  3 11:22:53 ArchivUnraid  smbd[19607]: [2023/03/03 11:22:53.319428,  0] ../../source3/smbd/open.c:3392(smbd_calculate_access_mask_fsp)
Mar  3 11:22:53 ArchivUnraid  smbd[19607]:   smbd_calculate_access_mask_fsp: Access denied on file 0012345-2019_20-1234: rejected by share access mask[0x001F00A9] orig[0x00100180] mapped[0x00100180] reject[0x00000100]
Mar  3 12:23:36 ArchivUnraid kernel: kernel tried to execute NX-protected page - exploit attempt? (uid: 0)
Mar  3 12:23:36 ArchivUnraid kernel: BUG: unable to handle page fault for address: ffff88816d86a5b0

 

Link to comment

That looks more like a hardware problem, one thing you can try is to boot the server in safe mode with all docker/VMs disabled, let it run as a basic NAS for a few days, if it still crashes it's likely a hardware problem, if it doesn't start turning on the other services one by one.

Link to comment
  • 8 months later...
On 3/6/2023 at 3:41 AM, Denny77 said:

new week, old problems. 

 

After disabling C-States, UnRAID finished his parity job without problems. 

The i started the next archive-job (python script inside vm) and as result i found the system today not accessible und syslog say (cuted):

 

 

 

Did you find out the issue and solution to this? I'm seeing very similar error messages.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.