April 26, 20251 yr i am using unraid 7.1 rc1 with qtb1 cpu(10900es, uhd630 igpu). rtx3080laptop and dg1 gpu. under normal situation, all gpu (630, 3080,dg1) can be seen in system devices and can be used by docker or vm. uhd630 and 3080 are used by tdarr-node docker to transcode, dg1 are used by fnos vm. but uhd630 could be lost for no reason, this happened under unraid 6.12 and unraid 7.0 tower-diagnostics-20250426-1650.zip
April 26, 20251 yr Community Expert 20 minutes ago, allenchou1994 said: i am using unraid 7.1 rc1 with qtb1 cpu(10900es, uhd630 igpu). rtx3080laptop and dg1 gpu. under normal situation, all gpu (630, 3080,dg1) can be seen in system devices and can be used by docker or vm. uhd630 and 3080 are used by tdarr-node docker to transcode, dg1 are used by fnos vm. but uhd630 could be lost for no reason, this happened under unraid 6.12 and unraid 7.0 tower-diagnostics-20250426-1650.zip 344.4 kB · 0 downloads The GPU is handing for some reason. Suggest you update BIOS as old had may have issues with newer kernels. Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020 i915 0000:00:02.0: [drm] vf#0:0[1491975] context reset due to GPU hang Apr 26 15:30:53 Tower kernel: vfio-pci 0000:04:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=io Apr 26 15:31:01 Tower kernel: ------------[ cut here ]------------ Apr 26 15:31:01 Tower kernel: pci 0000:00:02.0: [drm] pm_runtime_get_sync() failed: -13 Apr 26 15:31:01 Tower kernel: WARNING: CPU: 4 PID: 1491939 at drivers/gpu/drm/i915/intel_runtime_pm.c:170 __intel_runtime_pm_get.isra.0+0x62/0x80 [i915] Apr 26 15:31:01 Tower kernel: Modules linked in: xt_mark nf_conntrack_netlink xt_nat veth xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle iptable_mangle vhost_net vhost vhost_iotlb tap xt_conntrack xt_MASQUERADE xfrm_user xfrm_algo xt_addrtype nvidia_uvm(PO) nfsd auth_rpcgss lockd grace sunrpc md_mod algif_hash algif_skcipher af_alg cmac zfs(PO) spl(O) bnep tun nf_tables nfnetlink ip6table_nat iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ntfs3 tcp_diag inet_diag nct6775 nct6775_core hwmon_vid corefreqk(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet cfg80211 8021q garp mrp bridge stp llc bonding tls mei_gsc xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel nvidia_drm(PO) nvidia_modeset(PO) i915 kvm nvidia(PO) btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel crc32c_intel ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 aesni_intel bluetooth iosf_mbi drm_buddy crypto_simd Apr 26 15:31:01 Tower kernel: i2c_algo_bit cryptd drm_ttm_helper drm_display_helper ttm drm_kms_helper rapl intel_cstate mei_hdcp mei_pxp wmi_bmof intel_wmi_thunderbolt nvme drm intel_uncore r8125(O) igc mei_me nvme_core i2c_i801 intel_gtt mei i2c_smbus rfkill ahci agpgart ecdh_generic libahci input_leds joydev ecc led_class i2c_core thermal fan video wmi backlight acpi_tad acpi_pad button Apr 26 15:31:01 Tower kernel: CPU: 4 UID: 99 PID: 1491939 Comm: tdarr-ffmpeg Tainted: P U W O 6.12.23-Unraid #1 Apr 26 15:31:01 Tower kernel: Tainted: [P]=PROPRIETARY_MODULE, =USER, [W]=WARN, [O]=OOT_MODULE Apr 26 15:31:01 Tower kernel: Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020 Apr 26 15:31:01 Tower kernel: RIP: 0010:__intel_runtime_pm_get.isra.0+0x62/0x80 [i915] Apr 26 15:31:01 Tower kernel: Code: 71 19 00 01 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 d3 8a 68 db 44 89 e1 4c 89 ea 48 c7 c7 82 72 47 a6 48 89 c6 e8 0e af f5 da <0f> 0b 40 0f b6 f5 48 89 df 5b 5d 41 5c 41 5d e9 ea fe ff ff 66 2e Apr 26 15:31:01 Tower kernel: RSP: 0018:ffffc90001387cf8 EFLAGS: 00010282 Apr 26 15:31:01 Tower kernel: RAX: 0000000000000000 RBX: ffff888105bea060 RCX: 0000000000000027 Apr 26 15:31:01 Tower kernel: RDX: 0000000000000002 RSI: ffffffff82341452 RDI: 00000000ffffffff Apr 26 15:31:01 Tower kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: ffffffff82da3e50 Apr 26 15:31:01 Tower kernel: R10: 0000000000000002 R11: 0000000000000008 R12: 00000000fffffff3 Apr 26 15:31:01 Tower kernel: R13: ffff888101d28c50 R14: ffff888100582ac0 R15: 0000000000000001 Apr 26 15:31:01 Tower kernel: FS: 0000000000000000(0000) GS:ffff889040100000(0000) knlGS:0000000000000000 Apr 26 15:31:01 Tower kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Apr 26 15:31:01 Tower kernel: CR2: 000014d298703a24 CR3: 0000000005418002 CR4: 00000000003726f0 Apr 26 15:31:01 Tower kernel: Call Trace: Apr 26 15:31:01 Tower kernel: <TASK> Apr 26 15:31:01 Tower kernel: intel_runtime_pm_get+0xf/0x20 [i915] Apr 26 15:31:01 Tower kernel: i915_driver_release+0x22/0x80 [i915] Apr 26 15:31:01 Tower kernel: drm_dev_put+0x39/0x70 [drm] Apr 26 15:31:01 Tower kernel: singleton_release+0x1c/0x30 [i915] Apr 26 15:31:01 Tower kernel: __fput+0x106/0x1d0 Apr 26 15:31:01 Tower kernel: task_work_run+0x67/0x80 Apr 26 15:31:01 Tower kernel: do_exit+0x36c/0x8c0 Apr 26 15:31:01 Tower kernel: ? __pfx_futex_wake_mark+0x10/0x10 Apr 26 15:31:01 Tower kernel: do_group_exit+0x79/0x80 Apr 26 15:31:01 Tower kernel: get_signal+0x61a/0x660 Apr 26 15:31:01 Tower kernel: ? __seccomp_filter+0x83/0x380 Apr 26 15:31:01 Tower kernel: ? _raw_spin_unlock+0x14/0x30 Apr 26 15:31:01 Tower kernel: arch_do_signal_or_restart+0x2a/0x1f0 Apr 26 15:31:01 Tower kernel: ? do_futex+0xe6/0x170 Apr 26 15:31:01 Tower kernel: ? __do_sys_futex+0x11f/0x150 Apr 26 15:31:01 Tower kernel: ? syscall_trace_enter+0x61/0x130 Apr 26 15:31:01 Tower kernel: syscall_exit_to_user_mode+0x4f/0x80 Apr 26 15:31:01 Tower kernel: do_syscall_64+0x82/0xe0 Apr 26 15:31:01 Tower kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e Apr 26 15:31:01 Tower kernel: RIP: 0033:0x14d295491117 Apr 26 15:31:01 Tower kernel: Code: Unable to access opcode bytes at 0x14d2954910ed. Apr 26 15:31:01 Tower kernel: RSP: 002b:000014d286dfb200 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca Apr 26 15:31:01 Tower kernel: RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 000014d295491117 Apr 26 15:31:01 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00005613498784e4 Apr 26 15:31:01 Tower kernel: RBP: 00005613498784b8 R08: 0000000000000000 R09: 00000000ffffffff Apr 26 15:31:01 Tower kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000 Apr 26 15:31:01 Tower kernel: R13: 0000000000000000 R14: 0000000000000007 R15: 00005613498784e4 Apr 26 15:31:01 Tower kernel: </TASK> Apr 26 15:31:01 Tower kernel: ---[ end trace 0000000000000000 ]---
April 26, 20251 yr Author 6 hours ago, SimonF said: The GPU is handing for some reason. Suggest you update BIOS as old had may have issues with newer kernels. Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020 i915 0000:00:02.0: [drm] vf#0:0[1491975] context reset due to GPU hang Apr 26 15:30:53 Tower kernel: vfio-pci 0000:04:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=io Apr 26 15:31:01 Tower kernel: ------------[ cut here ]------------ Apr 26 15:31:01 Tower kernel: pci 0000:00:02.0: [drm] pm_runtime_get_sync() failed: -13 Apr 26 15:31:01 Tower kernel: WARNING: CPU: 4 PID: 1491939 at drivers/gpu/drm/i915/intel_runtime_pm.c:170 __intel_runtime_pm_get.isra.0+0x62/0x80 [i915] Apr 26 15:31:01 Tower kernel: Modules linked in: xt_mark nf_conntrack_netlink xt_nat veth xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle iptable_mangle vhost_net vhost vhost_iotlb tap xt_conntrack xt_MASQUERADE xfrm_user xfrm_algo xt_addrtype nvidia_uvm(PO) nfsd auth_rpcgss lockd grace sunrpc md_mod algif_hash algif_skcipher af_alg cmac zfs(PO) spl(O) bnep tun nf_tables nfnetlink ip6table_nat iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ntfs3 tcp_diag inet_diag nct6775 nct6775_core hwmon_vid corefreqk(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet cfg80211 8021q garp mrp bridge stp llc bonding tls mei_gsc xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel nvidia_drm(PO) nvidia_modeset(PO) i915 kvm nvidia(PO) btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel crc32c_intel ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 aesni_intel bluetooth iosf_mbi drm_buddy crypto_simd Apr 26 15:31:01 Tower kernel: i2c_algo_bit cryptd drm_ttm_helper drm_display_helper ttm drm_kms_helper rapl intel_cstate mei_hdcp mei_pxp wmi_bmof intel_wmi_thunderbolt nvme drm intel_uncore r8125(O) igc mei_me nvme_core i2c_i801 intel_gtt mei i2c_smbus rfkill ahci agpgart ecdh_generic libahci input_leds joydev ecc led_class i2c_core thermal fan video wmi backlight acpi_tad acpi_pad button Apr 26 15:31:01 Tower kernel: CPU: 4 UID: 99 PID: 1491939 Comm: tdarr-ffmpeg Tainted: P U W O 6.12.23-Unraid #1 Apr 26 15:31:01 Tower kernel: Tainted: [P]=PROPRIETARY_MODULE, =USER, [W]=WARN, [O]=OOT_MODULE Apr 26 15:31:01 Tower kernel: Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020 Apr 26 15:31:01 Tower kernel: RIP: 0010:__intel_runtime_pm_get.isra.0+0x62/0x80 [i915] Apr 26 15:31:01 Tower kernel: Code: 71 19 00 01 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 d3 8a 68 db 44 89 e1 4c 89 ea 48 c7 c7 82 72 47 a6 48 89 c6 e8 0e af f5 da <0f> 0b 40 0f b6 f5 48 89 df 5b 5d 41 5c 41 5d e9 ea fe ff ff 66 2e Apr 26 15:31:01 Tower kernel: RSP: 0018:ffffc90001387cf8 EFLAGS: 00010282 Apr 26 15:31:01 Tower kernel: RAX: 0000000000000000 RBX: ffff888105bea060 RCX: 0000000000000027 Apr 26 15:31:01 Tower kernel: RDX: 0000000000000002 RSI: ffffffff82341452 RDI: 00000000ffffffff Apr 26 15:31:01 Tower kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: ffffffff82da3e50 Apr 26 15:31:01 Tower kernel: R10: 0000000000000002 R11: 0000000000000008 R12: 00000000fffffff3 Apr 26 15:31:01 Tower kernel: R13: ffff888101d28c50 R14: ffff888100582ac0 R15: 0000000000000001 Apr 26 15:31:01 Tower kernel: FS: 0000000000000000(0000) GS:ffff889040100000(0000) knlGS:0000000000000000 Apr 26 15:31:01 Tower kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Apr 26 15:31:01 Tower kernel: CR2: 000014d298703a24 CR3: 0000000005418002 CR4: 00000000003726f0 Apr 26 15:31:01 Tower kernel: Call Trace: Apr 26 15:31:01 Tower kernel: <TASK> Apr 26 15:31:01 Tower kernel: intel_runtime_pm_get+0xf/0x20 [i915] Apr 26 15:31:01 Tower kernel: i915_driver_release+0x22/0x80 [i915] Apr 26 15:31:01 Tower kernel: drm_dev_put+0x39/0x70 [drm] Apr 26 15:31:01 Tower kernel: singleton_release+0x1c/0x30 [i915] Apr 26 15:31:01 Tower kernel: __fput+0x106/0x1d0 Apr 26 15:31:01 Tower kernel: task_work_run+0x67/0x80 Apr 26 15:31:01 Tower kernel: do_exit+0x36c/0x8c0 Apr 26 15:31:01 Tower kernel: ? __pfx_futex_wake_mark+0x10/0x10 Apr 26 15:31:01 Tower kernel: do_group_exit+0x79/0x80 Apr 26 15:31:01 Tower kernel: get_signal+0x61a/0x660 Apr 26 15:31:01 Tower kernel: ? __seccomp_filter+0x83/0x380 Apr 26 15:31:01 Tower kernel: ? _raw_spin_unlock+0x14/0x30 Apr 26 15:31:01 Tower kernel: arch_do_signal_or_restart+0x2a/0x1f0 Apr 26 15:31:01 Tower kernel: ? do_futex+0xe6/0x170 Apr 26 15:31:01 Tower kernel: ? __do_sys_futex+0x11f/0x150 Apr 26 15:31:01 Tower kernel: ? syscall_trace_enter+0x61/0x130 Apr 26 15:31:01 Tower kernel: syscall_exit_to_user_mode+0x4f/0x80 Apr 26 15:31:01 Tower kernel: do_syscall_64+0x82/0xe0 Apr 26 15:31:01 Tower kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e Apr 26 15:31:01 Tower kernel: RIP: 0033:0x14d295491117 Apr 26 15:31:01 Tower kernel: Code: Unable to access opcode bytes at 0x14d2954910ed. Apr 26 15:31:01 Tower kernel: RSP: 002b:000014d286dfb200 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca Apr 26 15:31:01 Tower kernel: RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 000014d295491117 Apr 26 15:31:01 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00005613498784e4 Apr 26 15:31:01 Tower kernel: RBP: 00005613498784b8 R08: 0000000000000000 R09: 00000000ffffffff Apr 26 15:31:01 Tower kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000 Apr 26 15:31:01 Tower kernel: R13: 0000000000000000 R14: 0000000000000007 R15: 00005613498784e4 Apr 26 15:31:01 Tower kernel: </TASK> Apr 26 15:31:01 Tower kernel: ---[ end trace 0000000000000000 ]--- updated bios, now can't log in system, safe mode doesn't help A324CF2C-FEE1-49A9-9CBF-D8381157F0DB.heic Edited April 26, 20251 yr by allenchou1994
April 26, 20251 yr Community Expert 38 minutes ago, allenchou1994 said: updated bios, now can't log in system, safe mode doesn't help A324CF2C-FEE1-49A9-9CBF-D8381157F0DB.heic 2.01 MB · 2 downloads I would rename vfio-pci.cfg on the flash as now may be incorrect. Plug usb into another machine and this file is in the config directory.
April 26, 20251 yr Community Expert Create a 2nd USB drive with Stock stable Unraid 7.0.1 to see if it boots ok.
April 26, 20251 yr Author 1 hour ago, SimonF said: Create a 2nd USB drive with Stock stable Unraid 7.0.1 to see if it boots ok. stil doesn't work
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.