Skip to content
View in the app

A better way to browse. Learn more.

Unraid

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

igpu got lost

Featured Replies

i am using unraid 7.1 rc1 with qtb1 cpu(10900es, uhd630 igpu). rtx3080laptop and dg1 gpu.

under normal situation, all gpu (630, 3080,dg1) can be seen in system devices and can be used by docker or vm.

uhd630 and 3080 are used by tdarr-node docker to transcode, dg1 are used by fnos vm.

but uhd630 could be lost for no reason, this happened under unraid 6.12 and unraid 7.0

tower-diagnostics-20250426-1650.zip

  • Community Expert
20 minutes ago, allenchou1994 said:

i am using unraid 7.1 rc1 with qtb1 cpu(10900es, uhd630 igpu). rtx3080laptop and dg1 gpu.

under normal situation, all gpu (630, 3080,dg1) can be seen in system devices and can be used by docker or vm.

uhd630 and 3080 are used by tdarr-node docker to transcode, dg1 are used by fnos vm.

but uhd630 could be lost for no reason, this happened under unraid 6.12 and unraid 7.0

tower-diagnostics-20250426-1650.zip 344.4 kB · 0 downloads

The GPU is handing for some reason. Suggest you update BIOS as old had may have issues with newer kernels.

 

Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020

 

i915 0000:00:02.0: [drm] vf#0:0[1491975] context reset due to GPU hang
Apr 26 15:30:53 Tower kernel: vfio-pci 0000:04:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=io
Apr 26 15:31:01 Tower kernel: ------------[ cut here ]------------
Apr 26 15:31:01 Tower kernel: pci 0000:00:02.0: [drm] pm_runtime_get_sync() failed: -13
Apr 26 15:31:01 Tower kernel: WARNING: CPU: 4 PID: 1491939 at drivers/gpu/drm/i915/intel_runtime_pm.c:170 __intel_runtime_pm_get.isra.0+0x62/0x80 [i915]
Apr 26 15:31:01 Tower kernel: Modules linked in: xt_mark nf_conntrack_netlink xt_nat veth xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle iptable_mangle vhost_net vhost vhost_iotlb tap xt_conntrack xt_MASQUERADE xfrm_user xfrm_algo xt_addrtype nvidia_uvm(PO) nfsd auth_rpcgss lockd grace sunrpc md_mod algif_hash algif_skcipher af_alg cmac zfs(PO) spl(O) bnep tun nf_tables nfnetlink ip6table_nat iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ntfs3 tcp_diag inet_diag nct6775 nct6775_core hwmon_vid corefreqk(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet cfg80211 8021q garp mrp bridge stp llc bonding tls mei_gsc xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel nvidia_drm(PO) nvidia_modeset(PO) i915 kvm nvidia(PO) btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel crc32c_intel ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 aesni_intel bluetooth iosf_mbi drm_buddy crypto_simd
Apr 26 15:31:01 Tower kernel: i2c_algo_bit cryptd drm_ttm_helper drm_display_helper ttm drm_kms_helper rapl intel_cstate mei_hdcp mei_pxp wmi_bmof intel_wmi_thunderbolt nvme drm intel_uncore r8125(O) igc mei_me nvme_core i2c_i801 intel_gtt mei i2c_smbus rfkill ahci agpgart ecdh_generic libahci input_leds joydev ecc led_class i2c_core thermal fan video wmi backlight acpi_tad acpi_pad button
Apr 26 15:31:01 Tower kernel: CPU: 4 UID: 99 PID: 1491939 Comm: tdarr-ffmpeg Tainted: P     U  W  O       6.12.23-Unraid #1
Apr 26 15:31:01 Tower kernel: Tainted: [P]=PROPRIETARY_MODULE, =USER, [W]=WARN, [O]=OOT_MODULE
Apr 26 15:31:01 Tower kernel: Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020
Apr 26 15:31:01 Tower kernel: RIP: 0010:__intel_runtime_pm_get.isra.0+0x62/0x80 [i915]
Apr 26 15:31:01 Tower kernel: Code: 71 19 00 01 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 d3 8a 68 db 44 89 e1 4c 89 ea 48 c7 c7 82 72 47 a6 48 89 c6 e8 0e af f5 da <0f> 0b 40 0f b6 f5 48 89 df 5b 5d 41 5c 41 5d e9 ea fe ff ff 66 2e
Apr 26 15:31:01 Tower kernel: RSP: 0018:ffffc90001387cf8 EFLAGS: 00010282
Apr 26 15:31:01 Tower kernel: RAX: 0000000000000000 RBX: ffff888105bea060 RCX: 0000000000000027
Apr 26 15:31:01 Tower kernel: RDX: 0000000000000002 RSI: ffffffff82341452 RDI: 00000000ffffffff
Apr 26 15:31:01 Tower kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: ffffffff82da3e50
Apr 26 15:31:01 Tower kernel: R10: 0000000000000002 R11: 0000000000000008 R12: 00000000fffffff3
Apr 26 15:31:01 Tower kernel: R13: ffff888101d28c50 R14: ffff888100582ac0 R15: 0000000000000001
Apr 26 15:31:01 Tower kernel: FS:  0000000000000000(0000) GS:ffff889040100000(0000) knlGS:0000000000000000
Apr 26 15:31:01 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 26 15:31:01 Tower kernel: CR2: 000014d298703a24 CR3: 0000000005418002 CR4: 00000000003726f0
Apr 26 15:31:01 Tower kernel: Call Trace:
Apr 26 15:31:01 Tower kernel: <TASK>
Apr 26 15:31:01 Tower kernel: intel_runtime_pm_get+0xf/0x20 [i915]
Apr 26 15:31:01 Tower kernel: i915_driver_release+0x22/0x80 [i915]
Apr 26 15:31:01 Tower kernel: drm_dev_put+0x39/0x70 [drm]
Apr 26 15:31:01 Tower kernel: singleton_release+0x1c/0x30 [i915]
Apr 26 15:31:01 Tower kernel: __fput+0x106/0x1d0
Apr 26 15:31:01 Tower kernel: task_work_run+0x67/0x80
Apr 26 15:31:01 Tower kernel: do_exit+0x36c/0x8c0
Apr 26 15:31:01 Tower kernel: ? __pfx_futex_wake_mark+0x10/0x10
Apr 26 15:31:01 Tower kernel: do_group_exit+0x79/0x80
Apr 26 15:31:01 Tower kernel: get_signal+0x61a/0x660
Apr 26 15:31:01 Tower kernel: ? __seccomp_filter+0x83/0x380
Apr 26 15:31:01 Tower kernel: ? _raw_spin_unlock+0x14/0x30
Apr 26 15:31:01 Tower kernel: arch_do_signal_or_restart+0x2a/0x1f0
Apr 26 15:31:01 Tower kernel: ? do_futex+0xe6/0x170
Apr 26 15:31:01 Tower kernel: ? __do_sys_futex+0x11f/0x150
Apr 26 15:31:01 Tower kernel: ? syscall_trace_enter+0x61/0x130
Apr 26 15:31:01 Tower kernel: syscall_exit_to_user_mode+0x4f/0x80
Apr 26 15:31:01 Tower kernel: do_syscall_64+0x82/0xe0
Apr 26 15:31:01 Tower kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
Apr 26 15:31:01 Tower kernel: RIP: 0033:0x14d295491117
Apr 26 15:31:01 Tower kernel: Code: Unable to access opcode bytes at 0x14d2954910ed.
Apr 26 15:31:01 Tower kernel: RSP: 002b:000014d286dfb200 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca
Apr 26 15:31:01 Tower kernel: RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 000014d295491117
Apr 26 15:31:01 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00005613498784e4
Apr 26 15:31:01 Tower kernel: RBP: 00005613498784b8 R08: 0000000000000000 R09: 00000000ffffffff
Apr 26 15:31:01 Tower kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
Apr 26 15:31:01 Tower kernel: R13: 0000000000000000 R14: 0000000000000007 R15: 00005613498784e4
Apr 26 15:31:01 Tower kernel: </TASK>
Apr 26 15:31:01 Tower kernel: ---[ end trace 0000000000000000 ]---

  • Author
6 hours ago, SimonF said:

The GPU is handing for some reason. Suggest you update BIOS as old had may have issues with newer kernels.

 

Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020

 

i915 0000:00:02.0: [drm] vf#0:0[1491975] context reset due to GPU hang
Apr 26 15:30:53 Tower kernel: vfio-pci 0000:04:00.0: vgaarb: VGA decodes changed: olddecodes=io+mem,decodes=io+mem:owns=io
Apr 26 15:31:01 Tower kernel: ------------[ cut here ]------------
Apr 26 15:31:01 Tower kernel: pci 0000:00:02.0: [drm] pm_runtime_get_sync() failed: -13
Apr 26 15:31:01 Tower kernel: WARNING: CPU: 4 PID: 1491939 at drivers/gpu/drm/i915/intel_runtime_pm.c:170 __intel_runtime_pm_get.isra.0+0x62/0x80 [i915]
Apr 26 15:31:01 Tower kernel: Modules linked in: xt_mark nf_conntrack_netlink xt_nat veth xt_CHECKSUM ipt_REJECT nf_reject_ipv4 xt_tcpudp ip6table_mangle iptable_mangle vhost_net vhost vhost_iotlb tap xt_conntrack xt_MASQUERADE xfrm_user xfrm_algo xt_addrtype nvidia_uvm(PO) nfsd auth_rpcgss lockd grace sunrpc md_mod algif_hash algif_skcipher af_alg cmac zfs(PO) spl(O) bnep tun nf_tables nfnetlink ip6table_nat iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ntfs3 tcp_diag inet_diag nct6775 nct6775_core hwmon_vid corefreqk(O) ip6table_filter ip6_tables iptable_filter ip_tables x_tables efivarfs af_packet cfg80211 8021q garp mrp bridge stp llc bonding tls mei_gsc xe drm_gpuvm drm_exec gpu_sched drm_suballoc_helper intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel nvidia_drm(PO) nvidia_modeset(PO) i915 kvm nvidia(PO) btusb btrtl btbcm crct10dif_pclmul crc32_pclmul btintel crc32c_intel ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 aesni_intel bluetooth iosf_mbi drm_buddy crypto_simd
Apr 26 15:31:01 Tower kernel: i2c_algo_bit cryptd drm_ttm_helper drm_display_helper ttm drm_kms_helper rapl intel_cstate mei_hdcp mei_pxp wmi_bmof intel_wmi_thunderbolt nvme drm intel_uncore r8125(O) igc mei_me nvme_core i2c_i801 intel_gtt mei i2c_smbus rfkill ahci agpgart ecdh_generic libahci input_leds joydev ecc led_class i2c_core thermal fan video wmi backlight acpi_tad acpi_pad button
Apr 26 15:31:01 Tower kernel: CPU: 4 UID: 99 PID: 1491939 Comm: tdarr-ffmpeg Tainted: P     U  W  O       6.12.23-Unraid #1
Apr 26 15:31:01 Tower kernel: Tainted: [P]=PROPRIETARY_MODULE, =USER, [W]=WARN, [O]=OOT_MODULE
Apr 26 15:31:01 Tower kernel: Hardware name: ASUS System Product Name/ROG STRIX Z490-H GAMING, BIOS 0403 03/23/2020
Apr 26 15:31:01 Tower kernel: RIP: 0010:__intel_runtime_pm_get.isra.0+0x62/0x80 [i915]
Apr 26 15:31:01 Tower kernel: Code: 71 19 00 01 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 d3 8a 68 db 44 89 e1 4c 89 ea 48 c7 c7 82 72 47 a6 48 89 c6 e8 0e af f5 da <0f> 0b 40 0f b6 f5 48 89 df 5b 5d 41 5c 41 5d e9 ea fe ff ff 66 2e
Apr 26 15:31:01 Tower kernel: RSP: 0018:ffffc90001387cf8 EFLAGS: 00010282
Apr 26 15:31:01 Tower kernel: RAX: 0000000000000000 RBX: ffff888105bea060 RCX: 0000000000000027
Apr 26 15:31:01 Tower kernel: RDX: 0000000000000002 RSI: ffffffff82341452 RDI: 00000000ffffffff
Apr 26 15:31:01 Tower kernel: RBP: 0000000000000001 R08: 0000000000000000 R09: ffffffff82da3e50
Apr 26 15:31:01 Tower kernel: R10: 0000000000000002 R11: 0000000000000008 R12: 00000000fffffff3
Apr 26 15:31:01 Tower kernel: R13: ffff888101d28c50 R14: ffff888100582ac0 R15: 0000000000000001
Apr 26 15:31:01 Tower kernel: FS:  0000000000000000(0000) GS:ffff889040100000(0000) knlGS:0000000000000000
Apr 26 15:31:01 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 26 15:31:01 Tower kernel: CR2: 000014d298703a24 CR3: 0000000005418002 CR4: 00000000003726f0
Apr 26 15:31:01 Tower kernel: Call Trace:
Apr 26 15:31:01 Tower kernel: <TASK>
Apr 26 15:31:01 Tower kernel: intel_runtime_pm_get+0xf/0x20 [i915]
Apr 26 15:31:01 Tower kernel: i915_driver_release+0x22/0x80 [i915]
Apr 26 15:31:01 Tower kernel: drm_dev_put+0x39/0x70 [drm]
Apr 26 15:31:01 Tower kernel: singleton_release+0x1c/0x30 [i915]
Apr 26 15:31:01 Tower kernel: __fput+0x106/0x1d0
Apr 26 15:31:01 Tower kernel: task_work_run+0x67/0x80
Apr 26 15:31:01 Tower kernel: do_exit+0x36c/0x8c0
Apr 26 15:31:01 Tower kernel: ? __pfx_futex_wake_mark+0x10/0x10
Apr 26 15:31:01 Tower kernel: do_group_exit+0x79/0x80
Apr 26 15:31:01 Tower kernel: get_signal+0x61a/0x660
Apr 26 15:31:01 Tower kernel: ? __seccomp_filter+0x83/0x380
Apr 26 15:31:01 Tower kernel: ? _raw_spin_unlock+0x14/0x30
Apr 26 15:31:01 Tower kernel: arch_do_signal_or_restart+0x2a/0x1f0
Apr 26 15:31:01 Tower kernel: ? do_futex+0xe6/0x170
Apr 26 15:31:01 Tower kernel: ? __do_sys_futex+0x11f/0x150
Apr 26 15:31:01 Tower kernel: ? syscall_trace_enter+0x61/0x130
Apr 26 15:31:01 Tower kernel: syscall_exit_to_user_mode+0x4f/0x80
Apr 26 15:31:01 Tower kernel: do_syscall_64+0x82/0xe0
Apr 26 15:31:01 Tower kernel: entry_SYSCALL_64_after_hwframe+0x76/0x7e
Apr 26 15:31:01 Tower kernel: RIP: 0033:0x14d295491117
Apr 26 15:31:01 Tower kernel: Code: Unable to access opcode bytes at 0x14d2954910ed.
Apr 26 15:31:01 Tower kernel: RSP: 002b:000014d286dfb200 EFLAGS: 00000246 ORIG_RAX: 00000000000000ca
Apr 26 15:31:01 Tower kernel: RAX: fffffffffffffe00 RBX: 0000000000000000 RCX: 000014d295491117
Apr 26 15:31:01 Tower kernel: RDX: 0000000000000000 RSI: 0000000000000189 RDI: 00005613498784e4
Apr 26 15:31:01 Tower kernel: RBP: 00005613498784b8 R08: 0000000000000000 R09: 00000000ffffffff
Apr 26 15:31:01 Tower kernel: R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
Apr 26 15:31:01 Tower kernel: R13: 0000000000000000 R14: 0000000000000007 R15: 00005613498784e4
Apr 26 15:31:01 Tower kernel: </TASK>
Apr 26 15:31:01 Tower kernel: ---[ end trace 0000000000000000 ]---

updated bios, now can't log in system, safe mode doesn't help

 

 

A324CF2C-FEE1-49A9-9CBF-D8381157F0DB.heic

Edited by allenchou1994

  • Author

changed vfio-pci.cfg  file name,  but didn't help

image.thumb.png.ff28cd52bc231a0c3cf52d2f9a9a0ef0.png

  • Community Expert

Create a 2nd USB drive with Stock stable Unraid 7.0.1 to see if it boots ok. 

  • Author
1 hour ago, SimonF said:

Create a 2nd USB drive with Stock stable Unraid 7.0.1 to see if it boots ok. 

stil doesn't work

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.