Jump to content

[Support] ich777 - AMD Vendor Reset, CoralTPU, hpsahba,...


Recommended Posts

2 hours ago, ich777 said:

Please uninstall the plugin, then remove the .conf file that you've created with:

rm -f /boot/config/modprobe.d/qnap_ec.conf

then install the plugin again, reboot and see if it is working.

I think it worked - after rebooting, I can see the "Airflow" widget on the dashboard, and the Dynamix Fan Auto Control app sees the qnap_ec pwm fan controllers right away, I didn't need to fiddle with the terminal to get them to show up.

 

Thank you!

 

If I may ask, what changes did you make to fix this? And is this a global fix for all TS-464s in the qnap-ec plugin?

Link to comment
8 hours ago, asbath said:

If I may ask, what changes did you make to fix this?

I only made a little change to the module itself.

When the module/driver is run on Unraid it set's the detect_chip automatically to off/false.

 

8 hours ago, asbath said:

And is this a global fix for all TS-464s in the qnap-ec plugin?

This is a global fix for all users and will be working on 6.12.10+ (except for the closed beta so far but I will be recompiling the driver for the latest closed beta too).

  • Thanks 1
Link to comment
11 hours ago, ich777 said:

Please uninstall the plugin, then remove the .conf file that you've created with:

rm -f /boot/config/modprobe.d/qnap_ec.conf

then install the plugin again, reboot and see if it is working.

 

Done. Seems to be functioning. I'll keep an eye on it but should be good I assume! Thanks.

  • Like 1
Link to comment
30 minutes ago, shanelord said:

I'll keep an eye on it but should be good I assume!

I just switched my build toolchain over to my repository, you now shouldn't need to put that one file into place (Unraid 6.12.10+).

If Unraid is detected it will automatically disable the chip check.

Link to comment
1 hour ago, animeking1987 said:

My AMD Ryzen 7 5700G with Radeon Graphics @ 3800 MHz is not showing on GPU statics. I have installed the AMD plugin  but it is not showing at all

Can you please post your full Diagnostics you syslog shows too less information about whats going on.

Link to comment
33 minutes ago, animeking1987 said:

Your iGPU is not enabled, please make sure to enable it in the BIOS but this is most certainly caused because you also got a Nvidia GPU in your system:

01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] [10de:1c82] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107 [GeForce GTX 1050 Ti] [1462:3351]
	Kernel driver in use: nvidia
	Kernel modules: nvidia_drm, nvidia
01:00.1 Audio device [0403]: NVIDIA Corporation GP107GL High Definition Audio Controller [10de:0fb9] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107GL High Definition Audio Controller [1462:3351]

 

Please make sure that you iGPU is set to the primary video output and also enable Multi Monitor support to activate the iGPU even when a dGPU is installed (at least on most BIOS versions that's the option to keep the iGPU enable even when a dGPU is installed in the system).

 

BTW, this has nothing to do with the RadeonTOP plugin.

Link to comment
1 hour ago, ich777 said:

Your iGPU is not enabled, please make sure to enable it in the BIOS but this is most certainly caused because you also got a Nvidia GPU in your system:

01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] [10de:1c82] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107 [GeForce GTX 1050 Ti] [1462:3351]
	Kernel driver in use: nvidia
	Kernel modules: nvidia_drm, nvidia
01:00.1 Audio device [0403]: NVIDIA Corporation GP107GL High Definition Audio Controller [10de:0fb9] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107GL High Definition Audio Controller [1462:3351]

 

Please make sure that you iGPU is set to the primary video output and also enable Multi Monitor support to activate the iGPU even when a dGPU is installed (at least on most BIOS versions that's the option to keep the iGPU enable even when a dGPU is installed in the system).

 

BTW, this has nothing to do with the RadeonTOP plugin.

So I can't use both the GPU and IGPU.  So set IGPU to primary use on the bios? 

Link to comment
5 hours ago, ich777 said:

Your iGPU is not enabled, please make sure to enable it in the BIOS but this is most certainly caused because you also got a Nvidia GPU in your system:

01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] [10de:1c82] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107 [GeForce GTX 1050 Ti] [1462:3351]
	Kernel driver in use: nvidia
	Kernel modules: nvidia_drm, nvidia
01:00.1 Audio device [0403]: NVIDIA Corporation GP107GL High Definition Audio Controller [10de:0fb9] (rev a1)
	Subsystem: Micro-Star International Co., Ltd. [MSI] GP107GL High Definition Audio Controller [1462:3351]

 

Please make sure that you iGPU is set to the primary video output and also enable Multi Monitor support to activate the iGPU even when a dGPU is installed (at least on most BIOS versions that's the option to keep the iGPU enable even when a dGPU is installed in the system).

 

BTW, this has nothing to do with the RadeonTOP plugin.

This worked. Enabled in the bios and now its active.

  • Like 2
Link to comment
  • 2 weeks later...

Hello Everyone

I need to know, my Spec 11900T using UHD750, Is now need using modprobe.d i915 to below command?

/boot/config/modprobe.d/i915.conf

options i915 force_probe=4c8a
options i915 enable_guc=2

or just only need install intel gpu top plugin then will work fine?

 

Link to comment
  • 2 weeks later...

Hey, 

I have the driver installed with a Radeon 6900XT and noticing these errors, checking the temps using gpu stats it sitting at 40c and I havent been able to find much regarding the other errors listed, on top my unraid server seem to hard lock up, every few days. not sure what the cause is, just assuming these errors might be related to that. Still currently trying to get a copy of the logs when the hard crash occurs. In the mean time anyone able to shed light on the issue below?
 

May 28 21:33:28 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: Fail to disable thermal alert!
May 28 21:33:28 Tower kernel: [drm:amdgpu_device_ip_suspend_phase2 [amdgpu]] *ERROR* suspend of IP block <smu> failed -22
May 28 21:33:28 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: free PSP TMR buffer
May 28 21:33:29 Tower kernel: [drm] PCIE GART of 512M enabled (table at 0x00000083FEB00000).
May 28 21:33:29 Tower kernel: [drm] PSP is resuming...
May 28 21:33:29 Tower kernel: [drm] reserve 0xa00000 from 0x83fd000000 for PSP TMR
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: SMU is resuming...
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: smu driver if version = 0x00000040, smu fw if version = 0x00000041, smu fw program = 0, version = 0x003a5800 (58.88.0)
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: SMU driver if version not matched
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: dpm has been enabled
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: SMU is resumed successfully!
May 28 21:33:29 Tower kernel: [drm] DMUB hardware initialized: version=0x02020020
May 28 21:33:29 Tower kernel: [drm] kiq ring mec 2 pipe 1 q 0
May 28 21:33:29 Tower kernel: [drm] VCN decode and encode initialized successfully(under DPG Mode).
May 28 21:33:29 Tower kernel: [drm] JPEG decode initialized successfully.
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma2 uses VM inv eng 14 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring sdma3 uses VM inv eng 15 on hub 0
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_dec_0 uses VM inv eng 0 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_0.0 uses VM inv eng 1 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_0.1 uses VM inv eng 4 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_dec_1 uses VM inv eng 5 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_1.0 uses VM inv eng 6 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring vcn_enc_1.1 uses VM inv eng 7 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: amdgpu: ring jpeg_dec uses VM inv eng 8 on hub 1
May 28 21:33:29 Tower kernel: amdgpu 0000:0a:00.0: [drm] Cannot find any crtc or sizes

 

Link to comment
16 minutes ago, Salty2011 said:

I have the driver installed with a Radeon 6900XT and noticing these errors, checking the temps using gpu stats it sitting at 40c and I havent been able to find much regarding the other errors listed, on top my unraid server seem to hard lock up, every few days.

My plugin is not strictly speaking a driver, it is just a binary that reads the stats from your GPU.

 

Do you have the GPU passed through to a VM? Diagnostics would also help to investigate further.

Link to comment
6 minutes ago, ich777 said:

My plugin is not strictly speaking a driver, it is just a binary that reads the stats from your GPU.

 

Do you have the GPU passed through to a VM? Diagnostics would also help to investigate further.


Dont have the GPU passed through to any VM, but do have it passed into the emby container, also that does seem to detect it. Did test using plex / unmanic and that works without issue and see load on the card using radeontop (so suspect its just the emby container)

 

Ive attached the diagnostic zip.

 

tower-diagnostics-20240528-2203.zip

Link to comment
36 minutes ago, Salty2011 said:

Dont have the GPU passed through to any VM, but do have it passed into the emby container, also that does seem to detect it.

May I ask why you even have two GPUs in your Server?

As far as I can see your GTX1080 should be sufficient and the AMD GPU doesn't make any sense to me if you haven't passed through one of either GPUs to a VM.

 

Anyways, make sure to disable C-States in the BIOS because it can cause trouble with AMD CPUs and PCIe devices (and other weird behavior).

Link to comment
9 hours ago, ich777 said:

May I ask why you even have two GPUs in your Server?

As far as I can see your GTX1080 should be sufficient and the AMD GPU doesn't make any sense to me if you haven't passed through one of either GPUs to a VM.

 

Anyways, make sure to disable C-States in the BIOS because it can cause trouble with AMD CPUs and PCIe devices (and other weird behavior).

thanks @ich777 ill take a look at disabling the c-states.

 

As for the duel gpu's, this is server I built up with parts I had laying around. I opted to put a second gpu in the rig as I wanted to test / poc moving to a virtualized gaming rig. There are two approaches i was going to look at trying to achieve this, either using Steam headless / Games On Whales (Wolf) or a VM with GPU pass through and just have sunshine stream installed.

 

end goal is to completely consolidate nas/daily pc and home assistant into the one box. However im quickly finding that for transcoding and game streaming, and literally everything nvidia seems to the better option to use for allot of this. Was kinda surprised given from a linux desktop standpoint nvidia has always been a royal pain. (i noticed the transcode performance of the 1080 was equal to the 6900xt but at way less power draw)

Link to comment
4 hours ago, Salty2011 said:

Steam headless / Games On Whales (Wolf)

There are now many things available it seems, don't forget Gaming on Whales (GoW). :D

 

4 hours ago, Salty2011 said:

6900xt

I would always recommend, at least at the time of writing, a Nvidia card since they are working both in VMs and in Docker container on Unraid and doesn't cause a crash like most of the AMD cards do.

 

For the crashing I really can't help much since this can be a Hardware compatibility issue or also a Firmware issue, either from the card itself or the Motherboard.

Link to comment
2 hours ago, ich777 said:

There are now many things available it seems, don't forget Gaming on Whales (GoW). :D

 

I would always recommend, at least at the time of writing, a Nvidia card since they are working both in VMs and in Docker container on Unraid and doesn't cause a crash like most of the AMD cards do.

 

For the crashing I really can't help much since this can be a Hardware compatibility issue or also a Firmware issue, either from the card itself or the Motherboard.

yeah thats fair. ill try the c-states suggestion and see if helps. if not will take out the AMD card. the use cases for it arent a must and ill wait till the next version of nvidia cards to come out and either get something on sale or the new option.

  • Like 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...