[Plugin] Nvidia-Driver


ich777

Recommended Posts

2 minutes ago, ich777 said:

That makes no difference for what you are using it, please also check if you have an option for resizable BAR and enable it too.

 

From what I see in your logs the driver is initialized successfully and after you put a load on it it fails and the card falls from the bus.

Can you try to put the card in another slot and check if the power cables are attached properly?

 

If possible please try to install the card in a desktop PC, install the drivers and put a 3D load on it if it works flawlessly.

yes i have the resizable BAR enable...

maybe this is a temperature issue? i have an 80Cº limit on trex but maybe the gpu get more hot. 4 days after start mining i get this problem. what you think?

Link to comment
5 minutes ago, luixmod said:

maybe this is a temperature issue? i have an 80Cº limit on trex but maybe the gpu get more hot. 4 days after start mining i get this problem. what you think?

Maybe, but I can't tell for sure, I can only tell you that it drops from the bus, is this a founders edition card? I won't recommend to run a card always at 80C 24/7.

Link to comment
10 minutes ago, GrahamsCrackers said:

I'm trying to add an EVGA XC3 3070 to my unraid server. I can download the Nvidia-Driver from community apps, but when I attempt to open it in plugins, it just sits on a blank screen and CPU 8 goes to 100%. I tried on unraid 6.9.2 and just upgraded to 6.10.0-rc2.

Can you post your Diagnostics please?

Link to comment
8 hours ago, GrahamsCrackers said:

I just restarted and pulled a new one.

I see nothing suspicious from the logs, it should all work fine, the card is recognized correctly:

0a:00.0 VGA compatible controller [0300]: NVIDIA Corporation GA104 [GeForce RTX 3070 Lite Hash Rate] [10de:2488] (rev a1)
	Subsystem: eVga.com. Corp. Device [3842:4755]
	Kernel driver in use: nvidia
	Kernel modules: nvidia_drm, nvidia
0a:00.1 Audio device [0403]: NVIDIA Corporation GA104 High Definition Audio Controller [10de:228b] (rev a1)
	Subsystem: eVga.com. Corp. Device [3842:4755]

 

Also from your syslog:

Dec 23 14:36:47 Tower root: 
Dec 23 14:36:47 Tower root: +==============================================================================
Dec 23 14:36:47 Tower root: | Installing new package /boot/config/plugins/nvidia-driver/nvidia-driver-2021.09.17.txz
Dec 23 14:36:47 Tower root: +==============================================================================
Dec 23 14:36:47 Tower root: 
Dec 23 14:36:47 Tower root: Verifying package nvidia-driver-2021.09.17.txz.
Dec 23 14:36:47 Tower root: Installing package nvidia-driver-2021.09.17.txz:
Dec 23 14:36:47 Tower root: PACKAGE DESCRIPTION:
Dec 23 14:36:47 Tower root: Package nvidia-driver-2021.09.17.txz installed.
Dec 23 14:36:47 Tower root: plugin: creating: /usr/local/emhttp/plugins/nvidia-driver/README.md - from INLINE content
Dec 23 14:36:47 Tower root: plugin: running: anonymous
Dec 23 14:36:48 Tower root: 
Dec 23 14:36:48 Tower root: --------------------Nvidia driver v495.46 found locally---------------------
Dec 23 14:36:48 Tower root: 
Dec 23 14:36:48 Tower root: -----------------Installing Nvidia Driver Package v495.46-------------------
Dec 23 14:37:12 Tower kernel: nvidia: module license 'NVIDIA' taints kernel.
Dec 23 14:37:12 Tower kernel: Disabling lock debugging due to kernel taint
Dec 23 14:37:12 Tower kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 242
Dec 23 14:37:12 Tower kernel: 
Dec 23 14:37:12 Tower kernel: nvidia 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
Dec 23 14:37:12 Tower kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module  495.46  Wed Oct 27 16:31:33 UTC 2021
Dec 23 14:37:12 Tower root: 
Dec 23 14:37:12 Tower root: --------------Installation of Nvidia driver v495.46 successful--------------
Dec 23 14:37:12 Tower root: plugin: nvidia-driver.plg installed

 

And there is no other error in there.

 

Can you please try to open up a terminal from unRAID itself and post the output from:

nvidia-smi

 

If you are experiencing any issues when issuing this command or it takes a long time to execute please pull the Diagnostics again and post them here but first I have some recommendations:

  • From what I see you are booting with UEFI, try to boot with Legacy Mode
  • Make sure that Above 4G decoding and Resizable BAR support is enabled in the BIOS
  • Set PCIe ACS override to Downstream in the VM Manager
    grafik.png.c226ef332161913e013fad59a3435d97.png
  • Make sure that AMD-V and AMD-Vi is both enabled
Link to comment
On 12/23/2021 at 8:57 PM, ich777 said:

Can you please post your Diagnostics?

I think this is maybe an issue with the plexpass container from @binhex.

I now saw multiple people report that a reinstallation from the container helped and that hardware transcoding was working after that.

 

Sorry for the delay, I had not seen the answer, I am attaching diagnostics, do you intend to reinstall plex?

 

Edited by Fastbobo
Link to comment
12 hours ago, ich777 said:

Yes, this solved most of the issues, see first/second/third page of this thread and search for @binhex on the site.

 

I just tried to remove Plex from the terminal and reinstall it, unfortunately nothing has changed, it continues to use only the CPU, I also tried to disable the integrated gpu from the bios and leave only PCIE but even so it uses only CPU.

I do not know what to do...

Link to comment
7 minutes ago, Fastbobo said:

I just tried to remove Plex from the terminal and reinstall it, unfortunately nothing has changed, it continues to use only the CPU, I also tried to disable the integrated gpu from the bios and leave only PCIE but even so it uses only CPU.

I do not know what to do...

Please post a few screenshots from what you did so far (container templates & Plex settings) please, this would be a good start, have you enabled HW transcoding in the Plex settings and also created a Plex Token?

Link to comment
27 minutes ago, ich777 said:

Please post a few screenshots from what you did so far (container templates & Plex settings) please, this would be a good start, have you enabled HW transcoding in the Plex settings and also created a Plex Token?

 

Ok, I attach screeshot, what do you mean by Plex Token?

Immagine 2021-12-28 213621.png

Immagine 2021-12-28 213715.png

Immagine 2021-12-28 213729.png

Immagine 2021-12-28 213519_LI.jpg

Immagine 2021-12-28 214041.png

Link to comment
36 minutes ago, Fastbobo said:

attach screenshot,

You do not create a variable called Extra Parameters.  It is already part of the template.  You must turn on Advanced View (Basic/Advanced View toggle) in the container template to see it.

 

image.png.cab38f0775174f317ef6e46df1aa6084.png

 

Since I have an Intel CPU with an iGPU I use that for transcoding as seen below.  You need the --runtime-=nvidia in Extra Parameters.

 

image.png.6670414171259179ad5f92801ffec5b3.png

  • Like 2
Link to comment
7 minutes ago, BradleyL said:

Hello, struggling to get this installed with a P2000. It is supported by the latest driver so really not sure what the issue is atm.

You have bound the card to VFIO from what I see in the screenshot and also in the Diagnostics.

If you bind it to VFIO the plugin can't see the card, please unbind it from VFIO and reboot your server. After that it should see the card, if not please post your Diagnostics again.

 

 

26:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP106GL [Quadro P2000] [10de:1c30] (rev a1)
	Subsystem: Dell GP106GL [Quadro P2000] [1028:11b3]
	Kernel driver in use: vfio-pci
	Kernel modules: nvidia_drm, nvidia
26:00.1 Audio device [0403]: NVIDIA Corporation GP106 High Definition Audio Controller [10de:10f1] (rev a1)
	Subsystem: Dell GP106 High Definition Audio Controller [1028:11b3]
	Kernel driver in use: vfio-pci

(you can see here that it uses the vfio-pci kernel module not the nvidia one because it's bound to VFIO)

Link to comment
11 minutes ago, ich777 said:

You have bound the card to VFIO from what I see in the screenshot and also in the Diagnostics.

If you bind it to VFIO the plugin can't see the card, please unbind it from VFIO and reboot your server. After that it should see the card, if not please post your Diagnostics again.

 

 

26:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP106GL [Quadro P2000] [10de:1c30] (rev a1)
	Subsystem: Dell GP106GL [Quadro P2000] [1028:11b3]
	Kernel driver in use: vfio-pci
	Kernel modules: nvidia_drm, nvidia
26:00.1 Audio device [0403]: NVIDIA Corporation GP106 High Definition Audio Controller [10de:10f1] (rev a1)
	Subsystem: Dell GP106 High Definition Audio Controller [1028:11b3]
	Kernel driver in use: vfio-pci

(you can see here that it uses the vfio-pci kernel module not the nvidia one because it's bound to VFIO)


Cheers that was the issue

  • Like 1
Link to comment

I have a brand new 3060 that I just put into my new unraid server. Had an older 6GB 1060 that was in there and it worked okay, but wanted more power. Unraid doesn't detect the 3060.

Any thoughts?

Nvidia drivers updated and v495.46 is loaded. Nvidia driver plugin shows this error.

 

"NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running."

 

This is a supermicro server mobo.

3060 was in a Windows 10 pc for a week or 2 so I know it works.

 

Link to comment
12 minutes ago, mgadbois said:

"NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running."

Without the Diagnostics I really can't tell anything.

 

Have you turned on Above 4G decoding in the BIOS also enable resizable BAR support if you have this option in your BIOS. Most of the times it seems like it's incompatibility between the hardware (Motherboard & Graphics Card) or a BIOS issue of some kind.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.