[Plugin] Nvidia-Driver


ich777

Recommended Posts

2 hours ago, Tithonius said:

welp we crashed again with the usb in a usb 3 slot... time to add a pci usb card...

Have you yet tried to remove the Nvidia Driver plugin and see if you got crashes too without it installed (after removing, you have to reboot)?

 

If you've done this already, I would strongly recommend that you are creating a post in the Gerneral Sub Forums here on the forums.

Link to comment
1 hour ago, tjb_altf4 said:

6.10.3

I've now tried it on my test server and it is working flawlessly (please ignore that no card is detected, I don't have one in that system currently):
grafik.thumb.png.20355c707826e8b1940ceceb369150f0.png

 

The Latest Versions indicator is empty because the latest Production/New Feature Branch and Open Source driver are simply not available for this old Unraid version because I stop compiling drivers for the previous version after a new Unraid version is released and start there fresh with the latest available Nvidia drivers for the current Unraid version.

 

Hope that makes sense to you.

  • Like 1
Link to comment

Signed up just so I could post here for others reference since I've seen multiple people ask with conflicting answers:
I was able to get a tesla P4 working with docker containers such as plex using the default nvidia driver plugin. I was running unraid 6.11.5 and using driver 530.30.02.

 

One issue I had is that the official plex docker was not properly accepting hardware transcode. I was able to remedy this by switching to the linuxserver docker for plex. Was also able to do hardware transcode using the p4 in jellyfin.

 

If you are having issues getting hardware transcoding in plex working, test with a different plex docker.

Edited by teslap4
typo
  • Like 1
  • Thanks 2
Link to comment

Okay, I'm back. and GOOD NEWS EVERYONE! I fixed the random hard crashes that I've been having! So it seems that unraid didn't like the "mismatched" RAM in my system. (it was the same model number and everything, same timings, the works, but 2 of my 4 sticks had different pcb layouts.) That being said, I'm not out of the woods yet. I had a good run of like 10 days in a row with no crashes, and I'm back to square one before all this started.

 

So now i am back to the issue where rarely i get home, hit refresh on my webui and get met with a 500 internal server error. But, now I have proper logging setup, and was able to capture the error in a syslog. I think this shows what's going on. (Hopefully)

 

Would love if you guys could take a look. It does look to me like an nvidia issue, hence why I posted here. 

 

Thanks again.

syslog-192.168.1.10.log

Edited by Tithonius
Link to comment
46 minutes ago, Tithonius said:

Okay, I'm back. and GOOD NEWS EVERYONE! I fixed the random hard crashes that I've been having! So it seems that unraid didn't like the "mismatched" RAM in my system. (it was the same model number and everything, same timings, the works, but 2 of my 4 sticks had different pcb layouts.) That being said, I'm not out of the woods yet. I had a good run of like 10 days in a row with no crashes, and I'm back to square one before all this started.

 

So now i am back to the issue where rarely i get home, hit refresh on my webui and get met with a 500 internal server error. But, now I have proper logging setup, and was able to capture the error in a syslog. I think this shows what's going on. (Hopefully)

 

Would love if you guys could take a look. It does look to me like an nvidia issue, hence why I posted here. 

 

Thanks again.

syslog-192.168.1.10.log 879.46 kB · 1 download

 

I'm no pro on this, but that kernel dump to me looks like a thermal failure. Your card (or motherboard?) overheated and got a a bad memory call.  The nvidia-smi called by your dashboard is what crashed, and then nginx died trying to query it.

  • Thanks 1
Link to comment
19 minutes ago, hexfury said:

 

I'm no pro on this, but that kernel dump to me looks like a thermal failure. Your card (or motherboard?) overheated and got a a bad memory call.  The nvidia-smi called by your dashboard is what crashed, and then nginx died trying to query it.

 

This is definitely not a thermal crash.. the card runs a nice cool 45C all the time under full load transcoding. Also, to be clear, the server didn't crash, just the GUI

 

I'm not overclocking anything, so the motherboard should be totally within thermal limits easy (its just a i3-10100) and the CPU and stuff have never ever had a thermal crash or issue being hot at all before.

Edited by Tithonius
Link to comment
13 hours ago, Tithonius said:

Would love if you guys could take a look. It does look to me like an nvidia issue, hence why I posted here. 

Definitely a issue with your Nvidia GPU.

I really can't say if it's a hardware compatibility issue or if the card is simply faulty in certain scenarios.

 

May I ask why are you using the Nvidia GPU when you have a capable iGPU for transcoding? Sure it's a bit slower in tdarr but it works and in most cases the quality is better as with the NVENC but ultimately it gets the job done.

 

I would recommend that you uninstall the GPU Statistics plugin first and see if that makes a difference.

Link to comment

I wish to say thank you for this awesome plugin, and all of the support you give to the Unraid community daily.

 

I am resurrecting an old server I found down in the spare parts in my basement.  An older Intel i-530 Clarkdale with 4GB of DRAM.  Currently running Unraid 5.0.4, but I am thinking of updating. 

 

I'm trying to add a Nvidia GPU to crunch through my collection of 4K movies, but for some strange reason, it isn't recognizing this GeForce 256.  Any ideas?

 

Oh... and Happy April Fools Day.  😁

GeForce 256.jpg

  • Haha 2
Link to comment

Hello,

my PC crashed while installing this plugin and now I'm stuck.

After a reboot of my PC I tried installing again but nothing seemed to happen in the INSTALL PLUGIN window after this:

 

plugin: installing: https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
plugin: downloading https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
plugin: downloading: https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg ... done
plugin: downloading: https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz ... done

+==============================================================================
| Skipping package nvidia-driver-2023.03.02 (already installed)
+==============================================================================

 

I checked the logs and decided to removed these folders:

/boot/config/plugins/nvidia-driver and
/usr/local/emhttp/plugins/nvidia-driver

 

Now the INSTALL PLUGIN window shows to this:

plugin: installing: https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
plugin: downloading https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
plugin: downloading: https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg ... done
plugin: downloading: https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz ... done

+==============================================================================
| Skipping package nvidia-driver-2023.03.02 (already installed)
+==============================================================================


+==============================================================================
| WARNING - WARNING - WARNING - WARNING - WARNING - WARNING - WARNING - WARNING
|
| Don't close this window with the red 'X' in the top right corner until the 'DONE' button is displayed!
|
| WARNING - WARNING - WARNING - WARNING - WARNING - WARNING - WARNING - WARNING
+==============================================================================

-----------------Downloading Nvidia Driver Package v515.76------------------
----------This could take some time, please don't close this window!------------

 

But in the logs it seems stuck again:

Apr  2 00:12:45 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  2 00:12:46 Stereo root: plugin: running: anonymous
Apr  2 00:12:46 Stereo root: plugin: creating: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - downloading from URL https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz
Apr  2 00:12:46 Stereo root: plugin: checking: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - MD5
Apr  2 00:12:46 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  2 00:12:46 Stereo root: plugin: creating: /usr/local/emhttp/plugins/nvidia-driver/README.md - from INLINE content
Apr  2 00:12:46 Stereo root: plugin: running: anonymous

It's over one hour now..

 

When i try nvidia-smi this is the result:

root@Stereo:~# nvidia-smi
bash: nvidia-smi: command not found

 

I have no idea what to do, is there a way to completely reset the installation process?

 

The part of my syslog beginning with the install process:

Apr  1 23:29:20 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  1 23:29:21 Stereo root: plugin: running: anonymous
Apr  1 23:29:21 Stereo root: plugin: creating: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - downloading from URL https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz
Apr  1 23:29:21 Stereo root: plugin: checking: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - MD5
Apr  1 23:29:21 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  1 23:29:21 Stereo root: plugin: creating: /usr/local/emhttp/plugins/nvidia-driver/README.md - from INLINE content
Apr  1 23:29:21 Stereo root: plugin: running: anonymous
Apr  1 23:32:36 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  1 23:32:36 Stereo root: plugin: running: anonymous
Apr  1 23:32:36 Stereo root: plugin: skipping: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz already exists
Apr  1 23:32:36 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  1 23:32:36 Stereo root: plugin: skipping: /usr/local/emhttp/plugins/nvidia-driver/README.md already exists
Apr  1 23:32:36 Stereo root: plugin: running: anonymous
Apr  1 23:47:11 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  1 23:47:12 Stereo root: plugin: running: anonymous
Apr  1 23:47:12 Stereo root: plugin: skipping: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz already exists
Apr  1 23:47:12 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  1 23:47:12 Stereo root: plugin: skipping: /usr/local/emhttp/plugins/nvidia-driver/README.md already exists
Apr  1 23:47:12 Stereo root: plugin: running: anonymous
Apr  1 23:48:28 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  1 23:48:29 Stereo root: plugin: running: anonymous
Apr  1 23:48:29 Stereo root: plugin: creating: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - downloading from URL https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz
Apr  1 23:48:29 Stereo root: plugin: checking: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - MD5
Apr  1 23:48:29 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  1 23:48:29 Stereo root: plugin: skipping: /usr/local/emhttp/plugins/nvidia-driver/README.md already exists
Apr  1 23:48:29 Stereo root: plugin: running: anonymous
Apr  2 00:12:45 Stereo emhttpd: cmd: /usr/local/emhttp/plugins/community.applications/scripts/pluginInstall.php install https://github.com/ich777/unraid-nvidia-driver/raw/master/nvidia-driver.plg
Apr  2 00:12:46 Stereo root: plugin: running: anonymous
Apr  2 00:12:46 Stereo root: plugin: creating: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - downloading from URL https://github.com/ich777/unraid-nvidia-driver/raw/master/packages/nvidia-driver-2023.03.02.txz
Apr  2 00:12:46 Stereo root: plugin: checking: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz - MD5
Apr  2 00:12:46 Stereo root: plugin: running: /boot/config/plugins/nvidia-driver/nvidia-driver-2023.03.02.txz
Apr  2 00:12:46 Stereo root: plugin: creating: /usr/local/emhttp/plugins/nvidia-driver/README.md - from INLINE content
Apr  2 00:12:46 Stereo root: plugin: running: anonymous

 

Edited by marcoroegner
Link to comment
6 hours ago, marcoroegner said:

my PC crashed while installing this plugin and now I'm stuck.

Please post your Diagnostics.

 

It would be really interesting why it crashed.

 

Remove the Plugin again, reboot and then install it again to solve the issues that you have above.

 

But I really would recommend that you post your Diagnostics first so that I can look into it why it crashed so that this doesn‘t happen again.

  • Thanks 1
Link to comment

I'm sorry, with "my PC crashed" I meant the Desktop PC from which I started the install crashed. Not the Unraid Server.

 

I managed to figure out my problem. I was running on OS v. 6.10.3. After an upgrade to OS v. 6.11.5 the install process ran smooth and the Plugin is now installed.

 

Thank you for your reply and help!

  • Like 1
Link to comment
1 hour ago, marcoroegner said:

I'm sorry, with "my PC crashed" I meant the Desktop PC from which I started the install crashed. Not the Unraid Server.

This is most likely a really weird coincidence of some sort.

I couldn‘t imagine why your PC from ehich you‘ve started the installation would crash.

Link to comment

It happens from time to time. I'm running Fedora Desktop and I am very happy with it... It just freezes around once a week. It didn't bother me enough to search for the problem, so far.

 

Since the Install Window says to not close this window under any circumstances, I freaked out a litte 😅

Link to comment
2 hours ago, marcoroegner said:

Since the Install Window says to not close this window under any circumstances, I freaked out a litte 😅

Next time this happens, simply remove the nvidia-driver.plg file and the nvidia-driver folder from /boot/config/plugins directory, reboot your server and install it again from the CA App.

Link to comment
47 minutes ago, joykingdom said:

Any one can help me ? 

This is most certainly caused by a bug in your BIOS, have you yet checked if you are on the latest BIOS version?

 

Why do you have created the nvidia.conf file in your modprobe.d folder:

options nvidia NVreg_OpenRmEnableUnsupportedGpus=1

you even don't run the open source driver.

 

Try to boot with CSM (Legacy Mode) and see if that solves the issue but most certainly this is a bug in your BIOS.

Also make sure that you've enabled Above 4G Decoding and Resizable BAR Support in your BIOS.

Link to comment
13 hours ago, ich777 said:

This is most certainly caused by a bug in your BIOS, have you yet checked if you are on the latest BIOS version?

 

Why do you have created the nvidia.conf file in your modprobe.d folder:

options nvidia NVreg_OpenRmEnableUnsupportedGpus=1

you even don't run the open source driver.

 

Try to boot with CSM (Legacy Mode) and see if that solves the issue but most certainly this is a bug in your BIOS.

Also make sure that you've enabled Above 4G Decoding and Resizable BAR Support in your BIOS.

I deleted the nvidia.conf;

Enabled Above 4G Decoding and Resizable BAR Support in your BIOS still can not solove

Boot with CSM,unraid cannot boot.

The error logs increase fast, any thing I can do?

Link to comment
22 minutes ago, joykingdom said:

The error logs increase fast, any thing I can do?

Are you on the latest BIOS? Can you maybe contact the manufacturer about that, this is something that I can't fix. As said above this is most certainly a bug in your BIOS.

 

In which PCIe slot do you have the card installed?

 

Have you yet tried to boot into Legacy Mode?

 

EDIT: Wait, now that I see it, this is a mobile T400 chip, what card is this exactly.

I'm not 100% sure if the default Nvidia driver supports mobile chips and even if it does it seems that you need a modified driver for this card because Nvidia is most certainly not selling PCIe GPUs with mobile chips.

Link to comment
On 4/5/2023 at 4:10 PM, ich777 said:

Are you on the latest BIOS? Can you maybe contact the manufacturer about that, this is something that I can't fix. As said above this is most certainly a bug in your BIOS.

 

In which PCIe slot do you have the card installed?

 

Have you yet tried to boot into Legacy Mode?

 

EDIT: Wait, now that I see it, this is a mobile T400 chip, what card is this exactly.

I'm not 100% sure if the default Nvidia driver supports mobile chips and even if it does it seems that you need a modified driver for this card because Nvidia is most certainly not selling PCIe GPUs with mobile chips.

Except for the persistent error log, the nvidia T400 driver installed with this plugin works fine, in Emby or plex.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.