Big-G

Members
  • Posts

    10
  • Joined

  • Last visited

Recent Profile Visitors

The recent visitors block is disabled and is not being shown to other users.

Big-G's Achievements

Noob

Noob (1/14)

1

Reputation

  1. In the terminal what do you get when you execute "modprobe nvidia"?
  2. Further to the previous comments, once the plugin is installed successfully, leaving it on Latest causes failures with server reboot, however if you set it to the version then it works without issue after reboot:
  3. Also cleared and tried this manually with the amended script while having second GPU vfio bound, and it worked 100%: mkdir -p /tmp/nvdrv && cd /tmp/nvdrv wget https://github.com/ich777/unraid-nvidia-driver/releases/download/5.15.43-Unraid/nvidia-515.43.04-5.15.43-Unraid-1.txz installpkg nvidia-515.43.04-5.15.43-Unraid-1.txz depmod -a modprobe nvidia rm -rf /tmp/nvdrv nvidia-smi Sat May 28 11:00:42 2022 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 515.43.04 Driver Version: 515.43.04 CUDA Version: 11.7 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A | | 0% 44C P0 39W / 180W | 0MiB / 8192MiB | 1% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+ Previous Issue seems to be between the driver version downloaded and the modprobe, however changing to correct version fixes the issue.
  4. Following the success above, we to CA and installed plugin again:
  5. Amended the script to reflect version deficit: /tmp/nvdrv# mkdir -p /tmp/nvdrv && cd /tmp/nvdrv wget https://github.com/ich777/unraid-nvidia-driver/releases/download/5.15.40-Unraid/nvidia-515.43.04-5.15.43-Unraid-1.txz installpkg nvidia-515.43.04-5.15.43-Unraid-1.txz depmod -a modprobe nvidia rm -rf /tmp/nvdrv nvidia-smi Results: Sat May 28 10:31:33 2022 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 515.43.04 Driver Version: 515.43.04 CUDA Version: 11.7 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A | | 0% 44C P0 39W / 180W | 0MiB / 8192MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ | 1 NVIDIA GeForce ... Off | 00000000:03:00.0 Off | N/A | | 35% 32C P0 N/A / 19W | 0MiB / 2048MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+
  6. Morning, just upgraded to 6.10.2 and my issue is back again, I repeated the steps I followed previously to get it working however to negative results. I have just tried your suggested steps above, results as follows: /tmp/nvdrv# mkdir -p /tmp/nvdrv && cd /tmp/nvdrv wget https://github.com/ich777/unraid-nvidia-driver/releases/download/5.15.40-Unraid/nvidia-515.43.04-5.15.40-Unraid-1.txz installpkg nvidia-515.43.04-5.15.40-Unraid-1.txz depmod -a modprobe nvidia rm -rf /tmp/nvdrv nvidia-smi --2022-05-28 10:18:06-- https://github.com/ich777/unraid-nvidia-driver/releases/download/5.15.40-Unraid/nvidia-515.43.04-5.15.40-Unraid-1.txz Resolving github.com (github.com)... 140.82.121.3 Connecting to github.com (github.com)|140.82.121.3|:443... connected. HTTP request sent, awaiting response... 302 Found Location: https://objects.githubusercontent.com/github-production-release-asset-2e65be/306724515/91c671a8-ea29-453f-8603-2b55d8950db6?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWNJYAX4CSVEH53A%2F20220528%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20220528T091807Z&X-Amz-Expires=300&X-Amz-Signature=af4ced2956ac30529332c4a4049c19707e39ae3bc79c3d5116b925b9b047b4cf&X-Amz-SignedHeaders=host&actor_id=0&key_id=0&repo_id=306724515&response-content-disposition=attachment%3B filename%3Dnvidia-515.43.04-5.15.40-Unraid-1.txz&response-content-type=application%2Foctet-stream [following] --2022-05-28 10:18:07-- https://objects.githubusercontent.com/github-production-release-asset-2e65be/306724515/91c671a8-ea29-453f-8603-2b55d8950db6?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWNJYAX4CSVEH53A%2F20220528%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20220528T091807Z&X-Amz-Expires=300&X-Amz-Signature=af4ced2956ac30529332c4a4049c19707e39ae3bc79c3d5116b925b9b047b4cf&X-Amz-SignedHeaders=host&actor_id=0&key_id=0&repo_id=306724515&response-content-disposition=attachment%3B filename%3Dnvidia-515.43.04-5.15.40-Unraid-1.txz&response-content-type=application%2Foctet-stream Resolving objects.githubusercontent.com (objects.githubusercontent.com)... 185.199.109.133, 185.199.111.133, 185.199.108.133, ... Connecting to objects.githubusercontent.com (objects.githubusercontent.com)|185.199.109.133|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 257129976 (245M) [application/octet-stream] Saving to: ‘nvidia-515.43.04-5.15.40-Unraid-1.txz’ nvidia-515.43.04-5.15.40-Unraid-1.txz 100%[========================================================================================================================================>] 245.22M 63.6MB/s in 3.9s 2022-05-28 10:18:11 (62.7 MB/s) - ‘nvidia-515.43.04-5.15.40-Unraid-1.txz’ saved [257129976/257129976] Verifying package nvidia-515.43.04-5.15.40-Unraid-1.txz. Installing package nvidia-515.43.04-5.15.40-Unraid-1.txz: PACKAGE DESCRIPTION: Package nvidia-515.43.04-5.15.40-Unraid-1.txz installed. modprobe: FATAL: Module nvidia not found in directory /lib/modules/5.15.43-Unraid NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
  7. Everything is working now, but if this may help solve a potential issue for others, I am willing to give it a try. Will I need to unbind (vfio) my other GPU again or leave it as is? (Before I do the above steps)
  8. Attached tower-diagnostics-20220519-2243.zip
  9. Hi, I believe I have figured out my issue, after 3 attempts (6 reboots) at following the below steps: Remove the plugin from failed installations plugins tab Execute command "rm -rf /boot/config/plugins/nvidia*" (without double quotes) Reboot Pull a fresh copy from the CA App I was consistently met with the screen shot below and never moving beyond this point when installing the plugin. What I then tried was the following: In my server I have multiple Nvidia GPU's one of which was bound (vfio) for passthrough, I unbound this card, and then followed the steps above and the plugin and driver installed without any issue, dockers using acceleration immediately started working however VM did not. I then rebound the GPU (vfio) and rebooted and all dockers and VM's returned to the working state that had existed in 6.9.2. I hope this is helpful, but definitely solved the issue for me on 6.10.0.
  10. Hi, I am having similar issues to the above with a slight difference, my installation seems to hang up on the driver installation, before 6.10.0 this would not take more than a minute give or take, I left this open for more than 20 mins and it did not progress. Steps I have tried to resolve: 1. Reboot Server 2. Remove plugin from failed 3. Executed command "rm -rf /boot/config/plugins/nvidia*" 4. Reboot Server 5. Reinstall plugin from CA 6. Stalled at installing driver 1. Reboot Server 2. Plugin not in failed 3. Execute command "rm -rf /boot/config/plugins/nvidia*" 4. Reboot Server 5. Execute commands: mkdir -p /boot/config/plugins/nvidia-driver echo 'first_installation=true driver_version=latest local_version=none disable_xconfig=false update_check=true' > "/boot/config/plugins/nvidia-driver/settings.cfg" 6. Reinstall plugin from CA 7. Stalled at installing driver GPU's are visible under devices and cant see anything obvious in the log that references an issue, not sure where to go from here now? Please help.