Skip to content
View in the app

A better way to browse. Learn more.

Unraid

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

Pass through 2x RTX Titans (with nvlink) to single VM

Featured Replies

I cant seem to find anything regarding NVLink and passthrough to a VM. Is it possible, or is there such a tutorial?

  • Author

here is what I am trying now: (causes a kernel panic when trying to start vm)

 

image.png.92d0b27d4ae6ca2afa74a5003342bba5.png

 

image.png.d1ac0e33ebe8c9c964692c05128d96c8.png

image.png.0011e4bf46c399b2fbccc0779d763d03.png

image.thumb.png.f8d3ce46ccc6e43b536e8ec3e58f44bb.png

image.thumb.png.b29a43a9112142762caac166b4e0339a.png

Edited by TheSkaz

  • Author

Does this mean anything useful in regards to my issue?

 

Sep 17 08:44:57 Tower kernel: vfio-pci 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=none
Sep 17 08:44:57 Tower kernel: Linux agpgart interface v0.103
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:01:00.2: remove, state 4
Sep 17 08:44:57 Tower kernel: usb usb2: USB disconnect, device number 1
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:01:00.2: USB bus 2 deregistered
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:01:00.2: remove, state 4
Sep 17 08:44:57 Tower kernel: usb usb1: USB disconnect, device number 1
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:01:00.2: USB bus 1 deregistered
Sep 17 08:44:57 Tower kernel: vfio-pci 0000:50:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=none
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:50:00.2: remove, state 4
Sep 17 08:44:57 Tower kernel: usb usb16: USB disconnect, device number 1
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:50:00.2: USB bus 16 deregistered
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:50:00.2: remove, state 4
Sep 17 08:44:57 Tower kernel: usb usb15: USB disconnect, device number 1
Sep 17 08:44:57 Tower kernel: xhci_hcd 0000:50:00.2: USB bus 15 deregistered
Sep 17 08:44:57 Tower kernel: nvidia: loading out-of-tree module taints kernel.
Sep 17 08:44:57 Tower kernel: nvidia: loading out-of-tree module taints kernel.
Sep 17 08:44:57 Tower kernel: nvidia: module license 'NVIDIA' taints kernel.
Sep 17 08:44:57 Tower kernel: nvidia: module license 'NVIDIA' taints kernel.
Sep 17 08:44:57 Tower kernel: Disabling lock debugging due to kernel taint
Sep 17 08:44:57 Tower kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 247
Sep 17 08:44:57 Tower kernel: vfio-pci 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=none
Sep 17 08:44:57 Tower kernel: vfio-pci 0000:50:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=none
Sep 17 08:44:57 Tower kernel: nvidia 0000:4e:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
Sep 17 08:44:57 Tower kernel: NVRM: The NVIDIA probe routine was not called for 2 device(s).
Sep 17 08:44:57 Tower kernel: NVRM: This can occur when a driver such as: 
Sep 17 08:44:57 Tower kernel: NVRM: nouveau, rivafb, nvidiafb or rivatv 
Sep 17 08:44:57 Tower kernel: NVRM: was loaded and obtained ownership of the NVIDIA device(s).
Sep 17 08:44:57 Tower kernel: NVRM: Try unloading the conflicting kernel module (and/or
Sep 17 08:44:57 Tower kernel: NVRM: reconfigure your kernel without the conflicting
Sep 17 08:44:57 Tower kernel: NVRM: driver(s)), then try loading the NVIDIA kernel module
Sep 17 08:44:57 Tower kernel: NVRM: again.
Sep 17 08:44:57 Tower kernel: NVRM: loading NVIDIA UNIX x86_64 Kernel Module  440.100  Fri May 29 08:45:51 UTC 2020

Edited by TheSkaz

  • Author

I have the VM up and able to boot with both gpus showing. in the VM logs for the machine, I am getting hundreds of these:

 

2020-09-22T06:21:28.221139Z qemu-system-x86_64: vfio_region_write(0000:01:00.0:region1+0x801b8, 0x0,8) failed: Device or resource busy

 

that is my primary video card for the system and 1 of the 2 gpus for the VM. anything that attempts to use the gpus freezes. 

  • Author

googled the error and found that running:

echo 0 > /sys/class/vtconsole/vtcon0/bind
echo 0 > /sys/class/vtconsole/vtcon1/bind
echo efi-framebuffer.0 > /sys/bus/platform/drivers/efi-framebuffer/unbind

 

works. NVLink seems to work too

  • 1 year later...

I've got the same problem. The two gpu's show up in device manager, but I can't see an option to enable nvlink in nvida's control panel? How did you fix this?

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.