armbrust

Members

Joined
March 11, 200818 yr
Last visited
June 22, 20206 yr

View Profile Find content

Rookie

Current rank (2/14)

Posts

Find content

56
Reputation
Neutral

0

Gender
Undisclosed

The recent visitors block is disabled and is not being shown to other users.

armbrust started following 6.7.2 Starting VM with passthrough GPU crashes unraid
- July 5, 20197 yr
6.7.2 Starting VM with passthrough GPU crashes unraid
6.7.2 Starting VM with passthrough GPU crashes unraid

armbrust posted a report in Stable Releases

Moving from 6.6.7 to either 6.7.0, 6.7.1 6.7.2 all have the same issue. Everything works correctly, except starting a VM that has a GPU passed through. When starting this VM the system crashes. I've attached diagnostics from both versions (6.6.7 and 6.7.2), just before starting the VM. Nothing was changed in the configuration between runs. There is another VM running fine in both cases. It has nothing passed through. Also attached is the xml config of the problem VM. I tailed the syslog in both versions when starting the VM, and they look the same.. In both there is some sort of DMA fault, but in 6.7.2, it works fine. This is a tail of the syslog when starting the problem VM in 6.7.2: Jul 5 09:33:23 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered disabled state Jul 5 09:33:23 Tower kernel: device vnet2 entered promiscuous mode Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:33:23 Tower kernel: br0: port 3(vnet2) entered forwarding state Jul 5 09:33:24 Tower avahi-daemon[7313]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jul 5 09:33:24 Tower avahi-daemon[7313]: New relevant interface vnet2.IPv6 for mDNS. Jul 5 09:33:24 Tower avahi-daemon[7313]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jul 5 09:33:24 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jul 5 09:33:25 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jul 5 09:33:25 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jul 5 09:33:28 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:33:30 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jul 5 09:33:30 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set Jul 5 09:33:30 Tower nginx: 2019/07/05 09:33:30 [crit] 7479#7479: *2093 connect() to unix:/var/tmp/Letsencrypt.sock failed (2: No such file or directory) while connecting to upstream, client: 192.168.1.101, server: , request: "GET /dockerterminal/Letsencrypt/ws HTTP/1.1", upstream: "http://unix:/var/tmp/Letsencrypt.sock:/ws", host: "tower" Here is the tail of the sys log on startup of the same VM in 6.6.7 for comparison. Jul 5 09:51:28 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered disabled state Jul 5 09:51:28 Tower kernel: device vnet2 entered promiscuous mode Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered blocking state Jul 5 09:51:28 Tower kernel: br0: port 3(vnet2) entered forwarding state Jul 5 09:51:29 Tower avahi-daemon[6629]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jul 5 09:51:29 Tower avahi-daemon[6629]: New relevant interface vnet2.IPv6 for mDNS. Jul 5 09:51:29 Tower avahi-daemon[6629]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jul 5 09:51:29 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jul 5 09:51:30 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jul 5 09:51:30 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jul 5 09:51:32 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jul 5 09:51:34 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jul 5 09:51:34 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set I've tried with and without "iommu=pt" in syslinux config. Anybody have any ideas? Thanks tower-diagnostics-6.7.2-20190705-1324.zip tower-diagnostics-6.6.7-20190705-0918.zip Problem VM Config.xml
- July 5, 20197 yr
- 5 comments
Unraid OS version 6.7.1 available
Unraid OS version 6.7.1 available

armbrust replied to limetech's topic in Announcements

Thanks for the reply, unfortunately no luck. This is the syslog at the time of VM start. Jun 25 21:44:17 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered blocking state Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered disabled state Jun 25 21:44:17 Tower kernel: device vnet2 entered promiscuous mode Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered blocking state Jun 25 21:44:17 Tower kernel: br0: port 3(vnet2) entered forwarding state Jun 25 21:44:18 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jun 25 21:44:18 Tower avahi-daemon[7180]: Joining mDNS multicast group on interface vnet2.IPv6 with address fe80::fc54:ff:fe13:8859. Jun 25 21:44:18 Tower avahi-daemon[7180]: New relevant interface vnet2.IPv6 for mDNS. Jun 25 21:44:18 Tower avahi-daemon[7180]: Registering new address record for fe80::fc54:ff:fe13:8859 on vnet2.*. Jun 25 21:44:19 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jun 25 21:44:19 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jun 25 21:44:22 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 21:44:24 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jun 25 21:44:24 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set
- June 26, 20197 yr
- 42 replies
Unraid OS version 6.7.1 available
Unraid OS version 6.7.1 available

armbrust replied to limetech's topic in Announcements

Hi, I tried upgrading from 6.6.7, and experiance a hard crash, when the VMs were started. For a few seconds, The GUI was available, but then the system rebooted, and came back with the VM manger disabled. When I enabled the VM manger, the system crashed (no web GUI didn't respond to pings), and didn't come back. I rebooted into safe mode, captured a diagnostic file, and reverted to 6.6.7. Back to 6.6.7 and all is well. The same thing happened when trying to upgrade to 6.7.0 from 6.6.7. The attached diagnostic file was captured in 6.7.1 safe mode. Thanks for any advice. I'm hoping it's not that my hardware is too out of date! Edit: more info: It seems to be a problem with one particular VM, which has pass through of GPU and USB controller. When starting this VM it dies. Here are the lines recorded in the syslog when I started the VM and the system froze. Jun 25 14:21:40 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered blocking state Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered disabled state Jun 25 14:21:40 Tower kernel: device vnet0 entered promiscuous mode Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered blocking state Jun 25 14:21:40 Tower kernel: br0: port 2(vnet0) entered forwarding state Jun 25 14:21:42 Tower kernel: vfio_ecap_init: 0000:0a:00.0 hiding ecap 0x19@0x900 Jun 25 14:21:42 Tower avahi-daemon[7190]: Joining mDNS multicast group on interface vnet0.IPv6 with address fe80::fc54:ff:feb3:33ee. Jun 25 14:21:42 Tower avahi-daemon[7190]: New relevant interface vnet0.IPv6 for mDNS. Jun 25 14:21:42 Tower avahi-daemon[7190]: Registering new address record for fe80::fc54:ff:feb3:33ee on vnet0.*. Jun 25 14:21:42 Tower kernel: vfio-pci 0000:00:1a.7: enabling device (0000 -> 0002) Jun 25 14:21:42 Tower kernel: vfio_cap_init: 0000:00:1a.7 hiding cap 0xa Jun 25 14:21:45 Tower kernel: vfio-pci 0000:0a:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem Jun 25 14:21:47 Tower kernel: DMAR: DRHD: handling fault status reg 2 Jun 25 14:21:47 Tower kernel: DMAR: [DMA Read] Request device [00:1a.7] fault addr eb000 [fault reason 06] PTE Read access is not set tower-diagnostics-20190625-1740.zip
- June 25, 20197 yr
- 42 replies

armbrust

Joined

Last visited

Rookie

Posts

Reputation

6.7.2 Starting VM with passthrough GPU crashes unraid

Unraid OS version 6.7.1 available

Unraid OS version 6.7.1 available

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)