Execut1ve Posted October 13, 2021 Share Posted October 13, 2021 My Windows 10 VM has lately started pausing itself for no apparent reason. Diagnostics attached. From some cursory research, it looks like it may be related to drive space, but I have only one drive in my system (aside from the parity drive) and it shows plenty of free space so I'm not sure how to resolve. Any help appreciated! remoteunraid-diagnostics-20211013-1051.zip Quote Link to comment
JorgeB Posted October 13, 2021 Share Posted October 13, 2021 Check that sleep/hibernation is disable in the VM. Quote Link to comment
Execut1ve Posted October 13, 2021 Author Share Posted October 13, 2021 Hibernation is turned off, and I believe sleep is as well - however I can't get the VM to run long enough to check before it pauses. Quote Link to comment
JorgeB Posted October 13, 2021 Share Posted October 13, 2021 Disk1 has enough space, but does is the vdisk large enough? You can try and expand it. Quote Link to comment
Execut1ve Posted October 13, 2021 Author Share Posted October 13, 2021 I just tried extending the vdisk... I was able to add space but the VM still pauses. If I try to force it to resume, I get Execution error internal error: unable to execute QEMU command 'cont': Resetting the Virtual Machine is required Quote Link to comment
JorgeB Posted October 14, 2021 Share Posted October 14, 2021 Sorry, no more ideas. Quote Link to comment
ghost82 Posted October 14, 2021 Share Posted October 14, 2021 Quote 2021-10-13T14:32:15.160406Z qemu-system-x86_64: vfio: Cannot reset device 0000:04:00.0, depends on group 21 which is not owned. 2021-10-13T14:32:18.265387Z qemu-system-x86_64: vfio: Cannot reset device 0000:04:00.0, depends on group 21 which is not owned. Quote Quote 04:00.0 VGA compatible controller [0300]: NVIDIA Corporation GM206 [GeForce GTX 960] [10de:1401] (rev a1) Subsystem: eVga.com. Corp. GM206 [GeForce GTX 960] [3842:2968] Kernel driver in use: vfio-pci Kernel modules: nvidia_drm, nvidia 04:00.1 Audio device [0403]: NVIDIA Corporation GM206 High Definition Audio Controller [10de:0fba] (rev a1) Subsystem: eVga.com. Corp. Device [3842:2968] Kernel driver in use: vfio-pci But in your xml only 04:00.0 is passed through. It could be related to properly resetting the gpu. Try to replace in your xml this: <hostdev mode='subsystem' type='pci' managed='yes'> <driver name='vfio'/> <source> <address domain='0x0000' bus='0x04' slot='0x00' function='0x0'/> </source> <address type='pci' domain='0x0000' bus='0x00' slot='0x06' function='0x0'/> </hostdev> With this: <hostdev mode='subsystem' type='pci' managed='yes'> <driver name='vfio'/> <source> <address domain='0x0000' bus='0x04' slot='0x00' function='0x0'/> </source> <address type='pci' domain='0x0000' bus='0x00' slot='0x06' function='0x0' multifunction='on'/> </hostdev> <hostdev mode='subsystem' type='pci' managed='yes'> <driver name='vfio'/> <source> <address domain='0x0000' bus='0x04' slot='0x00' function='0x1'/> </source> <address type='pci' domain='0x0000' bus='0x00' slot='0x06' function='0x1'/> </hostdev> Moreover, you have both vnc and gpu passthrough: I'm not sure it can be done, some report that it works, some other that it's not working. I would delete also this: <graphics type='vnc' port='-1' autoport='yes' websocket='-1' listen='0.0.0.0' keymap='en-us'> <listen type='address' address='0.0.0.0'/> </graphics> <video> <model type='qxl' ram='65536' vram='65536' vgamem='16384' heads='1' primary='yes'/> <address type='pci' domain='0x0000' bus='0x00' slot='0x02' function='0x0'/> </video> If you want vnc install a vnc server inside the windows os or use remote desktop. Since you have a br0 network the vm is reachable from the same lan. 1 Quote Link to comment
Execut1ve Posted October 14, 2021 Author Share Posted October 14, 2021 I actually got the VM to work without pausing by adding a USB keyboard and mouse and passing them through to the VM. I have no idea why that would work, but the VM did originally have those items. I had removed them when I physically relocated my server but now the VM is working after re-adding them. I typically access the VM via VNC. I usually don't have any issues with it, except occasionally the disappearing mouse cursor. Quote Link to comment
Execut1ve Posted October 15, 2021 Author Share Posted October 15, 2021 It seems it was too good to be true... pausing behavior has returned. Based on the syslog, it does seem to be related to an issue with one of the graphics cards: Oct 15 10:12:11 RemoteUnraid kernel: pcieport 0000:00:07.0: AER: Multiple Uncorrected (Fatal) error received: 0000:00:00.0 Oct 15 10:12:11 RemoteUnraid kernel: vfio-pci 0000:06:00.0: AER: PCIe Bus Error: severity=Uncorrected (Fatal), type=Inaccessible, (Unregistered Agent ID) Oct 15 10:12:12 RemoteUnraid kernel: pcieport 0000:00:07.0: AER: Root Port link has been reset Oct 15 10:12:12 RemoteUnraid kernel: pcieport 0000:00:07.0: AER: device recovery successful I'm going to try reseating all the cards and risers and see if that helps any Quote Link to comment
Execut1ve Posted October 18, 2021 Author Share Posted October 18, 2021 It looks like reseating everything did the trick, VM has been running steady ever since! Quote Link to comment
velvetforce Posted November 15, 2023 Share Posted November 15, 2023 Extending the vdisk size worked for me Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.