October 23, 20241 yr The server is running and the array start after some settings change. The problem is the VMs won't start anymore. Here are the details. Last time the server was rebooted is 9 days ago and was following an upgrade to 6.12.13. Everything was working fine. No configuration change or upgrade since then. I tried to access my smb shares tonight and the server was not responding. Web ui was responding. The last time it occured, two custom backup scripts where stuck at "running". They run each morning at 3AM from the "user script" add-on. The problem was solved last time by stopping the scripts and rebooting the server. Rebooted the server from the web ui. The array startup was not finishing after 15 minutes. Stopped docker manager and vms manager and restarted the server from the web ui. The array successfully started. Restarted the vm manager. VM section of the web ui says "Libvirt service failed to start". VM settings the path is /mnt/user/system/libvirt/libvirt.img. Opened a ssh session and have been able to successfully copy the file, so it is present and the content is readable. Procduced the diagnostic kit and created this post. UPDATE : This forum is self-healing, like a visit to the doctor right when we enter the hospital... After posting this, about 45 minutes after the server was rebooted, I wanted to stop the vm manager until an answer is received and found out there was new informations in the "libvirt volume infos" section. Got the the vm section and the message disapeared. There was not reboot between the time I have seen the libvirt error and the moment the vms are now functionnal. I leave the informations in case this would solve an intermitent issue. diag - srvr-virt-diagnostics-20241023-1909.zip Edited October 23, 20241 yr by sbeaudoin
October 24, 20241 yr Community Expert Solution There's data corruption on the backup pool, you should run a scrub.
October 24, 20241 yr Author I think you are right. I asked for a ZFS status and it is taking a realy long time. It is possibly the delay I had between the reboot and the time the problem was solved. I will try to do the scrub, see if I need to use my cold spare and see if there is still a delay. Edited October 24, 20241 yr by sbeaudoin
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.