May 22, 20242 yr the last few days I can reboot server and it works fine for a few hours and then eventually cannot access web interface until a hard reboot. I tried to hook up Keyboard/Monitor/Mouse and after lock up screen doesn't wake. I tried to replace the flash drive as 1 time during the lockup the screen said it couldn't see the flash drive or flash was corrupt. Logs attached. tower-diagnostics-20240522-1357.zip Edited May 22, 20242 yr by mike_2246
May 22, 20242 yr Community Expert Reviewing diagnostic. Usually, a system freeze like this is caused by bad system ram. Please run mem test. Looks like you are choosing to boot "gui" mode with AMD processor. Warnings: do you have a onbaod IGPU or G-Card that is not passed to a vm? per cmdlin in diag: "BOOT_IMAGE=/bzimage initrd=/bzroot,/bzroot-gui" If no igpu, booting gui willl kill the system as there is no graphic card for 3d acceleration ... I do see you have a NVIDIA 1050 per diag and quite a few plugins... May 22 13:55:01 Tower root: --------------------Nvidia driver v550.67 found locally--------------------- May 22 13:55:02 Tower root: May 22 13:55:02 Tower root: --------------Installation of Nvidia driver v550.67 successful-------------- May 22 13:55:02 Tower root: plugin: nvidia-driver.plg installed some bios iommu settings may need to be looked at. But not seeing anything out of the ordinary or error logged. at begingin yo have iommu Caps and no dmar rempaing. so you have iommu on but not using memory stuff. this will be need latter for pcie passthrough. I would have you run unraid headless. urnaid OS/software side not seeing any faults. I suspect system ram. Diag looks to be sanitized. Normally if there is a system crash we would have a previous syslog... among other logs... Please run mem test and/or try with 1 ram stick with headless to rule out hardware issues.
May 22, 20242 yr Author Can the mem test be started from unraid or do I need a Bootable diag usb? 99% of time it runs headless and has been fine for a long time. Only hooked up monitor to see if I could see any error since web ui wasn't loading. Since replacing usb boot drive and updating to latest .10 I believe it's stayed on longer than it has been. I don't think any dockers or vms are using the 1050, plex uses the 2200 and I haven't run tdarr in about a year. Edited May 22, 20242 yr by mike_2246
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.