Strange WebGui behavior then becomes unusable


Recommended Posts

syslog.zip

My "onprem" unraid server is acting strangely.  Seems every day or two the webgui will start showing strange data and fail to update fields.

  • Strange data: shows 3 of 8 threads pegged at 100%, while htop shows no activity.
  • Fail to load: unattached devices section of Main, docker, shares - all missing data or don't load at all.

If I click around the webgui a few times it generally crashes the gui completely and the webpage won't load anymore ("internal server error", or "Gateway Time-out").  Meanwhile SSH works and SMB shares still respond as normal, except for "diagnostics" or any of the shutdown/restart commands.  Server broadcasts that it is collecting diagnostic data, or shutting down, but nothing actually happens: image.png.2306e50a0ab231075bca8445a1e67492.png

Only way to get the server back to normal temporarily is to hard-reset and face the dreaded parity check.

I was able to manually copy the syslog.txt file (attached).  I'm not running any VMs, just three containers (pihole, zoneminder, plex).

Thanks so much for the help!

 

Hardware is quite old, I can try to get more specific if needed:

i7 3770

32GB RAM

Array: 4 x HDDs, 3 x SSDs (Cache)

1 x Unattached Device (USB data traveler)

 

image.png

Edited by The_N4RF
Link to comment

Your symptoms suggest a flash problem so switching the port may be the solution.

 

Your "system" shares, appdata, domains, system, have files on the array. You probably created dockers/VMs before installing cache so they got created on the array. Best if they are all on cache and set to stay on cache. Your system share specifically is set to get moved to the array. Mover can't move open files so Docker and VM Services would have to be disabled to get them moved.

 

Do you understand the Use cache settings?

Link to comment

Ok, usb flash drive is moved to USB2 port and all of the data for appdata, system, and domains are hosted only on the Cache.  Thanks for the help, we'll see how it goes!  

I did notice this week that if I left my Win10 VM running it seemed to be fine.  But if I stopped or force-stopped the VM I would find the unraid server unusable soon after.  This has only been tested twice though.

Link to comment

Failed again (VM was stopped).  Will try again and leave the VM running.  

-webgui showing several CPUs maxed, but htop is showing idle.

image.thumb.png.f60141f09dcc6873a5584b0cd848afda.png

 

-Unassigned Devices won't load.  (should be a 2.5" USB harddrive here)

image.thumb.png.e303ed2a9b9a81c4d3e727e8a1f817b4.png

 

These are the initial display errors.  Eventually the page will not load at all anymore.

Link to comment

Yup, worked for 2 days with the VM running.  All I did was stop the VM, then 8hrs later I find the unraid server effectively crashed.

image.png.6b17519bed3d23b36fd3f0bf97224d9e.png

Nonsense CPU load reporting.  WebGUI eventually stops responding.  SSH connects, most commands seem to work but will not shutdown/restart.

 

Next step?  It seems to be isolated to the VM.  What changes when I stop the VM?

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.