Server randomly stops working


Recommended Posts

Hi Guys.. need your help.. Recently had my flash drive die on me and managed to recover config folder and replace it.

 

After that i have experienced 3 times following.. dont know if its related to flash drive but just wanted to point out last change i made.

1) Unraid GUI not accessible

2) All other dockers and services become non accessible

3) I try to use CLI directly via keyboard/mouse even that is very slow.. even to simply login it takes ages

4) Tried to run diagnostics command.. it runs saying its saving it.. but took over 30 minutes and still keps saying that

5) Another time i tried to do a normal reboot from cli so as to not har shut down the server.. it says its starting the shutdown/reboot procedure and stays liek that.

6) Only thing that solves the issue is a hard reboot and then all is back to normal... until it does it again.. its happening randomly 12-24 hours apart

 

Last it happened was last night.. just ran a full movie over plex in docker.. watched it all.. on my tv... a few minutes later i got notification via email that server is down ( i have a downtime checker that points to one of my reverse proxy sites hosted on the server). Time it happened before this it was like 3 am local time and i check that at the time there is nothing scheduled to run.. just to start and figure out why.

 

Any ideas what i can do.. attaching diagnostics file i ran just now after last reboot was finished

 

TIA

alphaserver-diagnostics-20201025-0542.zip

Link to comment

So now the server went from a few hours to almost a week with no issues.. earlier this morning (my time) whilst not in use just running in the background it happened again.. was notified that server was not available (from uptime monitor) I when to its screen physically and took a photo of what was on it.. it was unresponsive to keyboard/mouse inputs.. was unable to get to it over network.. did a reboot and also extracted syslogo.. does not show anything happened.. last task was normal scheduled stuff early morning and then the restart procedure.. hope the photo from the servers screen helps in any way to help me solve the issue..

IMG_1889.JPG

syslog-192.168.0.21.log

Link to comment
  • 2 weeks later...

So a quick update here... sever was fine for a few days again... last night (My time.. got a notification server was offline// meaning no internet connection/network connection)

 

Syslog file went crazy with entries writing every second before that happened and also after it happened... the downtime for network was around

 

Nov 10 00:20:00 > Nov 10 00:30:00

 

Sine notifications are delayed and not instant

 

Attaching this (sorry huge) syslog file... any insight please.

 

BTW my machine is an Dell r710 with HBA, 48GB of ram and 2CPUs.. which are mostly idle and ram barley used.. have a few containers running.. 2 vms currently shutdown as i was testing waters and als PI HOLE has since been shut down from here so all dockers are running in bridge/host mode none running on custom br0 network

syslog-192.168.0.21.log

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.