Random Crashes - Completely Unaccessible


Recommended Posts

Attached you will find the syslog from my box. From what it looks like the CPU overheated and then perhaps some threads crash, but someone with more experience here take a look and tell me what you think.  The crashes seem to be random, and I can experience 1 a day or multiples, it varies. They do appear to be load related as the more activity the machine sees the more likely it is to crash. I've recently added the LetsEncrypt docker and started using Nginx to reverse proxy all of my other containers so I don't have to punch a bunch of holes in my firewall for remote access. Before LetsEncrypt was added the last docker added was Plex which I planned to try out before making the switch from using raw file shares w/ a Kodi PC connecting back from another room. 

 

I'm running 6.5.3 Basic on a i7-3770S CPU w/ 24GB of memory, and 16 TB across 4 disks, no cache drive. The machine is headless, so I cannot provide what was on the display at the time of the event(s). I tried plugging up a monitor after the fact over HDMI but the machine didn't recognize it and no signal was being sent to the monitor. Perhaps I should temporarily keep a monitor on the floor beside it until this is resolved (unless there's enough info in the syslog and diag dump)? 

 

I've also attached a diagnostic report that comes from my box AFTER restarting it since it can't be gathered during the crash event. I'm happy to provide any additional information requested to help figure out why this keeps happening, so don't hesitate to ask. 

syslog-1533257925

unraid-diagnostics-20180803-1158.zip

Link to comment

Am I correct in thinking the throttling is the cause of the crashing? If the thermal events aren't the reason for the crashing I really don't care if it throttles here and there. I recently installed Plex and I'm not surprised to see this given the increased demand on the CPU. 

 

I do plan to take the box apart and clean it and put fresh thermal paste on it, but I have to wait until Monday for that to arrive from Amazon (I don't have any local shops). I've also ordered a Evo 850 SSD to use as a cache drive to help alleviate all the parity writes from docker containers.

Link to comment

IF the CPU becomes too hot, it will completely shutdown.  Others may jump here at the point but I have never heard of CPU paste just going bad unless the seal was was disturbed.  Most of the time, CPU over heating is caused by clogged fins on the cooler, defective CPU fan, or the air intakes and exhausts being plugged with crap.  

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.