Fatal_Flaw Posted September 12, 2015 Share Posted September 12, 2015 Over the past couple of weeks, my unraid machine has become unresponsive every 2 to 3 days. It starts by being unable to access the unraid shares. Then shortly after that, the unraid web interface and SSH sessions becomes unresponsive. The odd thing is that the VM running on the cache discs still functions when this happens. When the web interface and SSH are unresponsive, the only way to restart is a hard shutdown. If I get to the web interface before it becomes unresponsive, I can shut down the VM properly. However, trying to stop the array causes it to hang when it says "Sync filesystems". I then have to do a hard shutdown. In the couple of times when I caught it after the shares became inaccessible but before the web interface does, I was able to download a syslog. I've attached them to this post. When I don't get there in time and the web interface is unresponsive, I am unable to get a syslog file because it's cleared on reboot. If anyone knows how to preserve the syslog when this happens, please let me know. filebox-syslog-20150908-1918.zip filebox-syslog-20150910-1944.zip Quote Link to comment
BRiT Posted September 12, 2015 Share Posted September 12, 2015 Post diagnostics. Quote Link to comment
Fatal_Flaw Posted September 12, 2015 Author Share Posted September 12, 2015 I've attached the diagnostics. filebox-diagnostics-20150912-1327.zip Quote Link to comment
Fatal_Flaw Posted September 15, 2015 Author Share Posted September 15, 2015 Hey BRiT (or anyone else), any thoughts on the diagnostics or anything I should try or look at? Thanks Quote Link to comment
m4f1050 Posted September 16, 2015 Share Posted September 16, 2015 Mine works fine, except for emhttp. And I can't kill it and restart it, it ALWAYS gives me segmentation fault. Why can't this be resolved? Why does it have to do "segmentation fault" and not behave like any other program/service? Quote Link to comment
Fatal_Flaw Posted October 3, 2015 Author Share Posted October 3, 2015 I've still been unable to solve this issue. I'm having to hard reboot it every 24-48 hours. I've moved the one virtual hard disk file from my VM off of the data array and onto the cache drive. That didn't change anything. I ran extended SMART tests on all of my drives and they all came back without errors. Does anyone have any ideas what else I could try? At this point I don't know what else to check. Quote Link to comment
Russ Uno Posted October 4, 2015 Share Posted October 4, 2015 Just a thought but have you tried upgrading to v6.1.3? It looks like you are running on v6.1.2. Quote Link to comment
rara1234 Posted October 4, 2015 Share Posted October 4, 2015 i've had this once since moving to an HP Gen8 - of course the causes of your and my problem are just as likely to be different - but i was wondering if anyone had a way to capture the syslog if the system is unresponsive? In my case, i can see via the ILO that the whole OS has stopped responding, so i can't even log in to the console. Quote Link to comment
m4f1050 Posted October 5, 2015 Share Posted October 5, 2015 What Russ Uno is trying to say is replace the bz files to 6.1.3 and give it a try. Quote Link to comment
splnut Posted October 6, 2015 Share Posted October 6, 2015 I was having a similar issue with unraid becoming unreachable randomly also. So far it's been stable without running any virtual machines. Sorry I can't offer anything about your issue, besides maybe leaving the vms shut down for a couple days to see what happens. Quote Link to comment
tonifarres13 Posted October 6, 2015 Share Posted October 6, 2015 I had a similar issue with emhttp crashing randomly. I solved running the VMs with fixed memory (initial memory = max memory). I think there is an issue with the memory ballooning driver in windows vms. Since I changed this configuration the system it's been stable. Quote Link to comment
RobJ Posted October 6, 2015 Share Posted October 6, 2015 I've still been unable to solve this issue. I'm having to hard reboot it every 24-48 hours. I've moved the one virtual hard disk file from my VM off of the data array and onto the cache drive. That didn't change anything. I ran extended SMART tests on all of my drives and they all came back without errors. Does anyone have any ideas what else I could try? At this point I don't know what else to check. You are thinking it's a hardware issue, but it's most likely a software problem. It *could* be hardware, memory or heat, I would run a very long Memtest, and make sure the CPU and bridge chipsets on the motherboard aren't getting too hot. I don't think there's any way the drives could be involved with a hard crash, no matter what went wrong with them. But it's more likely a software issue, especially if you are running VM's or plugins. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.