enigma27 Posted November 17, 2019 Share Posted November 17, 2019 (edited) Hi All. Hope someone can help as I seem to be out of my depth here on this one. So i was running a successful unraid server on an HP microserver gen8 with no problems... decided to upgrade my server so purchased the following Ryzen 1700 16gb ram asus Prime X370 Mobo (Bios up to date) 256gb NVME (For Cache which I have never had before) Nvidia GTX 760 (Temporary GPU) So i preceded to build the machine, plugged in the USB key and the 4 drives I had my array on from the old machine.. booted everything up and all seemed to be fine. I then set-up the cache drive, moved some folder onto it and left it at that. This is where problems started.. The server started to randomly freeze and only a hard reboot would bring it back up. I checked the logs and found some errors Error with CPU thread 11 - Thought there was a hardware issue so chucked a new formatted HD in and proceeded to install windows 10 to check for any hardware problems.. spent 5 hours with the machine with windows 10 bench marking cpu/gpu and found no issues what so ever, also did a memmtest and again no issues found Next i started unraid in safe mode with no plugins installed and what do you know after 2 hours of tinkering no crashes even running 8 dockers. so next i decided to reboot into normal mode and delete any plugins which i managed to do but before i could reboot server crashed again. So so far i have only been ale to run the server in safe mode without crashes. I have just rebooted the machine without any plugins to see how long it last this time around. some of the other errors in the log i have found Nov 17 13:33:40 Tower ntpd[1957]: kernel reports TIME_ERROR: 0x41: Clock Unsynchronized and something about upstream timeout on certain plugins (Before i removed them) also noticed this morning before i used safe mode and removed the plugins a couple of cores where stuck at 100% and overall usage at 26% which made loading GUI pages really slow. The only thing the system has is a i wouldn't say old but a power supply from an old machine. its a corsair TX650w. now i know alot of people say that could be the issue but no problems running in windows environment and i would have thought that would have stressed the system more than unraid would have. Any ideas how i start to diagnose this issue as the logs dont ssem to show much around the time of the crashes. I have attached some log files from last night. tower-diagnostics-20191117-1101.zip tower-diagnostics-20191116-1812.zip Edited November 17, 2019 by enigma27 Quote Link to comment
enigma27 Posted November 17, 2019 Author Share Posted November 17, 2019 ok so an update after deleting all of the plugins and rebooting the server its now been up for 1hr 22mins with no crash Quote Link to comment
John_M Posted November 17, 2019 Share Posted November 17, 2019 In the BIOS make sure the Power Supply Idle Control setting is changed to Typical Current Idle, not the default Low Current Idle. It can be tricky to find, so look for Advanced -> AMD CBS -> Power Supply Idle Control. Quote Link to comment
enigma27 Posted November 17, 2019 Author Share Posted November 17, 2019 1 hour ago, John_M said: In the BIOS make sure the Power Supply Idle Control setting is changed to Typical Current Idle, not the default Low Current Idle. It can be tricky to find, so look for Advanced -> AMD CBS -> Power Supply Idle Control. thanks i can see that setting and its set to auto also 2hrs and 52 mins now without a crash with no plugins installed Quote Link to comment
enigma27 Posted November 18, 2019 Author Share Posted November 18, 2019 Hi all. So as an update my server has been running fine for 24 hours with only Community apps and fix common problems installed so definataly a plugin problem. Thinking it maybe core temp not playing well with AMD cpu Quote Link to comment
John_M Posted November 18, 2019 Share Posted November 18, 2019 21 minutes ago, enigma27 said: Thinking it maybe core temp not playing well with AMD cpu What do you mean by "core temp"? Quote Link to comment
enigma27 Posted November 18, 2019 Author Share Posted November 18, 2019 The system temp app you can get to read CPU temps. I found an error at start up that it could not find a compatible device but was still loading drivers Quote Link to comment
John_M Posted November 18, 2019 Share Posted November 18, 2019 1 minute ago, enigma27 said: The system temp app you can get to read CPU temps. Can you be more specific? Do you mean the Dynamix System Temperature plugin? Or something else? Quote Link to comment
enigma27 Posted November 18, 2019 Author Share Posted November 18, 2019 System Temperature plugin - yes this one I have not loaded it back on my server and have had no crashes so far Quote Link to comment
John_M Posted November 18, 2019 Share Posted November 18, 2019 Ok. You might find it's better supported when you upgrade to Unraid version 6.8. Quote Link to comment
Tower_Of_Power Posted December 5, 2019 Share Posted December 5, 2019 been trying to troubleshoot my Ryzen 1700 build as well.. wish you luck. I'm gonna try the Power Supply Setting and hopefully get good results. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.