T_Matz Posted January 21 Author Share Posted January 21 Sounds good! I'll report back in 2 days when I start plugins Quote Link to comment
T_Matz Posted January 23 Author Share Posted January 23 i have almost half of my plugins started and the server is not having any issues. I am going to let the server run for 2 days and start the remaining plugins 2 at a time until they are all started Quote Link to comment
T_Matz Posted January 24 Author Share Posted January 24 Today was attempting to install another docker and the Webui froze and was unresponsive for 10 mins but unraid connect was available. From unraid connect i stopped the array, and it was able to bring back the webui. tower-diagnostics-20240124-1141.zip Quote Link to comment
T_Matz Posted January 25 Author Share Posted January 25 (edited) Server lasted for some days but it went down unresponsive today. narrowed it down to two plugins: CA Auto Turbo write mode, Dynamix chace directories. Also I had photo prisim indexing my photos. It hung while i did that. syslog tower-diagnostics-20240125-1541.zip Edited January 25 by T_Matz Quote Link to comment
JorgeB Posted January 26 Share Posted January 26 Still nothing relevant logged that I can see. Quote Link to comment
dxglaw Posted January 26 Share Posted January 26 A few months ago, my server also randomly became unresponsive after I upgraded the HW. I spent a lot of time trying starting/stopping dockers, plugins, and VMs. There was no luck. Finally, I tried memory check and saw memory errors during hours' test. After changing the memory, the issue is gone. Quote Link to comment
T_Matz Posted January 26 Author Share Posted January 26 I'll run a memory check again. I ran the memory check in the bios and it came up with nothing Quote Link to comment
T_Matz Posted January 26 Author Share Posted January 26 Can I stress test my CPU through Unraid? I've noticed that when the server has files written to the disks or larger tasks being done, the CPU cores will show high utilization on all cores, even those passed to VMs. I want to see if the CPU is stable as well as an extended mem test. If there isn't a stress test on Unraid, is it a stupid idea to put Windows on an external hard drive and run prime95 to ensure I don't have issues with the CPU? Quote Link to comment
dxglaw Posted January 29 Share Posted January 29 On 1/26/2024 at 10:05 PM, T_Matz said: I'll run a memory check again. I ran the memory check in the bios and it came up with nothing How long did you run the mem test in BIOS? You can use mem test provided in Unraid's boot menu. Usually this test runs for a few hours. Quote Link to comment
T_Matz Posted January 29 Author Share Posted January 29 I ran it for 2 hours I'm going to run a test tomorrow for at least 24 hours. I've been able to recreate a crash now a couple of times. Any massive file transfers, or recently I had photo prisim indexing my photos and every time my cpu cores are almost all maxed and it will crash. I want to run prime 95 as well to test the CPU bios settings. I'm going to boot to a windows thumb drive and run it to see how it does as well Quote Link to comment
T_Matz Posted February 6 Author Share Posted February 6 I downloaded MemTest 86 v10.6 and ran my memory through numerous passes and not a single error. I tested all of the hard drives and NVMe drives and they all pass. Unraid crashed again last night. I had Sonarr getting a show had about 60 different episodes and the system crashed again. I can basically repeat this crash now when ever i want. If i have the system moving a lot of data or indexing things the CPU is all over the place and mostly pinged at 100% utilization until it crashes. I have a spare hard drive from a previous build that I will use to load windows onto and run prime95 on the server. Should I disconnect all the other drives that I use for Unraid? Quote Link to comment
JorgeB Posted February 6 Share Posted February 6 On 1/17/2024 at 4:22 PM, JorgeB said: you can try is to boot the server in safe mode with all docker/VMs disabled, let it run as a basic NAS for a few days, if it still crashes it's likely a hardware problem, if it doesn't start turning on the other services one by one. Not sure if you already did this, if not it's worth a try. Quote Link to comment
T_Matz Posted February 6 Author Share Posted February 6 yes i did this. I booted into safe mode and slowly brought services online. The system will remain stable if i just let it run but if I try to do any intensive task it crashes. For instance i had sonarr que 60 episodes for download and it crashed. I was having photo prisim index photos and the system crashed. had radarr que movies for download and it crashed. Quote Link to comment
T_Matz Posted February 6 Author Share Posted February 6 but if i just let the system run and dont have it do any intensive task and only download 1 movie or show at a time the system will remain stable for days without issue Quote Link to comment
Solution trurl Posted February 6 Solution Share Posted February 6 On 1/29/2024 at 9:38 AM, T_Matz said: every time my cpu cores are almost all maxed and it will crash Are you sure you don't have a cooling problem? Quote Link to comment
T_Matz Posted February 6 Author Share Posted February 6 I dont believe so, but my plan is to plug in an unused hard drive with windows 10 on it and run prime 95 to really test the CPU and cooling to see whats going on with temps and the CPU. It really seem like the only culprit, everything else has been tested and seems to be working correctly. The CPU is a 12700k, cooler is a Noctua NH-U12S. Should i disconnect my other hard drives used for unraid before doing this? I will watch the temps on the CPU and ive never seen it go above 70c even during those max loads. The CPU on the unraid dashboard just shows very unusual behavior during those tasks described above. Normally my cpu sits around 4-8% load with a temp of 28-32c but when i do those tasks all or most of the CPU cores will sit at 80-100% even the ones that I have isolated for VMs. The server becomes sluggish, if you are watching anything on plex at that time will buffer constantly and then the whole server will lock up shortly after. Quote Link to comment
trurl Posted February 6 Share Posted February 6 5 hours ago, T_Matz said: Should i disconnect my other hard drives used for unraid before doing this? Might be a good idea so you don't have any accident with Windows trying to use the disks. Quote Link to comment
T_Matz Posted February 7 Author Share Posted February 7 Sounds good, going to do that over the weekend. Since I'm doing that work I'll repaste the CPU while I'm at it and report back after. Quote Link to comment
T_Matz Posted February 7 Author Share Posted February 7 Quick update, so I went back into the bios after work and went to the EZ mode and noticed "AI overclock" was enabled. I disabled it and the system seemed to run stable but was only at 3600mhz instead of what is normal for 4900-5200mhz. so I decided to triy a photo prisim index and the system did not lock up. So either you were correct that there was a thermal issue and the system froze or the intel turbo was just unstable. I'm still going to repaste and do a prime 95 testing. I will update again later Quote Link to comment
T_Matz Posted February 12 Author Share Posted February 12 I believe the bios "AI Optomized" settings were creating an unsafe overclock. I tested the system with the setting off and have had zero issues. It seems the AI setting in the bios were stating my cooling options were more than what i had, and were boosting the CPU frequencies very high and creating unsafe temps. Quote Link to comment
trurl Posted February 12 Share Posted February 12 You really shouldn't overclock a server anyway. Data is more important than any marginal speed improvements, and I/O is going to be the limiting factor anyway, not the CPU. Quote Link to comment
T_Matz Posted February 12 Author Share Posted February 12 I agree, it think it was a left over setting when i had the system set up as my daily driver and not running as a server. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.