Jurak Posted February 14, 2022 Share Posted February 14, 2022 My server keeps crashing its happening about every other day. When it crashes the only way to get it back is to power off the server and start it back up. I have tried to SSH into it when this happens but I'm unable to connect its basically completely down. I know by restarting the server it clears the logs. So is there a way to have the logs saved to the flash drive while its running to have the logs when it crashes to provide to you guys to look at? Attached is the logs after starting the server back up just in case there is any info in there. I am also connecting a monitor to it to see if it shows anything when this happens. Thanks. heimdall-diagnostics-20220214-1217.zip Quote Link to comment
JorgeB Posted February 14, 2022 Share Posted February 14, 2022 4 minutes ago, Jurak said: So is there a way to have the logs saved to the flash drive while its running to have the logs when it crashes to provide to you guys to look at? Yes, you can enable the syslog server, but before that check this. 1 Quote Link to comment
Jurak Posted February 14, 2022 Author Share Posted February 14, 2022 29 minutes ago, JorgeB said: Yes, you can enable the syslog server, but before that check this. Thanks I have enabled the syslog server. I have also followed the other article you posted. Changed the c states and the power supply settings. The memory was already set to what is listed in the article. If it crashes again, I will post the logs. Thanks again. Quote Link to comment
JorgeB Posted February 15, 2022 Share Posted February 15, 2022 11 hours ago, Jurak said: The memory was already set to what is listed in the article. It wasn't in the diags posted. 1 Quote Link to comment
ChatNoir Posted February 15, 2022 Share Posted February 15, 2022 You have both single and dual rank DIMMs, so dual rank settings probably apply. 1 Quote Link to comment
dalben Posted February 15, 2022 Share Posted February 15, 2022 Try removing the CoreFrequency plugin. I had a similar issue and removing it solved my problems. corefreq.plg - 2021.07.13 (Up to date) 1 Quote Link to comment
Jurak Posted February 15, 2022 Author Share Posted February 15, 2022 8 hours ago, JorgeB said: It wasn't in the diags posted. Per the bios its not set at 3200 any more so yeah lol. 3 hours ago, dalben said: Try removing the CoreFrequency plugin. I had a similar issue and removing it solved my problems. Thanks I have removed that. 4 hours ago, ChatNoir said: You have both single and dual rank DIMMs, so dual rank settings probably apply. Yeah its set at the dual rank speed at least that is what the bios stated last time I was in there. Quote Link to comment
superloopy1 Posted February 15, 2022 Share Posted February 15, 2022 just to say that my server is also hanging moreorless every day, needing reboot, parity etc. Mine seems to be a panic in Samba though, just in case yours is similar, not trying to hijack the thread, i've raised my own. 1 Quote Link to comment
Jurak Posted February 16, 2022 Author Share Posted February 16, 2022 (edited) Ok crashed during the night. Here is the syslog and the last diagnostics the was on the flash drive. I hope there is something in there to help with this. syslog heimdall-diagnostics-20220214-1217.zip also the pic of the screen is attached Edit to add pic Edited February 16, 2022 by Jurak Quote Link to comment
JorgeB Posted February 16, 2022 Share Posted February 16, 2022 Nothing about the crash in the syslog, this usually points to a hardware issue, and not saying that is the problem but RAM is still overclocked. Quote Link to comment
Jurak Posted February 16, 2022 Author Share Posted February 16, 2022 30 minutes ago, JorgeB said: not saying that is the problem but RAM is still overclocked. Damn ok I will check that again. Thanks Quote Link to comment
Jurak Posted February 16, 2022 Author Share Posted February 16, 2022 Doing a memtest. Its still running but it looks like one of ram sticks is going bad. Many errors around the same area. Quote Link to comment
Jurak Posted February 16, 2022 Author Share Posted February 16, 2022 Yep memtest finished. One ram stick is bad with over 25 errors on it. So pulled that stick and booting back up. Its gskill ram and they have a limited lifetime warranty so guess I will start an RMA. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.