frodr Posted June 7, 2021 Share Posted June 7, 2021 (edited) Hi, I was asked to install mcelog (already installed) and post diagnostics here. I hope someone can shred some light on what is going on. // frode maxx-diagnostics-20210607-1051.zip Edited June 7, 2021 by frodr Quote Link to comment
frodr Posted June 8, 2021 Author Share Posted June 8, 2021 15 hours ago, frodr said: Hi, I was asked to install mcelog (already installed) and post diagnostics here. I hope someone can shred some light on what is going on. // frode maxx-diagnostics-20210607-1051.zip 230.75 kB · 0 downloads Syslog says: 06:08:08 Maxx root: mcelog: warning: 8 bytes ignored in each record Jun 7 06:08:08 Maxx root: mcelog: consider an update Quote Link to comment
frodr Posted July 3, 2021 Author Share Posted July 3, 2021 I have been trying to solve this problem intensively for a week. But no luck. Before I through out the motherboard, I have one question: Is there an Unraid setting which restarts the PC when reached a certain temperature? Quote Link to comment
FreeMan Posted July 4, 2021 Share Posted July 4, 2021 Have you notified whoever asked you to post diagnostics that they are up here? Maybe describe the issues you're having in more detail and someone else may be able to take a look. Most modern CPUs will throttle back if they get too hot, and will probably shut the computer down if temps continue to go up. You'd probably need to look at the docs for your mother board to determine if it has that feature and where in the BIOS settings it may be. The Parity Check Tuning plugin can be set to pause a parity check or disk rebuild if disk temps get too hot, but it won't shut down the whole server. Quote Link to comment
itimpi Posted July 4, 2021 Share Posted July 4, 2021 6 hours ago, FreeMan said: The Parity Check Tuning plugin can be set to pause a parity check or disk rebuild if disk temps get too hot, but it won't shut down the whole server. The more recent versions of the plugin DO have an option to shutdown the server if disks overheat There is, though, no option to do this on something like the CPU overheating. Quote Link to comment
frodr Posted July 4, 2021 Author Share Posted July 4, 2021 Thanks for feedback. Diagnostics in the first post. But it lists only from the restart. The server is liquid cooled. max temp on cpus I seen 60C. Max working temp is +90C according to Intel.. Max temp on motherboard is not listed by Asus, the reboot is at about 55C. // Frode Quote Link to comment
frodr Posted July 4, 2021 Author Share Posted July 4, 2021 I removed the Parity Check Tuning plugin just to be at the safe side. No change, it rebooted. I have tested the same load with the chassie open and fans at full speed. Then no reboot: So heat, not load is causing the reboot. I had a suspision against the power supply, Corsair AX1600i. I tested a new psu of the same type, no change. This psu have a temp overload feature. That is 120C, so it should not do it. IPMI/BMC log says: which to me can imply that the reboot cause is outside the ilmi/motherboard, but maybe OS or PSU. It is possible I need a new motherboard. What I really want to avoid is ending up with the same reboot on a new mb. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.