stephack Posted May 29, 2022 Share Posted May 29, 2022 I've been troubleshooting various errors for a while. As part of my troubleshooting I've started fresh with 6.10. I'm still getting occasional MCE errors but now it doesn't give me the details. I've reinstalled the Fix Common Problems plug in and installed Perl as well as Mce logs...but still don't get the details I used to get...only an alert that there was an mce event. Please help with any suggestions would be appreciated. Quote Link to comment
jmztaylor Posted May 29, 2022 Share Posted May 29, 2022 3 minutes ago, stephack said: I've been troubleshooting various errors for a while. As part of my troubleshooting I've started fresh with 6.10. I'm still getting occasional MCE errors but now it doesn't give me the details. I've reinstalled the Fix Common Problems plug in and installed Perl as well as Mce logs...but still don't get the details I used to get...only an alert that there was an mce event. Please help with any suggestions would be appreciated. Need diagnostics posted Quote Link to comment
stephack Posted May 29, 2022 Author Share Posted May 29, 2022 (edited) 1 hour ago, jmztaylor said: Need diagnostics posted I've posted them in the past. But that's not what I'm looking for. Normally when I get the MCE events in the log, I can run the Fix Common Problems scan which would them pump the actual log details into the log for troubleshooting. Since going to Unraid 6.10, they are no longer displayed. Is that expected and if so, how do I view the actual Machine Check errors myself? Edited May 29, 2022 by stephack Quote Link to comment
stephack Posted May 29, 2022 Author Share Posted May 29, 2022 Hopefully someone like @Squid could provide some insight into this for me. It's been a pretty frustrating and difficult process troubleshooting the various errors I'm receiving, but not being able to see the details now makes it pretty much impossible. Should I revert to 6.9 so at least I can see the detailed errors? What's changed regarding mce logs in 6.10.....again.....I don't need help troubleshooting the errors....yet anyway... I just want to see what they actually are. Quote Link to comment
Squid Posted May 29, 2022 Share Posted May 29, 2022 You need to post your diagnostics. It's probably happening before mcelog is getting installed Quote Link to comment
stephack Posted May 29, 2022 Author Share Posted May 29, 2022 (edited) Here's the diags as well: Edited June 3, 2022 by stephack Quote Link to comment
stephack Posted May 29, 2022 Author Share Posted May 29, 2022 2 minutes ago, Squid said: It's probably happening before mcelog is getting installed I posted the diags above but I probably have a misconception as to how the mce events work. The server has been up for some time before the errors occur. Are you saying that the event that pops up in the log could have occured long before it was reported? The real problem with me troubleshooting these errors is that they occur randomly and can take more than 24 hours before they reoccur. I have been making small s/w and h/w changes to find the culprit for a few weeks now. In desperation I've upgraded to 6.10 but couldnt see the mce errors so I reverted to 6.9.2. I then decided to build a new trial usb with 6.10 and test. So far I've gotten seemingly less mce events, but I still can't see the actual errors to determine if they are the same ones or different. Really frustrated at this point and considering giving up on Unraid. This thread is my last attempt to stick with it. Quote Link to comment
jmztaylor Posted May 29, 2022 Share Posted May 29, 2022 have you tried running /usr/sbin/mcelog > mcelog.txt and checking out that file? Quote Link to comment
stephack Posted May 29, 2022 Author Share Posted May 29, 2022 10 minutes ago, jmztaylor said: /usr/sbin/mcelog > mcelog.txt Nothing there...there is no /usr/sbin/mcelog unfortunately, so the command yields an empty file. Quote Link to comment
stephack Posted June 1, 2022 Author Share Posted June 1, 2022 On 5/29/2022 at 4:08 PM, Squid said: You need to post your diagnostics. @Squid I posted my diags as requested. Any thoughts? Quote Link to comment
Squid Posted June 2, 2022 Share Posted June 2, 2022 Yeah, it's nothing and can be ignored. But, a BIOS update if available might get rid of it May 28 13:52:41 Tower mcelog: failed to prefill DIMM database from DMI data May 28 13:52:41 Tower mcelog: Kernel does not support page offline interface Quote Link to comment
stephack Posted June 2, 2022 Author Share Posted June 2, 2022 Thanks Squid. I searched the diagnostics for "mcelog" but it didn't find anything. Next time I'll simply open the syslog.txt file and look for the details there. Appreciate your guidance as usual. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.