Machine check error event after server crash


Go to solution Solved by Squid,

Recommended Posts

Hello,

 

My server crashed at about 16:35 yesterday, and when it came back up the fix common problem plugin reported I had a machine check error event. I would appreciate help in finding out what caused it as I assumed whatever it was is the same problem that caused the server to crash. I've attached my diagnostics, and I also have the syslog if that would be helpful.

 

The crash also caused a parity check, which is expected, and several of my docker containers became corrupted and had to be reinstalled. Still working through that part as the parity check continues for about another day yet.

 

Thank you for any help!

punisher-diagnostics-20221120-0957.zip

Link to comment
  • Solution

Top of my head, the mce is because of the microcode update being issued to the CPU and is probably no issue

Nov 19 16:35:03 Punisher kernel: microcode: CPU0: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU1: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU2: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU3: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU4: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU5: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU6: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU7: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU8: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU9: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU10: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: CPU11: patch_level=0x08701021
Nov 19 16:35:03 Punisher kernel: microcode: Microcode Update Driver: v2.2.
Nov 19 16:35:03 Punisher kernel: IPI shorthand broadcast: enabled
Nov 19 16:35:03 Punisher kernel: sched_clock: Marking stable (11946833916, 343487151)->(12438976477, -148655410)
Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: Machine check events logged
Nov 19 16:35:03 Punisher kernel: registered taskstats version 1
Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 27: faa000000000080b
Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: TSC 0 MISC d012000800000000 SYND 5d000000 IPID 1002e00000500 
Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1668897207 SOCKET 0 APIC 0 microcode 8701021

 

  • Like 1
Link to comment

Ok, sounds good! I'll go ahead and set Fix Common Problems to ignore. I also noticed it had an MCE that has come up in other forum threads a few times and the general consensus is that it too can be safely ignored:

 

mcelog: ERROR: AMD Processor family 23: mcelog does not support this processor.  Please use the edac_mce_amd module instead.

 

Thank you for taking a look, much appreciated!

Edited by Walker843
Added extra words for clarification
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.