November 20, 20223 yr Hello, My server crashed at about 16:35 yesterday, and when it came back up the fix common problem plugin reported I had a machine check error event. I would appreciate help in finding out what caused it as I assumed whatever it was is the same problem that caused the server to crash. I've attached my diagnostics, and I also have the syslog if that would be helpful. The crash also caused a parity check, which is expected, and several of my docker containers became corrupted and had to be reinstalled. Still working through that part as the parity check continues for about another day yet. Thank you for any help! punisher-diagnostics-20221120-0957.zip
November 20, 20223 yr Solution Top of my head, the mce is because of the microcode update being issued to the CPU and is probably no issue Nov 19 16:35:03 Punisher kernel: microcode: CPU0: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU1: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU2: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU3: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU4: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU5: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU6: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU7: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU8: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU9: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU10: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: CPU11: patch_level=0x08701021 Nov 19 16:35:03 Punisher kernel: microcode: Microcode Update Driver: v2.2. Nov 19 16:35:03 Punisher kernel: IPI shorthand broadcast: enabled Nov 19 16:35:03 Punisher kernel: sched_clock: Marking stable (11946833916, 343487151)->(12438976477, -148655410) Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: Machine check events logged Nov 19 16:35:03 Punisher kernel: registered taskstats version 1 Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 27: faa000000000080b Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: TSC 0 MISC d012000800000000 SYND 5d000000 IPID 1002e00000500 Nov 19 16:35:03 Punisher kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1668897207 SOCKET 0 APIC 0 microcode 8701021
November 20, 20223 yr Author Ok, sounds good! I'll go ahead and set Fix Common Problems to ignore. I also noticed it had an MCE that has come up in other forum threads a few times and the general consensus is that it too can be safely ignored: mcelog: ERROR: AMD Processor family 23: mcelog does not support this processor. Please use the edac_mce_amd module instead. Thank you for taking a look, much appreciated! Edited November 20, 20223 yr by Walker843 Added extra words for clarification
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.