Hardware Failure (mcelog) notice


uaktags

Recommended Posts

Unraid 6.9.0 Beta30
Machinist x99 zx-du99d4 mobo (new mobo, old one was an ASMB-935...error started with only the new x99)
dual 2643v3
64GB ram

 

/var/log# grep -rn mce
dmesg:257:[    0.379352] mce: CPU0: Thermal monitoring enabled (TM1)
dmesg:271:[    0.380274] mce: [Hardware Error]: Machine check events logged
dmesg:272:[    0.380276] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 20: c800a94000200e0f
dmesg:273:[    0.380278] mce: [Hardware Error]: TSC 0 MISC 800000
dmesg:274:[    0.380280] mce: [Hardware Error]: PROCESSOR 0:306f2 TIME 1605623282 SOCKET 0 APIC 0 microcode 43
syslog:259:Nov 17 09:28:38 Tower kernel: mce: CPU0: Thermal monitoring enabled (TM1)
syslog:273:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: Machine check events logged
syslog:274:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 20: c800a94000200e0f
syslog:275:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: TSC 0 MISC 800000
syslog:276:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: PROCESSOR 0:306f2 TIME 1605623282 SOCKET 0 APIC 0 microcode 43
syslog:1887:Nov 17 09:29:36 Tower nerdpack: Installing mcelog-161 package...
syslog:1889:Nov 17 09:29:36 Tower root: Installing mcelog-161 package...
syslog:5144:Nov 17 09:41:14 Tower root: mcelog: warning: 8 bytes ignored in each record
syslog:5145:Nov 17 09:41:14 Tower root: mcelog: consider an update


Outside of this HWError, everything else is working as I'd expect it. Dockers are running, all cores are being utilized, nothing appears degraded in perf/usage. 

 

tower-diagnostics-20201117-1004.zip

Link to comment
  • 4 months later...

Hi,
I got the same error message with Unraid 6.9.2 (5.10.28-Unraid).
Searching around a bit, it looks like we can ignore this message.
As stated by aegl a the end of this thread : https://github.com/andikleen/mcelog/issues/1
 

Quote

The message is not serious. It just means that you are running on a kernel that added some fields to the “struct mce” and the version of mcelog you have doesn’t know what they are. If the messages bother you, then build a new mcelog binary from the git sources on kernel.org

 

Edited by prune
added unraid version number
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.