uaktags Posted November 17, 2020 Share Posted November 17, 2020 Unraid 6.9.0 Beta30 Machinist x99 zx-du99d4 mobo (new mobo, old one was an ASMB-935...error started with only the new x99) dual 2643v3 64GB ram /var/log# grep -rn mce dmesg:257:[ 0.379352] mce: CPU0: Thermal monitoring enabled (TM1) dmesg:271:[ 0.380274] mce: [Hardware Error]: Machine check events logged dmesg:272:[ 0.380276] mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 20: c800a94000200e0f dmesg:273:[ 0.380278] mce: [Hardware Error]: TSC 0 MISC 800000 dmesg:274:[ 0.380280] mce: [Hardware Error]: PROCESSOR 0:306f2 TIME 1605623282 SOCKET 0 APIC 0 microcode 43 syslog:259:Nov 17 09:28:38 Tower kernel: mce: CPU0: Thermal monitoring enabled (TM1) syslog:273:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: Machine check events logged syslog:274:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 20: c800a94000200e0f syslog:275:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: TSC 0 MISC 800000 syslog:276:Nov 17 09:28:38 Tower kernel: mce: [Hardware Error]: PROCESSOR 0:306f2 TIME 1605623282 SOCKET 0 APIC 0 microcode 43 syslog:1887:Nov 17 09:29:36 Tower nerdpack: Installing mcelog-161 package... syslog:1889:Nov 17 09:29:36 Tower root: Installing mcelog-161 package... syslog:5144:Nov 17 09:41:14 Tower root: mcelog: warning: 8 bytes ignored in each record syslog:5145:Nov 17 09:41:14 Tower root: mcelog: consider an update Outside of this HWError, everything else is working as I'd expect it. Dockers are running, all cores are being utilized, nothing appears degraded in perf/usage. tower-diagnostics-20201117-1004.zip Quote Link to comment
uaktags Posted November 19, 2020 Author Share Posted November 19, 2020 Guess I'll just update that I haven't seen the error since posting. Quote Link to comment
uaktags Posted November 26, 2020 Author Share Posted November 26, 2020 Nope, seems to have popped back up again. Strange. Quote Link to comment
prune Posted April 15, 2021 Share Posted April 15, 2021 (edited) Hi, I got the same error message with Unraid 6.9.2 (5.10.28-Unraid). Searching around a bit, it looks like we can ignore this message. As stated by aegl a the end of this thread : https://github.com/andikleen/mcelog/issues/1 Quote The message is not serious. It just means that you are running on a kernel that added some fields to the “struct mce” and the version of mcelog you have doesn’t know what they are. If the messages bother you, then build a new mcelog binary from the git sources on kernel.org Edited April 15, 2021 by prune added unraid version number Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.