Jump to content

disconcerting hardware error in log


bobbintb

Recommended Posts

One of the CA plugins was warning me about hardware errors I looked in the log and saw this:

ErrorWarningSystemArrayLogin


Apr 24 21:44:18 Tower root: CPUID Vendor Intel Family 6 Model 60
Apr 24 21:44:18 Tower root: mcelog: Too many trigger children running already
Apr 24 21:44:18 Tower root: Hardware event. This is not a software error.
Apr 24 21:44:18 Tower root: MCE 5
Apr 24 21:44:18 Tower root: CPU 5 THERMAL EVENT TSC 1051e7f0aca 
Apr 24 21:44:18 Tower root: TIME 1493091778 Mon Apr 24 21:42:58 2017
Apr 24 21:44:18 Tower root: Processor 5 below trip temperature. Throttling disabled
Apr 24 21:44:18 Tower root: STATUS 88020282 MCGSTATUS 0
Apr 24 21:44:18 Tower root: MCGCAP c09 APICID 3 SOCKETID 0 
Apr 24 21:44:18 Tower root: CPUID Vendor Intel Family 6 Model 60
Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: CPU 0 on socket 0 received unknown error
Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: CPU 0 on socket 0 received unknown error
Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: Location: CPU 0 on socket 0
Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: Location: CPU 0 on socket 0
Apr 24 21:47:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 112306)
Apr 24 21:47:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 112306)
Apr 24 21:47:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 112306)
Apr 24 21:47:57 Tower kernel: CPU0: Package temperature/speed normal
Apr 24 21:47:57 Tower kernel: CPU1: Package temperature/speed normal
Apr 24 21:47:57 Tower kernel: CPU2: Package temperature/speed normal
Apr 24 21:47:57 Tower kernel: CPU3: Package temperature/speed normal
Apr 24 21:47:57 Tower kernel: CPU4: Package temperature/speed normal
Apr 24 21:47:57 Tower kernel: CPU5: Package temperature/speed normal
Apr 24 21:47:58 Tower kernel: CPU6: Package temperature/speed normal
Apr 24 21:47:58 Tower kernel: CPU5: Core temperature/speed normal
Apr 24 21:47:58 Tower kernel: CPU1: Core temperature/speed normal
Apr 24 21:47:58 Tower kernel: CPU7: Package temperature/speed normal
Apr 24 21:47:58 Tower kernel: mce_notify_irq: 1 callbacks suppressed
Apr 24 21:47:58 Tower kernel: mce: [Hardware Error]: Machine check events logged
Apr 24 21:48:04 Tower emhttp: cmd: /usr/local/emhttp/plugins/dynamix.plugin.manager/scripts/plugin install https://raw.github.com/bergware/dynamix/master/unRAIDv6/dynamix.system.temp.plg
Apr 24 21:48:05 Tower root: plugin: running: anonymous
Apr 24 21:48:05 Tower root: plugin: creating: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz - downloading from URL https://raw.githubusercontent.com/bergware/dynamix/master/archive/dynamix.system.temp.txz
Apr 24 21:48:05 Tower root: plugin: checking: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz - MD5
Apr 24 21:48:05 Tower root: plugin: running: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz
Apr 24 21:48:05 Tower root: plugin: creating: /usr/sbin/sensors-detect - downloading from URL https://raw.githubusercontent.com/bergware/dynamix/master/archive/sensors-detect
Apr 24 21:48:06 Tower root: plugin: setting: /usr/sbin/sensors-detect - mode to 0755
Apr 24 21:48:06 Tower root: plugin: running: anonymous
Apr 24 21:48:06 Tower kernel: nct6775: Found NCT6776D/F or compatible chip at 0x2e:0x290
Apr 24 21:52:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 196784)
Apr 24 21:52:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 196784)
Apr 24 21:52:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 196784)
Apr 24 21:52:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 196785)
Apr 24 21:52:57 Tower kernel: CPU3: Package temperature/speed normal
Apr 24 21:52:57 Tower kernel: CPU4: Package temperature/speed normal
Apr 24 21:52:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 196808)
Apr 24 21:52:57 Tower kernel: CPU5: Package temperature/speed normal
Apr 24 21:52:58 Tower kernel: CPU6: Package temperature above threshold, cpu clock throttled (total events = 196828)
Apr 24 21:52:58 Tower kernel: CPU6: Package temperature/speed normal
Apr 24 21:52:58 Tower kernel: CPU5: Core temperature above threshold, cpu clock throttled (total events = 142286)
Apr 24 21:52:58 Tower kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 142286)
Apr 24 21:52:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 196842)
Apr 24 21:52:58 Tower kernel: mce: [Hardware Error]: Machine check events logged
Apr 24 21:52:58 Tower kernel: mce: [Hardware Error]: Machine check events logged
Apr 24 21:52:58 Tower kernel: CPU1: Core temperature/speed normal
Apr 24 21:52:58 Tower kernel: CPU5: Core temperature/speed normal
Apr 24 21:52:58 Tower kernel: CPU7: Package temperature/speed normal
Apr 24 21:56:41 Tower sshd[10067]: Accepted password for root from 192.168.1.130 port 53883 ssh2
Apr 24 21:57:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 272273)
Apr 24 21:57:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 272273)
Apr 24 21:57:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 272273)
Apr 24 21:57:57 Tower kernel: CPU2: Package temperature/speed normal
Apr 24 21:57:57 Tower kernel: CPU1: Package temperature/speed normal
Apr 24 21:57:57 Tower kernel: CPU0: Package temperature/speed normal
Apr 24 21:57:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 272273)
Apr 24 21:57:57 Tower kernel: CPU3: Package temperature/speed normal
Apr 24 21:57:57 Tower kernel: CPU4: Package temperature above threshold, cpu clock throttled (total events = 272291)
Apr 24 21:57:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 272314)
Apr 24 21:57:57 Tower kernel: CPU5: Package temperature/speed normal
Apr 24 21:57:58 Tower kernel: CPU6: Package temperature/speed normal
Apr 24 21:57:58 Tower kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 203272)
Apr 24 21:57:58 Tower kernel: CPU5: Core temperature above threshold, cpu clock throttled (total events = 203272)
Apr 24 21:57:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 272339)
Apr 24 21:57:58 Tower kernel: mce_notify_irq: 1 callbacks suppressed
Apr 24 21:57:58 Tower kernel: mce: [Hardware Error]: Machine check events logged
Apr 24 21:57:58 Tower kernel: mce: [Hardware Error]: Machine check events logged
Apr 24 21:57:58 Tower kernel: CPU1: Core temperature/speed normal
Apr 24 21:57:58 Tower kernel: CPU5: Core temperature/speed normal
Apr 24 21:57:58 Tower kernel: CPU7: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU2: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU0: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU1: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 352796)
Apr 24 22:02:57 Tower kernel: CPU3: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU4: Package temperature above threshold, cpu clock throttled (total events = 352804)
Apr 24 22:02:57 Tower kernel: CPU4: Package temperature/speed normal
Apr 24 22:02:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 352814)
Apr 24 22:02:58 Tower kernel: CPU6: Package temperature/speed normal
Apr 24 22:02:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 352843)
Apr 24 22:02:58 Tower kernel: CPU7: Package temperature/speed normal

I've always gotten those cpu clock threshold errors for some reason, even though the CPU seems fine. It says 39 degrees in the bottom right of the UnRAID webui. But those "mce: [Hardware Error]: Machine check events logged" errors are new. I'm a little more concerned now and I don't know what they mean. Any help is appreciated.

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...