bobbintb Posted April 25, 2017 Share Posted April 25, 2017 One of the CA plugins was warning me about hardware errors I looked in the log and saw this: ErrorWarningSystemArrayLogin Apr 24 21:44:18 Tower root: CPUID Vendor Intel Family 6 Model 60 Apr 24 21:44:18 Tower root: mcelog: Too many trigger children running already Apr 24 21:44:18 Tower root: Hardware event. This is not a software error. Apr 24 21:44:18 Tower root: MCE 5 Apr 24 21:44:18 Tower root: CPU 5 THERMAL EVENT TSC 1051e7f0aca Apr 24 21:44:18 Tower root: TIME 1493091778 Mon Apr 24 21:42:58 2017 Apr 24 21:44:18 Tower root: Processor 5 below trip temperature. Throttling disabled Apr 24 21:44:18 Tower root: STATUS 88020282 MCGSTATUS 0 Apr 24 21:44:18 Tower root: MCGCAP c09 APICID 3 SOCKETID 0 Apr 24 21:44:18 Tower root: CPUID Vendor Intel Family 6 Model 60 Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: CPU 0 on socket 0 received unknown error Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: CPU 0 on socket 0 received unknown error Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: Location: CPU 0 on socket 0 Apr 24 21:44:18 Tower root: <27>Apr 24 21:44:18 mcelog: Location: CPU 0 on socket 0 Apr 24 21:47:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 112306) Apr 24 21:47:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 112306) Apr 24 21:47:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 112306) Apr 24 21:47:57 Tower kernel: CPU0: Package temperature/speed normal Apr 24 21:47:57 Tower kernel: CPU1: Package temperature/speed normal Apr 24 21:47:57 Tower kernel: CPU2: Package temperature/speed normal Apr 24 21:47:57 Tower kernel: CPU3: Package temperature/speed normal Apr 24 21:47:57 Tower kernel: CPU4: Package temperature/speed normal Apr 24 21:47:57 Tower kernel: CPU5: Package temperature/speed normal Apr 24 21:47:58 Tower kernel: CPU6: Package temperature/speed normal Apr 24 21:47:58 Tower kernel: CPU5: Core temperature/speed normal Apr 24 21:47:58 Tower kernel: CPU1: Core temperature/speed normal Apr 24 21:47:58 Tower kernel: CPU7: Package temperature/speed normal Apr 24 21:47:58 Tower kernel: mce_notify_irq: 1 callbacks suppressed Apr 24 21:47:58 Tower kernel: mce: [Hardware Error]: Machine check events logged Apr 24 21:48:04 Tower emhttp: cmd: /usr/local/emhttp/plugins/dynamix.plugin.manager/scripts/plugin install https://raw.github.com/bergware/dynamix/master/unRAIDv6/dynamix.system.temp.plg Apr 24 21:48:05 Tower root: plugin: running: anonymous Apr 24 21:48:05 Tower root: plugin: creating: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz - downloading from URL https://raw.githubusercontent.com/bergware/dynamix/master/archive/dynamix.system.temp.txz Apr 24 21:48:05 Tower root: plugin: checking: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz - MD5 Apr 24 21:48:05 Tower root: plugin: running: /boot/config/plugins/dynamix.system.temp/dynamix.system.temp.txz Apr 24 21:48:05 Tower root: plugin: creating: /usr/sbin/sensors-detect - downloading from URL https://raw.githubusercontent.com/bergware/dynamix/master/archive/sensors-detect Apr 24 21:48:06 Tower root: plugin: setting: /usr/sbin/sensors-detect - mode to 0755 Apr 24 21:48:06 Tower root: plugin: running: anonymous Apr 24 21:48:06 Tower kernel: nct6775: Found NCT6776D/F or compatible chip at 0x2e:0x290 Apr 24 21:52:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 196784) Apr 24 21:52:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 196784) Apr 24 21:52:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 196784) Apr 24 21:52:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 196785) Apr 24 21:52:57 Tower kernel: CPU3: Package temperature/speed normal Apr 24 21:52:57 Tower kernel: CPU4: Package temperature/speed normal Apr 24 21:52:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 196808) Apr 24 21:52:57 Tower kernel: CPU5: Package temperature/speed normal Apr 24 21:52:58 Tower kernel: CPU6: Package temperature above threshold, cpu clock throttled (total events = 196828) Apr 24 21:52:58 Tower kernel: CPU6: Package temperature/speed normal Apr 24 21:52:58 Tower kernel: CPU5: Core temperature above threshold, cpu clock throttled (total events = 142286) Apr 24 21:52:58 Tower kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 142286) Apr 24 21:52:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 196842) Apr 24 21:52:58 Tower kernel: mce: [Hardware Error]: Machine check events logged Apr 24 21:52:58 Tower kernel: mce: [Hardware Error]: Machine check events logged Apr 24 21:52:58 Tower kernel: CPU1: Core temperature/speed normal Apr 24 21:52:58 Tower kernel: CPU5: Core temperature/speed normal Apr 24 21:52:58 Tower kernel: CPU7: Package temperature/speed normal Apr 24 21:56:41 Tower sshd[10067]: Accepted password for root from 192.168.1.130 port 53883 ssh2 Apr 24 21:57:57 Tower kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 272273) Apr 24 21:57:57 Tower kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 272273) Apr 24 21:57:57 Tower kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 272273) Apr 24 21:57:57 Tower kernel: CPU2: Package temperature/speed normal Apr 24 21:57:57 Tower kernel: CPU1: Package temperature/speed normal Apr 24 21:57:57 Tower kernel: CPU0: Package temperature/speed normal Apr 24 21:57:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 272273) Apr 24 21:57:57 Tower kernel: CPU3: Package temperature/speed normal Apr 24 21:57:57 Tower kernel: CPU4: Package temperature above threshold, cpu clock throttled (total events = 272291) Apr 24 21:57:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 272314) Apr 24 21:57:57 Tower kernel: CPU5: Package temperature/speed normal Apr 24 21:57:58 Tower kernel: CPU6: Package temperature/speed normal Apr 24 21:57:58 Tower kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 203272) Apr 24 21:57:58 Tower kernel: CPU5: Core temperature above threshold, cpu clock throttled (total events = 203272) Apr 24 21:57:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 272339) Apr 24 21:57:58 Tower kernel: mce_notify_irq: 1 callbacks suppressed Apr 24 21:57:58 Tower kernel: mce: [Hardware Error]: Machine check events logged Apr 24 21:57:58 Tower kernel: mce: [Hardware Error]: Machine check events logged Apr 24 21:57:58 Tower kernel: CPU1: Core temperature/speed normal Apr 24 21:57:58 Tower kernel: CPU5: Core temperature/speed normal Apr 24 21:57:58 Tower kernel: CPU7: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU2: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU0: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU1: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 352796) Apr 24 22:02:57 Tower kernel: CPU3: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU4: Package temperature above threshold, cpu clock throttled (total events = 352804) Apr 24 22:02:57 Tower kernel: CPU4: Package temperature/speed normal Apr 24 22:02:57 Tower kernel: CPU5: Package temperature above threshold, cpu clock throttled (total events = 352814) Apr 24 22:02:58 Tower kernel: CPU6: Package temperature/speed normal Apr 24 22:02:58 Tower kernel: CPU7: Package temperature above threshold, cpu clock throttled (total events = 352843) Apr 24 22:02:58 Tower kernel: CPU7: Package temperature/speed normal I've always gotten those cpu clock threshold errors for some reason, even though the CPU seems fine. It says 39 degrees in the bottom right of the UnRAID webui. But those "mce: [Hardware Error]: Machine check events logged" errors are new. I'm a little more concerned now and I don't know what they mean. Any help is appreciated. Link to comment
Frank1940 Posted April 25, 2017 Share Posted April 25, 2017 Post up your complete diagnostics file. Open up you case, check that all fans are working and the fan, intakes and heatsink cooling fins are clean. Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.