January 21, 20188 yr Was casually browsing my logs this morning and noticed this error pop up Jan 21 03:57:59 Dirge kernel: mce: [Hardware Error]: Machine check events logged Jan 21 03:57:59 Dirge kernel: [Hardware Error]: Corrected error, no action required. Jan 21 03:57:59 Dirge kernel: [Hardware Error]: CPU:0 (15:2:0) MC4_STATUS[-|CE|MiscV|-|AddrV|-|CECC]: 0x9c1f4850001c017b Jan 21 03:57:59 Dirge kernel: [Hardware Error]: Error Addr: 0x000000060a428784 Jan 21 03:57:59 Dirge kernel: [Hardware Error]: MC4 Error (node 0): L3 data cache ECC error. Jan 21 03:57:59 Dirge kernel: [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: EV CPU temps are within norms so I am a little concerned. Should I be? dirge-diagnostics-20180121-0931.zip Edited January 21, 20188 yr by Darqfallen Diagnostics Attached
January 21, 20188 yr Most likely a bad CPU. It's L3 cache reported an ECC error, which was corrected, but all mce errors should be taken seriously
January 21, 20188 yr Then I would suspect one of: - faulty processor - incompatibility with the dynamic power-control of the current kernel - unclean power from the PSU
January 21, 20188 yr Author Processor has been running for 12 months+ with no issue so far, so I don't think its a faulty processor. Perhaps the PSU has gone bad, I will wait to see if more errors creep up. Since I've only updated to 6.4 four days ago I'm leaning towards: 3 minutes ago, pwm said: - incompatibility with the dynamic power-control of the current kernel I will monitor and keep you guys updated. Edited January 21, 20188 yr by Darqfallen
Archived
This topic is now archived and is closed to further replies.