metaldrummer610 Posted October 16, 2019 Share Posted October 16, 2019 Hey everyone! I scanned my system with fix common problems and it came back with "Machine Check Events found on your system" :(. I ran mcelog but it didn't output anything. However, when I grabbed my diagnostics zip, I found in my syslog more information about the error: Quote Oct 16 16:40:00 Tower root: Fix Common Problems Version 2019.10.13a Oct 16 16:40:07 Tower root: Fix Common Problems: Error: Machine Check Events detected on your server Oct 16 16:40:07 Tower root: Hardware event. This is not a software error. Oct 16 16:40:07 Tower root: MCE 0 Oct 16 16:40:07 Tower root: CPU 1 BANK 8 TSC 1fafd8dea800c Oct 16 16:40:07 Tower root: TIME 1571221316 Wed Oct 16 06:21:56 2019 Oct 16 16:40:07 Tower root: MCG status: Oct 16 16:40:07 Tower root: MCi status: Oct 16 16:40:07 Tower root: Corrected error Oct 16 16:40:07 Tower root: Error enabled Oct 16 16:40:07 Tower root: MCA: MEMORY CONTROLLER GEN_CHANNELunspecified_ERR Oct 16 16:40:07 Tower root: Transaction: Generic undefined request Oct 16 16:40:07 Tower root: STATUS 900000400009008f MCGSTATUS 0 Oct 16 16:40:07 Tower root: MCGCAP 1000c18 APICID 40 SOCKETID 1 Oct 16 16:40:07 Tower root: MICROCODE 3b Oct 16 16:40:07 Tower root: CPUID Vendor Intel Family 6 Model 47 I can't tell if this is saying one of cpus are going bad, or if some ram is going bad. Has anyone seen this before? Another note of interest (and probably related), was that my server has completely frozen a few times with a kernel panic. I had a feeling it might be a hardware error, and this might confirm my suspicion... In case it helps, here are the specs of my server: Dell r910 with 4 x 10-core cpus and 192GB of ram. Thanks! tower-diagnostics-20191016-2044.zip Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.