Jump to content

Fix Common Problems Stating I have hardware errors.


Recommended Posts

Bad memory

Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 9: 8c000047000800c1
Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: TSC c4dba98dd7d33 
Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: ADDR 548773000 
Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: MISC 900080008000e8c 
Mar 15 11:20:16 PlexServer kernel: EDAC sbridge MC0: PROCESSOR 0:206d7 TIME 1584292816 SOCKET 0 APIC 0
Mar 15 11:20:16 PlexServer kernel: EDAC MC0: 1 CE memory scrubbing error on CPU_SrcID#0_Ha#0_Chan#1_DIMM#0 (channel:1 slot:0 page:0x548773 offset:0x0 grain:32 syndrome:0x0 -  area:DRAM err_code:0008:00c1 socket:0 ha:0 channel_mask:2 rank:0)

You system event log may have more information on which stick in particular over the description of channel 1, dimm 0

Link to comment

@Jurak

  • From plexserver-diagnostics.zip ==> logs ==> syslog
Quote

Mar 14 19:36:24 PlexServer kernel: mce: [Hardware Error]: Machine check events logged
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 5: 8c00004000010091
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: TSC be6f927362a25 
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: ADDR 548773c80 
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: MISC 204050d086 
Mar 14 19:36:24 PlexServer kernel: EDAC sbridge MC0: PROCESSOR 0:206d7 TIME 1584236184 SOCKET 0 APIC 0
Mar 14 19:36:24 PlexServer kernel: EDAC MC0: 1 CE memory read error on CPU_SrcID#0_Ha#0_Chan#1_DIMM#0 (channel:1 slot:0 page:0x548773 offset:0xc80 grain:32 syndrome:0x0 -  area:DRAM err_code:0001:0091 socket:0 ha:0 channel_mask:2 rank:0)

Looks like memory, attach peripherals and run memtest at boot, if you're unfamiliar there is an image of the boot screen here. You can also check the system event logs in the BIOS for more information. Here is a recent discussion on the matter

Link to comment
13 minutes ago, Dissones4U said:

Damn! well I'm getting a litter better at reading these things anyway... 

Any memory error that's explicitly mentioned in a syslog means that its ECC (otherwise it couldn't detect it).  And unless you're either lucky (or very unlucky depending upon your viewpoint), Memtest won't catch the problem as it'll be corrected.  Only if the stick is bad enough that it doesn't catch it will memtest show anything

  • Thanks 1
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...