Jump to content

Server Detected Hardware Errors


fiore00713

Recommended Posts

Jun  6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 -  OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1)

Memory errors (and a lot of them)

Link to comment
51 minutes ago, Squid said:

Jun  6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 -  OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1)

 Memory errors (and a lot of them)

Taking a quick breeze through syslog it looks like it's isolated to Chan#0_DIMM#0, would you agree?

I'll try reseating that stick today and see what happens, worse case I can pickup a new stick.

I appreciate your quick response!

Link to comment
2 hours ago, fiore00713 said:

I upgraded my processors early last week

Since the CPU socket has a fairly direct connection to the memory nowadays, if reseating the DIMM doesn't change anything, I'd swap the DIMM to another socket and see if the errors follow the DIMM or stay with that socket. I think it's possible a pin not making good connection to the CPU could theoretically be at fault, but I'm not a board engineer so I can't be positive.

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...