June 9, 20197 yr Woke up to a sad error from Fix Common Problems this morning stating my server had detected hardware errors. Full disclosure, I upgraded my processors early last week, though I'm pretty sure I ran Fix Common Problems and it didn't give me any troubles, it's been a long week and I could be mistaken. Any assistance would be appreciated. unraid-diagnostics-20190609-1412.zip
June 9, 20197 yr Jun 6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 - OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1) Memory errors (and a lot of them) Edited June 9, 20197 yr by Squid
June 9, 20197 yr Author 51 minutes ago, Squid said: Jun 6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 - OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1) Memory errors (and a lot of them) Taking a quick breeze through syslog it looks like it's isolated to Chan#0_DIMM#0, would you agree? I'll try reseating that stick today and see what happens, worse case I can pickup a new stick. I appreciate your quick response!
June 9, 20197 yr 2 hours ago, fiore00713 said: I upgraded my processors early last week Since the CPU socket has a fairly direct connection to the memory nowadays, if reseating the DIMM doesn't change anything, I'd swap the DIMM to another socket and see if the errors follow the DIMM or stay with that socket. I think it's possible a pin not making good connection to the CPU could theoretically be at fault, but I'm not a board engineer so I can't be positive.
Archived
This topic is now archived and is closed to further replies.