fiore00713 Posted June 9, 2019 Share Posted June 9, 2019 Woke up to a sad error from Fix Common Problems this morning stating my server had detected hardware errors. Full disclosure, I upgraded my processors early last week, though I'm pretty sure I ran Fix Common Problems and it didn't give me any troubles, it's been a long week and I could be mistaken. Any assistance would be appreciated. unraid-diagnostics-20190609-1412.zip Link to comment
Squid Posted June 9, 2019 Share Posted June 9, 2019 Jun 6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 - OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1) Memory errors (and a lot of them) Link to comment
fiore00713 Posted June 9, 2019 Author Share Posted June 9, 2019 51 minutes ago, Squid said: Jun 6 04:02:44 unRAID kernel: EDAC MC1: 2 CE memory read error on CPU_SrcID#1_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x1538d57 offset:0xe00 grain:32 syndrome:0x0 - OVERFLOW area:DRAM err_code:0001:0090 socket:1 ha:0 channel_mask:1 rank:1) Memory errors (and a lot of them) Taking a quick breeze through syslog it looks like it's isolated to Chan#0_DIMM#0, would you agree? I'll try reseating that stick today and see what happens, worse case I can pickup a new stick. I appreciate your quick response! Link to comment
JonathanM Posted June 9, 2019 Share Posted June 9, 2019 2 hours ago, fiore00713 said: I upgraded my processors early last week Since the CPU socket has a fairly direct connection to the memory nowadays, if reseating the DIMM doesn't change anything, I'd swap the DIMM to another socket and see if the errors follow the DIMM or stay with that socket. I think it's possible a pin not making good connection to the CPU could theoretically be at fault, but I'm not a board engineer so I can't be positive. Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.