DIMM going bad? Which one?


zybron

Recommended Posts

I'm seeing the following in my log periodically:

 

Sep 29 10:31:51 Tower kernel: mce: [Hardware Error]: Machine check events logged
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: HANDLING MCE MEMORY ERROR
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: CPU 0: Machine Check Event: 0 Bank 9: 8c000041000800c0
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: TSC 12475bcf2a26c 
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: ADDR 54c5c4000 
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: MISC 90002000200028c 
Sep 29 10:31:51 Tower kernel: EDAC sbridge MC0: PROCESSOR 0:306e4 TIME 1569767511 SOCKET 0 APIC 0
Sep 29 10:31:51 Tower kernel: EDAC MC0: 1 CE memory scrubbing error on CPU_SrcID#0_Ha#0_Chan#0_DIMM#0 (channel:0 slot:0 page:0x54c5c4 offset:0x0 grain:32 syndrome:0x0 -  area:DRAM err_code:0008:00c0 socket:0 ha:0 channel_mask:1 rank:0)

In my system profiler, I have this entry for one of the DIMM modules:

Memory Device	
Total Width:	72 bits
Data Width:	64 bits
Size:	8192 MB
Form Factor:	DIMM
Set:	None
Locator:	P1_DIMMD1
Bank Locator:	Node0_Bank0
Type:	DDR3
Type Detail:	Registered (Buffered)
Speed:	1600 MT/s
Manufacturer:	Kingston
Serial Number:	CF0F9841
Asset Tag:	Dimm9_AssetTag
Part Number:	9965433-180.A
Rank:	1
Configured Memory Speed:	1600 MT/s

Does the Dimm9_AssetTag correspond to the Bank 9 reference in the log?

 

Just in case, I have also attached diagnostics if that is useful.

tower-diagnostics-20190929-1433.zip

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.