Advice needed on potential memory failure


Recommended Posts

Fix Common Problems alerted me to a previous MCE.  I'm seeing repeating entries for about 5 secs of:

Oct 18 22:34:33 Tower kernel: mce: [Hardware Error]: Machine check events logged
Oct 18 22:34:33 Tower kernel: EDAC sbridge MC1: HANDLING MCE MEMORY ERROR
Oct 18 22:34:33 Tower kernel: EDAC MC1: 1 CE memory read error on CPU_SrcID#1_Ha#0_Chan#3_DIMM#0 (channel:3 slot:0 page:0x5ff304 offset:0xec0 grain:32 syndrome:0x0 -  area:DRAM err_code:0001:0091 socket:1 ha:0 channel_mask:8 rank:0)

It corrected and I have not seen a repeat event.  Is this a one-time event that bears attention if it should happen again, or do I need to start looking for some new memory?  Thanks in advance for any advice you can offer.

tower-diagnostics-20211024-1411.zip

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.