Interconnect Errors


Recommended Posts

I'm getting this error repeated in my syslog over and over every minute or 2, nothing else seems to be affected and the server is rock solid stable. I have carried out an overnight memory test also, without issue. Any ideas...

 

Jul 15 09:11:51 Gandalf kernel: mce: [Hardware Error]: Machine check events logged
Jul 15 09:11:51 Gandalf mcelog: Running trigger `bus-error-trigger' (reporter: bus)
Jul 15 09:11:51 Gandalf mcelog: CPU 8 on socket 1 received Bus and Interconnect Errors in Other-transaction
Jul 15 09:11:51 Gandalf mcelog: Location: CPU 8 on socket 1


Diagnostics Attached.

Thanks in advance for any insights :)

gandalf-diagnostics-20230715-1026.zip

Link to comment
  • 4 weeks later...
2 hours ago, JorgeB said:

Try looking at the SEL to see if there's more info, but if it's stable I would probably ignore for now, unless you have a spare CPU you can test with, or test with one at a time.

Thanks for the reply, nothing extra in the Enhanced Log. I'm going to upgrade to a pair of E5-2697s in the next month so will see what happens then.

Link to comment
  • 2 months later...
On 8/12/2023 at 9:48 PM, NeoDude said:

Thanks for the reply, nothing extra in the Enhanced Log. I'm going to upgrade to a pair of E5-2697s in the next month so will see what happens then.


Hey NeoDude,

I get a very similar log on my server. I have a pair of E5-2670 v3 but may look to swap them out too if you have had success with your swap?

How are the E5-2697 V2s (12 Core / 2.7GHz)? I am looking at a pair of E5-2687W v4. They a higher base clock (12 Core / 3GHz) which suits my use case better

image.png.5fac89e574cfa22e7b138d38a5dd3ffe.png

Edited by Raider_M
Link to comment
  • 1 month later...
On 11/6/2023 at 1:31 AM, Raider_M said:


Hey NeoDude,

I get a very similar log on my server. I have a pair of E5-2670 v3 but may look to swap them out too if you have had success with your swap?

How are the E5-2697 V2s (12 Core / 2.7GHz)? I am looking at a pair of E5-2687W v4. They a higher base clock (12 Core / 3GHz) which suits my use case better

image.png.5fac89e574cfa22e7b138d38a5dd3ffe.png


No more errors with my new CPUs :)

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.