Skip to content
View in the app

A better way to browse. Learn more.

Unraid

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

Machine Check Events hardware problem detected

Featured Replies

Hello, and thanks in advance for any help/advice you can offer. I'll try to provide as much detail as I can here.

 

I was experiencing some weird playback issues with my Plex server, so I restarted the server. Upon a clean restart, I had a disk report that it was unmountable. After some digging, it seemed to be a filesystem issue and I ran through the xfs_repair instructions here. This didn't fix the issue, so I ended up having to remove the disk from the array and rebuilding it.

 

The server went through a data rebuild and completed successfully. After that, the disks all looked fine and the array appeared to be functioning correctly. I started extended SMART tests on my smaller drives, including the disk that had originally presented a problem, and went to bed. I also ran short SMART tests on the larger disks and those seem to all have completed successfully.


When I woke up, the server had gone through what seems to have been a hard reset and was running a parity check. Fix Common Problems reported an error:

Quote

 

Machine Check Events detected on your server

 

Your server has detected hardware errors. You should install mcelog via the NerdPack plugin, post your diagnostics and ask for assistance on the unRaid forums. The output of mcelog (if installed) has been logged

 

 

So I'm posting my diagnostics file here and asking for any help you guys can offer.

 

A quick skim through my syslog and it seems like these are concerning lines:

 

Dec 3 07:42:50 dunbar kernel: mce: [Hardware Error]: Machine check events logged

Dec 3 07:42:50 dunbar kernel: mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 5: bea0000000000108

Dec 3 07:42:50 dunbar kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff81074b06 MISC d012000100000000 SYND 4d000000 IPID 500b000000000

Dec 3 07:42:50 dunbar kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1638484949 SOCKET 0 APIC 8 microcode 8001138

...

Dec 3 07:54:09 dunbar root: Fix Common Problems: Error: Machine Check Events detected on your server

Dec 3 07:54:09 dunbar root: mcelog: ERROR: AMD Processor family 23: mcelog does not support this processor. Please use the edac_mce_amd module instead.

 

 

A few searches in this forum turned up bad memory issues, so I'll run memtests as soon as the current parity check finishes, but please let me know if there's anything else I should be looking out for.

dunbar-diagnostics-20211203-1003.zip

Edited by dgallaher
cleaned up formatting

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.