Wurztha Posted August 20, 2020 Share Posted August 20, 2020 (edited) I have Fix Common Problems installed and it is telling me "Your server has detected hardware errors. You should install mcelog via the NerdPack plugin, post your diagnostics and ask for assistance on the unRaid forums. The output of mcelog (if installed) has been logged" I have installed mcelog from NerdPack and tried to see if I could find the issue myself but when I look in what I think is the correct place I see "mcelog: ERROR: AMD Processor family 23: mcelog does not support this processor. Please use the edac_mce_amd module instead." edac_mce_amd isn't in NerdPack as far as I can see and I searched the CA store with no results there either. I have attached my diagnostic zip if someone could please point me in the right direction, thanks! EDIT: Found these errors in logs: Aug 20 07:58:19 3950X kernel: mce: [Hardware Error]: Machine check events logged Aug 20 07:58:19 3950X kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 5: bea0000000000108 Aug 20 07:58:19 3950X kernel: mce: [Hardware Error]: TSC 0 ADDR 1f80470db9ab0 MISC d012000200000000 SYND 4d000000 IPID 500b000000000 Aug 20 07:58:19 3950X kernel: mce: [Hardware Error]: PROCESSOR 2:870f10 TIME 1597906684 SOCKET 0 APIC 0 microcode 8701013 Updated BIOS for the motherboard, error still there. Reflashed unRaid and copied config over and error hasn't returned yet! EDIT2: Error is back... Anyone care to help? EDIT3: Few more days, still nothing? 3950x-diagnostics-20200820-1013.zip Edited August 28, 2020 by Wurztha Quote Link to comment
mr2web Posted October 31, 2020 Share Posted October 31, 2020 I'm experiencing similar issues. Here what's in my log: Oct 31 02:14:13 Serverbrain3 kernel: mce: [Hardware Error]: Machine check events logged Oct 31 02:14:13 Serverbrain3 kernel: mce: [Hardware Error]: CPU 13: Machine Check: 0 Bank 5: bea0000000000108 Oct 31 02:14:13 Serverbrain3 kernel: mce: [Hardware Error]: TSC 0 ADDR 1ffff8168b28a MISC d012000100000000 SYND 4d000000 IPID 500b000000000 Oct 31 02:14:13 Serverbrain3 kernel: mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1604135631 SOCKET 0 APIC b microcode 8001138 Motherboard: Asus ROG STRIX X470-F GAMING (BIOS is up to date) CPU: AMD Ryzen 7 1700 Unraid: v6.8.3 Have u managed to get hold of the 'edac_mce_amd' module referenced by the mcelog? Quote Link to comment
Wurztha Posted October 31, 2020 Author Share Posted October 31, 2020 Hi, After a lot of messing I believe it was a motherboard firmware downgrade that fixed the issue for me. Give that a try and post back. I didn't get hold of the module. Best of luck! Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.