Today fix common problems has shown me the following warning: 'Your server has detected hardware errors. The output of mcelog has been loggeg. Post your diagnostics and ask for assistance on the Unraid forums'
Checking the syslog it seems that mainly these 3 errors are repeated:
1st error
Nov 7 12:40:07 227CS kernel: mce: [Hardware Error]: Machine check events logged
Nov 7 12:40:07 227CS kernel: [Hardware Error]: Corrected error, no action required.
Nov 7 12:40:07 227CS kernel: [Hardware Error]: CPU:1 (19:21:2) MC16_STATUS[-|CE|-|AddrV|-|-|UECC|-|-|-]: 0x854824048b480084
Nov 7 12:40:07 227CS kernel: [Hardware Error]: Error Addr: 0x0000000000000000
Nov 7 12:40:07 227CS kernel: [Hardware Error]: IPID: 0x0000000000000000
Nov 7 12:40:07 227CS kernel: [Hardware Error]: Bank 16 is reserved.
Nov 7 12:40:07 227CS kernel: [Hardware Error]: cache level: RESV, tx: DATA
..............................
2nd error
Nov 8 04:33:13 227CS nginx: 2024/11/08 04:33:13 [error] 11074#11074: *545367 upstream timed out (110: Connection timed out) while reading response header from upstream, client: 192.168.1.107, server: , request: "GET /plugins/gpustat/gpustatusmulti.php?gpus={%220A:00.0%22:{%22id%22:%220A:00.0%22,%22model%22:%22NVIDIA%20GeForce%20GTX%201650%22,%22vendor%22:%22nvidia%22,%22guid%22:%22GPU-df869a0b-80ad-c050-71e2-954b30b6ca7b%22,%22panel%22:1}} HTTP/1.1", upstream: "fastcgi://unix:/var/run/php5-fpm.sock", host: "192.168.1.103", referrer: "http://192.168.1.103/Dashboard"
.............................
3rd error
Nov 9 04:40:07 227CS root: CPU is unsupported
Nov 9 11:01:39 227CS root: Fix Common Problems Version 2024.10.02
Nov 9 11:01:48 227CS root: Fix Common Problems: Error: Machine Check Events detected on your server
Nov 9 11:01:48 227CS root: mcelog: ERROR: AMD Processor family 25: mcelog does not support this processor. Please use the edac_mce_amd module instead.
.............................
There are two more issues that I believe are not related to this alert but I comment them in case they have correlation:
- Lately the system has been unstable and has experienced some freezes with the computer totally locking up with no response to the keyboard or shutdown button forcing me into unclean shutdowns. I have associated them with the Plex docker as this only happens with it.
- In the morning it has done a scheduled scrub of a zfs pool resulting in an uncorrectable error.
Since I know neither the severity nor the possible solution, I would appreciate some guidance.
Thanks in advance.