Jump to content

Fix common errors - mcelog


Recommended Posts

Hi!

 

I turned on my server today to find this message under Fix Common problems.

"Your server has detected hardware errors. You should install mcelog via the NerdPack plugin, post your diagnostics and ask for assistance on the unRaid forums. The output of mcelog (if installed) has been logged".

I have mcelog installed via NerdPack.

Unsure as to what is the issue, but highly appreciate if someone can take a look at my attached diagnostics files.

 

Running version 6.8.3

 

Thanks in advance!

tron-diagnostics-20200817-2131.zip

Link to comment
28 minutes ago, sublime24 said:

 

Your syslog is full of this:

Aug 23 11:10:24 KaChing s3_sleep: Disk activity on going: sdb
Aug 23 11:10:24 KaChing s3_sleep: Disk activity detected. Reset timers.
Aug 23 11:11:24 KaChing s3_sleep: Disk activity on going: sdb
Aug 23 11:11:24 KaChing s3_sleep: Disk activity detected. Reset timers.

I don't use that plugin but I haven't seen that in other syslogs so I assume there must be some way to get rid of it. Does the plugin have an option to enable/disable logging?

 

On 8/23/2020 at 1:32 PM, trurl said:

Have you done memtest?

 

 

Link to comment

I forgot I had that plugin configured haha, I just removed it. My server no longer sleeps as I use it for my Home Assistant DB. 

I have not ran a memtest, this can be done at the Unraid boot menu?

I do see this error, could this be the issue?

image.png.cfb07b7fa09eb3146022d0a86fcbac9f.png

I noticed this hard drive is also hotter than the others. 

image.thumb.png.2533e1e4955e7c74cb9f2b36cb7b9e8f.png

Edited by sublime24
Link to comment

CRC errors are usually connection issues. A small number not increasing is nothing to worry about.

Just acknowledge the CRC error by clicking on the "thumbs down" on the Dashboard. It will warn you again if that increases.

42 minutes ago, sublime24 said:

memtest, this can be done at the Unraid boot menu

Yes. You might have to boot in non-UEFI mode to run memtest.

Link to comment
10 hours ago, nnzy1734 said:

No, I will try running it tonight to see if it produces anything. Thanks for the tip!

Your MCE's are being issued at initialization time of the cores.  Fairly common, and nothing to worry about.

 

2 hours ago, sublime24 said:

Your problem @sublime24 may also be because you're running an ancient BIOS revision.  Update it.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...