Jump to content

Unraid server becomes unresponsive roughly on a weekly basis


Recommended Posts

Hi. I had an unraid server stable for several years running 7/7 - 365 . I recently upgraded to a new more powerful hardware based on an AMD CPU. I kept on getting weekly crashes when unraid becomes totally unresponsive on the network. I already did a few changes suggested on forums (Global c-state disabled, reduce frequency of RAM just in case as I am NOT doing any overclocking anyway) but the issues are stil ocurring. This is driving me nuts. That said when the server becomes totally unresponsive it still responds to me pressing the physical power shutdown and then does a graceful shutdown. Below is an extract of the syslog just when the unresponsiveness is triggered. Any help would be appreciated. 

log.txt

Link to comment

I am dealing with the same issue. In my case running an Intel-based CPU (Gigabyte Mobo with 2.5G Intel 225 NIC onboard). Same occasional crashes, same syslog error message.

 

About 14 days ago I deactivated all ASPM BIOS settings and all C-State settings. Also everything related to powertop tweaking in Unraid. Since then, no crash. You may give this a try yourself…

 

Nevertheless this should not be the final solution for me, as the server consumes more energy than necessary…but to track things down I started „from scratch“. I am abroad at the moment, so I cannot tweak anything in BIOS. I will try to re-activate things as soon as I come back.

Link to comment
14 hours ago, JorgeB said:

Problem with the NIC getting dropped, I assume this is onboard?

 

Oct 14 15:28:29 Tower kernel: igc 0000:0b:00.0 eth0: PCIe link lost, device now detached

 

Yes, this the Intel 2.5 Gb Ethernet NIC onboard the ROG STRIX X670E-E motherboard.

Link to comment
12 hours ago, JayDee73 said:

I am dealing with the same issue. In my case running an Intel-based CPU (Gigabyte Mobo with 2.5G Intel 225 NIC onboard). Same occasional crashes, same syslog error message.

 

About 14 days ago I deactivated all ASPM BIOS settings and all C-State settings. Also everything related to powertop tweaking in Unraid. Since then, no crash. You may give this a try yourself…

 

Nevertheless this should not be the final solution for me, as the server consumes more energy than necessary…but to track things down I started „from scratch“. I am abroad at the moment, so I cannot tweak anything in BIOS. I will try to re-activate things as soon as I come back.

Thank you very much. I have just deactivated  the ASPM in the BIOS (C-state was already disabled in BIOS and I have not installed powertop in Unraid). I will continue to monitor the server and revert to this forum with updates.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...