Unraid server freezes | WEBUI no longer accessible | keyboard input also not possible #Unraid_Version_6.12.8


Recommended Posts

Hi,

 

I've been struggling with freezes again for a few weeks now. I have already tested the following:

 

 

 

  • I started a memtest86 and let it run for 21 hours. 5 passes without errors. 4x32GB ECC RAM - 128GB (Successfull)
  • I have started a stress test for the CPU. It ran for just over an hour on all cores. CPU had 70 degrees. -> stress --cpu 20 (Successfull)

  • I ran scrub on my Cache Pool. 2x SK hynix Gold P31 2TB PCIe NVMe Gen3 M.2 2280 in a ZFS-Encrypted - Mirror (Successfull)

  • I ran SMART tests on both 12TB WD Red Plus WD120EFBX 256MB 3.5" (8.9cm) SATA 6Gb/s. (Successfull)

 

 

I am out of Ideas. Currently I am running another memtest86 on all RAM. I think my server has been crashing ever since I started running the Palworld server. But I don't think it's because of this Docker or does anyone has the same behavior ?

 

In that Topic you can see my BIOS settings (I removed the OC -> "Dynamic Vcore (DVID)" set to "auto" AND "PCH Core" set to "auto"):

 

What else could I do?

 

Thanks!

 

 

tower-diagnostics-20240314-2102.ziptower-syslog-20240312-1118.ziptower-syslog-previous-20240312-1117.zip

Edited by UNRA1DUser
Link to comment
31 minutes ago, JorgeB said:

Did you also test with docker service disabled? That would be a good clue if issue is hardware related or not.

So you mean disable docker service and start a Benchmark on CPU and RAM?

 

Would that stress the CPU and RAM correctly? I have 128GB RAM and 20 threads.

 

128 GB = 128000 MB / 1024 MB = 125

stress --cpu 20 --vm 125 --vm-bytes 1024M

 

Or should I also stress the Storage and IO ? Is that the right Syntax?

 

stress --cpu 20 --vm 125 --vm-bytes 1024M –hdd 20 –io 20

 

 

Besides that, my last memtest86 was running 21 hours and passed 5 times. Is that enough? Should I stop my current second memtest86 run? It´s actually running for 15 hours and has 3 passes.

Link to comment
24 minutes ago, UNRA1DUser said:

So you mean disable docker service and start a Benchmark on CPU and RAM?

No, just disable the docker service and run the server normally to see if it still freezes.

 

2 hours ago, UNRA1DUser said:

I think my server has been crashing ever since I started running the Palworld server.

Does it freeze if that container is not running?

Link to comment
Posted (edited)
24 minutes ago, JonathanM said:

No, just disable the docker service and run the server normally to see if it still freezes.

 

I can try that. Is there a problem if I stress test the Server in that mode? Otherwise there wouldn´t be any load to it. And how long should the Server run in that mode?

 

 

24 minutes ago, JonathanM said:

Does it freeze if that container is not running?

 

As I know than the Server is running and running without problems. But there is also nothing installed that is using a lot of Resources of the Server itself.

 

Last time I started the Unraid Server I stopped the Palworld Server and it runs for 4-5 hours before I restartet it by myself and tried another test (Went to sport in the mean time and forgot to start the Palworld Server again). I don´t know anymore if this was in safe mode or not.

 

As I remember correctly the freezes started after I installed Palworld. But it´s not my first time that the Server is freezing -> 

After I bought this "Adapter" and connected it to the internal USB2.0 Port those freezes were gone. But I think I also played around and removed some Dockers. So could be possible that also another Docker was the reason for that.

 

 

So should I abort my current running memtest86 and start the unraid server normally without the Palworld Docker and see if its running?

Or should I first boot Unraid into safe mode without Docker, VMs and Plugins and wait for some hours ?

Edited by UNRA1DUser
Link to comment
11 minutes ago, JorgeB said:

Long enough to see if it no longer crashed, I would take the typical time it takes to crash and double it, while using the server as a NAS only.

Than it should be something like 2-4 Hours. I will stop the memtest86 now and start the Unraid Server in safe mode without Docker, VM and Plugins. I will also start a stress test to get some load on it. Let´s see whats gonna happening.

 

 

 

 

Link to comment
Posted (edited)

I started the Unraid Server in normal mode and disabled VMs and Dockers and stopped the Array.

 

I also started a stress test. It´s running since 17:35. So about 1 hour for now. Server is still running. CPU Temp 72 C and the cpu Mhz is @ 4899.996 Mhz

 

stress --cpu 20 --vm 125 --vm-bytes 1024M

 

image.thumb.png.30b74ca566d4eacf4b92b60a3e3edb3e.png

 

 

 

I can just see a lot of Network drops. Is that Normal?

 

image.png.52faf258ca11735a0129a3bb63f6cb1c.png

 

image.thumb.png.dd64a7f5188c920623134b3c5bd91adb.png

 

 

Edited by UNRA1DUser
Link to comment
41 minutes ago, JorgeB said:

Many times that I've seen a container crashing the server.


Do you also know the reason for that? I mean I really like to host a palworld server. And it seems to work for a lot of people. I can’t understand why one container is freezing a whole server. 

Link to comment
Posted (edited)

Stress test and Server are still running. No Problems. Close to 3 hours run time. (21:25)

 

I will let it run until tomorrow morning.

 

By the way. The Server is using 190 W under Benchmark.

 

stress --cpu 20 --vm 125 --vm-bytes 1024M

Edited by UNRA1DUser
Link to comment

I stopped the Stress test yesterday 11:00 PM. So it rans about 4 hours without Errors / freezes.

 

After that the Server was still running without VMs and Dockers until now. Without freezes.

 

Uptime in Sum 8 hours. So it´s not a Hardware Issue. I will start the Docker again and Stop the Palworld Server.

Link to comment
On 3/15/2024 at 10:44 AM, UNRA1DUser said:

 I think my server has been crashing ever since I started running the Palworld server.

interesting, that you mentioned the Palworld Server, I had similiar problems since I used that container, but now 6 days after resetting my unraid, and without installing the palworld server my unraid server hasn't crashed once.

My post about this similiar issue here if you want more information: 

 

Link to comment
51 minutes ago, MassimoMx said:

interesting, that you mentioned the Palworld Server, I had similiar problems since I used that container, but now 6 days after resetting my unraid, and without installing the palworld server my unraid server hasn't crashed once.

My post about this similiar issue here if you want more information: 

 

 

That´s really crazy.

 

My Server didn´t freezed or restarted since I deleted the Palworld Docker. But I still don´t understand why :D

Link to comment
15 hours ago, UNRA1DUser said:

My Server didn´t freezed or restarted since I deleted the Palworld Docker. But I still don´t understand why :D

Did you let the container un the entire time?

I would recommend that you run daily restart from the container, the container is running since day one on my server.

Link to comment
Posted (edited)
4 hours ago, ich777 said:

Did you let the container un the entire time?

I would recommend that you run daily restart from the container, the container is running since day one on my server.

 

Yes, All my Containers gets a restart every night via the AppData Backup Plugin. But if I run the Palworld Container my Server gets freezes after 1 - 4 hours. Sometimes also longer times. But I must say, I limited the Palworld Server to 32 GB. (--memory=32G) But we are not more than 4-5 Players at ones online. And Also if no one joins the Server Unraid freezed after 1 hour.

 

Until now my server is running without any freezes or problems. I would wait some more days. If the server keeps running I will install the Palworld Docker again and see whats gonna happening.

Edited by UNRA1DUser
Link to comment
58 minutes ago, UNRA1DUser said:

But if I run the Palworld Container my Server gets freezes after 1 - 4 hours.

I can't reproduce this, my container is now running for about 10 hours and it's working fine, I restart it every 24 hours:

grafik.thumb.png.1bcc9505cb8ad12551d38a6f4e4abb1c.png

 

As you can see in the screenshot I'm running it with a 50GB cap and have no issues whatsoever.

 

Are you sure that you're CPU or RAM is not overheating? I know about OOM errors and as a result crashing servers but it's different from what you are reporting, the power supply and everything in your server is also up to the task?

Link to comment

I tested also another PSU with 750 W. Did 2 memtest86 for several hours and also run stress for some hours on CPU and RAM. Everything works fine. CPU highest temps with Turbo mode enabled are 72 C. No freezes or crashes.

 

I don´t know the temps from the RAM but I guess it would crash if the memtest86 is running and at least if the stress test for CPU and RAM is running. But everything was working without crashes.

 

 

Link to comment
  • 1 month later...

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.