Jump to content

Unraid crashes since 6.12.3. Switched back to 6.12.2. Crashes stay.


Go to solution Solved by jsspanjer,

Recommended Posts

Hi.

 

Unraid crashes since 6.12.3. Switched back to 6.12.2. Crashes stay

 

What i did thusfar:

I have followed advice here to look at my Docker Network configuration.

Switched from maclan to iplan.

I also disabled "Host access to custom networks"

I set the advanced plex configuration to skip the health check.

I looked at my go file to see if there was ipv6 settings. None exist.

I switched om my syslog server to dump syslog on the cache. I see no relevant "crash" notifications.

I see only in my "telegram bot" that Partity starts and after a while it is "cancelled".

Server reboots and we are in a loop

 

Currently i disabled docker and vm and cancelled the parity.

I attached my logs.

syslog.zip media-01-diagnostics-20230830-1703.zip

Link to comment

One thing to mention. I am not a new user. I work in IT and have used UNRAID since the "Limetech old days". My post count is low because I never used the forum to post. I always have used it as reference. Only reason for posting is that I need a pair of eyes and I really do not want corrupt data. Not everything is backed up.

Link to comment
1 hour ago, JorgeB said:

This suggests a hardware issue, start by running memtest and/or using a different PSU.

 

 

But it is a strange coincidence that it started after installing 6.12.3 . Rollback to 6.12.2 did not solve it though. 

I get that parity intensifies operation of the server. There are more services running on the machine that still work if I just cancel Parity. I only stopped them for convenience.

But.

I will do a memtest.

PSU is not something I have laying around. I could order one but that would be a shame if it was not related to the issue.

Is there a way to see if this is PSU related?

Just checked.

PSU is from 2019

Corsair RM550x 550 Watt 80 PLUS Gold Fully Modular ATX PSU ( 10 Year Warranty)

Edited by jsspanjer
Link to comment

So. Did a 10 pass memory test and test my PSU with a PSU Tester.

Nothing wrong with them. Everything tests fine. Als did an upgrade to 6.12.4 with the recommended configurations for docker.

Nothing solved.

Still a crash after i start Docker and VM's.

I have custom Docker networks on. eth0 and vlan configuration. Also reformatted my cache drives to zfs.

IMG_7918.thumb.jpeg.355eb75393214c9fd669d88027e771b0.jpegIMG_7922.thumb.jpeg.ad16c1cab82e791c95583f036abeaaf9.jpeg

Link to comment
4 hours ago, jsspanjer said:

test my PSU with a PSU Tester.

That PSU tester (I have the exact same one) only tests at idle current, it does not put any load on the supply. Similar to the memtest, it only can confirm a failure with a negative result, a pass by either memtest or that style of PSU tester just means it passed under certain conditions, not all conditions.

Link to comment
10 hours ago, JonathanM said:

That PSU tester (I have the exact same one) only tests at idle current, it does not put any load on the supply. Similar to the memtest, it only can confirm a failure with a negative result, a pass by either memtest or that style of PSU tester just means it passed under certain conditions, not all conditions.

Hi. I get that. But.

I have traversed this forum for years. I have never asked a question here. Every time someone posts a problem here, there's always someone who asks for syslog and diagnostics.

I only see the syslog downloaded once. So it is not looked at. I myself am in the IT troubleshooting business. I am doing Windows. Unraid is a hobby.

When something like this happens, I always look at wat was changed.

I had an uptime of hundreds of hours before I updated to 6.12.3.

Immediately after reboot the crashes happen.

Mind you. It still could be hardware related. But it happened directly after installing 6.12.3.

After that the crashes stayed. Degrading to 6.12.2 did not fix it. Upgrading to 6.12.4 did not fix it.

I did a couple of memtest days (as asked) and tested my PSU (as asked).

But still see 1 download of the syslog. 0 downloads of the diagnostics files

I am not ruling out hardware but I am not Unraid minded enough to see if there's anything wrong in the syslog / diagnostics.

Could please someone rule them out before I am buying new PSU?

Or if anyone is kind enough to show me (a manual) how to read them.

Edited by jsspanjer
Link to comment
6 hours ago, jsspanjer said:

I have never asked a question here. Every time someone posts a problem here, there's always someone who asks for syslog and diagnostics.

I only see the syslog downloaded once. So it is not looked at.

Not quite sure what point you are trying to make?   The diagnostics includes the current syslog from RAM.    The only time I would expect to see a need to post a syslog separately is when it is one resulting from the syslog server.

Link to comment
9 hours ago, jsspanjer said:

If I insert my Power calculations into https://outervision.com/power-supply-calculator I get a Load Wattage: 543 W

So. That means that my current PSU is not capable of running that kind of max power. 

I currently have a Corsair RM550x 550 Watt 80 PLUS Gold Fully Modular ATX PSU.

Hmmm.

 

 

Take a look at your motherboard settings to find a way to limit the TDP of the CPU to 65W (disable one ccx and enable eco mode for example or look online). It will reduce performance but decrease power consumption by quite a bit, so you can stay bellow your PSU max wattage. 

 

If you have a kill a watt or similar you can see what the system is actually pulling and see if you exceed 500W.

Link to comment
14 hours ago, SP67 said:

 

Take a look at your motherboard settings to find a way to limit the TDP of the CPU to 65W (disable one ccx and enable eco mode for example or look online). It will reduce performance but decrease power consumption by quite a bit, so you can stay bellow your PSU max wattage. 

 

If you have a kill a watt or similar you can see what the system is actually pulling and see if you exceed 500W.

I have ordered a new power supply. This time I bought the Corsair RM1200 shift. That's mostly overkill but will make sure that there's enough power.
Hopefully that will solve the issue. :D 

  • Upvote 1
Link to comment

Installed the new power supply. RM1200 (1200W).

Came to realize that the "old" power supply was 850W. It was the RM850i. So should have been sufficient.

Booted the server up. Parity check started again at sep 7, 19:20 hours.

Looked this morning and the server rebooted again at 4:20 ish.

I am not seeing anything strange in syslog. Could someone please assist in looking at it?

Latest image is from my telegram-bot notification.

One thing to mention. Disk 1 is normally not used and is seen overheating when parity occurs. 51 degrees and drops again to 49 degrees.

I have many more of the same kind. They are close together but only this one is 3 degrees warmer than the rest.

Normally I do not use this disk. It was empty. I started doing Time Machine backups to it a couple of weeks ago. This morning, after I saw that the server still reboots,  I disabled that and removed all files on it.

 

syslog.zip

SCR-20230908-idsp.png

media-01-diagnostics-20230904-0922.zip

Link to comment

So. I have an extra usb drive (backup) which i erased. Installed fresh version of Unraid 6.12.4 on it.

Only config i copied over are the minimum settings to do a parity sync to see if it stays up.

From latest flash backup:

/config/pools

/config/shares

/config/disk.cfg

/config/share.cfg

super.dat

copied my second pro.key to it.

 

Parity succeeded. Uptime 19 hours and counting.

As far as i see it. No hardware issue.

The original "master" usb is an upgraded Unraid installation from 2019.

I will slowly configure this new usb to see if the server will continue to function. 

 

 

 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...