Jump to content

[SOLVED] System crashing when doing parity scan Unraid 6.0 RC5


inky

Recommended Posts

Hey everyone,

 

I recently upgraded my rock solid system from Unraid 5. I ran 6.0 for two weeks with no issues. Migrated all my apps to dockers and two VM's have been imported. I migrated all my 11 drives to XFS. Everything was fine and totally stable till the first of the month arrived. Monthly parity check time. I'm getting constant crashes if i try to do a parity check. At which point i lose all communication to the server. GUI dead, can't ping any NICs. I can get the IPMI screen which i have attached. The only way to get it back up is a hard reset. The system runs fine as long as i don't do parity checks. If i do it crashes around 1tb usually sometimes earlier. I'm assuming this is part of the I/O issue others have been having i just want to make sure. No parity function has me worried for my data. I've attached a syslog but i can't get one once the system crashes. Hope you guys can help.

 

Please keep in mind I have run parity scan in safe mode with no addons, dockers or VM's and still get the same result.

 

Hardware:

Supermicro X8SIL-FO

Xeon X3470

32gb Reg ECC

2x AOC-SASLP-MV8 (Flashed to .21)

wtf.JPG.97fa86fd51f1a91259af058db71af536.JPG

syslog.zip

Link to comment

Looks like a machine check event, a notoriously difficult issue to solve.  I'd start by running a LONG Memtest, overnight, just to eliminate the memory.  64 bit v6 works the memory differently than 32 bit v5, *might* be a clue there.  Check for heat issues.  Perhaps after working so hard, something is getting too hot.

 

When you next try to test again, keep a tail running on the monitor and keep an eye on it, just in case there's anything unusual appearing just before it crashes.

 

You might check for any firmware and BIOS updates.

 

Syslog looked fine, no clues there.  Nice to see you had already tested in Safe Mode!

Link to comment

So i did a 12hour mem test and no issues. I had an extra 4TB figured i'd try upgrading the parity drive and see the result. No crash at all. Very odd. I'll check parity after and see if it crashes. I will then add the old parity as data if all is well

Link to comment

So parity rebuild was successful. No issues/crash. I then tried to check the newly built parity and system crashed like before after 15-20mins.  >:(

 

Also checked for heat issues and they are non existent. Everything is running cool, confirmed by both sensor readings and using an IR thermometer.

 

Trying it again now with record enabled on the IPMI window dumping to avi file.

 

 

 

Link to comment
  • 3 months later...

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...