inky Posted June 5, 2015 Share Posted June 5, 2015 Hey everyone, I recently upgraded my rock solid system from Unraid 5. I ran 6.0 for two weeks with no issues. Migrated all my apps to dockers and two VM's have been imported. I migrated all my 11 drives to XFS. Everything was fine and totally stable till the first of the month arrived. Monthly parity check time. I'm getting constant crashes if i try to do a parity check. At which point i lose all communication to the server. GUI dead, can't ping any NICs. I can get the IPMI screen which i have attached. The only way to get it back up is a hard reset. The system runs fine as long as i don't do parity checks. If i do it crashes around 1tb usually sometimes earlier. I'm assuming this is part of the I/O issue others have been having i just want to make sure. No parity function has me worried for my data. I've attached a syslog but i can't get one once the system crashes. Hope you guys can help. Please keep in mind I have run parity scan in safe mode with no addons, dockers or VM's and still get the same result. Hardware: Supermicro X8SIL-FO Xeon X3470 32gb Reg ECC 2x AOC-SASLP-MV8 (Flashed to .21) syslog.zip Link to comment
RobJ Posted June 6, 2015 Share Posted June 6, 2015 Looks like a machine check event, a notoriously difficult issue to solve. I'd start by running a LONG Memtest, overnight, just to eliminate the memory. 64 bit v6 works the memory differently than 32 bit v5, *might* be a clue there. Check for heat issues. Perhaps after working so hard, something is getting too hot. When you next try to test again, keep a tail running on the monitor and keep an eye on it, just in case there's anything unusual appearing just before it crashes. You might check for any firmware and BIOS updates. Syslog looked fine, no clues there. Nice to see you had already tested in Safe Mode! Link to comment
inky Posted June 7, 2015 Author Share Posted June 7, 2015 So i did a 12hour mem test and no issues. I had an extra 4TB figured i'd try upgrading the parity drive and see the result. No crash at all. Very odd. I'll check parity after and see if it crashes. I will then add the old parity as data if all is well Link to comment
inky Posted June 7, 2015 Author Share Posted June 7, 2015 So parity rebuild was successful. No issues/crash. I then tried to check the newly built parity and system crashed like before after 15-20mins. Also checked for heat issues and they are non existent. Everything is running cool, confirmed by both sensor readings and using an IR thermometer. Trying it again now with record enabled on the IPMI window dumping to avi file. Link to comment
inky Posted June 8, 2015 Author Share Posted June 8, 2015 So still no luck. Tailed the syslog and it did not show anything. After checking parity for 1:38min it's produced these three screenshots. Temps were all normal. Link to comment
inky Posted June 10, 2015 Author Share Posted June 10, 2015 Upgraded to RC5 and did the same tests. Same result..... Link to comment
dgaschk Posted June 10, 2015 Share Posted June 10, 2015 Do a New Config and just assign two disks. Attach both disks to the MB if possible or the same HBA. Build parity and test. Link to comment
inky Posted September 17, 2015 Author Share Posted September 17, 2015 So the problem has been constant till i installed Unraid 6.1.2. Problem has miraculously gone away. I did 3 parity checks back to back and no issues. Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.