-
Random reboots and MCE Hardware error
Alright I updated BIOS to latest version, and from stock settings I set the power limit and no XMP. No other changes in BIOS. Running the memtest now so lets see what happens.
-
Random reboots and MCE Hardware error
Hm ok I've had it on Auto for well over a year now with no issues with Auto on the CPU ratio limit too. I will change that first thing. Any other recommendations? I'm not sure how else to test for these hardware errors after changes other than letting it run for a while and waiting for a reboot to happen because it always seems stable for a while everything working great and then boom reboot.
-
flallnatural started following [PLUGIN] GPU Statistics , Random reboots and MCE Hardware error , unRAID reboots right before Parity Sync completes and 1 other
-
Random reboots and MCE Hardware error
Hello, In the past month I have started getting random reboots and MCE Hardware errors in my log. Running a intel i9-10850k on an Asrock Z590 Pro 4 with 64gb of RAM. I have an HBA card and a Nvidia P2000 installed and nothing else. I changed out my HBA card the last time I posted about this thinking that was the issue since I was having drives disappearing. At the time I had also done a memtest at that time and it passed. Since then the errors have continued. I have attached my diagnostics after the latest reboot that happened last night (Apr 8th around 1 AM it seems). The server was nearing completion of a parity sync and the reboot occurred. I have seen this behavior at least 2 times before. Also have the syslog server running for the past couple weeks to see if I can catch anything. I am starting to wonder if the motherboard has gone bad somehow. The HBA card issue happened on PCIEx16 slot 1 and after replacing the HBA card with a brand new one it has been ok with no drive issues. However since then, one of the hardware errors/reboots was preceded by a transcoder error with my nvidia gpu on PCIEx16 slot 2. The latest reboot does not show either of the same errors I have seen before. Some help would be greatly appreciated. unraid-diagnostics-20230408-0907.zip syslog-10.10.20.11.log
-
Machine Check Events error
Any luck resolving this? I am getting mce events and random reboots but I can't figure out where the mce log is?
-
unRAID reboots right before Parity Sync completes
Turns out it was my HBA card that causes disks to go offline which caused other downstream issues even when I connected the disks directly to the motherboard. I got a new HBA and parity synced everything and its all working perfectly. The problematic disks were not the issue
-
unRAID reboots right before Parity Sync completes
Ok I'll try to obtain some logs via the server you mentioned. This time I am running parity with the problematic parity disk removed so lets so if that changes anything.
-
unRAID reboots right before Parity Sync completes
Hey guys I need some help. I haven't been able to figure this out. My server has been working great for 2-3 years+ and this is the first time I am encountering this issue. Recently, my HBA card started causing errors (I'm pretty sure it was the problem) and I thought some of my disks had died. I put in another HBA and all the discs were recognized no problem. Of course a data rebuild starts and completes successfully. Now I am trying to complete a Parity Sync but when I come back to my server around the time it is supposed to have finished, I find that the server has restarted and the parity sync was cancelled. This is with no interaction with the server. I found that there was a mce hardware error in the logs after one of the reboots so I did a memtest and everything came out clean. I have dual parity with 14TB disks (one of these was one that disappeared when I had the HBA problem) and 5x 12TB disks on the array (one of which also disappeared when I had the HBA problem). Both disks worked when I tried the new HBA and I had no data loss. However Parity Sync still does not complete and the server reboots. Some help would be greatly appreciated.
-
[PLUGIN] GPU Statistics
Hi, This is a great extension and seems to work great. I am seeing a Throttle message and I don't quite understand it. What does it mean? The GPU is pass-through to Plex only. I've attached a screenshot. Thanks
flallnatural
Members
-
Joined
-
Last visited