Cryptic51 Posted February 16, 2021 Share Posted February 16, 2021 (edited) I have 2 unraid servers running similar hardware. Both run fine but one system becomes unresponsive and requires a reset switch reboot when the scheduled parity check executes. No obvious hardware errors or "Fix Common Problems" errors. I've disabled and removed VM and Docker but the hangs persist. Can anyone suggest a possible fix? sunset-shimmer-diagnostics-20210216-1515.zip Edited February 19, 2021 by Cryptic51 fix typo Quote Link to comment
Squid Posted February 16, 2021 Share Posted February 16, 2021 Have you run a memtest on the system? Quote Link to comment
Cryptic51 Posted February 16, 2021 Author Share Posted February 16, 2021 (edited) Yes, no errors detected. Mainboard firmware is latest recommended for the CPU installed. Running on a UPS. Edited February 16, 2021 by Cryptic51 Quote Link to comment
Squid Posted February 16, 2021 Share Posted February 16, 2021 Then the next thing would be is the PS up to snuff? Quote Link to comment
Cryptic51 Posted February 17, 2021 Author Share Posted February 17, 2021 The PS is a good EVGA 500. The system runs 24/7 and only hangs on a parity check. I can go 90 days, no problem, then the scheduled parity check runs and the system hangs. Quote Link to comment
Cryptic51 Posted February 19, 2021 Author Share Posted February 19, 2021 I manually ran a parity check yesterday. I "spin up" all drives and started the check. It looks like the check process froze at 53%. I was able to pause the job in the GUI, but could not cancel it or run a restart. The restart also hung while attempting to force restart after waiting 90 seconds for processes to halt. I just tested the Flash Drive, it passed several diagnostic programs. Quote Link to comment
Cryptic51 Posted February 20, 2021 Author Share Posted February 20, 2021 New parity check, Seems to have hung at 80% this time. I've attached the tail of the system log. Is that a SAS controller error? System Log Tail.txt Quote Link to comment
Squid Posted February 20, 2021 Share Posted February 20, 2021 Reseat all the cabling to the drives and to the controller Feb 20 05:59:42 Sunset-Shimmer kernel: sd 12:0:7:0: Power-on or device reset occurred Feb 20 05:59:42 Sunset-Shimmer kernel: sd 12:0:8:0: Power-on or device reset occurred Quote Link to comment
Cryptic51 Posted March 2, 2021 Author Share Posted March 2, 2021 Re-seated all the cabling for the drives and controller. Moved the parity drive off the SAS controller and onto the Mainboard SATA controller. > Parity check seemed to have completed, but at end, the GUI was non-responsive and the system would not shut down. > A short push on the power button started the shutdown, but after 20 minutes it still had not completed the shutdown. > used reset button to restart. > After restart, parity check began again at 0% complete. > I cancelled the Parity check. I've ordered a replacement SAS card (same model) waiting on delivery. sunset-shimmer-diagnostics-20210302-1448.zip Quote Link to comment
Vr2Io Posted March 2, 2021 Share Posted March 2, 2021 Suggest perform SMART longtest on every disks, check does all pass. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.