jmcskis Posted February 12, 2022 Share Posted February 12, 2022 I'm in the process of increasing the storage in my server. During the parity rebuild after adding the new parity drive, I lost all access to the server. I can't access the web gui, I can't SSH in, and now it's not even showing as connected to my network. I also tried connecting a monitor and keyboard to the server, but all I get is a blank screen and no response. As a result, I'm unable to get the diagnostics or the syslog. The only indication I have that my server is even on is that the CPU and case fans are still running. All HDD activity has stopped (or at least the HDD activity LED has stopped flashing). It seems like my only option is to perform an unclean shutdown, but given this happened during a rebuild I'm terrified that this will just screw things up even more. My system only has a single parity drive. I do have the old parity drive, which I can reinsert if required. Any ideas what may have caused this? Could this be an indication of an issue with my usb drive? What steps can I take to limit data loss from this? Also, I don't know if it matters, but during the rebuild, I was also preclearing a new data drive. Quote Link to comment
JorgeB Posted February 13, 2022 Share Posted February 13, 2022 You'll need to force a re-start, if the parity sync didn't finished before it crashed it will just start over. Quote Link to comment
jmcskis Posted February 22, 2022 Author Share Posted February 22, 2022 Thanks. After forcing a reboot, I was able to get the parity rebuilt and moved on to replacing data drives. Same thing again, everything froze up partway through, forced a reboot, got the drive replaced fine. Moved on to preclear my next drive and what do you know...things freeze up again. Ok, time to check everything. Did the usual checks of all my connections, ran a check on the flash, etc. After searching a bit more, it seems like my RAM could be at fault so I ran a memtest and oh boy. Given that I'm getting 1000's of errors, I'm guessing that I found my issue... I did pull diagnostics before my server hung up on me the last time. Not sure if there's anything in there that would tell me I have failing RAM. tower-diagnostics-20220216-1250.zip Quote Link to comment
trurl Posted February 22, 2022 Share Posted February 22, 2022 Exactly zero RAM errors from memtest is the only acceptable result. You shouldn't even attempt to run a computer with bad RAM. Quote Link to comment
jmcskis Posted February 22, 2022 Author Share Posted February 22, 2022 Yep. I called things quits right there until I could replace it. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.