December 31, 20232 yr I'm having an issue where randomly many drives seem to be dropping out from my array and crashing the server when it happens. I'm trying to track down the cause of the issue. There doesn't seem to be any common timeframe or element, other than each time it's happened has been while i am trying to preclear one, or both of two new drives I'm planning to use for parity drives. One of the drives has been working just fine for the last 6 months as parity two, but somewhere in my tinkering the array decided it was no longer correct and wanted to rebuild it. I had another failed drive that started the whole endeavor. Where I am: Started out with old HBA's that only supported 2TB drives. about 6 months ago I installed a newer one, but only hooked it up to the 4 drives over that size I had installed. Had some trouble needing to update the firmware there, but got it going and it was good enough for my needs. Had a drive fail a while back, so I decided to buy a lot of 10 4TB drives on the cheap, and start replacing the 2TB ones I have been using. I did some rearranging, removing the old Sun LSI HBAs and installing a second HP Port expander I had laying around. There is already one, as the drives are split between two 4U chassis. So, now the one HBA has two port expanders connected to it. one in each chassis, those connected to the drives. I had to keep one of the old HBAs installed with nothing connected to it because without it, the USB doesn't show up as one of the bootable drives in the BIOS. This all works as expected... until it randomly crashes. after 2 to 24 hours. I'm not sure if there is an issue with the drive I'm trying to clear, with some other drive, with my cables, Expanders, HBA, or maybe even my PSU is dropping out? (There is one 750W powering 15 HDD and the old supermicro opteron MB, and a second PSU in the other chassis powering the rest of the drives) I'm just not even sure where to start and with it being so random have no clue what's going on. Logs attached and hoping someone can point me in the right direction. Other than re routing the drives through an expander so I can use drives over 2TB the setup has been working fine for several years. jgl-diagnostics-20231230-0015.zip
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.