Matheew Posted August 18, 2022 Share Posted August 18, 2022 (edited) Hello unRAID Community! I've bought a brand new WD Red 12TB disk which I will use to replace my current 8TB parity drive. I've read a lot of different ways to do this but I decided to go for this method: Stop the array Install new drive. In main assign the new 12TB drive to parity slot 2 Start the array and allow parity to rebuild Stop the array In main unassign the old 8TB parity drive from parity slot 1 Start the array However, when I've assigned the new 12TB drive to parity slot 2 and started the array, things go south. The parity rebuild pretty much instantly pauses and unRAID gives me the error messages you can see in the attached file. The new 12TB drive reports errors and unRAID tells me that the new drive is in an error state. In order to get back to normal I: Cancel the paused parity build. Stop the array Remove the 12TB drive from the parity 2 slot. Start the array unRAID then tells me everything is fine: If I then repeat the process above the error occurs again in the same way, and that is where I am at right now. First thought is obviously a broken disk, but that doesn't seem too likely since the disk is brand new. So I ran an extended SMART-test on the disk yesterday and the disk reported no error at all, you can find the SMART report in the attached Diagnostics file. System: unRAID version 6.8.3 Logic 24-bay Hot Swap with IBM M1015 Thanks in advance! unraid-diagnostics-20220818-0848.zip Edited August 18, 2022 by Matheew Quote Link to comment
JorgeB Posted August 18, 2022 Share Posted August 18, 2022 Log is spammed with ssh related text, reboot to clear it add the disk to parity2 and post new diags after the problem occurs. Quote Link to comment
Kilrah Posted August 18, 2022 Share Posted August 18, 2022 (edited) Likely either the disk or the port / cable it's connected to has issues. Probably the latter since there are I/O errors in the log. 1 hour ago, Matheew said: doesn't seem too likely since the disk is brand new. "Infant mortality" is a thing on new devices, that's why it's typically recommended to leave new disks unassigned and run a preclear on them before adding them to the array just to exercise them and see if they're going to die outright or there are other communication-related issues. Edited August 18, 2022 by Kilrah Quote Link to comment
Matheew Posted August 18, 2022 Author Share Posted August 18, 2022 Thanks for the replies guys, see attached Diagnostics file for a syslog without SSH attempts. Also, I switched the HDD to another drive bay, no difference. unraid-diagnostics-20220818-1856.zip Quote Link to comment
JorgeB Posted August 19, 2022 Share Posted August 19, 2022 Update the LSI firmware to latest, if still issues try connecting that disk to the onboard SATA ports. Quote Link to comment
Matheew Posted August 21, 2022 Author Share Posted August 21, 2022 (edited) Thanks for the reply! My M1015 is now updated to firmware 20.00.07.00, the issue is still the same. I connected the HDD directly to the motherboard (obviously not a feasible long term solution) and the issue is gone. What conclusion can we draw from this? I would not rush to the conclusion that the M1015 is broken since I have four drives that have been working perfectly for more than a year. The difference here is that I've never before have connected a drive larger than 8TB to the HBA, I've googled but could not find anything that would indicate that there is a max HDD size for the card in question. Any thoughts? Edited August 21, 2022 by Matheew Quote Link to comment
JorgeB Posted August 21, 2022 Share Posted August 21, 2022 25 minutes ago, Matheew said: What conclusion can we draw from this? I would suspect some compatibility issue between the HBA and that disk model. Quote Link to comment
Frank1940 Posted August 21, 2022 Share Posted August 21, 2022 30 minutes ago, Matheew said: I connected the HDD directly to the motherboard (obviously not a feasible long term solution) ... Why not? Quote Link to comment
Matheew Posted August 21, 2022 Author Share Posted August 21, 2022 12 minutes ago, JorgeB said: I would suspect some compatibility issue between the HBA and that disk model. Hmm, it is a WD Red, so no obscure brand or anything. 6 minutes ago, Frank1940 said: Why not? A number of reasons, but the primary one being the fact that my chassi is using SAS backplanes to install disks, which is how I want it for expansion possibilities. There is no way to mount a disk permanently and connect it directly to the MB. Quote Link to comment
Frank1940 Posted August 21, 2022 Share Posted August 21, 2022 2 hours ago, Matheew said: my chassi is using SAS backplanes I seem to recall that occasionally backplanes have gone bad. (I have one of the nine hot swap backplanes in my Media server where an installed disk throws CRC errors. Now, I have not trouble shot it to determine if it is the cable or the backplane or the controller lane as I don't need it at this point...) Quote Link to comment
Matheew Posted August 21, 2022 Author Share Posted August 21, 2022 2 hours ago, Frank1940 said: I seem to recall that occasionally backplanes have gone bad. (I have one of the nine hot swap backplanes in my Media server where an installed disk throws CRC errors. Now, I have not trouble shot it to determine if it is the cable or the backplane or the controller lane as I don't need it at this point...) Well... I currently have two out of six SAS backplanes connected to the M1015 and disks connected to each of them without issue, and moving the new HDD from one backplane to another does not fix the issue. The probability that both backplanes are broken but none of the currently installed disks are affected seems very unlikely. Quote Link to comment
JorgeB Posted August 22, 2022 Share Posted August 22, 2022 20 hours ago, Matheew said: Hmm, it is a WD Red, so no obscure brand or anything. Yes, but it's a 12TB drive, not so widely used as lower capacities, it's still my best guess. Quote Link to comment
Solution Matheew Posted September 4, 2022 Author Solution Share Posted September 4, 2022 FYI - upgrading the BIOS on the HBA from 07.29.00.00 to 07.39.02.00 solved the issue. 2 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.