rmp5s Posted February 6, 2021 Share Posted February 6, 2021 Had a drive die today. Has the red x next to it. Not sure what the best path forward is... What I'm thinking is this: I took it out of the array. Then I can let the parity rebuild while I get a new drive (that one wasn't even very old!!) or fix that one or whatever. Then I can put the new one in/put this one back in and let the parity rebuild again. One concern I have is that it's an array of four 8TB drives with an 8TB parity...rebuilding parity will probably take a looooooooooong time... Thoughts? What are "best practices" for when a drive kicks the bucket in unRAID? Thanks! Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 So the drive was empty? Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 Just now, jonathanm said: So the drive was empty? Yup. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 Even if it wasn't though...wouldn't letting the parity rebuild take care of that when I took the drive out? Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 Just now, rmp5s said: Even if it wasn't though...wouldn't letting the parity rebuild take care of that when I took the drive out? Parity will rebuild whatever is emulated in the missing disk slot onto a drive you put back in to the slot. If you rebuild parity without that slot, all data that was there will be gone. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 Interesting. So, basically, if you have a drive die, you better have a spare on hand to swap in ASAP? Quote Link to comment
Hoopster Posted February 6, 2021 Share Posted February 6, 2021 3 minutes ago, rmp5s said: Interesting. So, basically, if you have a drive die, you better have a spare on hand to swap in ASAP? I have a spare drive for each of my servers sitting on a shelf ready to go when needed. They have been precleared to ensure the drives were good. Do you have a SMART test on the disk that failed? Do you know why it failed? I have had one disk failure in 9 years running unRAID on two servers. That "failure" was actually due to a cabling problem that resulted in so many errors in a very short time that the drive was marked unable to write with a red X. The drive was actually good and I rebuilt it onto itself once I fixed the cabling/SATA port issue that led to the drive "failing." You may actually have a bad drive, but if you have some SMART reports for it you might be able to determine what went wrong. My parity/rebuild process takes about 16.5 hours with 8TB parity and data drives. Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 9 minutes ago, rmp5s said: Interesting. So, basically, if you have a drive die, you better have a spare on hand to swap in ASAP? Yes. If you only have 1 parity drive, you can only have 1 missing drive. Any further failures will result in complete data loss from all missing data drives, including the first failure. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 (edited) 12 minutes ago, Hoopster said: Do you have a SMART test on the disk that failed? Do you know why it failed? You may actually have a bad drive, but if you have some SMART reports for it you might be able to determine what went wrong. I was trying to get some info out of the thing but haven't been able to yet. I left the drive in there and just unassigned it so I can run some tests and stuff on it. I'll post my findings. 12 minutes ago, jonathanm said: Yes. If you only have 1 parity drive, you can only have 1 missing drive. Any further failures will result in complete data loss from all missing data drives, including the first failure. Right, I knew I only had 1 drive that could die. I just thought you could lose a drive, move the data onto the remaining drives, then add another drive later. Dunno if the "NAS drives" are worth it over regular drives, but I've got a new 8TB IronWolf NAS drive on the way. It was only 50 bucks more than the one it's replacing so hey...why not...lol Edited February 6, 2021 by rmp5s Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 1 minute ago, rmp5s said: I just thought you could lose a drive, move the data onto the remaining drives, then add another drive later. Sure, but it's a matter of manually moving the data off the emulated missing drive to another destination. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 1 minute ago, jonathanm said: Sure, but it's a matter of manually moving the data off the emulated missing drive to another destination. If the drive is dead, how are you supposed to move the data off of it? Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 18 minutes ago, Hoopster said: You may actually have a bad drive, but if you have some SMART reports for it you might be able to determine what went wrong. I kinda think it's dead...lol I can't even get Unassigned Devices to mount it...I click mount and the screen just refreshes real quick. That's it. I'll take it out and plug it into my desktop and have a look around. If it's not dead dead, I'll format it and put it back in but set it as excluded for a while...see if it dies again. I don't trust it! lol Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 52 minutes ago, rmp5s said: If the drive is dead, how are you supposed to move the data off of it? When a drive fails, if parity is accurate at the point it drops, then the only indication will be the red x and the notifications. The data slot should still contain all the data as emulated by parity. Navigating to /mnt/diskX will still work, and all the files on the drive will be there and readable. The disk itself isn't being accessed, the data is being generated by all the other drives that are still working. 1 Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 51 minutes ago, rmp5s said: I'll format it and put it back in Formatting while the drive is outside the array won't do anything, whatever is being emulated by the rest of the disks will be written back to the drive when you put it back in. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 3 minutes ago, jonathanm said: When a drive fails, if parity is accurate at the point it drops, then the only indication will be the red x and the notifications. The data slot should still contain all the data as emulated by parity. Navigating to /mnt/diskX will still work, and all the files on the drive will be there and readable. The disk itself isn't being accessed, the data is being generated by all the other drives that are still working. Ah, I see what you mean. That's pretty cool, if somewhat convoluted. 2 minutes ago, jonathanm said: Formatting while the drive is outside the array won't do anything, whatever is being emulated by the rest of the disks will be written back to the drive when you put it back in. Right, and that would be fine. There was nothing on it anyway. How can I "reset" the thing then? So I could put it back in and not have a red x anymore? I'd like to do so as a test just to see if the thing is really toast. Quote Link to comment
JonathanM Posted February 6, 2021 Share Posted February 6, 2021 https://wiki.unraid.net/Troubleshooting#Re-enable_the_drive 1 Quote Link to comment
trurl Posted February 6, 2021 Share Posted February 6, 2021 You have to rebuild it, or you have to New Config without it and rebuild parity instead. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 Wow...once you give a drive to unRAID, it really goes "MINE!!!" Hahaha Quote Link to comment
trurl Posted February 6, 2021 Share Posted February 6, 2021 4 minutes ago, rmp5s said: I'd like to do so as a test just to see if the thing is really toast. Wow, 17 posts before anyone asks for Diagnostics. Go to Tools - Diagnostics and attach the complete Diagnostics ZIP file to your NEXT post in this thread. Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 I clicked both of the SMART self-test buttons for that drive when it was part of the array and neither did anything...the log and history and all that didn't do anything other than tell me to use a command and that command...well...I didn't really feel like going down that rabbit hole at the time... Are the logs stored somewhere? It runs these tests automatically periodically, doesn't it? Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 (edited) 5 minutes ago, trurl said: Wow, 17 posts before anyone asks for Diagnostics. Go to Tools - Diagnostics and attach the complete Diagnostics ZIP file to your NEXT post in this thread. Guess that's where the SMART logs would be...lol diagnostics-20210206-1656.zip Edited February 6, 2021 by rmp5s Quote Link to comment
rmp5s Posted February 6, 2021 Author Share Posted February 6, 2021 Aww...the diagnostics aren't going to work. Disk3 is gone...I've been messing with it. Shoulda pulled these as soon as I noticed an issue. I'll remember that in the future... Quote Link to comment
rmp5s Posted February 7, 2021 Author Share Posted February 7, 2021 All the files in the "smart" folder show the same stupid thing I was seeing earlier, though. It just says to run a command. That normal? Quote Link to comment
trurl Posted February 7, 2021 Share Posted February 7, 2021 Are these SAS disks? Quote Link to comment
rmp5s Posted February 7, 2021 Author Share Posted February 7, 2021 1 hour ago, trurl said: Are these SAS disks? 3.5" SATA Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.