banterer Posted February 15, 2023 Author Share Posted February 15, 2023 So, checked all connections, moved the 2x problematic drives (3 and 4) to other bays vacated by removing unused drives, rebooted, (gui fine now), ran diags, attached. For info - all drives in the array are on a hotswap backplane, with 8 x SATA ports connected to the SAS card by 2x SAS->SATA 4 way splitters. I chose the card after a long conversation (well several) on this forum, as others had had success with them. tower-diagnostics-20230215-1623.zip Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 Also, I need to urgently access my docker installs but my array won't start like this. How can I mount just the cache SSD, and 'point' docker at it? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 Can you see disk3 in BIOS? Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 51 minutes ago, trurl said: Can you see disk3 in BIOS? It's headless, in the garage. I have a USB-> Sata I can use to try hooking it up to my Mac? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 If it didn't work on another system I guess maybe it doesn't work. If it did work on another system we would still have the same question. Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 I guess if we can at least get disk4 up and running, my data should be safe? What should I do with that, and how can I at least mount my cache so I can access my dockers? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 You have to start the array to start the docker service. You can't start the array because you have a missing disk and a disabled disk, but only single parity. It would be possible to New Config/Rebuild parity without the missing disk, then the disabled disk would be accepted just as it is. But you would have no way to rebuild the missing disk. It is also possible to get Unraid to accept disk4 just as it is and rebuild disk3 instead. How well that would work depends on how out-of-sync things are. 17 hours ago, trurl said: parity rebuild had not finished when you were having problems with those 2 data disks, so not clear parity build would have been good Might even be best if both data disks were used in a New Config just as they are and try parity rebuild again. Assuming we can get disk3 to show up. Do you have backups of anything important and irreplaceable? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 13 minutes ago, banterer said: if we can at least get disk4 up and running, my data should be safe? According to those first diagnostics you posted, both disk3 and disk4 were 80% full. You have to have disk3 to rebuild disk4. Or, you have to have disk4 to rebuild disk3. Not sure how well rebuild would work in either case depending on how good parity was or how out-of-sync disk4 had gotten while it was disabled. Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 21 minutes ago, trurl said: Do you have backups of anything important and irreplaceable? Yeah this is where I say 'no', and you tell me I'm stupid, right? The truth is a lot more complicated than that. EG right now I'm supposed to be doing stuff to something on the cloud, that this is the offsite backup for. I can't do the cloud stuff, because I can't risk it without a backup. And a dozen other complex things that would take too long to explain. What would you do, out of the two options (force accept disk4, and new config)? Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 Update - I've connected 3 to my USB->SATA adapter, and it's not showing up on my Mac as a device (in disk util or /dev) so I'm guessing that's fried? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 1 hour ago, trurl said: backups of anything important and irreplaceable? I mention backups because, often, not everything is important and irreplaceable. So it might make sense to concentrate our efforts on things that are. Disk3 never gave a SMART report even on your first diagnostics, so couldn't really see how healthy it was. Was it an old disk? Or maybe a new disk that hadn't been tested? Did it have any data on it that it might be worth going to some extra trouble to try to recover? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 Do you have any spare 3 TB disks? Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 1 hour ago, trurl said: Disk3 never gave a SMART report even on your first diagnostics, so couldn't really see how healthy it was. Was it an old disk? Or maybe a new disk that hadn't been tested? Did it have any data on it that it might be worth going to some extra trouble to try to recover? It was one of the older ones. But I don't really know which data was on it, as the folders are all split. Is that something I can find out somehow? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 4 minutes ago, banterer said: Is that something I can find out somehow? Only by elimination unless we can recover its data. We do have this much information from those first diagnostics when disk3 was apparently working well enough to read. appdata shareUseCache="prefer" # Share exists on cache, disk1, disk2 b-----s shareUseCache="no" # Share exists on disk1, disk2, disk3, disk4 c--v shareUseCache="yes" # Share exists on disk1, disk3, disk4 domains shareUseCache="prefer" # Share exists on cache d-------s shareUseCache="yes" # Share exists on disk1, disk2 isos shareUseCache="yes" # Share exists on disk1, disk2 kr shareUseCache="prefer" # Share does not exist m---a shareUseCache="no" # Share exists on disk1, disk2, disk3, disk4 system shareUseCache="prefer" # Share exists on disk1 v-s shareUseCache="prefer" # Share exists on disk1 Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 At this point it doesn't look like we are going to rebuild disk4 so we will just have to accept its contents as they are. But we might see if the array will emulate disk3 well enough to try to recover some of disk3 data. 1 hour ago, trurl said: Do you have any spare 3 TB disks? Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 Hmmm, well one is backups, which is only important if needed. The other ones are cctv (not important) and media (don't really want to lose, but not the end of the world). How can I force disk4 to do something? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 Just had a closer look at your most recent diagnostics. Disks 1, 2 are the only healthy disks you have in the array. That is going to make things very tricky indeed. We can't count on parity or disk4 even if their contents were good. At this point we would probably begin talking about cloning disk4 since you can't rebuild it. And we would probably give up on disk3 entirely. Why were you using all these bad disks? And looking at those first diagnostics, you were apparently ignoring this crucial warning from Fix Common Problems. Jan 29 17:18:01 Tower root: Fix Common Problems: Warning: No destination (browser / email / agents set for Warning level notifications You must setup Notifications to alert you immediately by email or other agent as soon as a problem is detected. Don't allow one problem to become multiple problems and data loss. Likely you had multiple problems to begin with. Notifications would have been screaming at you about all those bad disks. They also would have all had SMART warnings ( 👎 )on the Dashboard page. Probably parity and disk4 are showing those now if they are spunup. Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 Parity actually looked OK in that first SMART report you posted so I guess it has gone bad since. Disk4 was already showing problems in those first diagnostics. Never got SMART report for disk3. Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 35 minutes ago, banterer said: How can I force disk4 to do something? We can see if the array can emulate disk3 but I don't have much hope. We will have to have a spare 3TB disk that can play the role of disk 3 so we can disable it instead of disk4. Quote Link to comment
banterer Posted February 15, 2023 Author Share Posted February 15, 2023 2 minutes ago, trurl said: We can see if the array can emulate disk3 but I don't have much hope. We will have to have a spare 3TB disk that can play the role of disk 3 so we can disable it instead of disk4. Ok, so first step get another disk. I guess it can be bigger than 3TB, just at least 3TB, right? Can you list out the steps I should take? Quote Link to comment
trurl Posted February 15, 2023 Share Posted February 15, 2023 Just now, banterer said: get another disk It has to be at least as large as disk3 was, but it can't be any larger than parity. So 3TB is your only choice. Quote Link to comment
trurl Posted February 16, 2023 Share Posted February 16, 2023 8 minutes ago, banterer said: list out the steps Better if we take them a little at a time and see how things go. When do you expect to have a spare disk? Quote Link to comment
trurl Posted February 16, 2023 Share Posted February 16, 2023 7 minutes ago, trurl said: When do you expect to have a spare disk? Ultimately, you need to replace parity and disk4 also regardless of how disk3 comes out. But maybe wait and see before getting those extra disks in case it makes sense to get larger parity. Expect to work on this for a few days with no guarantees. Quote Link to comment
banterer Posted February 16, 2023 Author Share Posted February 16, 2023 Ok just ordering two more disks now. WD Red. Are the plus or pro worth going for? Quote Link to comment
trurl Posted February 16, 2023 Share Posted February 16, 2023 I'm going to summarize where we are and suggest ways to proceed in case anyone else wants to get involved. Multiple disk problems during parity rebuild. Since then, parity and disk4 have many pending sectors. Disks 1, 2 seem OK. Disk4 currently disabled, disk3 missing and presumed dead. I am thinking about trying to get the array to emulate disk3 instead of currently disabled disk4 using the usual trick of New Config/Trust parity to get all disks into the array including a spare disk3, then disable disk3. I would expect emulated disk3 to be unmountable at that point, and usually we would repair filesystem before rebuilding. But since we aren't rebuilding on top of the original, and the whole array is pretty shaky anyway, maybe do the repair after rebuild (if we can even get that far). Other things that might be considered is cloning parity and disk4 before doing anything else, but that would mean even more spare disks. I'm going to ping some of the usual suspects @JonathanM @itimpi and of course @JorgeB Way past bedtime in that part of the world so probably be some hours before any response 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.