markswam

Members
  • Posts

    38
  • Joined

  • Last visited

About markswam

  • Birthday November 2

Converted

  • Gender
    Male

markswam's Achievements

Noob

Noob (1/14)

0

Reputation

  1. Update again: I managed to get to the web UI in a normal boot, so I should be able to pull down the data that I need. So at least that crisis has been averted. Please forgive me, I’m kind of a stressful person.
  2. Further update: I cannot use Unassigned Devices, because the web UI is only accessible in safe mode.
  3. Update: The web UI is accessible again, but now it's showing both Disk 5 and Disk 6 as unmountable with no filesystem. Hooray, I guess. Time to mount Disk 6 with Unassigned Devices, pull data down off of it, and then format both of those disks and start re-uploading stuff. Thankfully I had already pulled all of the data down off Disk 5...
  4. Well, I may have royally screwed up. I rebooted the machine again after checking to see what disks were in what slots, and now I can't access it via the web anymore. I can log in fine locally, but it only shows my cache and disks 1-4 mounted. I discovered that Disks 1-5 and my cache are all plugged into my motherboard, while my parity drive and Disk 6 are plugged into the external SATA card. So yes, 1,2, and 5 should all be on the same controller.
  5. sigh dammit, I rebooted after I pulled the diagnostics so I could re-seat my SATA card. Guess it's time to wait for this failure to happen again...
  6. Well, it's happened again. Here is the diagnostic zip as requested. Hopefully this can help finally figure out why this keeps happening. I literally didn't even get done moving data back onto the disk this time. tower-diagnostics-20200721-1018.zip
  7. Oh, I probably should have mentioned that earlier. I upgraded to the 6.8.3 Nvidia build after the recovery back in May, so that's up to date at least. Part of me thinks this might be the fault of the PCI-E SATA card I've got in my system (I know my Cache is plugged into that. Not sure which other drive is plugged into it, but I know there's two) since my motherboard doesn't have enough SATA ports to support the 8 drives I've got in the system. Maybe I should ditch that card entirely and just buy a new motherboard with enough SATA ports...
  8. I unfortunately don't have a diagnostic from the more recent incidents, but I did post a diagnostic the first time this failure occurred. Would that be helpful in any way, since it's likely the same failure mode?
  9. Well, it's somehow managed to happen two more times, to the same drive. And I have no idea how, since both times literally nothing got written to that drive in between recoveries. Everything was being written to the next disk in the array (disk 6), and it just suddenly failed, and disks 1, 2, and 5 all showed hundreds of thousands of errors in the "Main" tab of the GUI (but neither 1 nor 2 show up as "unmountable: no file system"). Last week, every single drive passed an extended SMART test with absolutely no problems reported, so I don't think there's a physical issue with any of the drives. I've also swapped out all of the SATA cables with brand-new ones in the last month. At this point, I'm not even losing any data, I'm just getting annoyed. Would it be prudent to go through every drive in the array, move the data off, reformat, and move the data back onto all of them? I'm completely out of ideas otherwise, and just want to stop this monthly ritual of pulling all my shit off a "corrupt" drive and reformatting it. Also: This only ever seems to happen when I invoke the mover while I have a large number of small files (think a couple thousand video frames extracted as photos by ffmpeg) on the cache drive. I have no idea why that would cause an issue, but it seems to be the only constant.
  10. Right on, thank you. I'll try doing that in the morning.
  11. Hey, me again. Sorry if I'm being irritating. I've just got one more question. I've managed to pull all of the data off of the corrupted disk and onto my local machine, so I just want to make sure I understand my next steps before continuing. Would the best order of operations be: Mount the corrupted disk using UD and delete all the data off of it Leave the corrupted disk unassigned Create a New Config (obviously being careful to get the parity drive assignment right) Allow the parity rebuild to complete Stop the array Assign the corrupted disk to its new slot Reformat the corrupted disk Start the array and proceed as normal Or am I getting that wrong/have extraneous/missed steps?
  12. So...yes, then? Just upload all data to a freshly formatted disk? I just want to make absolutely sure I understand what you're saying so I don't wind up doing something dumb.
  13. Oh, so just not relying on parity at all? Just starting from scratch (whether that be reformatting the current drive or just installing a fresh one) and uploading the data to the drive?
  14. Would it be best just to physically remove Disk 5, swap in an empty spare and let it rebuild from that? Or should I try again with the same physical disk? Also, would it be a good idea to try swapping what PCI slot I have my SATA controller plugged into, just in case that slot has suddenly gone wonky?