abhi.ko

Members
  • Posts

    318
  • Joined

  • Last visited

About abhi.ko

  • Birthday 11/13/1978

Converted

  • Gender
    Male
  • Location
    ALLEN, TX

Recent Profile Visitors

The recent visitors block is disabled and is not being shown to other users.

abhi.ko's Achievements

Contributor

Contributor (5/14)

2

Reputation

  1. A BIG thank you to everyone who helped. The issue is fixed now, I wanted to wait a month to see if anything got messed up again. Luckily no issues so far and hence I hope this is resolved and I will mark it as closed. The log filling up issue stopped happening after another reboot and the system is running much cooler, I was going to replace the existing CPU cooler and with something more beefy but did not need to. Even while the log was at 100% the system was running pretty solid, so decided to wait and see, and no issues after that reboot. I upgraded to rc3 and still everything to the best of my knowledge is running smoothly. Thanks guys again for the assist and the education.
  2. @Marzel I am having the same issue? Did you fix it and get the container started? If you did, could you please share what worked for you?
  3. hmm - interesting. This is new, the case is pretty clean and I got enough fans running inside, wonder why. Are these temp too hot? Can I adjust the threshold?
  4. Just posting an update and asking for help on another issue - so disk 19 got rebuilt to a new drive without any errors, and all the lost and found items were moved to right directories, everything looks good. Thanks @JorgeB @itimpi @trurl I got the additional cables and now all the backplanes are connected 1 tray to one molex connector on the PSU, so 4 drives to one connector, thanks @Michael_P @Vr2Io Currently rebuilding Disk 12 on to the old Disk 19 drive, but my log is filling up with call trace errors. Not sure why, latest diags posted, could someone take a look and tell me what seems to be the issue. No other consequences noticed yet, because of the error, other than the logs filling up. tower-diagnostics-20220217-2159.zip
  5. Thank you. I think I'm going to wait till this disk finishes rebuild. I am not seeing any other issues in the array (outside of the log) as I did before. Like disks with reallocated sector counts going up etc. Will rename the top folders in lost+found in the disk after it is done and then restart the array before I start rebuilding disk 12. That should clear the log file and will monitor it closely after that. Should I rebuild disk 12 onto itself, since the file system check did not create a lost+found in that disk, I am assuming that the disk itself is okay and it was the voltage fluctuations causing the issues with the disk dropping off? Is that a good plan? Or should I rebuild it into the old disk from disk 19 slot?
  6. Thank you @itimpi and @trurl. I started rebuilding the data disk (disk 19) with the new drive yesterday (2/15), and everything is going well except for the fact that I noticed today morning, the logs are getting filled up. it looks like it is mostly filled with error messages from 2/14 when the array was not even started but eh preclear was running. From what it looks to me, it seems like a backup trying to access a share that was not live during that time, the client IP in the error message below is that of my main workstation, and I have that being backed up using Paragon Backup to the server, so I am thinking that is what it is about. I might be wrong. There are some other call trace errors in there as well and I am not sure if there is still a hardware issue that is causing errors in the log since the log is not being written to now, hence this post here to seek some expert help. Feb 14 02:32:51 Tower nginx: 2022/02/14 02:32:51 [error] 11687#11687: *323602 limiting requests, excess: 20.842 by zone "authlimit", client: 10.0.0.232, server: , request: "PROPFIND /login HTTP/1.1", host: "tower" Feb 15 21:12:58 Tower kernel: Call Trace: I have posted the diags here not sure if that is helpful, since the logs have been full and not written to since yesterday night it looks like. Is there any way to clear the log file without restarting in the middle of the rebuild? Should I even try to? tower-diagnostics-20220216-0643.zip
  7. Got it. Was probably a brain fart moment, not sure what I was thinking that the lost+found was not written to parity somehow. Clear now. Oh okay, that probably was what I was thinking about.
  8. Ah...thanks for clarifying that. I was incorrectly assuming that the rebuild was going to restore the disk as it should be and a physical move of the lost+found folders won't be necessary. Yes, but would it save time from skipping preclear and starting to rebuild directly? I always understood that Unraid does its own version of stress testing on a new drive, which is not precleared, even though I have never added one without pre-clearing ever.
  9. Thank you both. So I plan to do this for now: Shutdown server Add the new 10TB drive Preclear the 10TB drive Once done, unassign disk 19 from current drive and assign new precleared drive instead Start array and rebuild Does that sound good? Or should I rebuild disk 12 to the new one? Can I re-assign without preclearing and start rebuilding immediately?
  10. Thank you. I don't have a spare hot-swap drive available in the case currently, and I would rather not wait another day or two to pre-clear the drive and then rebuild onto that with 2 disabled disks, unless you think that is the way to go. Just curious, what do we do with the lost+found on Disk 19?
  11. Thanks for reading through. To answer your question, YES, I did switch to a higher single rail amperage rated power supply (Seasonic GS-1300W) and have distributed load as efficiently as currently possible. Old wiring was all of the backplanes being connected to one molex cable. Current scenario is distributed between 2 cables back to the PSU. One cable has 4 backplanes (15 disks, 14 HDD and one SSD (cache)) and another has 2 (8 disks) , since I only had 2 cables that came with the PSU. Future (ideal) scenario is 1:1 to connections from the backplane to the PSU. Have requested more molex cables from Seasonic since I don't want to try and mix cables even from my other Seasonic PSU's, and will be re-wiring as soon as I have them.
  12. Ok, so just clarify next steps: For Disk 19: Do I just rename the numbered directories to the original share names? How do I change the disk from the current disabled status? For Disk 12: what do I do?
  13. Sorry, I honestly don't have a before frame of reference disk-wise to compare against, to know if it is actually missing anything. I took it to mean the same (i.e. repair was successful). Is there a chance that there is a still a chance of corruption? Numbered folders within lost+found but recognizable sub folders within those.
  14. Not a precleared one. But I do have a 10TB disk that is not in the case or array.
  15. Both mounted fine - still disabled and emulated. Disk 12 has no lost+found Disk 19 only has lost+found Diagnostics & screenshots below. What now please? Thanks for all the help till now. tower-diagnostics-20220213-1131.zip