djonesax

Members
  • Posts

    53
  • Joined

  • Last visited

Everything posted by djonesax

  1. @JorgeB and @trurl Thanks, it would seem that way I agree and both Disk2 and Disk3 are on the controller. Parity and Disk1 are on the motherboard and they have had issues too. Maybe it was happenstance but for months it seemed to run better after I spilt the drives evenly across the two PS rails versus all on one rail. Also it seemed to run better after taking power away from the cache disk. The PS is at minimum 500W (could be 750W, don't remember and label is hidden) with 6 HDDs and a SSD. If I had a poor performing powersupply could that cause the controller to act up?
  2. Thanks, I understand but I am looking to retire this server and didnt want to replace the disk. I also have two unused disks in the array, so if I decide to keep it, I can recreate the array and really dont need this disk. That being said, I dont actually think there is anything wrong with the disks but rather a hardware issue somewhere causing IO errors making Unraid think that the disks are bad and disabling them. Also, disks randomly have problems or drop offline at times. Likes yesterday when I would stop the array Disk2 would go missing but a reboot would bring it back. Or times when I would do a directory listing on a large directory, the server would lock up for a bit, and then all my shares would be gone, that or a bunch of files would be missing, but a reboot would fix. For some reason I ran a file system check on disk3 which fixed 6 errors and ever since I've had no disk problems and my file copies are going fine but I am now getting increasing CRC error counts on disk1. I've tried new disks, cables, different ports, redistributing power to even out the rails. The issues are so weird that I'm thinking there is a powersupply or motherboard issue and I didn't feel like going down the debugging road, of replacing parts until I fix the issue, so I just bought a Synology and hoping for a less hands on solution. Unraid itself is great but the overall experience over 10+ years for me, perhaps mostly related to hardware, hasn't been the greatest. Unraid is awesome but in my opinion, it requires someone with sysadmin and good hardware knowledge to keep it running optimally and I'm a little burnt out on it. Sorry for the rant.
  3. I have been having all sorts of strange problems with my unraid setup off and on for years now. Seemingly good drives die, I get disk errors even though smart health and diagnostics show no issues. Sometimes disks will just go missing and after a reboot everything is fine. I suspect I have some sort of a power issue that is causing IO errors making unraid think disks are bad and marking them disabled or missing. Right now I have Disk 3 missing and emulated and I have tried to rebuild it with two other drives, both of which failed. I have a synology that I want to copy all my files to but even with the Disk3 emulated, entire contents of source folders disappear hours into the copy and then the copy fails. Occasionally the shares have all disappeared too but a reboot brings them back. Needless to say I have a mess of problems and I just want to get my files off while I still can. I am guessing I have some corruption in my filesystem and wondered if I could do I file system check with a drive down, or of that would permanently mess something up. I don't want to spend another $100 on a drive for this thing as the drives that are in it are new and failing and there are two disk in it that are empty. I just want to get my files off and start over without the current disk3 in the array and try a new power supply. Also, just now, I stopped the array and disk2 is now missing along with disk3, but I know if I reboot it will be back online and showing healthy. Ideas? tower-diagnostics-20220105-1950.zip Edit**** I put the array into maintenance mode, and ran a filesystem check on the emulated drive, which I didn't know I could do. It replayed 6 transactions, and now at least some of the missing files are back.
  4. Hi, I think I may need to replace my PSU. Recently I had issues with drives dying and I have been replacing them with new drives only to find that there was nothing wrong with the old drives. I replaced cables and tried different SATA ports but had the same issues. I redistributed my 7 drives evenly across two rails versus before having 6 HDDs on one, and the cache SSD on another. This seemed to fix the issue with failing drives. The other day I had a power failure and lost another drive, the smart status was fine so I rebuilt it and it is running fine again. I want to replace the PSU for good measure. Would a 750w be good for a system with 9 drives?
  5. I just ran another short and extended test and got this, so maybe the disk is bad. SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 109 132496 SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 109 132496 # 2 Short offline Completed: read failure 90% 109 132496 # 3 Short offline Completed: read failure 90% 109 132496
  6. Recently I had what I think was a powersupply problem which was causing disks to randomly go offline. I moved the disks to a different power rail and it seems to almost fix the issue. I say that because disk 1 is showing as disabled and the contents are emulated. I downloaded the smart report and it shows it as passed but the disk still is disabled. I have attached the diagnostic log and the smart report. Any ideas how to reenable the disk? Thanks, David tower-diagnostics-20210617-1621.zip tower-smart-20210617-1214.zip
  7. I did put the new cable on another port after that and have not gotten any errors since. I only got one error on the new cable on the old port, before switching, where I had gotten 9,800 before Thanks for the tips.
  8. I replaced the cable a few hours ago and still getting the errors.
  9. Unraid has been complaining a lot about CRC errors on my Samsung Evo 850 500GB SSD cache drive. I have checked that the cables are connected and secure, already. I have been using this drive for at least a year with no issues until recently when I was in the system replacing a drive. I have attached the SMART report and the unraid diagnostics. Any ideas? tower-smart-20210426-1017.zip tower-diagnostics-20210426-1413.zip
  10. The erase failed, I guess I'll toss the drive, after sledge hammering it. Here is all the tool gave me for smart data.
  11. I couldn't get smart data when it was is in unraid, maybe because it was disabled. After I put it in my windows box and deleted the volume, I was able to give it a drive letter and see it in the disk utility. At that point I could get smart data but it just said there were no errors after running an extended test for almost a full day. Do you think its OK after doing a successful full overwrite, to put it back in the array and just monitor it for errors again?
  12. I used the WD Drive Dashboard utility on it and did an extended smart test that found no errors. I am now doing a full overwrite on the drive and if all that works, I think I will just pop it back into the unraid array and see how it does.
  13. I have a disk that unraid disabled because there were 2 sector errors. I replaced the drive and the new drive is rebuilding now. My question is, can I repair this drive some how? Is there a tool that I could use on it that find the bad sectors and mark them as bad or something?
  14. Got the cache disk installed and upgraded to the latest version. That old disk must have an un-parked head because I can hear it moving around inside.
  15. for sure.... Any ideas on a free windows 10 or OSX reiserfs app to that i can use with an external hard drive enclosure to at least see what was on the cache drive?
  16. I have 500GB SSD sitting on my desk for this very purpose.
  17. The bad thing is now the cache drive not mountable and Ive been copying stuff off an old lap top all day and deleting old files. I might have gotten lucky because it looks like those other shares dont have the cashe drive enabled.
  18. So I removed the cache disk from the share, stopped and started the array and now I have my movies in the user share.
  19. I get this when i check the filesystem on the cache disk reiserfsck 3.6.27 Will read-only check consistency of the filesystem on /dev/sdb1 Will put log info to 'stdout' The problem has occurred looks like a hardware problem. If you have bad blocks, we advise you to get a new hard drive, because once you get one bad block that the disk drive internals cannot hide from your sight,the chances of getting more are generally said to become much higher (precise statistics are unknown to us), and this disk drive is probably not expensive enough for you to you to risk your time and data on it. If you don't want to follow that follow that advice then if you have just a few bad blocks, try writing to the bad blocks and see if the drive remaps the bad blocks (that means it takes a block it has in reserve and allocates it for use for of that block number). If it cannot remap the block, use badblock option (-B) with reiserfs utils to handle this block correctly. bread: Cannot read the block (2): (Input/output error).
  20. The cahe drive is online, and it being used right now for a separate file transfer to a different folder. The folder permissions seem to be screwed up for the "HighResMovies" folder in the "Movies" folder but I get access denied when I try to fix it with cmod and chown. root@Tower:~# ls -lath /mnt/cache/Movies/ /bin/ls: cannot access '/mnt/cache/Movies/HighResMovies': Permission denied total 0 drwxrwx--- 9 nobody users 200 Jan 6 2018 ../ drwxrwx--- 3 nobody users 80 Oct 11 2014 ./ ?????????? ? ? ? ? ? HighResMovies
  21. I turned the cache disk off for that share but I still get access denied.
  22. I am with you, i was just looking at that. Something has gone awry on that folder for sure. root@Tower:~# ls -lath /mnt/cache/Movies/ /bin/ls: cannot access '/mnt/cache/Movies/HighResMovies': Permission denied total 0 drwxrwx--- 9 nobody users 200 Jan 6 2018 ../ drwxrwx--- 3 nobody users 80 Oct 11 2014 ./ ?????????? ? ? ? ? ? HighResMovies root@Tower:~# chown nobody /mnt/cache/Movies/HighResMovies chown: cannot access '/mnt/cache/Movies/HighResMovies': Permission denied
  23. Its the one ending in "s" The other is "mackbook" and its exported with AFP for timemachine backups.
  24. So I found that I dont have access to the mount but the permissions look the same as the other folders. root@Tower:~# ls -lath /mnt/user/Movies/ /bin/ls: reading directory '/mnt/user/Movies/': Permission denied total 0 root@Tower:~# ls -lath /mnt/user/ total 28K drwxrwx--- 1 nobody users 216 Oct 23 13:57 Backups/ drwxrwx--- 1 nobody users 312 Oct 22 13:05 Photos/ drwxrwx--- 1 nobody users 144 Apr 30 2018 BitTorrent\ Downloads/ drwxrwx--- 1 nobody users 200 Apr 29 2018 ./ drwxrwxrwx 1 nobody users 48 Apr 29 2018 isos/ drwxrwxrwx 1 nobody users 48 Jan 6 2018 domains/ drwxrwxrwx 1 nobody users 72 Jan 6 2018 system/ drwxrwx--- 1 nobody users 592 Oct 12 2017 Software/ drwxrwxrwx 1 nobody users 48 Jan 31 2017 SFTP/ drwxrwx--- 1 nobody users 72 Mar 22 2015 TV\ Shows/ drwxrwx--- 1 nobody users 72 Mar 22 2015 Music/ drwxrwx--- 1 nobody users 80 Oct 11 2014 Movies/ drwxrwx--- 1 nobody users 328 Apr 30 2014 macbook/ -rw-rw---- 1 nobody users 6.1K Mar 31 2013 .DS_Store -rw-rw---- 1 nobody users 4.0K Mar 31 2013 ._.DS_Store drwxrwx--- 1 nobody users 72 Feb 28 2013 EBooks/ drwxrwx--- 1 nobody users 112 Sep 8 2012 Protected/ drwxrwx--- 1 nobody users 16K Sep 3 2012 Music\ Videos/ drwxr-xr-x 8 root root 160 Nov 19 2011 ../ But I can access the disk shares where the Movies are. root@Tower:~# ls -lath /mnt/disk1/Movies/ total 34K drwxrwx--- 13 nobody users 392 Jan 6 2018 ../ drwxrwx--- 304 nobody users 15K Dec 23 2014 HighResMovies/ drwxrwx--- 6 nobody users 168 Oct 11 2014 ./ drwxrwx--- 415 nobody users 20K Mar 5 2013 LowResMovies/ drwxrwx--- 8 nobody users 384 Dec 24 2012 3D/ drwxrwx--- 5 nobody users 168 Sep 3 2012 VideoCamera/ root@Tower:~# ls -lath /mnt/disk2/Movies/ total 4.0K drwxrwx--- 12 nobody users 288 Mar 22 2015 ../ drwxrwx--- 80 nobody users 3.7K Jun 18 2014 HighResMovies/ drwxrwx--- 16 nobody users 944 Mar 5 2013 LowResMovies/ drwxrwx--- 5 nobody users 136 Mar 5 2013 ./ drwxrwx--- 3 nobody users 88 Dec 24 2012 3D/ root@Tower:~# ls -lath /mnt/disk3/Movies/ total 16K -rw-rw---- 1 nobody users 6.1K Oct 23 13:59 .DS_Store drwxrwx--- 15 nobody users 360 Apr 29 2018 ../ drwxrwx--- 87 nobody users 4.2K Jun 18 2014 HighResMovies/ drwxrwx--- 3 nobody users 80 Mar 30 2014 Training/ drwxrwx--- 4 nobody users 168 Mar 30 2014 ./ -rw-rw---- 1 nobody users 4.0K Sep 29 2013 ._.DS_Store
  25. Thanks, I added it to the original post. Anything else helpful I can add?