djonesax

Members
  • Posts

    53
  • Joined

  • Last visited

Converted

  • Gender
    Undisclosed

Recent Profile Visitors

The recent visitors block is disabled and is not being shown to other users.

djonesax's Achievements

Rookie

Rookie (2/14)

1

Reputation

  1. @JorgeB and @trurl Thanks, it would seem that way I agree and both Disk2 and Disk3 are on the controller. Parity and Disk1 are on the motherboard and they have had issues too. Maybe it was happenstance but for months it seemed to run better after I spilt the drives evenly across the two PS rails versus all on one rail. Also it seemed to run better after taking power away from the cache disk. The PS is at minimum 500W (could be 750W, don't remember and label is hidden) with 6 HDDs and a SSD. If I had a poor performing powersupply could that cause the controller to act up?
  2. Thanks, I understand but I am looking to retire this server and didnt want to replace the disk. I also have two unused disks in the array, so if I decide to keep it, I can recreate the array and really dont need this disk. That being said, I dont actually think there is anything wrong with the disks but rather a hardware issue somewhere causing IO errors making Unraid think that the disks are bad and disabling them. Also, disks randomly have problems or drop offline at times. Likes yesterday when I would stop the array Disk2 would go missing but a reboot would bring it back. Or times when I would do a directory listing on a large directory, the server would lock up for a bit, and then all my shares would be gone, that or a bunch of files would be missing, but a reboot would fix. For some reason I ran a file system check on disk3 which fixed 6 errors and ever since I've had no disk problems and my file copies are going fine but I am now getting increasing CRC error counts on disk1. I've tried new disks, cables, different ports, redistributing power to even out the rails. The issues are so weird that I'm thinking there is a powersupply or motherboard issue and I didn't feel like going down the debugging road, of replacing parts until I fix the issue, so I just bought a Synology and hoping for a less hands on solution. Unraid itself is great but the overall experience over 10+ years for me, perhaps mostly related to hardware, hasn't been the greatest. Unraid is awesome but in my opinion, it requires someone with sysadmin and good hardware knowledge to keep it running optimally and I'm a little burnt out on it. Sorry for the rant.
  3. I have been having all sorts of strange problems with my unraid setup off and on for years now. Seemingly good drives die, I get disk errors even though smart health and diagnostics show no issues. Sometimes disks will just go missing and after a reboot everything is fine. I suspect I have some sort of a power issue that is causing IO errors making unraid think disks are bad and marking them disabled or missing. Right now I have Disk 3 missing and emulated and I have tried to rebuild it with two other drives, both of which failed. I have a synology that I want to copy all my files to but even with the Disk3 emulated, entire contents of source folders disappear hours into the copy and then the copy fails. Occasionally the shares have all disappeared too but a reboot brings them back. Needless to say I have a mess of problems and I just want to get my files off while I still can. I am guessing I have some corruption in my filesystem and wondered if I could do I file system check with a drive down, or of that would permanently mess something up. I don't want to spend another $100 on a drive for this thing as the drives that are in it are new and failing and there are two disk in it that are empty. I just want to get my files off and start over without the current disk3 in the array and try a new power supply. Also, just now, I stopped the array and disk2 is now missing along with disk3, but I know if I reboot it will be back online and showing healthy. Ideas? tower-diagnostics-20220105-1950.zip Edit**** I put the array into maintenance mode, and ran a filesystem check on the emulated drive, which I didn't know I could do. It replayed 6 transactions, and now at least some of the missing files are back.
  4. Hi, I think I may need to replace my PSU. Recently I had issues with drives dying and I have been replacing them with new drives only to find that there was nothing wrong with the old drives. I replaced cables and tried different SATA ports but had the same issues. I redistributed my 7 drives evenly across two rails versus before having 6 HDDs on one, and the cache SSD on another. This seemed to fix the issue with failing drives. The other day I had a power failure and lost another drive, the smart status was fine so I rebuilt it and it is running fine again. I want to replace the PSU for good measure. Would a 750w be good for a system with 9 drives?
  5. I just ran another short and extended test and got this, so maybe the disk is bad. SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed: read failure 90% 109 132496 SMART Extended Self-test Log Version: 1 (1 sectors) Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 90% 109 132496 # 2 Short offline Completed: read failure 90% 109 132496 # 3 Short offline Completed: read failure 90% 109 132496
  6. Recently I had what I think was a powersupply problem which was causing disks to randomly go offline. I moved the disks to a different power rail and it seems to almost fix the issue. I say that because disk 1 is showing as disabled and the contents are emulated. I downloaded the smart report and it shows it as passed but the disk still is disabled. I have attached the diagnostic log and the smart report. Any ideas how to reenable the disk? Thanks, David tower-diagnostics-20210617-1621.zip tower-smart-20210617-1214.zip
  7. I did put the new cable on another port after that and have not gotten any errors since. I only got one error on the new cable on the old port, before switching, where I had gotten 9,800 before Thanks for the tips.
  8. I replaced the cable a few hours ago and still getting the errors.
  9. Unraid has been complaining a lot about CRC errors on my Samsung Evo 850 500GB SSD cache drive. I have checked that the cables are connected and secure, already. I have been using this drive for at least a year with no issues until recently when I was in the system replacing a drive. I have attached the SMART report and the unraid diagnostics. Any ideas? tower-smart-20210426-1017.zip tower-diagnostics-20210426-1413.zip
  10. The erase failed, I guess I'll toss the drive, after sledge hammering it. Here is all the tool gave me for smart data.
  11. I couldn't get smart data when it was is in unraid, maybe because it was disabled. After I put it in my windows box and deleted the volume, I was able to give it a drive letter and see it in the disk utility. At that point I could get smart data but it just said there were no errors after running an extended test for almost a full day. Do you think its OK after doing a successful full overwrite, to put it back in the array and just monitor it for errors again?
  12. I used the WD Drive Dashboard utility on it and did an extended smart test that found no errors. I am now doing a full overwrite on the drive and if all that works, I think I will just pop it back into the unraid array and see how it does.
  13. I have a disk that unraid disabled because there were 2 sector errors. I replaced the drive and the new drive is rebuilding now. My question is, can I repair this drive some how? Is there a tool that I could use on it that find the bad sectors and mark them as bad or something?
  14. Got the cache disk installed and upgraded to the latest version. That old disk must have an un-parked head because I can hear it moving around inside.
  15. for sure.... Any ideas on a free windows 10 or OSX reiserfs app to that i can use with an external hard drive enclosure to at least see what was on the cache drive?