February 2, 20197 yr Hey Guys My disk1 just went into error state and I can not access most of the Shares anymore. Diagnostics are attached, what can I do now? knowlage-diagnostics-20190202-2356.zip Edited February 3, 20197 yr by Jaster
February 2, 20197 yr I don't see any of your hard drives in that. Check the power cables. Reseat the controller card. Etc.
February 2, 20197 yr Author I can't stop the array. It seems to hand during "unmounting disks". Shall I just power down? PS: Syslog shows about 60k errors Edited February 2, 20197 yr by Jaster
February 2, 20197 yr 9 minutes ago, Jaster said: Shall I just power down? yes P.S. I saw your syslog in those diagnostics.
February 2, 20197 yr Author I already had some issues with D1. Reboot/Checks did not change anything. I shutdown the array and replaced the disk. It come up fine, rebuilding now.... hope it lasts.... I attached the most recent diagnostics. knowlage-diagnostics-20190202-2356.zip Edited February 2, 20197 yr by Jaster
February 2, 20197 yr The fact that all of them went missing makes me skeptical there is anything wrong with the original disk1. But rebuilding to a new disk is fine and even gives you another option since that original disk can probably be read if there is any problem with the rebuild. Did you do anything other than replace the disk, such as? 37 minutes ago, trurl said: Check the power cables. Reseat the controller card. Etc.
February 2, 20197 yr Author 2 minutes ago, trurl said: The fact that all of them went missing makes me skeptical there is anything wrong with the original disk1. But rebuilding to a new disk is fine and even gives you another option since that original disk can probably be read if there is any problem with the rebuild. Did you do anything other than replace the disk, such as? I checked all power cables and sas/sata cables. Also checked the Contoller card if it is aligned crrectly, etc. It come up and started to rebuild, after a few minutes I saw millions of errors while reading any disks... I could only shut it down the hard way.
February 3, 20197 yr So it isn't clear to me where you are now. You struck out your previous post with no explanation. Are you saying it is failing again in the same way? Are you sure you checked all the power connnections at each point going back to the PSU?
February 3, 20197 yr You're having a problem with the HBA: Feb 2 23:35:52 Knowlage kernel: mpt2sas_cm0: SAS host is non-operational !!!! Update firmware to latest, you're on FWVersion(18.00.00.00) and latest is 20.00.07.00, also try a different slot if available, if still issues after that it could be a failing (or fake) HBA or some compatibility issue with your board if this is a new config.
February 3, 20197 yr Author Edit: removed post just left the diagnostics. knowlage-diagnostics-20190203-0702.zip Edited February 3, 20197 yr by Jaster
February 3, 20197 yr Author 8 hours ago, trurl said: So it isn't clear to me where you are now. You struck out your previous post with no explanation. Are you saying it is failing again in the same way? Are you sure you checked all the power connnections at each point going back to the PSU? All cables/connections are fine. The Failing is the same. Edited February 3, 20197 yr by Jaster
February 3, 20197 yr Author I Flashed the controller, but I have no Idea if it has done something good as I do not want to go on with any testing without your inbput. Now what happened; the new disk (8TB) goes to fail state, no matter where I attach it. The old disk (2TB) seems to be back, but I can not put it back into the array as It says it is to small to replace the 8TB (which used to be the replacement). How do I proceed? knowlage-diagnostics-20190203-0931.zip
February 3, 20197 yr 2 hours ago, Jaster said: What happened now is, that disk 1 got failing again. Those diags only show a disabled disk, related to the previous issues, disk itself look fine, you'll need to rebuild, but update the firmware first, you can't do it on Unraid, use a DOS boot disk or a Windows desktop if you have one.
February 3, 20197 yr We posted at the same time, controller is using latest firmware, now you'll need to rebuild the disabled disk: https://wiki.unraid.net/Troubleshooting#Re-enable_the_drive
February 3, 20197 yr Author Updating the controller was kinda hell... I think I'll post some guidance for others running into that task.... However, the disk come up and rebuild started. After about 15 minutes the disk failed and got a bunch of errors again. Diks went into failed state again. Diagnostics are attached... what can I do next? Can I somehow reuse the smaller (2TB) disk and see if it works with that one? knowlage-diagnostics-20190203-1325.zip EDIT: I tried stopping the array and ALL disks gone missing. I assume the controller is Trash...? knowlage-diagnostics-20190203-1332.zip Edited February 3, 20197 yr by Jaster
February 3, 20197 yr HBA problems again, is the HBA new, used, new from China? It could also be a compatibility issue with your board.
February 3, 20197 yr Author New. Unfortunatley I did not pay attention on amz checkout. So it is from China... I'm already looking for replacements, but I am really unexperienced with that. Actually u mentioned to pick a LSI chip from these: SAS2008 2308 3008 9201-8i 9211-8i 9207-8i 9300-8i What about SAS2108 and 2308? Those usually come as rebranded RAID controllers. So I should be able to flash them into IT mode?... I assume the card manufacturer does not matter? (There are soooo many out there) P.S: could you explanin me how you dig into the dianostics and where you see the issues? I'd like to be able to that on my own or just write a small util to do so... Edited February 3, 20197 yr by Jaster
February 3, 20197 yr 9 minutes ago, Jaster said: could you explanin me how you dig into the dianostics and where you see the issues? I'd like to be able to that on my own or just write a small util to do so... Same as before, this means the HBA stopped responding, that means Unraid will lose contact with all disks connected there: Feb 3 13:24:58 Knowlage kernel: mpt2sas_cm0: SAS host is non-operational !!!! The SAS2008 chip is one of the best, don't go for RAID versions like the 2108, 2308 is also good, similar to the 2008 but PCIe 3.0, you either have a failing HBA, a fake one, or some compatibility issue with your board.
February 3, 20197 yr Author Just ordered a 2308 manufactured by SilverStone. Thanks for your assistance!
Archived
This topic is now archived and is closed to further replies.