JorgeB Posted January 9, 2018 Share Posted January 9, 2018 20 minutes ago, steve1977 said: Does my GPU benefit from the "faster" lane or could I swap my GPU with the controller card? Depends on the GPU, you can always try and see if you notice any change. Quote Link to comment
steve1977 Posted January 10, 2018 Author Share Posted January 10, 2018 Parity build has been completed. So, this confirms that my M1015 controller does not work properly with WD 10TB drives. Not a big deal, but worth sharing back and maybe there will eventually a FW update fixing this? Is it safe to say that this is not an Unraid, but a controller issue? With regards to the slow speed, I will probably go the way to remove one of my GPUs. I will need to find a way to pass-through my GPU to a VM from primary slot, which I hadn't succeeded in the past. I'll give it another try though and if this succeeds, I'll remove my second GPU and put my controller back to higher speeds. Quote Link to comment
JorgeB Posted January 10, 2018 Share Posted January 10, 2018 1 hour ago, steve1977 said: So, this confirms that my M1015 controller does not work properly with WD 10TB drives. We need more people having the same issue to consider this a general problem with the controller, I have my doubts, but as more users get 10TB disks we'll see. Quote Link to comment
steve1977 Posted March 7, 2018 Author Share Posted March 7, 2018 I have some new observations, so pulling this up again. The issue may not be the 10TB disk per se, but the maximum capacity of the M1015 controller? It may be limited to a certain number of TB? I Unraid has been working for quite some while now without hickups. I now tried to replace a 4TB disk with a 6TB disk (connected to the M1015). I am facing the same issue as I did when replacing the 6TB with a 10TB disk. The upgrade fails every time during parity build. I don't know for sure, but it could likely be related to the M1015. Any thoughts? Related question: given the upgrade from 4TB to 6TB disk fails everytime, how can I protect the data? Can I just rebuild with a 4TB disk? Or better to start from scratch and then mount the "old" 4TB disk and copy the content back to the array? Quote Link to comment
JorgeB Posted March 7, 2018 Share Posted March 7, 2018 4 minutes ago, steve1977 said: The issue may not be the 10TB disk per se, but the maximum capacity of the M1015 controller? It may be limited to a certain number of TB? I Unraid has been working for quite some while now without hickups. I now tried to replace a 4TB disk with a 6TB disk (connected to the M1015). I am facing the same issue as I did when replacing the 6TB with a 10TB disk. The upgrade fails every time during parity build. I don't know for sure, but it could likely be related to the M1015. Any thoughts? There's no such limit, if it's not a different problem you may have a bad controller. 5 minutes ago, steve1977 said: Can I just rebuild with a 4TB disk? You can try Quote Link to comment
steve1977 Posted March 7, 2018 Author Share Posted March 7, 2018 Unfortunately, it doesn't work to rebuild. The "old" disk was 4TB and the attempt to upgrade to 6TB failed. Now rebuilding with another 4TB gives me "he replacement disk must be as big or bigger than the original.". That's a pity and feels that I am somewhat "stuck". A fresh build should work though and I should then still be able to copy the "old`' disk back manually via UD. I actually already replaced the controller some time ago, but I am willing to give it another shot to buy a new one. Are the ones on ebay reliable? For example somehting like https://www.ebay.de/itm/IBM-ServeRaid-M1015-SATA-SAS-HBA-Controller-RAID-6Gbps-PCIe-x8-wie-9220-8i/131847466320?hash=item1eb2b8ad50:g:rGMAAOSwQItUICB9 Quote Link to comment
JorgeB Posted March 7, 2018 Share Posted March 7, 2018 2 minutes ago, steve1977 said: Unfortunately, it doesn't work to rebuild. The "old" disk was 4TB and the attempt to upgrade to 6TB failed. Now rebuilding with another 4TB gives me "he replacement disk must be as big or bigger than the original.". That's a pity and feels that I am somewhat "stuck". A fresh build should work though and I should then still be able to copy the "old`' disk back manually via UD. Yes, forgot about that, you'll need same size or larger. IIRC you have regular dropped disks with your server, it's difficult to say what would help or not since there might be other issues. Quote Link to comment
steve1977 Posted March 7, 2018 Author Share Posted March 7, 2018 Thanks. Yes I do though it feels that it has been a bit better over time. A few things about the drop-outs: * 100% of the drop-outs happen on disks on the M1015. Disks drop from different slots, but always only those connected to the M1015 * I recently changed Mobo, CPU and Ram. So, this can be ruled out * I changed cabling several times, so this can also be ruled out * I still suspect that the PSU may be an issue, but it is pricey to change. Was also thinking about adding a UPS if this could help. * I could switch out the M1015, but have heard from "experts" that it is very unlikely to be the issue. Any advice how I can still trouble-shoot? Quote Link to comment
JorgeB Posted March 7, 2018 Share Posted March 7, 2018 I would replace the HBA and/or PSU. Quote Link to comment
JonathanM Posted March 7, 2018 Share Posted March 7, 2018 How is your air circulation? Those HBA's can run hot, maybe you need better cooling directed at it? Quote Link to comment
steve1977 Posted March 8, 2018 Author Share Posted March 8, 2018 let me replace the HBA first. if this brings no change, the PSU or a UPS may be the last things to resort to.it could be air circulation. would this “kill” the card or just lead to temporary mal function?Sent from my iPhone using Tapatalk Quote Link to comment
JonathanM Posted March 8, 2018 Share Posted March 8, 2018 24 minutes ago, steve1977 said: it could be air circulation. would this “kill” the card or just lead to temporary mal function? Yes. First indication would be errors under load that get better when the card cools down, progressing to errors almost constantly regardless of how cool it is. Quote Link to comment
steve1977 Posted March 9, 2018 Author Share Posted March 9, 2018 Just ordered a new M1015 and will give it a try.This may indeed be the case that the card broke due to over-heating. This would also explain why it fails at parity rebuild (high load) and historically failed when copying large amount of data between disks.will keep you posted whether the new card will work better.i don’t really know how to better cool it, but at least it helps to narrow down the issue. at least for the first few weeks, the new card should work flawless.Sent from my iPhone using Tapatalk Quote Link to comment
JorgeB Posted March 9, 2018 Share Posted March 9, 2018 There's also the possibility your current HBA isn't failing but just overheating, and if that's the case the same may happen to the replacement, you should try installing a fan near it, even if just temporarily and test. Quote Link to comment
steve1977 Posted March 9, 2018 Author Share Posted March 9, 2018 New care is on its way, so let’s see whether the parity rebuild will work.One thing that makes me think that it isn’t a a broken hba is that a new parity works with existing capacity, but not when adding larger disks (6->10 or 4->6).anyhow, we’ll know in a few days once the new hba card arrives.Sent from my iPhone using Tapatalk Quote Link to comment
JorgeB Posted March 9, 2018 Share Posted March 9, 2018 29 minutes ago, steve1977 said: One thing that makes me think that it isn’t a a broken hba is that a new parity works with existing capacity, but not when adding larger disks (6->10 or 4->6). That's very weird. Quote Link to comment
steve1977 Posted March 9, 2018 Author Share Posted March 9, 2018 agree and that’s why i initially thought it’s an incompatibility with 10tb disks or a maximum capacity issue. another explanation could be that all other disks are “green”, while the 10tb disk is “red” and the new 6tb disk is “blue”.well... nothing i can try now, but just wait until the new m1015 arrives and i’ll test with the new one whether it’s working.Sent from my iPhone using Tapatalk Quote Link to comment
steve1977 Posted March 31, 2018 Author Share Posted March 31, 2018 Hooray!!! The new M1015 seems to have solved this issue. So, the old one must indeed have over-heated and mis-functioning. Thanks for the all the help on this forum! I don't know how to prevent this issue from happening again in the future besides buying a new case / colling system. But at least I now know what the issue is and it should be fine for a few months... Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.