KptnKMan

Members
  • Posts

    223
  • Joined

  • Last visited

Everything posted by KptnKMan

  1. Pulled the 1Gb cable (the unraid end) on unraid1, and looks like it failed over to the 10Gb. After plugging it back in, it stayed on the 10Gb as primary: I need to figure out how to configure unraid to prioritise the 10Gb always, unless its disconnected. Any idea on that?
  2. iperf3 confirms the 1Gb is primary: root@blaster:~# iperf3 -c 192.168.178.12 Connecting to host 192.168.178.12, port 5201 [ 5] local 192.168.178.11 port 52652 connected to 192.168.178.12 port 5201 [ ID] Interval Transfer Bitrate Retr Cwnd [ 5] 0.00-1.00 sec 106 MBytes 889 Mbits/sec 0 235 KBytes [ 5] 1.00-2.00 sec 105 MBytes 883 Mbits/sec 0 235 KBytes [ 5] 2.00-3.00 sec 105 MBytes 880 Mbits/sec 0 235 KBytes [ 5] 3.00-4.00 sec 107 MBytes 898 Mbits/sec 0 238 KBytes [ 5] 4.00-5.00 sec 106 MBytes 893 Mbits/sec 0 238 KBytes [ 5] 5.00-6.00 sec 105 MBytes 877 Mbits/sec 0 235 KBytes [ 5] 6.00-7.00 sec 107 MBytes 894 Mbits/sec 0 235 KBytes [ 5] 7.00-8.00 sec 105 MBytes 882 Mbits/sec 0 235 KBytes [ 5] 8.00-9.00 sec 106 MBytes 891 Mbits/sec 0 235 KBytes [ 5] 9.00-10.00 sec 106 MBytes 891 Mbits/sec 0 5.66 KBytes - - - - - - - - - - - - - - - - - - - - - - - - - [ ID] Interval Transfer Bitrate Retr [ 5] 0.00-10.00 sec 1.03 GBytes 888 Mbits/sec 0 sender [ 5] 0.00-10.00 sec 1.03 GBytes 886 Mbits/sec receiver iperf Done.
  3. That makes sense, and I should probably do some more reading on the subject. I will say that the active-backup bond worked just fine without the CRS309 being setup with a special bonding config... So maybe that could be it. I've just nuked the CRS309, and both unraids, all came up without issues first time, with the active-backup bond setup by default (Only with the onboard primary). I've just switched them around and rebooted. Yeah, no worries. I appreciate all you've done to help me out. I'm not really sure what to make of this, but I was also on the long-term until yesterday. Well, I'm going ahead right now with some testing. Everything seems to be behaving, reboots come up without fuss. Also took this opportunity to move the unraid SFP+ DACs along, to give that S+RJ10 a little space. In the new config, it looks like the 1Gb is working as primary, so I will pull some cables later and see if it fails over to the 10Gb. From my observations, the standby always has no Rx transmissions, and the Tx is only ever a few kbps. The CRS309 reset config (only set interface names): unraid with default config (only reordered the nics):
  4. Yeah, I did some reading and came to the same conclusion before I posted that the 10Gb ones NEED to be separated, but not so much for the 1Gb ones. I'll keep an eye on them regardless, but the 1Gb transceivers are barely warm to the touch. Well, I'm quite a daring guy so that sounds harmless. When I have time I'll do some research and see if anyone has done a teardown of the CRS309, and if not then I'll take a look. Its not reached 90c but its been in the mid 80s at load, so I'll investigate the cooling possibilities just the same. Yeah super weird. Pulled it at the CRS309 side, should that make a difference? This whole goose chase has kinda confused me, as it doesn't seem to have a good reason for it. I might just wipe the network config by deleting the network.cfg and network-rules.cfg files and start again, not sure what else I can do on the unraid side to be honest. Yeah sorry this is my bad, I miscommunicated here. I didn't add an extra bridge, I know that would be a bad idea. There's a default configured bridge already present, and that's what I used. I meant to say: "I started by removing the interfaces from the default bridge (As that's what the error I experienced earlier about this referred to), and configuring a new bonding interface (aka bonding-unraid2), as you advised in your earlier post. Then I added that bonding-unraid2 interface to the bridge." Reading your instructions, that's exactly what I did so that's good to know at least. From everything I've read and videos I've seen, setting up an active-backup bond should be simple and easy. Story of my life. ๐Ÿ™„ In the unraid interface it even states: "Mode 1 (active-backup) is the recommended setting. Other modes allow you to set up a specific environment, but may require proper switch support. Choosing a unsupported mode can result in a disrupted communication." I can set the link L2-MTU and L3-MTU but on the bond interface, I'm not sure why, I cant set the L3-MTU (Labelled 'MTU'). You can see in the 2nd screenshot I posted earlier, the L2-MTU is greyed out, and inaccessible: Yeah I thought that the active-backup bond would be simple enough, especially as its "default" bond in unraid, and would be nice to fail back to the onboard controller... But its not proven so simple. Maybe I should have just gotten a 2nd DAC for each server and tried bond across the dual 10Gb instead. Then I could have just used more simple 802.3ad link-aggregation, balance-rr or another method that requires same-speed links, like I did before when I had 1Gb cards. I guess you live and learn, still looking to nuke the network config on both systems and start again, maybe should do on the CRS309 as well, in case I (more than likely) messed something up. Yeah, thanks I appreciate that. Honestly I'm hesitant to start from the beginning seeking help in yet another forum just yet, so I'm gonna go with the scorched earth approach first, just nuke all the configs, start again and see if that might fix things. I think I might just go step by step and see where it all falls apart.
  5. Ah, yeah, a couple things there. I bought 2x S+RJ10 10Gb tranceivers and 2x S-RJ01 1Gb (I mistakenly stated them as S-RJ10), which are rated for much lower temperature. Currently they are next to each other, as in the diagram, but as of now they are very cool and not even nearly as lava hot as the single S+RJ10 hooked up to the media room CSS610. I'm planning to relocate the S-RJ01s soon, just haven't yet as they seem good right now. Yeah, I combed over the specs of the S+RJ10 and S-RJ01 to make sure I understood their tolerances as well. That 10Gb transceiver is very hot, burns to the touch. I might have to put a small fan near it or something. Not yet, I temporarily relocated cards I don't need for a while out and they are both running in the 2nd full PCIEx16 slot of each system. I'd like to get a good idea of full performance before I eventually move them, if at all. At some point I will have to, I think. Proper VLANs are something I have my network pre-segmented for already, so that I can do that. My IOT stuff all uses separate WLANs from my domestic household stuff, and the Home Automation cabinet is separated for this exact reason as well. I played with VLANs at home a while ago, but now I have all widespread VLAN-capable hardware so the time is getting close... very close. I try to be take things "one at a time" so I know things are stable before moving onto the next radical change. I guess thats the Ops Engineer in me. Haha, I should probably be spending less money not more but this 10Gbit setup has been a pipe dream of mine for more than 8 years. I'm finally doing it, and very happy its working as well as it has. Saying that... something weird is happening... Yeah, the bonds were working well... or so I thought... but I've highlighted something very frustratingly confusing by trying to properly configure the active-backup bond for both systems. First of all, I did some testing by unplugging/disabling the 10Gb link for unraid2, which seemed to fail the link over as expected, but then it refused to fail back. It refused like at all, causing the server to become unavailable, until I eventually rebooted the system and it reverted to the 10Gb as primary (Without touching/changing anything else). This doesn't seem correct, as I shouldn't have to reboot the server to fix a failover. I could see the link was up, and registering on the CRS309 but I could only see a couple Kb of Tx and nothing Rx. So I decided that I should properly configure the active-backup on the CRS309, especially as I can now do it on a single box. I did some reading, and watched some YouTube vids about it, then dived in. Simple, I thought... I started by removing the interfaces from the default bridge (As that's what the error I experienced earlier about this referred to), and configuring a new bridge, as you advised in your earlier post. Then I added that "bonding-unraid2" interface to the bridge. Now there's a few things I have run into that are frustrating me, I've been round and round it for hours now and I'm not sure what to do. I've setup both unraid systems identically the same now and have had varying results. 1) If I so much as touch the bonding configuration of either bond on the CRS309, they stop working, and the entire server goes dark. I can't reach it, I have to gracefully shut it down or reboot it at the box and upon reboot, it comes up available again. Even so much as looking at the interface and clicking Ok or Apply (Without changing anything) causes it to freak out and give up. 2) I cannot set the MTU of the bonding-unraidX interface, as much as I've tried. It refuses, and throws an error that the "MTU could be set", then the server goes dark again until reboot. So at this point I've set everything back to 1500 on both unraid systems, just to try and have consistency to test and get working. 3) My second system unraid2, like refuses to play ball 90% of the time, and boots up with a link-local address of 169.254.129.35, indicating that it cannot find the network. I double/triple/quadruple checked the unraid network config and rebooted more than a dozen times to figure out what's going on. I found that a cold boot works more often and picks up an Ip of 192.168.178.12, but a reboot usually stalls for a bit right before login, and then uses 169.254.129.35. I use DHCP on my network, from my WAN router, never had any DHCP issues. I think something on the CRS309 is very wrong, and I'm missing something. I've tried using "arp" link monitoring with an IP, as well as Mii, both seem to have the same behaviour. At this point I've shut down unraid2, as the alternative would be to delete the bonding interface and go back to what was "working". 4) My primary system unraid1, doesn't seem to be too bothered about any of this, and seems to be working with the bonding-unraid1 configuration on the CRS309. I haven't tested the bond failover by pulling the cable yet, as I've been busy trying to figure out why unraid2 is not picking up an IP with an identical config. 5) I updated the CRS309 to latest RouterOS stable 6.49, and nothing seems to have helped. Same issues. I saw some mentions of link stability in the changelogs and thought it might be worth a shot. At this point I'm not sure what to do. Some config screenshots (both systems networking configured identically) if that helps:
  6. So I'm back with an update. TL;DR: 1) I haven't returned the Ubiquity transceivers yet. 2) I finally made a quick and dirty network diagram. 3) Finally moved my main WAN router over to the server room, centralising everything except the ISP router. 4) Ordered, received, and setup the new 3rd CSS610 for the media setup. 5) Connected the new CSS610 across the house with new Mikrotik S+RJ10 transceivers and CAT7A+ S/FTP. 6) Connected the bond0 active-backup secondary 1Gb to the CRS309 using a couple new Mikrotik S-RJ10 transceivers. Detailed: 1) I've been doing some testing on the Ubiquity transceivers I purchased before, I'm planning to return them but they seem to register now but still no link. Gonna keep playing with them a bit more. 2) I've made better, but I finally got around to making a very quick diagram: 3) Yep, I finally got the CAT7 cables run over to the server room and moved the WAN router. All working well and centralised in the same location. The ISP router runs in passthrough, and is direct connected to coaxial, so I can't move that unfortunately. Pic of the core setup: 4+5) So with the CAT7A+ run complete with keystones, and some short CAT7 cables acquired for each end, I hooked everything up. Those S+RJ10 transceivers do get toasty though, as advised. Measured temps of 83c, highest I've seen so far under load, which seems kinda high but I think its understood that these get hot? Got the 3rd CSS610 hooked up and it is behaving normally, link up at 10Gbit, no issues encountered. 6) Purchased a couple S-RJ10 transceivers so that I could hook those directly into the CRS309, and that seems to work well. WIthout needing to bridge over to the CSS610 it seems more stable also, I haven't seen any dropouts. Also looks like I've almost filled the CRS309 SFP+ ports as well, which I didn't expect when starting this. So all in all, I'm pretty pleased with how everything is turned out. Gonna keep tweaking as I go, and I heard on the Uncast YouTube video on 10Gb that there is apparently some tips for running 10Gb Unraid somewhere. Gonna try and find that.
  7. Yeah, I don't think your statements were confusing for me in that way exactly, because I think I get the L2-L3 relationship (I'm not new to networking, but with this particular use case I've been trying to be "correct".) but I guess there is no correct answer. I was mainly looking for a recommendation of a way forward to make sense of. You indicated what I shouldn't do, so I was really trying to understand what a recommendation of what I should do would be. Anyhow, thinking of a 20 byte IP header + payload combination makes enough sense that I can go on with that. I also found some more examples like this article that support that understanding is a good way to go forward. As much as this makes sense, the MTUs in the image raise new questions for me, like how are the odd numbered MTUs related to 1500 and 9000 respectively. But as it is, I'm not going to worry too much about that right now. As it is now, I've set the MTU to 9000, and L2-MTU to 9800 (Max L2 apparently being 10218), and the sky doesn't seem to have fallen yet. ๐Ÿ˜… Everybody seems to be behaving, and the network appears to be performing as desired. I've also re-enabled the active-backup/10Gb-1Gb bond0, and that seems to be working without issues. ๐Ÿ‘ I'll be monitoring the bond setup and seeing what happens. Next I guess will be to return the 1Gb transceivers, get 10Gb RJ45 transceivers, and get an additional CSS610 for across the house. ๐Ÿค”
  8. Thanks, that's great to know. I'm not planning to mix the units, so that should be good. Hopefully, using the MT-branded transceivers will ensure proper compatibility and performance. I've also read elsewhere that when using SFP+ RJ45 transceivers, that they should be spaced out as well. I've been planning to do this, so ideally shouldn't be an issue, if I can just sandwich them between the existing DAC modules. See, I'm genuinely trying to understand and I spent some hours yesterday diving into this to get a definitive answer of why, but I'm still not finding a clear understanding. You mentioned a few posts ago, that L3-MTU and L2-MTU should not be set to the same value, but I'm struggling to understand why because the default setting for both MTU and L2-MTU are both 1500. Is that not technically the same problem, and yet the majority of networks work with that default setting? From what I have been observing on networks, and reading in places like the MT-Wiki (And online), the MTU and L2-MTU can be "smaller or equal". This would make sense, from what I'm seeing, the "equal" part being interesting for me. According to the OSI model, wouldn't this mean that an L3-MTU/MTU and L2-MTU of the same value would be functional, and would encapsulate all required data and overhead, as long as I have "complete path end-to-end" control? From my understanding, my CRS309 would have that control and would translate/fragment/defragment packets from my unraids to other ports with other MTU settings? Am I right in thinking that if I setup MTU 9000 links/ports to (for example) my other switches, then I would need to make sure the MTU/MRU of those switch ports would be set on that end as well to avoid issues? From what I understand, it gets out of hand when there are other out-of-control entities (Like my WAN link to my ISP) transmitting on different MTU sizes from mine, causing packets to be dropped and return ICMP "fragmentation needed" requests. But surely, that is not a concern for me, seeing as my ISP link (To my understanding as I've never reconfigured it) would be at default MTU 1500? Thinking about this I just checked, its default MTU 1500. Again, I'm not challenging anyone's advice or knowledge here, I'm trying to understand. I'm just trying to understand, what would be (In my case) an appropriate higher value for L2-MTU? Is there an known/typical overhead I should calculate in (Like how most PPPOE connections add 8 bytes to headers, reducing payload to 1492), and how can I find information about this? I'm sure there's a rabbit hole somewhere about packet encapsulation that I have yet to misunderstand, but I'm just trying to get a grip on the broad strokes and how this applies to what I'm doing.
  9. Thanks, good to know the MT transceivers would work as intended upto 30m. With that in mind, this afternoon I decided to not be lazy and finally rerun that CAT7 cable to where my WAN router is, I mentioned this earlier, and this would affect the requirement of how long I would need a capable transceiver. So with a bit of blood and sweat, I managed to run 2x CAT7A S/FTP and 2x CAT6 U/UTP across the house. After some reworking, I got the run down to 21m. I also installed and tested 10Gb rated keystones on the CAT7A cables (I'll get around to the CAT6 cables tomorrow) and I'll be able to 1) move my WAN router to the server room, 2) buy an additional CSS610 for 10Gb uplink to that part of the house and 3) go with the MT transceivers as I guess it would be safely under the 30m limit: Exactly I also have the default MTU from 1500 and get 10Gbit/s speeds. Now this has led me to an interesting issue, in that I've left everything as-is with MTU set to 9000 for the last few days, and I haven't seen any strange or adverse affects. I've been doing a lot of reading about this, and various recommendations point to setting the MTU to 9000, but I must say I've not seen mention of discriminating between MTU and L2-MTU. Also, the detriment I've seen mentioned elsewhere is the performance hit of translation between different MTU specifications on the same network. Interestingly also, I've seen no noteworthy difference in backup times between servers, last night was about 3 minutes and 46 seconds. I also can't find errors or performance penalties in any of the router, switch or system logs. The only real difference I've seen is that iperf3 now consistently results in ~9.80Gb-9.85Gb throughput, with no retries. I downloaded iperf3 to my windows laptop (An old cygwin-compiled version v3.1.3 from 2016) and that returns a solid 1Gb speed (~931 to 939 Mbit/sec), as expected. I'm also not seeing any Rx/Tx errors or Rx/Tx drops on the sfpplus interfaces for my unraids. Do you have any idea where I can look, or should look to figure out if something bad is happening? I've been reading through the MTU docs on the MT-Wiki as advised, but I'm struggling to see an advice. The Wiki does state "Make sure that MPLS MTU is smaller or equal to L2MTU", but I don't see where "MPLS MTU" applies, and there is no specific mention of it in RouterOS, only "MTU" and "L2 MTU". The MT-Wiki also mentions re "L3 MTU": "If router receives IP packet of size 1500, but MTU for outgoing interface is set to 1400, router will either fragment the packet (if "Don't Fragment" bit is not set in IP header) or drop the packet and send ICMP "Need Fragmentation" error back to originator (this is essential for Path MTU Discovery to work). Sometimes it can be bad idea to change IP MTU from its default 1500 bytes on router interfaces if complete path end-to-end is not in administrators control. Although IP fragmentation and end-to-end Path MTU Discovery is intended to handle this situation, if ICMP Need Fragmentation errors are filtered somewhere along the path, Path MTU Discovery will not work. There are several features in MikroTik RouterOS that can benefit from possibility to exceed standard MTU" Although, I'm not seeing any drops and it's only these 2 sfpplus ports on this CRS309 device alone set to 9000. Would that break things? Also, I'm a little confused by your advice, do you mean I should reduce "MTU" back to 1500, but leave "L2-MTU" at 9000? That would make sense with the diagram from the MT-Wiki, indicating that "IP-MTU" can be less than "L2-MTU": With all this in mind, what should I do?
  10. Ok, so I went back to the drawing board on this one... and I may have turned up some results. I did some more research, and was reminded that when optimising 10Gb, I should adjust the MTU settings of interfaces and switches, among some other tips and things. I think I may have lost the MTU config when changing to the Mellanox cards, and also forgot to review that. So I started with that, and adjusted the MTU to 9000 on both my unraid systems. This resulted in a disastrous set of performance results: Yikes. ๐Ÿค” But at least something changed, which in my experience prompts closer inspection. I adjusted both systems back, and saw the performance go back to what I was seeing before. Ok... So I reconfigured both systems to higher MTU and investigated the CRS309, then adjusted the MTU of interfaces sfpplus3+4 to 9000 (Actual and L2): Then tested again: Now that's more like it! ๐Ÿ˜„ Now, as far as I understand in my experience with this, and what I've read is that there can be a performance hit (Due to translation) if other parts of the network are set to other MTUs. With that in mind, and that I'm testing directly between 2 systems via the CRS309 switch, I'm going to leave this for now and do more testing from other devices to my unraid systems (Like from my WAN router). I'll maybe set the MTU between all 10Gb links to the same to prevent translation performance hits as well. I should still be getting the full 1Gb from that I think, but I'll see what it turns up. I also have bonding disabled, still at this point, so I may be able to see better bonding performance with the higher MTU. Currently all interfaces are set to 9000, but only port 1 of the Mellanox cards are plugged in and up. Gotta go for now, but will report back later. For now, further tests show fairly consistently I'm getting 9.85Gb to 9.88Gb, at least between my unraids. ๐Ÿ˜„ Hopefully, someone will benefit from reading this journey as well.
  11. So I disabled all boding on both systems and left bridging enabled, as I need that for docker and VMs. Turns out, I'm still getting about 5.4Gb using iperf3, and I'm not sure why. The onboard 1Gb is still unplugged, and the Mellanox 10Gb is set to eth0, used as the only bridge interface. Maybe the bond isn't the problem at all, but something with the bridging? This is strange.
  12. A x4 slot will definitely have more lanes and bandwidth than a x1 slot yeah, but for the performance I think I'd get (And as others have posted in this thread, it seems reasonable to get around 6/7Gb from a PCIe 3.0 x1 slot. I don't have thunderbolt on either of these boards (in my signature) as well. Edit: Also, both the NVME slots in both servers are in use for fast NVME storage, not really looking to replace them, they work pretty well. That would compromise actually being able to transfer ~10Gb over the network, as spinning drives won't cut it. I did look into this a while ago, but I didn't find much reliable information on usable adaptors for this. Do you have any suggestions? I remember looking at a reddit thread discussing this (among a few other places) and they seemed to conclude that using USB3.1 for 10Gb networking isn't as good as it sounds. Have you had a different experience?
  13. Thanks, I appreciate the advice. I think I might just go ahead and replace these with 2x of the 10Gb ethernet transceivers you indicated before, as I know I'll use those before too long. The Mikrotik 10Gb ethernet transceivers are available also on eurodk, do you think these would be comparable to the fs.com modules? I assume they would be preferable for Mikrotik hardware. Ok, well I think I would need these for using docker (without custom config) and my VMs anyway. I'm not sure what you meant by the PCIe slots specs change? I've not had many issues using this AMD setup, I expected a lot more bugginess tbh. Currently the cards in both systems are in the second PCIe x16 slot, especially as I want to get a working baseline for some time before eventually migrating them to an x1 slot. I know now that raw they get close to the 9.5Gb I should expect. Do you mean in the "interface rules" section of the network settings? I already set that order, I think I posted the screenshot before. Is there another part? Is this correct: I currently HAD them set like this (below), and I set active-backup bond0 members to eth0 and eth1. They should be functionally the same, as 10Gb port2 is unplugged, but I've set them back to the first configuration (above), prioritising both the 10Gb ports, and unplugged the onboard: Yeah, I'm not sure what to do about it. Nothing else changed, all same hardware and bonding config. I'm about to finish the reconfigure of both systems to use only eth0 and reboot, so I'll test again and edit this with the results (As I'm posting from within one of the VMs now). ๐Ÿ‘ Edit: Well I tested again with only eth0 in bond0, and everything else working as expected. The speed seems slow again, not sure why, as everything is going through only the CRS309. I'll have to do some research into why the active-backup bonding configuration slows everything down. Test results:
  14. Thanks for the advice, I managed to test the cards today, but I need to do some more testing of the Ubiquity 1Gb transceivers. So far though, I couldn't get them to be recognised at all in the CRS309, they don't seem to show up from what I can see. I'm going to try them in one of the CSS610s and see if they are recognised there at all. I disabled the bridge and bond0, and did another set of speed tests between servers. Looks like with bond and bridging disabled (This test was done with the array unmounted and nothing running) I can get around 9.39/9.4Gb between systems fairly reliably: After the speed test, I enabled bridge, but the networking became quite unreliable and wouldn't function correctly on either server. The onboard 1Gb cards are recognised as eth0 and will not allow me to reliably use the 10Gb cards without disabling the onboard adaptors. After some frustration, it looks like I may need to Yeah thanks, I upgraded the firmware of both servers cards to the latest 2.42.5000, so that I could test some more with latest firmware, as the cards had different old versions installed. Both cards now show identical details: After installing the firmware, I re-enabled the bond0+bridging and tested again. It seems that I'm getting slower speeds than before, which is interesting: I guess I may need to decide if I want to use this bonding setup, as it seems the speed is significantly compromised by having it enabled. I guess either I'll have to disable the active-backup configuration, and disable to onboard 1Gb, or try to fix it on the switch config side if that is the issue. The 1Gb transceivers not working is also an annoying issue that I had planned by using them would solve the port forwarding issue.
  15. Oh something I forgot to mention above. I installed the Mellanox Firmware Tools and got the details of my cards: unraid1: unraid2: The cards should be identical as well, but it seems they have different firmwares and also versions. Thing is, I've been to the Mellanox ConnectX-3 firmware download page, and I'm a little confused which firmware to download: Would the different firmwares be responsible for the performance behaviour I've been experiencing?
  16. I do have a few questions, maybe there's some things I'm missing. So I monitored this over the last few days, and the speed has been fairly consistent, but still not at the 9.5 that I figure it should be capable of. There are backups that runs every night between servers (for VMS, docker appdata, personal files, etc) and I have a particular VM that has a primary vdisk of 200GB. That particular backup used to consistently take about 31 minutes, 19 seconds. Now the same backup runs at about 3 minutes, 44 seconds. I've not been seeing any errors of packet loss in any of the logs as well, but I've rerun a few tests over the last few days between servers. I'm pretty happy with it, but it definitely should be doing a touch better given the specs in both systems, so there should be no shortage of bandwidth or bottlenecks that I'm aware of: Yeah it's setup as an active-backup configuration. I've still got the CRS running in RouterOS bridge mode, as I haven't yet moved my WAN router and that does my WAN routing pretty well at this point. I can't seem to setup the active-backup bond though, as the bridge mode throws an error: Something else I'm having trouble with, I went and purchased a couple Ubiquity 1Gb RJ45 transceivers but they seem to not be recognised at all by the CRS309. I was trying to test bond by having everything on the same switch. I may have goofed. Are these not compatible: This is my next move, I haven't had much chance to bring everything down and play around until today. Will post back those results.
  17. Thanks, I'm definitely going to be updating and investigating more into these in the coming days. In recent weeks I've already been reading and watching setup guides and configuration instructions. I haven't used any Mikrotik switches before, so it's going to be lots to learn. ๐Ÿ˜ This is via the CRS309, using the DAC cables I got included with the Mellanox cards. Bearing in mind I haven't adjusted anything, its all just plugged in and pushing data right now. The test show they don't seem to be running at 100% full speed, but I think I'll watch overnight and see how the throughput goes when backups run between servers. The current core setup, in its current position, due to be moved at some point soon: The unraid servers are hooked into SFP+ ports 3 and 4. The other CSS610 on the lab/desk will be setup and uplinked in the next days.
  18. Alright... so it looks like I got it working. Reconfigured the DHCP and the active-backup bond0 seems to be up without trouble. some new iperf3 results between servers: Not quite full 10Gb, especially as they are in physical x16 ports... but pretty good. ๐Ÿ˜
  19. Actually, yeah I think I fixed it. I took a look in those files after I posted and just deleted the listing for eth3. Seemed to work: I think I might have fixed it, by deleting the eth3 entry from the network-rules.cfg I have the Mellanox as primary now, need to setup the second server and do some tests. Oh I see, I was thinking I could use the MAC of my onboard as the bond0 MAC. Maybe that's a bad idea then, and I should just configure the Mellanox as the only card. I should probably do that, just reconfigure the router DHCP rather than be lazy. ๐Ÿ˜„ That sounds useful. I'll definitely read through this, thanks.
  20. I think I may need to open a new thread for this issue. I'm trying to setup acive-backup on bond0, so that the (eth1) Mellanox 10Gb port1 is primary, and the (eth0) onboard 1Gb is backup. The duplicate MAC error seems to be blocking it. From what I'm reading I can't seem to query the bond status.
  21. It was on the old unmanaged Netgear switch I was using until this upgrade. I just finished getting everything installed and setting up the links, and looks like the core CRS309+CSS610 have booted up ok and pushing data. Before I get into setting them up, I seem to have run into a strange issue with the MACs of the interfaces. The bonding configuration seems to think there are duplicate MAC adresses: eth0 is the oboard 1Gb, and eth1/2/3 seem to be the Mellanox dual-port card. I think I need to delete one of the interfaces, reading up on why this might have happened. If I try to change something in the GUI I get an error: ๐Ÿค”
  22. Thanks, I've got everything setup... I even managed to move some hardware around, and I have both the Mellanox NICs in the 2nd PCIe x16 slot. I'll need to move them at some point, but for now I'll ideally get full speed out of them, and glad I have the x1-to-x16 adaptors. ๐Ÿ˜„ This is from the old 1Gb cards, but one is the onboard 1Gb and an Intel 2x 1Gb card. I had them bonded but the performance was still lacklustre from what they should be. I should be getting about 3Gb/s out of the bonded link... but the 10Gb card should do better.
  23. Looks like I'm having fun today: And some current iperf3 results between servers:
  24. Hey, although I have an AMD build, what you've listed is essentially what I've been doing with my setup. I've had great results, everything is listed in my signature below.
  25. Thanks I really appreciate you helping me out with this, and of course everyone else in this thread. It's been a long time figuring this out, getting feedback, deciding the setup. ๐Ÿ˜… I just ordered the eurodk kit. Once I get confirmation, I'll get the amazon order done. Also, yeah everything shows in stock for me. Not sure why it's not showing up in the screenshot as such.