Nooke Posted October 20, 2019 Share Posted October 20, 2019 Hi, so I have some serious differences in L3 cache performance for my win10 vm depending on cores being used. there must be something fishy... My current kernel options: append processor.max_cstate=1 nvme_core.default_ps_max_latency_us=0 kvm_amd npt=1 nested=1 amd_iommu=on isolcpus=6-11,18-23 nohz_full=6-11,18-23 rcu_nocbs=6-11,18-23 pcie_acs_override=downstream,multifunction vfio_iommu_type1.allow_unsafe_interrupts=1 initrd=/bzroot Here my current xml for 4core 8thread. Only difference for 5c/10t and 6c/12t would be the core-count, emulatorpin win10vm.xml Using unraid 6.8.0-rc3 latest windows 10 1903 as I'm still having issues with my system, appreciate any support cheers Nooke Quote Link to comment
Nooke Posted October 21, 2019 Author Share Posted October 21, 2019 1 hour ago, jordanmw said: Check this out: Thanks for sharing but I don't have a core pairing issue here. I'm using cores only from 1 numa node (eg node1, cores 6-11,18-23). Numa Node 1 is connected to my GPU aswell as my NMVe SDD (see lstopo attached) So my problem here is that with cores from 1 node depending on core assignments in win10 I get totally screwed benchmarks for L3 Cache performance in AIDA64. Quote Link to comment
jordanmw Posted October 21, 2019 Share Posted October 21, 2019 Are you using the Epyc hack for processor identification? It seems that helps some of us with TR CPUs. Quote Link to comment
Nooke Posted October 22, 2019 Author Share Posted October 22, 2019 I did - no difference. Quote Link to comment
testdasi Posted October 23, 2019 Share Posted October 23, 2019 Each core has its own L1 / L2 so it scales pretty well. In contrast, L3 is shared between groups of cores. As you can see on your numa config, each of 6-8 and 9-11 groups has its own L3 cache. So depending on how the cores are assigned (e.g. 6-9 is different from 6-7+9-10 despite both being 4 cores) AND the exact circumstances of the test run, your test results will differ. What issues do you have on your system that would lead you to think it's related to L3 performance? Quote Link to comment
Nooke Posted October 23, 2019 Author Share Posted October 23, 2019 (edited) I had some serious fps drops and stutter in win 10 (not only in gaming). felt sluggish since last win 10 update. anyways I kinda fixed it. L3 cache benchmarks looking fine now: L3 read 388GB/s L3 write 300GB/s L3 copy 367GB/s L3 latency 10.6ns (6core 12threads) that's more like the results I would have expected at the beginning. If anyone else has some issues regarding this - could share my findings. Edited October 23, 2019 by Nooke Quote Link to comment
Tritech Posted October 24, 2019 Share Posted October 24, 2019 16 hours ago, Nooke said: If anyone else has some issues regarding this - could share my findings. Please do! Quote Link to comment
Nooke Posted October 25, 2019 Author Share Posted October 25, 2019 pretty simple - I just changed the cpu topology. I used the following <cpu mode='host-passthrough' check='none'> <topology sockets='1' cores='6' threads='2'/> <cache mode='passthrough'/> and now I'm using this instead <cpu mode='host-passthrough' check='none'> <topology sockets='2' cores='3' threads='2'/> <cache mode='passthrough'/> that made my L3 cache benchmark skyrocket. and overall smoothness of the windows 10 vm is improved. kinda have the most fps ever in cs:go and world of warcraft (stable!) Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.