Hi All,
I am having issues with same setup where server keeps crashing all the time. "The Ryzen" issue where it just locks up.
Hardware
ROG Zenith II Extreme Alpha
AMD Ryzen Threadripper 3970x 32-Core
192GB DDR4 Corsair Vengeance Pro (256GB Kit, the other 64GB is in my Intel Server)
NVIDIA Quadro RTX 4000
Additional 1GB NIC Realtek RTL8111
ROG RYUJIN 360 AIO Water Cooler
ASUS Hyper M.2 X16 Gen 4 RAID Controller Card with 4 x 1TB Samsung 980 Pro Gen 4
1TB Samsung 980 Pro on the Board Gen 4
1TB Samsung 980 Gen 3 UnAssigned
1TB Samsung 980 Gen 3 as Parity
So the even when the array is not mounted or doing anything it becomes unresponsive and a hard power down to restart fixes it.
I have disabled C States, I have put in rcu_nocbs=0-63 in the config and nothing has helped so far. I have run this machine under Windows for a week and the hardware is stable, no crashes and get the full performance. So i am guessing its something to do with my BIOS or UnRAID server settings.
I am tempted to sell it and get 6 Beast Canyon NUCs. So they can each do my work in pieces.