somebuddy Posted January 26, 2020 Share Posted January 26, 2020 Hi, at first sorry for my bad english! My plan was to build a new home server for my house. I need a stable NAS solution and some vms and docker containers so i tried unraid. This ist my hardware config: CPU: Athlon 240GE Mainboard: Asus X370 Prime Pro (newest bios) RAM: 16GB DDR4 2133Mhz HDD: Some 4TB, 2TB IDE Drives (11TB sum) After a restart my server is working without problems for a few days. Sometimes there are stopped docker containers and after a few days the unraid machine freezes. No ping, no web GUI ... nothing. After a hard reset unraid tries to check the parity. While working with the gui i see sometimes BTRFS Erros / critical / missing leaf etc. Sometimes after a restart my Docker Containers and VMs are lost completely. I didn't copy much data until it works stable. So a reinstall ist possible. I hope you can halb me making this great "thing" stable. Thank you! tower-diagnostics-20200126-0129.zip Quote Link to comment
trurl Posted January 26, 2020 Share Posted January 26, 2020 2 hours ago, somebuddy said: Some 4TB, 2TB IDE Drives Are you really using IDE drives? Quote Link to comment
JorgeB Posted January 26, 2020 Share Posted January 26, 2020 Ryzen based CPUs on Linux can lock up due to issues with c-states, make sure bios is up to date, then look for "Power Supply Idle Control" (or similar) and set it to "typical current idle" (or similar), or completely disable C-sates. More info here: https://forums.unraid.net/bug-reports/prereleases/670-rc1-system-hard-lock-r354/ Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 5 hours ago, trurl said: Are you really using IDE drives? SATA of course nodtalgia Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 28 minutes ago, johnnie.black said: Ryzen based CPUs on Linux can lock up due to issues with c-states, make sure bios is up to date, then look for "Power Supply Idle Control" (or similar) and set it to "typical current idle" (or similar), or completely disable C-sates. More info here: https://forums.unraid.net/bug-reports/prereleases/670-rc1-system-hard-lock-r354/ I will try.. i hope this also fixes my lost vm and docker issue. Quote Link to comment
JorgeB Posted January 26, 2020 Share Posted January 26, 2020 With power supply control set correctly you can usually leave c-states enable. Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 after changing the bios settings i tried if this fixes my docker / kvm problems.. i reinstalled the docker containers and tried to install a debian vm. After some working with the vm i ran into the following issue again. I think after a restart there is no more docker /vm like before. Maybe it is wrong to install the docker into appdata which uses my cache SSDs? Docker Tab: VM Tab: tower-diagnostics-20200126-1059.zip Quote Link to comment
itimpi Posted January 26, 2020 Share Posted January 26, 2020 The diagnostics show that your cache is failing to mount which could explain both your docker and VM problems. Quote Link to comment
JorgeB Posted January 26, 2020 Share Posted January 26, 2020 Cache drive filesystem is corrupt, you need to reformat, but before that run memtest, various apps are crashing/segfaulting, very likely you have a hardware problem, like bad RAM. Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 OK.. thank you! I think a have a problem with my ram. It is brand new so i didn't mention that this could bei the problem. Quote Link to comment
itimpi Posted January 26, 2020 Share Posted January 26, 2020 Definitely a RAM problem Anything other than 0 errors when running the tests means you have a problem. Sometimes it is just a case of reseating the RAM modules. You could also try clocking them at a lower speed? Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 36 minutes ago, itimpi said: Definitely a RAM problem Anything other than 0 errors when running the tests means you have a problem. Sometimes it is just a case of reseating the RAM modules. You could also try clocking them at a lower speed? There is one of two RAM modules faulty.. The other one runs in memtest without a problem until now. I will send it back this week. Thank you. Can you please tell me how the reformat the cache partition? Quote Link to comment
JorgeB Posted January 26, 2020 Share Posted January 26, 2020 Format button is below start array button, note that any data on the cache device will be deleted, if needed you can try using these recovery options, but btrfs is very intolerant of bad RAM, might be very corrupt. Quote Link to comment
somebuddy Posted January 26, 2020 Author Share Posted January 26, 2020 (edited) it seems that it runs stable at the moment thank you ! Edited January 26, 2020 by somebuddy Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.