July 9, 20241 yr Hello, I recently ran into issues with my 10 year old Unraid server after updating the OS. Current Version: 6.12.6 Previous Version: 6.11.5 I had many issues after the update, most notably seeing lots of BTRFS errors in the logs on my two SSD drives in my pool, with some entries referencing some drives in my array. I did some research online and saw some posts mentioning faulty RAM or loose SATA cables. Multiple times I powered down my system and made sure all cables were securely plugged in, both SATA and power. I have an internal and external SAS card running to a second case with more drives. Everything was connected properly. My system would work for a couple hours after turning back on before the BTRFS errors returned. When this happened some of my media shares would go offline while some would stay online. Sample of some errors I saw during this time. I decided to replace my i7-2600K/ASUS MB system with a i7-12700K/MSI bundle from Microcenter. I swapped in the new hardware, CPU, MB, RAM, PSU and put in my SAS cards. I had some issues with getting the flash drive to Boot but some old forum entries helped me fix that. I booted up the system and all the drives were detected. I started the array and launched Plex and started streaming some items to test. Things worked for 15-20 minutes before my stream froze. I checked Unraid logs and saw more BRTFS errors referencing the two SSD drives in my pool. The GUI became unresponsive and I had to forcibly shut down my system. I had to cycle the power a couple of times for Unraid to boot. My array wants to run a Parity check but I'm anxious to do so as I'm afraid I may get more errors during the process as my array is pretty large. I'm posting logs before (2024/07/04) and after (2024/07/09) I replaced the internal hardware to show the logs. My two SSDs are different brands but the same size. I am planning to swap those out for NVME drives since my new MB has multiple onboard slots to accommodate. I appreciate some help reviewing my diagnostics and helping me pinpoint the issues. Thanks tower-diagnostics-20240704-1118.zip tower-diagnostics-20240709-0138.zip Edited July 9, 20241 yr by Dradder1 Fix grammar errors
July 9, 20241 yr Community Expert Solution Jul 2 21:39:32 Tower kernel: BTRFS info (device sdf1): bdev /dev/sdu1 errs: wr 1853791, rd 28389, flush 8887, corrupt 4507, gen 0 This shows that one of the pool devices dropped offline in the past, run a correcting scrub and post the results.
July 9, 20241 yr Author Thanks for the quick reply. A couple of questions. Should I start the array and if so do I do it in maintenance mode? A parity check is currently set to start because I did not shut down properly. I disabled the Dynamix Trim plugin and now see it listed in the Plugin File Install Errors. Can I run the scrub from the terminal per the instructions here? https://docs.unraid.net/unraid-os/manual/storage-management/#scrub btrfs check --readonly /dev/sdX1 btrfs check --repair /dev/sdX1 Thanks again
July 9, 20241 yr Community Expert You can do it with the array started in normal mode, it can be done using the GUI, click on the pool then scroll down to the scrub section.
July 11, 20241 yr Author I started the array and ran the scrub on the pool and it repaired corrupted blocks. The system is running a parity check and corrected some sync errors. I'll let it finish then will test my system again and follow-up here. Thanks,
July 11, 20241 yr Community Expert OK, also see here for better pool monitoring, in case a device drops again: https://forums.unraid.net/topic/46802-faq-for-unraid-v6/?do=findComment&comment=700582
July 12, 20241 yr Author My system has been working fine since repairing the scrub. I've been able to add media to my shares and use some dockers without issue. I will continue to monitor and will look into incorporating the pool monitoring. Thank you!
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.