pappaq

Members
  • Posts

    205
  • Joined

  • Last visited

Everything posted by pappaq

  1. My problem has returned, worse than ever. The system is pretty much locked up when the iowait issue occures. Seems like memory writeback is the issue. Is there something I can do? I've found many threads about it but never a solution for it. Here some screenshots from my netdata and the diagnostics. dringenet-ms-diagnostics-20231021-1038.zip
  2. With "Repair corrupted blocks" enabled, I assume?
  3. I am moving the files off of the App_Cache to my array, will replace the RAM on Thursday and replace the older 500GB SSD in the App_Cache for a 1TB SSD, to get an even 1TB App_Cache instead of the 1Tb + 500GB combo which resulted in 750GB. Was on my list anyway. I will recreate the docker-image then as well. How do I scrubb the two pools? Starting with the Data_cache, because the data of the other pool is currently moved. Thanks for your support!
  4. I've removed two RAM modules and ran the test again for an hour and did not get any errors. Tried now to get the server back to run until the new RAM arrives but the docker service fails to start anyway. I'm getting this error. Did the RAM corrupted something? I've got a backup of all my appdata... New diagnostics are attached. dringenet-ms-diagnostics-20231002-1402.zip
  5. JorgeB, you have answered my question in another post: Just ordered unregistered ECC! Thanks for the support!
  6. Looks like we have a Winner. It was up to 25 errors when I stopped the test. I'm thinking about getting unregistered ECC memory now for replacement. My Ryzen setup should support it. I'm currently reading up about it, my knowledge about memory is a bit rusty. Would it prevent errors like these?
  7. Hey there, this morning I woke up to an error on my server. The Docker service didn't come up after the backup process this night. It seems like one of the cache drives in my application cache pool is failing. Both drive logs threw btrfs errors. After a reboot the cache and docker service are up and running again. I've read a little bit here in the forums and it seems like one of the two drives is failing. But I can't really figure out which one. Could somebody help me figuring out which one or if I have to switch both? Here the diagnostics and the smart logs of both drives. Thanks in advance! dringenet-ms-diagnostics-20231002-0808.zip dringenet-ms-smart-20231002-0822.zip dringenet-ms-smart-20231002-0823.zip
  8. Hello, as stated in the title my server has detected hardware errors, says fixcommonproblems but does not tell me what is wrong. Here my diagnostics, maybe someone can help me! dringenet-ms-diagnostics-20230702-1501.zip
  9. Yeah, thought so. Gonna do that tomorrow evening. Thanks for you feedback!
  10. Unfortunatly nothing changed after shutting down the VM. The wa is still high and random cores are up to 100% load. Here are my diagnostics after shutting down the VM. Do you guys have an idea? dringenet-ms-diagnostics-20230424-1652.zip
  11. When the CPU spikes occure, the wa value peaks as well: I'm going to try your suggestion later. Thanks in advance. IF it is the VM what would be the next step? Just to lay out a plan to keep downtimes to a minimum!
  12. I did not try that, yet. The light in the flat is controlled by it and I would like to check it, if nobody else is home. In the meantime I've observed, that the high CPU usage is on random cores, not only on the core which are pinned to the VM. May that be an indication against your idea?
  13. Hey, my server is currently having the issue of "getting stuck" every few seconds. Three CPU cores go up to 100%, file transfers are stuck and continue after the CPU usage return back to normal. This repeats every few seconds. This has an impact on other services as well but data transfer is impacted the most. top returns high shfs usage on one core. Could someone have a look at my logs and help me? Thanks in advance! dringenet-ms-diagnostics-20230419-1717.zip
  14. My USB flashdrive was corrupt once again. Replaced it, restored the drive and the server is back up and running again...3 flashdrives in 4 month. Can be closed.
  15. Yesterday I've updated my Appdata Backup addon to version 2.5 and this night it pretty much fucked my whole system. My shares are gone my Dockers won't start. I'm trying to restore the stick from a flash backup but the flash tool is stuck at "syncing filesystem" and a normal copy of everything from the flashdrive backup onto the stick causes the system to not boot up anymore. Could somebody please look at my diagnostics and give me a direction to fix this? dringenet-ms-diagnostics-20230403-1659.zip
  16. Hey there, I've got struggle with my boot flash drives for over a month now. First my 4+ years old boot flash drive failed in late december 2022. I've replaced it with a dirt cheap one on 30th of december. Unraid changed it to read-only again at the beginning of this week. Then I've searched for a good and hopefully long lasting alternative and bought a Samsung stick, which was recommended in one of spaceinvaders videos. But I get this in the disk log. It's a brand new stick! Does it mean that the new stick was dead on arrival or did I do something wrong in the migration process? Thanks in advance, I've attached the diagnostics. Cheers dringenet-ms-diagnostics-20230127-1426.zip
  17. I've got 22 Dockers running and one virtual machine (Home assistant - should not pull so much). I think the H310 pulls about 8W, unfortunatly. The 56-61W are with all drives spun down. So I think I am at the lowest I can get. I'm trying to hold the downtimes as low as I can get, so tinkering much more is not an option. But I am curious what would be possible with a Intel platform and a low power CPU, because the extra power of the 1700 is not really needed. Only the IO of the ATX board is really necessary. Has anybody experience with a Intel platform and a similar setup?
  18. Currently I'm trying to bring the powerconsumption of my server down because of the rising prices in electricity. I can't bring my Ryzen 1700 to use more then C1 and C2. I've enabled Global C-state Control = Enabled and Power Supply Idle Control = Low Current Idle in the BIOS and used your tweaks including powertop --auto-tune. My setup: Asus B450 F Strix Ryzen 1700 7x WD 8TB HDD 3x WD 4TB HDD 4x SanDisk SSDs 1x Crucial m.2 1x nvidia GTX1050ti for transcoding 5x Noctua 120mm Fans running in "silent mode" according to the BIOS 1x Dell H310 SAS Controller + 40mm Noctua fan The whole system idles at 56-61W but I would love it to bring this down even further. I am glad for any advice!
  19. I've installed an older version through repository tags. Take a look at this. https://hub.docker.com/r/linuxserver/nextcloud/tags?page=1
  20. Same problem here. "linuxserver/nextcloud:140" downgrades too far.
  21. And I am having the "File not found." error again after rebooting my server. Attached is the last diagnostics from yesterday. I don't want to set up my server again from scratch this time. Does anybody got a clue? dringenet-ms-diagnostics-20220714-2006.zip
  22. I've never had this error. BUT today the electricity was gone for five hours and my server was cut from electricity abruptly. Could you guys please take a look at the diagnostics? Thanks dringenet-ms-diagnostics-20220714-2006.zip
  23. I've deleted the docker img and VM image and set up everything from scratch. The server ist running now witout a problem for nearly 4 days straight. Looks solved. If it occures again, I will report here.
  24. I suspect more and more that it has something to do with the shares. Every time I start working on the shares in the main tab, the error comes up. I will try to reproduce the behavior...