October 22, 20241 yr For the past couple of days, I have noticed some services offline when I go to access them. When I go look at my server homepage, it's running, but the parity has stopped and says 1 error. Where could i find this error, and what would I do to ensure it stops this behavior? Thanks! Please let me know what i need to post to be able to get help. I am semi new to the homelab scene, but I've been running for a little over a year with no issues.
October 22, 20241 yr Author 9 hours ago, JorgeB said: Please post the diagnostics Please see the attached diagnostics.. Thanks for your help! bigdaddy-diagnostics-20241022-1324.zip
October 23, 20241 yr Community Expert On 10/22/2024 at 1:01 AM, F0R3STLANE said: but the parity has stopped and says 1 error. What do you mean by this? Post a screenshot. Also, btrfs is detecting data corruption, you should run memtest, and then a scrub.
October 23, 20241 yr Author 5 hours ago, JorgeB said: What do you mean by this? Post a screenshot. Also, btrfs is detecting data corruption, you should run memtest, and then a scrub. Yes, so there is no real error or screenshot to show. The only thing I am able to see is when my whole house goes down, I run to my desktop to see that the Parity has stopped. When I start the Parity again, it would run a Parity check. I watched it run, it got to about 50-55%, and then I saw that the parity check basically ended. The parity would stay online and then stop again. I would do the same thing. Now it's time to figure out what's going on and get it fixed, so the parity will stay online. If there is possible Data corruption like you say, I will run the memtest and then when you say "scrub" what does this mean exactly? What would I run on the server? Thanks for your time and your help! The server and parity have been online for ongoing 9 hours now...
October 23, 20241 yr What is odd is that your 'services' go offline and you see that parity is stopped with an 'error' neither of these things should effect the other, if your parity was bad... the array would function as normal, thus your 'services/dockers' would continue as normal... Likely your issue is with the backend... ie PSU or motherboard... Have you recently added a new HDD, GPU? Are you sure that your PSU is stable and large enough to run your system? Edited October 23, 20241 yr by mathomas3
October 23, 20241 yr Author 7 minutes ago, mathomas3 said: What is odd is that your 'services' go offline and you see that parity is stopped with an 'error' neither of these things should effect the other, if your parity was bad... the array would function as normal, thus your 'services/dockers' would continue as normal... Likely your issue is with the backend... ie PSU or motherboard... Have you recently added a new HDD, GPU? Are you sure that your PSU is stable and large enough to run your system? You know what's funny.. Last night, as I was scratching my head trying to figure this all out, I went back there to open the server up. I noticed that the power supply is 650W and it's not a high end PSU.. I was already thinking about just buying a new power supply and throwing it in there. No new HDD, there is a GTX 970 in there but i was thinking to remove it because it does not get used (to my knowledge).. Any other ideas?! Thanks for your answer too!
October 23, 20241 yr IMO... pull the GPU and see if that makes things a stable, it might be enough to make things stable but given that it's likely not pulling much power(not being used) and your likely to add a HDD sometime down the road, I would opt to upgrade to a 800 watt PSU or higher
October 23, 20241 yr I was about to say that you could run a parity check to validate that you dont have enough power... but you have already run it... sooo... PSU upgrade is needed IMO
October 23, 20241 yr Author 5 minutes ago, mathomas3 said: I was about to say that you could run a parity check to validate that you dont have enough power... but you have already run it... sooo... PSU upgrade is needed IMO When i run the Parity checks, they are not able to finish.. its like they time out or something odd. I will start one, it will get to about 50-60% and then say its completed (shows 1 error) but i cant see what that error is. And then the parity wont be online.. i have to go in, start it up, and everything works.. VERY ODD!!! If a simple power supply does the trick, i will be very happy Thanks!
October 23, 20241 yr Just now, F0R3STLANE said: When i run the Parity checks, they are not able to finish.. its like they time out or something odd. I will start one, it will get to about 50-60% and then say its completed (shows 1 error) but i cant see what that error is. And then the parity wont be online.. i have to go in, start it up, and everything works.. VERY ODD!!! If a simple power supply does the trick, i will be very happy Thanks! Its likely the PSU... ie it's pulling the most power from the PSU when all the disks are spinning and reading/writing, thus it errors out... you could try pulling a HDD(moving data off of it, should you have enough free space) and removing the GPU... IMO that would be enough given you are likely on the knifes edge of what you need/have but if you have the $$$ to spare for a new PSU... that's the easiest thing to do and have peace of mind
October 23, 20241 yr Author 40 minutes ago, mathomas3 said: Its likely the PSU... ie it's pulling the most power from the PSU when all the disks are spinning and reading/writing, thus it errors out... you could try pulling a HDD(moving data off of it, should you have enough free space) and removing the GPU... IMO that would be enough given you are likely on the knifes edge of what you need/have but if you have the $$$ to spare for a new PSU... that's the easiest thing to do and have peace of mind Just ordered a 850W PSU and it will be here today! i will have it changed out today and pray for best results!
October 23, 20241 yr 2 minutes ago, F0R3STLANE said: Just ordered a 850W PSU and it will be here today! i will have it changed out today and pray for best results! Feels odd when recommending others to spend money when there is a chance that you could be wrong even though you feel it's the best advice... Praying that it fixes the issue for you... Good luck
October 23, 20241 yr Author 12 minutes ago, mathomas3 said: Feels odd when recommending others to spend money when there is a chance that you could be wrong even though you feel it's the best advice... Praying that it fixes the issue for you... Good luck Dont feel that way. Only reason i went ahead and ordered was because i was having the same thoughts last night when i looked at it and noticed it was only 650W power supply and its a old one. I will throw this 850W in there and be good.. fingers crossed! The server / parity has been online going on 12 hours now.
October 28, 20241 yr Author On 10/25/2024 at 7:42 PM, mathomas3 said: have you had any issues since upgrading the PSU? Hello! Soooo. I had one "unclean shutdown" since and i started the party again, ran the parity check and it went all the way thru with no errors. NOW, this morning, it stopped again. Saying Unclean Shutdown detected. No errors.. I do run a Debian VM on the server that hosts a game server for my son.. There is nothing "new" in the system.. I am not sure what i can do... Any advice would be GREAT! Thanks!
October 30, 20241 yr My guess is that there was a power flick that caused it to go down... do you use a UPS? Guessing not. I would recommend one but Im willing to guess that this is a one off... should this happen again soon, lets talk, but for now I think your good... Hoping
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.