JohanSF Posted November 8, 2018 Share Posted November 8, 2018 (edited) This is a continuation of: and with diagnostics as per Squids' instructions, however, I did have to reboot in order to start the docker service again.hal9000-diagnostics-20181108-1715.zip In response to I do have my appdata on the cache drive. I also think it did move last night, the 260 GB here makes sense as I downloaded large content just after the first crash. But it does indeed seem to have something to do with the cache drive and/or a container. Edited November 8, 2018 by JohanSF Quote Link to comment
JorgeB Posted November 8, 2018 Share Posted November 8, 2018 Your cache drive is fully allocated and needs a balance, see here: https://lime-technology.com/forums/topic/62230-out-of-space-errors-on-cache-drive/?do=findComment&comment=610551 Quote Link to comment
JohanSF Posted November 8, 2018 Author Share Posted November 8, 2018 (edited) 32 minutes ago, johnnie.black said: Your cache drive is fully allocated and needs a balance, see here: https://lime-technology.com/forums/topic/62230-out-of-space-errors-on-cache-drive/?do=findComment&comment=610551 Alright I don't really know what I am doing but you ask me to do btrfs balance start -dusage=75 /mnt/cache in the console right? Edited November 8, 2018 by JohanSF Quote Link to comment
JorgeB Posted November 8, 2018 Share Posted November 8, 2018 Yes, and like it mention on the thread linked: Quote If you get ENOSPC lower the 75 until you can complete a balance, e.g. try -dusage=50, 25 and so on, then do again with an higher number until you can do with at least 75. Quote Link to comment
JohanSF Posted November 8, 2018 Author Share Posted November 8, 2018 I think it's doing its thing now. Quote Link to comment
JorgeB Posted November 8, 2018 Share Posted November 8, 2018 Yes, if it's was going to error it's usually quick, but if you want post new diags when it finishes to confirm all is well. Quote Link to comment
JohanSF Posted November 8, 2018 Author Share Posted November 8, 2018 (edited) hal9000-diagnostics-20181108-1833.zipI got this: I stopped the parity check and tried this: All good now? - if yes, that was an easy fix, can you explain more about what is going on and how you diagnosed it? New diagnostics: hal9000-diagnostics-20181108-1833.zip Edited November 8, 2018 by JohanSF Quote Link to comment
JorgeB Posted November 8, 2018 Share Posted November 8, 2018 Yes, it's fine, and it shouldn't happen again, this only happens with older kernels, or users coming from older kernels and never ran a balance which I assume is your case. Before: Data Metadata System Id Path single single single Unallocated -- -------------- --------- -------- -------- ----------- 1 /dev/nvme0n1p1 474.93GiB 2.01GiB 4.00MiB 56.00KiB -- -------------- --------- -------- -------- ----------- Total 474.93GiB 2.01GiB 4.00MiB 56.00KiB Used 240.03GiB 1.27GiB 80.00KiB After: Data Metadata System Id Path single single single Unallocated -- -------------- --------- -------- -------- ----------- 1 /dev/nvme0n1p1 253.01GiB 3.01GiB 4.00MiB 220.92GiB -- -------------- --------- -------- -------- ----------- Total 253.01GiB 3.01GiB 4.00MiB 220.92GiB Used 240.02GiB 1.24GiB 64.00KiB Problem was the unallocated space, which you dind't have any. Quote Link to comment
JohanSF Posted November 8, 2018 Author Share Posted November 8, 2018 I cannot thank you enough, it is good to have a stable system again. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 (edited) I celebrated too early. The whole unRaid server crashed again now during the night. It must have been before 3:40 am as the mover has not run. Here is the syslog and diagnostics: syslog.txt (I know that Ihal9000-diagnostics-20181109-0622.zipserver to watch something on plex up until about 11 pm) hal9000-diagnostics-20181109-0622.zip It should also not be caused by my Ryzen 1700 processor as I have the zenstates script applied to disable C6 states: Edited November 9, 2018 by JohanSF Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 (edited) I just updated to 6.6.5 and started the array. Next to the Array status on the main page it now says "BTRFS operation is running". Now it is unresponsive.. should I hard-restart the machine? This is becoming a little scary. Edited November 9, 2018 by JohanSF Quote Link to comment
bonienl Posted November 9, 2018 Share Posted November 9, 2018 Either a balance or scrub operation is being performed and array can not be stopped until this operation is completed. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 (edited) 4 minutes ago, bonienl said: Either a balance or scrub operation is being performed and array can not be stopped until this operation is completed. Ok. It is unresponsive in the way that on the main page, everything on the page under the disk status boxes is now missing. Using my phone with teamviewer to see it. I can also see that the log has red erros. I can post that when I get home. Edited November 9, 2018 by JohanSF Quote Link to comment
bonienl Posted November 9, 2018 Share Posted November 9, 2018 1 minute ago, JohanSF said: Ok. It is unresponsive in the way that on the main page, everything on the page under the disk status boxes is now missing. Using my phone with teamviewer to see it. That doesn't sound right. Do you use BTRFS for the array or the cache or both? Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 Only for the cache. Quote Link to comment
bonienl Posted November 9, 2018 Share Posted November 9, 2018 It might be a corrupted cache file system. Can you post diagnostics. If the GUI doesn't work then use terminal/telnet and type 'diagnostics', the zip file will be saved on your flash device in the /logs folder. Probably need the help of the true expert @johnnie.black Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 (edited) 1 hour ago, bonienl said: It might be a corrupted cache file system. Can you post diagnostics. If the GUI doesn't work then use terminal/telnet and type 'diagnostics', the zip file will be saved on your flash device in the /logs folder. Probably need the help of the true expert @johnnie.black I can click Download diagnostics but it is collecting diagnosis information forever and the download never happens. Trying with the terminal method I get "Starting diagnostics collection..." and nothing happens. Update: I cannot restart it remotely it seems, have to do a hard reset when I get home. I really hope the cache drive is not corrupted Edited November 9, 2018 by JohanSF Quote Link to comment
JorgeB Posted November 9, 2018 Share Posted November 9, 2018 If you can't get diags before rebooting, grab and post them right after rebooting. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 (edited) Got home to this log: Restarted the machine with hardware button. Here are the diagnostics before starting the array: hal9000-diagnostics-20181109-1539.zip It started, parity check runs and dockers started too. I am looking at this now: Should I start the Troubleshooting Mode in "Fix Common Problems"? Edit: Not sure I can do that though, the "Scanning" when I enter the page seems to stay there forever. This is in the log: Edited November 9, 2018 by JohanSF Quote Link to comment
John_M Posted November 9, 2018 Share Posted November 9, 2018 I'd like to know exactly what the nginx errors you're seeing are about, as I've seen them myself on occasion but I've never seen an explanation for them. The Web GUI pages are really quite complicated and for nginx to serve one up it has to retrieve the sources from multiple locations, most of which are dynamic and dependent on scripts completing and returning the necessary code. That looks as though it's failing here and causing the unresponsiveness. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 Can I restart the plugin? I don't want to restart the whole server now that the parity check is running. Quote Link to comment
JorgeB Posted November 9, 2018 Share Posted November 9, 2018 2 hours ago, JohanSF said: diagnostics before starting the array: Sorry, my fault, I meant diags right after starting the array. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 Here: hal9000-diagnostics-20181109-1756.zip Quote Link to comment
JorgeB Posted November 9, 2018 Share Posted November 9, 2018 Except for the nginx errors, which I also don't know what they mean, though I see them frequently, all appears good, cache is fine. Quote Link to comment
JohanSF Posted November 9, 2018 Author Share Posted November 9, 2018 That is good news, do you have any idea what do to about them? do I have to restart? Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.