MothyTim Posted June 27, 2022 Share Posted June 27, 2022 Hi, I had the occasional crash with the RC releases, but it seems to have become more unstable with the stable releases! Not sure whats going on but it crashed on friday and nothing was working, no GUI and no respons on command line! So unclean shutdown! And same again this morning! I was suspicious of CA Backup because the crashes seemed to happen at 6:00am on the RC's which is when it runs but this morning it wasn't that time! So hoping someone cleverer than me can spot something? I've attached diagnostics from Friday and today. Thanks, Tim tower-diagnostics-20220624-1452.zip tower-diagnostics-20220627-1114.zip Quote Link to comment
JorgeB Posted June 27, 2022 Share Posted June 27, 2022 Enable the syslog server and post that after a crash. Quote Link to comment
MothyTim Posted June 27, 2022 Author Share Posted June 27, 2022 Already enabled here is the log! Sorry should have thought of including that! syslog-10.19.64.2.log Quote Link to comment
JorgeB Posted June 27, 2022 Share Posted June 27, 2022 May 14 00:41:06 Tower kernel: macvlan_broadcast+0x116/0x144 [macvlan] May 14 00:41:06 Tower kernel: macvlan_process_broadcast+0xc7/0x110 [macvlan] Macvlan call traces are usually the result of having dockers with a custom IP address, uswitching to ipvlan should fix it (Settings -> Docker Settings -> Docker custom network type -> ipvlan (advanced view must be enable, top right)), or see below for more info. https://forums.unraid.net/topic/70529-650-call-traces-when-assigning-ip-address-to-docker-containers/ See also here: https://forums.unraid.net/bug-reports/stable-releases/690691-kernel-panic-due-to-netfilter-nf_nat_setup_info-docker-static-ip-macvlan-r1356/ P.S. Unrelated but you're also running out of RAM. 1 Quote Link to comment
MothyTim Posted June 27, 2022 Author Share Posted June 27, 2022 Ok, thanks for that, I've changed the Plex docker back to host as that was the only one with a static IP, hopefully that'll fix it! I'll keep an eye on the RAM it's probably when I'm running Windows 11 VM, if so I'm not worried, but I'll check that it's nothing else! Thanks again for your help! Cheers, Tim Quote Link to comment
MothyTim Posted June 30, 2022 Author Share Posted June 30, 2022 It crashed again last night, so sadly it seems that hasn't fixed the issue? It happened at around 20:30 - 20:45, the system wasn't totally hung like before though and I was able to restart properly! Logs attached, hopefully there is something there? Cheers, Tim syslog-10.19.64.2.log tower-diagnostics-20220630-0839.zip Quote Link to comment
JorgeB Posted June 30, 2022 Share Posted June 30, 2022 13 minutes ago, MothyTim said: It happened at around 20:30 - 20:45 Unfortunately there's nothing, I assumed it happen between these lines: Jun 29 19:53:06 Tower autofan: Highest disk temp is 40C, adjusting fan speed from: 163 (63% @ 1548rpm) to: 140 (54% @ 1336rpm) Jun 29 20:50:13 Tower kernel: mdcmd (36): set md_write_method 1 Quote Link to comment
MothyTim Posted June 30, 2022 Author Share Posted June 30, 2022 Ok, thanks for looking so quickly. I think it was something with docker as Plex became unresponsive and I tried to restart its container but it through a server error and the container wouldn't start. I guess I could have restarted the docker service but opted to restart the whole machine! Cheers, Tim Quote Link to comment
MothyTim Posted July 1, 2022 Author Share Posted July 1, 2022 Ok so again a problem I think the GUI crashed as all dockers etc seemed to still be working but no GUI, system wouldn't restart so had to hit the oh shit button! Logs attached! I'm wondering if I'll have to give up on 6.10 and go back to 6.9? Cheers, Tim tower-diagnostics-20220701-1811.zip syslog-10.19.64.2.log Quote Link to comment
JorgeB Posted July 1, 2022 Share Posted July 1, 2022 Jul 1 13:48:56 Tower kernel: general protection fault, probably for non-canonical address 0x9c0101000034: 0000 [#1] SMP PTI Now there's something, but no Linux module/driver is mentioned, so don't know if it was a software or hardware issue. Quote Link to comment
MothyTim Posted July 1, 2022 Author Share Posted July 1, 2022 Ok thanks for looking, annoying that it’s not more specific! cheers, Tim Quote Link to comment
trurl Posted July 1, 2022 Share Posted July 1, 2022 What do you get from command line with this? df -h / Check periodically to see if usage is growing. Quote Link to comment
MothyTim Posted July 2, 2022 Author Share Posted July 2, 2022 root@Tower:~# df -h / Filesystem Size Used Avail Use% Mounted on rootfs 16G 953M 15G 6% / root@Tower:~# Quote Link to comment
MothyTim Posted July 2, 2022 Author Share Posted July 2, 2022 root@Tower:~# df -h / Filesystem Size Used Avail Use% Mounted on rootfs 16G 959M 15G 7% / root@Tower:~# Quote Link to comment
trurl Posted July 2, 2022 Share Posted July 2, 2022 rootfs is the space the OS uses for its own files, and if it fills all sorts of things can go wrong since the OS no longer has any space to work. Sometimes users will have something misconfigured such as a docker host mapping to a path that isn't actual mounted storage, which would be in rootfs. Your usage has grown a little but nowhere near full yet and maybe it won't fill up. Just another thing you can check on if other things don't give a clue. Quote Link to comment
MothyTim Posted July 2, 2022 Author Share Posted July 2, 2022 Ok thanks for the info, just run it again. root@Tower:~# df -h / Filesystem Size Used Avail Use% Mounted on rootfs 16G 953M 15G 6% / root@Tower:~# Cheers, Tim Quote Link to comment
MothyTim Posted July 3, 2022 Author Share Posted July 3, 2022 And this mornings. root@Tower:~# df -h / Filesystem Size Used Avail Use% Mounted on rootfs 16G 980M 15G 7% / root@Tower:~# Cheers, Tim Quote Link to comment
MothyTim Posted August 3, 2022 Author Share Posted August 3, 2022 Ok, so it stayed up for a whole month and has crashed again this morning! So bad this time that it won't boot back up again!!!! This is beyond frustrating now, I should have guessed a problem was looming as CA Backup has failed to restart my docker containers for 2 days on the trot. I ran diagnostics before reboot as all the USB derices had disappeared from USB Manager! Really hope something shows in the logs this time. Cheers, Tim tower-diagnostics-20220803-1238.zip Quote Link to comment
JorgeB Posted August 3, 2022 Share Posted August 3, 2022 Aug 1 12:35:11 Tower kernel: general protection fault, probably for non-canonical address 0x9c0101000034: 0000 [#1] SMP PTI Aug 1 13:01:44 Tower kernel: irq 16: nobody cared (try booting with the "irqpoll" option) A couple of days ago there was the same error as last time and then IRQ 16 also got disabled, possibly not a big issue since the server worked for 2 more days, main issue was that this USB controller stopped working: Aug 3 12:35:33 Tower kernel: xhci_hcd 0000:00:14.0: xHCI host controller not responding, assume dead And the flash drive was using it, so after that Unraid cannot continue to work correctly. Quote Link to comment
MothyTim Posted August 3, 2022 Author Share Posted August 3, 2022 I've got a bad feeling that the USB on the motherboard has failed/is failing as the mouse doesnt light up! Could be a port failed just trying it with minimal stuff plugged in! Unraid USB stick mounts and is brouseable on my Mac, do you think it might be corupted? Should I reflash it? Thanks for your help again. Cheers, Tim Quote Link to comment
JorgeB Posted August 3, 2022 Share Posted August 3, 2022 Run chkdsk on it but the flash should be OK. Quote Link to comment
MothyTim Posted August 3, 2022 Author Share Posted August 3, 2022 Ok ran first aid on stick on my mac and its booted up, the attached screen shows an error though: XFS (md3): Metadata corruption detected at xfs_dinode_verify+0xa4/0x56f [xfs], inode 0x5c1bf6 dinode XFS (md3): Unmount and run xfs_repair Then lines of hex code! What do I do about that? Thanks, Tim Quote Link to comment
JorgeB Posted August 3, 2022 Share Posted August 3, 2022 3 minutes ago, MothyTim said: What do I do about that? 3 minutes ago, MothyTim said: Unmount and run xfs_repair Check filesystem on disk3. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.