DuzAwe

Members
  • Content Count

    55
  • Joined

  • Last visited

Everything posted by DuzAwe

  1. Manged to catch this today. With the lightest of google searches it looks like it may be a bug/regression in the kernel. I very well could be wrong but maybe? https://askubuntu.com/questions/1293945/20-10-complete-system-freeze-with-general-protection-fault-smp-nopti https://www.spinics.net/lists/linux-nfs/msg78091.html https://www.spinics.net/lists/amd-gfx/msg48596.html Jun 12 14:00:01 thelibrary kernel: Jun 12 14:00:44 thelibrary kernel: general protection fault, probably for non-canonical address 0x1090000ffffff76: 0000 [#1] SMP NOPTI Jun 12 14:00:44 thelibr
  2. I am in the rather unique situation to be able to build a almost exact clone of my set up (ram will be different). I could set up a VPN and provide Limetech with access for testing.
  3. A third hard reboot remounted everything..... syslog (3).7z
  4. Hey, Woke up this morning to an read-only disk, restarted the array and now I have an Unmountable disk present for a cache pool. Its btrfs and spinning rust. It has a disk ID sdf, I don't know the best course of action.
  5. Welp here we are again. Server died in the middle of a plex stream. Nothing in the log that I can see. I have no Br0 Dockers, I have ecc memory. None of these issues happened on the very same hardware while I was on 6.8. syslog-1-6.txt.zip thelibrary-diagnostics-20210601-1748.zip
  6. So, I have Br0 off and all my dockers are host/bridge.
  7. Had a lock up while moving some files around. It just stopped responding had to hard reset to get it back. Attached are mirrored logs, But I can see anything it happened between 9 and 11 this morning. syslog-28-05.7z
  8. Files dont seem to be ignored in my set up I have tried .log, .nfo, .tgz, .tmp and *.log, *.nfo, *.tgz, *.tmp. Keep getting alerts about .nfos changing. Either ts a bug or I am doing something wrong.
  9. I had it set up as a script installing and running as a cron with Userscripts. Just follow the install instructions on the repo. I have stopped using it as I have issues with macvlan crashes since moving to 6.9.
  10. If it is, I have this disabled at the bios level as well as in my boot.cfg and I have still had lock ups with macvlan.
  11. Same issues as K1ng0011 also with an ASRockRack Board, Same X470 Base. Have had a great number of system lock ups since December. Started for me with RC2. Not in a position to be able to create new vlans. Have instead removed all my dockers that used BR0 as even stopped I was having issues. thelibrary-diagnostics-20210418-2211 (1).zip
  12. So bug? My crashes lock ups started around rc2 and have been pretty random since. Since I swapped to ECC memory I had a pretty smooth run of it about two weeks and then they started up again.
  13. Looks like I have had another kernal panic/macvlan crash but no lock up this time. I am able to export diags as a result, hopefully it shows something that can stop this all together. As I said earlier, I don't have any Dockers any more with static custom set ips in my doicker set up. Apr 18 14:54:38 thelibrary kernel: ------------[ cut here ]------------ Apr 18 14:54:38 thelibrary kernel: WARNING: CPU: 6 PID: 13151 at net/netfilter/nf_conntrack_core.c:1120 __nf_conntrack_confirm+0x9b/0x1e6 [nf_conntrack] Apr 18 14:54:38 thelibrary kernel: Modules linked in: macvlan nvidia_uvm(PO) v
  14. Already removed, Saw that info during this week. Only GPU stuff installed is the driver plugin, a user script to allow more then three streams and plex. Far as I am aware nothing else is touching nvidia-smi or the gpu.
  15. OK makes sense, Dockers are gone now. Attached is what remains. I have also changed form a bonded interface for my NIC to a bridge
  16. Had two br0 dockers both of which where disabled at the time of the panic. I have removed them completely this morning. Would disabled dockers still cause issues?
  17. I believe I have the sae issue on 6.9.2 syslog
  18. So on and off since my jump to 6.9-RC2 I have had a freeze issue. I have replaced the mother board and ram in that time. Ram is now ECC, looking at the logs in my understanding it looks like a drive crash? Either MacVlan or Nvidia. Syslog attached, Help is much appreciated. Server must be hard reset to get any access so diags arent possible. Apr 18 04:13:38 thelibrary kernel: NETDEV WATCHDOG: eth1 (igb): transmit queue 2 timed out Apr 18 04:13:38 thelibrary kernel: WARNING: CPU: 0 PID: 0 at net/sched/sch_generic.c:442 dev_watchdog+0xcf/0x12b Apr 18 04:13:38 thelibrary ke
  19. @S80_UK Im mirroring logs to my usb currently waiting for another crash. But I have also removed the GPU stat plugin and have previously ruled out my ram. Should I get another crash Ill create my own thread.
  20. I appear to be in the same boat. Only adding my voice to amplify the message.
  21. Its just one of those things I guess with so many variables in set up it could be anything causing the issue to express for some and not others. I had (hoping its fixed now) with the Vlan issue locking up my box every few days. If it helps at all firmware on my LSI is the most recent 20 something I think. Need to update my sig I also swapped to ECC memory in the last few weeks due to issues with BTRFS.
  22. I have almost all Seagate drive and have had no issues.
  23. Welp, Thanks for your help again. I guess Ill be back at some point.
  24. Well I feel like a fool. Ok Looks clean. Opening filesystem to check... Checking filesystem on /dev/nvme1n1p1 UUID: cdb12f2a-8005-48a1-b8f7-bd0e1fc9fd43 [1/7] checking root items [2/7] checking extents [3/7] checking free space tree [4/7] checking fs roots [5/7] checking only csums items (without verifying data) [6/7] checking root refs [7/7] checking quota groups skipped (not enabled on this FS) found 266705154048 bytes used, no error found total csum bytes: 225909672 total tree bytes: 698073088 total fs tree bytes: 379420672 total extent tree bytes: 46546944 btree space waste byte
  25. Comes back with: root@thelibrary:~# btrfs check /dev/sdX1 Opening filesystem to check... ERROR: mount check: cannot open /dev/sdX1: No such file or directory ERROR: could not check mount status: No such file or directory