mmjaberi

Members
  • Posts

    15
  • Joined

  • Last visited

mmjaberi's Achievements

Noob

Noob (1/14)

2

Reputation

  1. It was fixed for few days, again I started seeing it back in the logs after upgrading to 6.12
  2. I followed the steps you mentioned and it fixed the issue. Thanx 🤙
  3. Hello everyone, I've got a bit of a situation here. My unraid server's been acting up lately — freezes, crashes, you name it. I have to keep restarting it just to get things running smoothly again. I noticed my cache drive got set to read-only for some reason, and no matter what I do, I can't change it back to read-write. I backed up all the data and formatted the cache, thinking it might solve the crash problem, but the problem's still there. I've turned on syslog to gather some logs, and I'm also attaching the diagnostics output from my server. Anyone have any idea what's going on? syslog_unraid.zip unraid-diagnostics-20231001-0148.zip
  4. Hope it's not hardware issue 😓 I will get back to this post after testing it.
  5. Hello unRAID Community, I've been using an unRAID server for almost a month now. During the first week, everything was running smoothly, and I didn't encounter any issues. However, as I began to download more containers and plugins, I started experiencing various problems. The most common issue I'm facing is the server hanging and becoming unresponsive. Sometimes, containers refuse to shut down, and lately, the entire Docker service has been crashing. I haven't been able to pinpoint the exact cause of these issues. I've noticed that when a specific task runs (I've included the relevant logs below), the containers become unresponsive, and the server also becomes unresponsive. I've had to forcefully shut down the server every single day since the problem started. Apr 20 23:01:32 MJ-unRaid kernel: <TASK> Apr 20 23:01:32 MJ-unRaid kernel: ? writepage_delalloc+0xa5/0x155 Apr 20 23:01:32 MJ-unRaid kernel: ? __extent_writepage+0x18e/0x233 Apr 20 23:01:32 MJ-unRaid kernel: ? extent_write_cache_pages+0x22a/0x2f3 Apr 20 23:01:32 MJ-unRaid kernel: ? virt_to_slab+0x5/0x19 Apr 20 23:01:32 MJ-unRaid kernel: ? memcg_slab_free_hook+0x4b/0xf9 Apr 20 23:01:32 MJ-unRaid kernel: ? btrfs_drop_extent_cache+0x3a1/0x3c9 Apr 20 23:01:32 MJ-unRaid kernel: ? extent_writepages+0x7f/0xda Apr 20 23:01:32 MJ-unRaid kernel: ? do_writepages+0xb8/0x124 Apr 20 23:01:32 MJ-unRaid kernel: ? filemap_fdatawrite_wbc+0x5e/0x73 Apr 20 23:01:32 MJ-unRaid kernel: ? __filemap_fdatawrite_range+0x53/0x71 Apr 20 23:01:32 MJ-unRaid kernel: ? btrfs_fdatawrite_range+0x1d/0x4a Apr 20 23:01:32 MJ-unRaid kernel: ? start_ordered_ops.constprop.0+0x41/0x74 Apr 20 23:01:32 MJ-unRaid kernel: ? btrfs_sync_file+0xed/0x3ca Apr 20 23:01:32 MJ-unRaid kernel: ? do_fsync+0x33/0x51 Apr 20 23:01:32 MJ-unRaid kernel: ? __x64_sys_fsync+0xf/0x16 Apr 20 23:01:32 MJ-unRaid kernel: ? do_syscall_64+0x68/0x81 Apr 20 23:01:32 MJ-unRaid kernel: ? entry_SYSCALL_64_after_hwframe+0x63/0xcd Apr 20 23:01:32 MJ-unRaid kernel: </TASK> Apr 20 23:01:32 MJ-unRaid kernel: sysimgblt fb_sys_fops thermal wmi fan tpm_crb tpm_tis tpm_tis_core tpm video backlight acpi_tad acpi_pad button unix Apr 20 23:01:32 MJ-unRaid kernel: CPU: 12 PID: 6414 Comm: dockerd Tainted: P D O 5.19.17-Unraid #2 Apr 20 23:01:32 MJ-unRaid kernel: Hardware name: ASUS System Product Name/ROG MAXIMUS Z790 HERO, BIOS 0904 03/29/2023 Apr 20 23:01:32 MJ-unRaid kernel: RIP: 0010:do_exit+0x39/0x8e5 Apr 20 23:01:32 MJ-unRaid kernel: Code: 89 fd 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 31 c0 65 48 8b 1c 25 c0 bb 01 00 48 83 bb a0 07 00 00 00 74 02 <0f> 0b 48 8b bb c8 06 00 00 e8 b7 c0 7c 00 48 8b 83 c0 06 00 00 83 Apr 20 23:01:32 MJ-unRaid kernel: RSP: 0018:ffffc9000202bee0 EFLAGS: 00010086 Apr 20 23:01:32 MJ-unRaid kernel: RAX: 0000000000000000 RBX: ffff8881d3885000 RCX: 0000000000000000 Apr 20 23:01:32 MJ-unRaid kernel: RDX: 0000000000000000 RSI: ffffffff820d7be1 RDI: 0000000000000009 Apr 20 23:01:32 MJ-unRaid kernel: RBP: 0000000000000009 R08: 0000000000000000 R09: 0000000000aaaaaa Apr 20 23:01:32 MJ-unRaid kernel: R10: 0000000000000001 R11: 0000000000000001 R12: 0000000000000000 Apr 20 23:01:32 MJ-unRaid kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000010 Apr 20 23:01:32 MJ-unRaid kernel: FS: 00001465a9e50700(0000) GS:ffff88a02d300000(0000) knlGS:0000000000000000 Apr 20 23:01:32 MJ-unRaid kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Apr 20 23:01:32 MJ-unRaid kernel: CR2: ffffffffffffffd6 CR3: 00000001bcaba004 CR4: 0000000000770ee0 Apr 20 23:01:32 MJ-unRaid kernel: PKRU: 55555554 Apr 20 23:01:32 MJ-unRaid kernel: Call Trace: Apr 20 23:01:32 MJ-unRaid kernel: <TASK> Apr 20 23:01:32 MJ-unRaid kernel: make_task_dead+0xba/0xba Apr 20 23:01:32 MJ-unRaid kernel: rewind_stack_and_make_dead+0x17/0x17 Apr 20 23:01:32 MJ-unRaid kernel: RIP: 0033:0x4baa7b Apr 20 23:01:32 MJ-unRaid kernel: Code: e8 2a e5 fa ff eb 88 cc cc cc cc cc cc cc cc e8 db 2c fb ff 48 8b 7c 24 10 48 8b 74 24 18 48 8b 54 24 20 48 8b 44 24 08 0f 05 <48> 3d 01 f0 ff ff 76 20 48 c7 44 24 28 ff ff ff ff 48 c7 44 24 30 Apr 20 23:01:32 MJ-unRaid kernel: RSP: 002b:000000c0001659e0 EFLAGS: 00000206 ORIG_RAX: 000000000000004a Apr 20 23:01:32 MJ-unRaid kernel: RAX: ffffffffffffffda RBX: 000000c00005c000 RCX: 00000000004baa7b Apr 20 23:01:32 MJ-unRaid kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000014 Apr 20 23:01:32 MJ-unRaid kernel: RBP: 000000c000165a20 R08: 0000000000000001 R09: 0000000000000000 Apr 20 23:01:32 MJ-unRaid kernel: R10: 000000c0017bcbd0 R11: 0000000000000206 R12: 0000000000000000 Apr 20 23:01:32 MJ-unRaid kernel: R13: 0000000000000000 R14: 000000c001886000 R15: ffffffffffffffff Apr 20 23:01:32 MJ-unRaid kernel: </TASK> At this point, I'm not sure if the problem lies with Docker itself or the plugins I've installed, but I'm guessing one of them might be the cause. I'm really frustrated and disappointed with my custom-built server, especially considering the amount of money I've invested in it. I've attached the diagnostics logs, syslogs, and a screenshot of the Docker service after it crashed. I would greatly appreciate any help or guidance you can provide to resolve these issues. Thank you in advance for your assistance. unraid-diagnostics-20230421-0034.zip syslog.txt
  6. Yes, I am using "NVIDIA GeForce RTX 3070" for transcoding.
  7. I managed to upgrade the Intel ME driver and all other Intel drivers, but it didn't change anything. 🙄 😔 Thank you very much @steveme for the guidance. 😇
  8. It's not related, I think it's bug as I read it in many online forums 🤓... Just wanted to know how did managed upgrade ME driver. I'm trying to figure out how to upgrade the Intel ME firmware on my ASUS ROG MAXIMUS Z790 HERO motherboard, as I'm running UNRAID OS exclusively on the server. The Intel ME driver is a Windows executable package, which I can't run in UNRAID. I've attempted to use a bootable Windows PE (Preinstalled Environment) with the package included, but that didn't work either. Has anyone with a similar setup managed to upgrade their Intel ME firmware 🤔? Any suggestions on how to approach this issue would be greatly appreciated. 🤝
  9. do you mean intel AMT tool (Active Management Technology)? I couldn't find anything called office tool in intel website. BTW did you also use Windows PE install the upgrade ?
  10. Could you tell me how you were able to upgrade the Intel ME 🧐? I tried using Windows PE and placed the Intel ME upgrade package in it for the upgrade, but it didn't work for me 🤯 and I ended up upgrading the BIOS only
  11. Even after upgrading my BIOS to the latest version, I continue to encounter the error mentioned below, which I've also removed from the attached log file. Apr 7 02:56:10 MJ-unRaid kernel: ACPI BIOS Error (bug): Failure creating named object [\_SB.PC00.PEG1.PEGP._DSM.USRG], AE_ALREADY_EXISTS (20220331/dsfield-184) Apr 7 02:56:10 MJ-unRaid kernel: ACPI Error: AE_ALREADY_EXISTS, CreateBufferField failure (20220331/dswload2-477) Apr 7 02:56:10 MJ-unRaid kernel: ACPI Error: Aborting method \_SB.PC00.PEG1.PEGP._DSM due to previous error (AE_ALREADY_EXISTS) (20220331/psparse-529) This time, I noticed a 'BTRFS' error that I believe might be related to my primary issue. However, unlike before, the server did not hang or become unresponsive this time around. I'm uncertain whether the BIOS update resolved the problem or if there's another underlying issue at play. Apr 6 21:38:55 MJ-unRaid kernel: BTRFS warning (device nvme0n1p1): csum failed root 5 ino 395 off 24314224640 csum 0xaad58a8b expected csum 0x54912a6a mirror 1 Apr 6 21:38:55 MJ-unRaid kernel: BTRFS error (device nvme0n1p1): bdev /dev/nvme0n1p1 errs: wr 0, rd 0, flush 0, corrupt 114, gen 0 Any thoughts or suggestions would be appreciated. syslog.txt
  12. Then I am gonna upgrade the Bios first to cross out the previous issue ACPI error, then will re-try the scenario and will post the logs here afterwards. Thanks for the support.
  13. What about the hanging thing 😅😅, how can I fix this issue. One more thing I forgot to mention here, when the server hangs I can see on the screen the following error message: “Kernel Panic Not Syncing… etc” is this something related to USB flash drive?
  14. I do have GPU installed on the motherboard . Is there any fix available for this bug or work around.
  15. Hello Unraid community, I am experiencing an issue with my Unraid server where it hangs whenever I move large amounts of data to the server sometimes when I ping the server IP address I am getting reply, but I can't access unraid web console or even ssh to the server. The server becomes unresponsive, and I have to shut it down manually and restart it each time this happens. While investigating the issue, I came across several ACPI-related errors in the syslog. I'm not sure if these errors are directly related to the hanging issue, but I'm hoping that resolving them might help improve my server's stability and performance. Here are the ACPI-related errors I found in the syslog: ( Please check the attached Unraid logs) ACPI BIOS Error (bug): Failure creating named object [_SB.PC00.PEG1.PEGP._DSM.USRG], AE_ALREADY_EXISTS (20220331/dsfield-184) ACPI Error: AE_ALREADY_EXISTS, CreateBufferField failure (20220331/dswload2-477) ACPI Error: Aborting method _SB.PC00.PEG1.PEGP._DSM due to previous error (AE_ALREADY_EXISTS) (20220331/psparse-529) To troubleshoot these errors, I have tried the following steps: Checked RAM and CPU utilization during file transfers (both under 10%). Updated the BIOS to the latest version. Unfortunately, the errors persist, and my server continues to hang during large file transfers. I would greatly appreciate any assistance or suggestions from the community to help resolve these ACPI errors and the server hanging issue. My hardware configuration is as follows: Motherboard: ASUS ROG MAXIMUS Z790 HERO. BIOS Version: 0816 CPU: Intel Core I9 139000K 3.00GHz RAM: Corsair Vengeance DDR5 128GB Pool Devices: 1 KINGSTON NVMe 2TB Thank you for your help! syslog.txt