ramiro

Members
  • Posts

    20
  • Joined

  • Last visited

Everything posted by ramiro

  1. I am running 6.12.3 , i am running 6.12 since the RC release, could it maybe be a communcation problem between the cpu's (as far as i saw the qpi link also has error correction so it also shouldn't be a problem).
  2. Shouldn't there atleast be an ecc error then , or can there be undetected memory errors with ecc ? The array still runs fine , actually ran 2 scrubs already because of the monthly scrub.
  3. So i also ran a balance and it also had no errors , so i will i have to worry now that this can happen any time again ?
  4. Hello I got a very weird problem yesterday , suddenly my raid10 with 6 Disks went read only with this error : Jul 24 23:07:09 ramiroserver dnsmasq[10431]: using nameserver 8.8.8.8#53 Jul 24 23:07:09 ramiroserver dnsmasq[10431]: using nameserver 2003:dc:2f25:200:6ef0:49ff:fe45:27eb#53 Jul 24 23:07:09 ramiroserver dnsmasq[10431]: using nameserver 2003:dc:2f12:fb01:a236:9fff:febe:59dc#53 Jul 24 23:12:45 ramiroserver kernel: BTRFS critical (device sdh1): corrupt leaf: block=28466229657600 slot=142 extent bytenr=22370527346688 len=131072 invalid extent refs, have 1 expect >= inline 4222615553 Jul 24 23:12:45 ramiroserver kernel: BTRFS info (device sdh1): leaf 28466229657600 gen 610063 total ptrs 208 free space 59 owner 2 Jul 24 23:12:45 ramiroserver kernel: #011item 0 key (22370491891712 168 65536) itemoff 16230 itemsize 53 Jul 24 23:12:45 ramiroserver kernel: #011#011extent refs 1 gen 365653 flags 1 Jul 24 23:12:45 ramiroserver kernel: #011#011extent refs 1 gen 552994 flags 1 Jul 24 23:12:45 ramiroserver kernel: #011#011ref#0: extent data backref root 5 objectid 9942256 offset 409495535616 count 1 Jul 24 23:12:45 ramiroserver kernel: #011item 207 key (22370586890240 168 282624) itemoff 5259 itemsize 53 Jul 24 23:12:45 ramiroserver kernel: #011#011extent refs 1 gen 554551 flags 1 Jul 24 23:12:45 ramiroserver kernel: #011#011ref#0: extent data backref root 5 objectid 9942256 offset 552465006592 count 1 Jul 24 23:12:45 ramiroserver kernel: BTRFS error (device sdh1): block=28466229657600 write time tree block corruption detected Jul 24 23:12:45 ramiroserver kernel: BTRFS: error (device sdh1) in btrfs_commit_transaction:2460: errno=-5 IO failure (Error while writing out transaction) Jul 24 23:12:45 ramiroserver kernel: BTRFS info (device sdh1: state E): forced readonly Jul 24 23:12:45 ramiroserver kernel: BTRFS warning (device sdh1: state E): Skipping commit of aborted transaction. Jul 24 23:12:45 ramiroserver kernel: BTRFS: error (device sdh1: state EA) in cleanup_transaction:1958: errno=-5 IO failure I took the array offline and ran btrfs check without repair and it showed that everything is fine , so i restarted the server and now the array is fully working , i ran a scrub today with repair option that came back without any errors : UUID: 2df74f3a-347d-4ae8-9f13-e760dc67014a Scrub started: Tue Jul 25 01:44:23 2023 Status: finished Duration: 11:32:36 Total to scrub: 15.90TiB Rate: 401.11MiB/s Error summary: no errors found I am running a dual 2011 system with ecc memory and had no error whatsoever before the btrfs error , the only problem is that due to the macvlan issue , i had 1 unclean reset a week before this , had the same thing 2 months ago thad corrupted my other raid1 array but that one did not came back after restart so i had to reformat it. I am a litte worried now if this problem will come back and what is the real issue ? Thank you
  5. i have the same problem every 2-3 days nginx stops working and has this error : Jun 29 08:05:09 ramiroserver nginx: 2023/06/29 08:05:09 [error] 28347#28347: nchan: Out of shared memory while allocating channel /disks. Increase nchan_max_reserved_memory. Jun 29 08:05:09 ramiroserver nginx: 2023/06/29 08:05:09 [error] 28347#28347: *668759 nchan: error publishing message (HTTP status code 507), client: unix:, server: , request: "POST /pub/disks?buffer_length=1 HTTP/1.1", host: "localhost" Jun 29 08:05:09 ramiroserver nginx: 2023/06/29 08:05:09 [crit] 28347#28347: ngx_slab_alloc() failed: no memory After nginx restart everything works again.
  6. Hello, After upgrading from RC to 6.12.1 my nginx doesn't want to start by itself. Everything is running samba vm's dockers but not nginx , the weird thing is that i have to wait for the opnsense vm to start up then i can type nginx start and everything works before that just nothing happens and status shows not running. Could nginx be waiting for some kind of network initialization? There is also no error about nginx in the log. With 6.12 rc5 everything was fine but with 6.12.1 i have this issue. Attache my diagnostics ramiroserver-diagnostics-20230623-2200.zip
  7. I looked into the code about the custom schedule scripts not starting after restart, i cannot see any function that creates them on reboot , could it be that it is not implented , or was i just blind ?
  8. I figured out a solution so if anybody else also needs this here is what i did. I installed ipmitool with nerdtools. Then i created a script that at the start of the array initalizes the watchdog : #!/bin/bash modprobe ipmi_si modprobe ipmi_devintf /usr/bin/ipmitool raw 0x06 0x24 0x04 0x01 0x00 0x00 0x70 0x17 And another script that every minute runs this command #!/bin/bash /usr/bin/ipmitool mc watchdog reset
  9. Has anybody else the problem that scripts with custom quene are not added to the /etc/cron.d/root file after a reboot ? I have to change something in the scripts and after i click apply they are back in the file.
  10. Hello , I have a Z9PE-D16 running unraid , for some reason i don't know yet my systems freezes ~ once a month. I saw that my board has a watchdog that would at least minimalize the downtime , but when i enable it the system restarts after 10 min because unraid is not sending anything to it. As far as i found out i should use the iTCO_wdt modul but it is not included in unraid ? Has anybody running a watchdog with unraid ? I found something about watchdog in this release notes https://wiki.unraid.net/Manual/Release_Notes/Unraid_OS_6.9.1 but nothing how to use it. Thank you Ramiro
  11. I attached my diagnostics. ramiroserver-diagnostics-20220709-1603.zip
  12. Hello , is still have this problem on 6.10.3 after restart host accest doesn't work, i have to stop the docker service disable host access then enable host acess and start the docker service. I am using a opnsense VM as my router so there is no network when unraid is startig but all the interfaces have a static address. This affects dockers that are not running on the main interface but are tryining to access a docker on the main interface(my Apache dockers are on vlan 20 , but my mysql docker is running on the host interface) .
  13. Had this bug on the released 6.10 today , very annoying especialy if you don't know about it and are looking everywhere why its not working.
  14. Thank you its working fine now.
  15. Yes i have exactly the same problem , it also resets the local endpoint address.
  16. Hello Already googled this issue just couldn't find anythig. The network stays active on standby just at 100 mbit , that maybe confues the card. But i found in the logs this error message Aug 6 21:14:24 ramiroserver kernel: bnx2x 0000:09:00.1 eth3: NIC Link is Up, 10000 Mbps full duplex, Flow control: ON - receive & transmit Aug 6 21:14:35 ramiroserver kernel: bnx2x: [bnx2x_timer:5796(eth3)]MFW seems hanged: drv_pulse (0x269b) != mcp_pulse (0x7fff) Aug 6 21:14:35 ramiroserver kernel: bnx2x: [bnx2x_acquire_hw_lock:2016(eth3)]lock_status 0xffffffff resource_bit 0x1 Aug 6 21:14:35 ramiroserver kernel: bnx2x: [bnx2x_hw_stats_update:869(eth3)]NIG timer max (4294967295) Aug 6 21:14:35 ramiroserver kernel: bnx2x 0000:09:00.1 eth3: MDC/MDIO access timeout ### [PREVIOUS LINE REPEATED 1 TIMES] ### Aug 6 21:14:35 ramiroserver kernel: bnx2x 0000:09:00.1 eth3: Error: Power fault on Port 0 has been detected and the power to that SFP+ module has been removed to prevent failure of the card. Please remove the SFP+ module and restart the system to clear this error. i just don't i have any idea what triggers it , it seems to happen when the other pc wakes up.
  17. Hello My Broadcom 57810S 10 Gigabit card stops to work after Unraids runs for some time , if i restart the Server everythings works again for some time , it was running fine for some months , but now i updated the Bios on PC that is directly connected to the Unraid Server and then this problem started , i think it usualy stops working when the PC that is directly connected goes into standby. Any idea where the problem is comming from ? Aug 6 21:37:13 ramiroserver kernel: bnx2x: [bnx2x_acquire_hw_lock:2016(eth3)]lock_status 0xffffffff resource_bit 0x1 Aug 6 21:37:13 ramiroserver kernel: bnx2x 0000:09:00.1 eth3: MDC/MDIO access timeout ### [PREVIOUS LINE REPEATED 1 TIMES] ### Aug 6 21:37:13 ramiroserver kernel: bnx2x: [bnx2x_state_wait:310(eth3)]timeout waiting for state 2 Aug 6 21:37:13 ramiroserver kernel: bnx2x: [bnx2x_func_stop:9107(eth3)]FUNC_STOP ramrod failed. Running a dry transaction Aug 6 21:37:13 ramiroserver kernel: bnx2x: [bnx2x_igu_int_disable:895(eth3)]BUG! Proper val not read from IGU! Aug 6 21:37:13 ramiroserver kernel: bnx2x: [bnx2x_func_hw_reset:6126(eth3)]Unknown reset_phase (0x0) from MCP Aug 6 21:37:19 ramiroserver kernel: bnx2x: [bnx2x_fw_command:3047(eth3)]FW failed to respond! Aug 6 21:37:19 ramiroserver kernel: bnx2x 0000:09:00.1 eth3: bc 7.13.54 Aug 6 21:37:19 ramiroserver kernel: bnx2x: [bnx2x_fw_dump_lvl:782(eth3)]3MCP PC at 0xffffffff Aug 6 21:37:19 ramiroserver kernel: bnx2x: [bnx2x_fw_dump_lvl:794(eth3)]Unable to dump trace buffer (mark ffffffff) Aug 6 21:37:19 ramiroserver kernel: bnx2x: [bnx2x_acquire_hw_lock:2016(eth3)]lock_status 0xffffffff resource_bit 0x800 syslog.2.txt
  18. Hello I sucessfuly installed Mojave on my unraid server with the instructions and it worked great , but offcourse the GUI is very laggy , so i passthrough a Radeon 280x to the vm and thats were the problems started , i can't get it to load the drivers do i have to flash the videocard to an efi BIOS or is it possibe without BIOS flash ? Also if the Videocard is detected will it work through the VNC server in mac os or will it not use the GPU like windows RDP , i have no screen connected to the unraid server thats why i would like to use mac os that way ?