martinjuhasz

Members
  • Posts

    37
  • Joined

  • Last visited

Everything posted by martinjuhasz

  1. Hello, so my server started throwing CRC errors on different drives. The cache drive especially, where CRC Errors increase by the hour (currently 15). Also i get different kernel warnings in my logs: ata5.00: failed command: READ FPDMA QUEUED ata5: hard resetting link and btrfs errors: BTRFS info (device dm-3): read error corrected: ino 4173238 off 1912832 (dev /dev/mapper/sdg1 sector 314771312) i have no idea what is happening. it doesnt seem like the disk is the problem? What might be the case? Any help greatly appreciated winston-diagnostics-20230905-1058.zip
  2. i've run the board for years in another desktop computer without problems. I'll get some new SATA first and hope for the best. If not, i'll grab another PSU (uhg). We are talking about the SATA Cables (for data), not the power connector to the PSU right? Thanks again for your awesome support Edit: Since both drives failed already at some point, it might be the PSU, damn
  3. @JorgeB sure, thanks! winston-diagnostics-20230322-1210.zip
  4. I've put my disks on different rails on my PSU and also changed connectors just to make sure. Today it happened again. @JorgeB does this look again like a power isse? If it is so, it seems i might need to replace the PSU and see if this (hopefully) resolves it. Mar 22 07:45:09 Winston emhttpd: spinning down /dev/sdc Mar 22 07:56:43 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 07:56:43 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 07:56:43 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:27:26 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:27:26 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:27:26 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:58:08 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:58:08 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 08:58:08 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:28:51 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:28:51 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:28:51 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:59:47 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:59:47 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 09:59:47 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 10:30:29 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 10:30:29 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 10:30:29 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 11:00:02 Winston emhttpd: read SMART /dev/sdc Mar 22 11:00:13 Winston emhttpd: read SMART /dev/sdb Mar 22 11:00:25 Winston kernel: sd 3:0:0:0: [sdd] tag#15 access beyond end of device Mar 22 11:00:25 Winston kernel: I/O error, dev sdd, sector 12915028112 op 0x0:(READ) flags 0x0 phys_seg 4 prio class 0 Mar 22 11:00:25 Winston kernel: md: disk1 read error, sector=12915028048 Mar 22 11:00:25 Winston kernel: md: disk1 read error, sector=12915028056 Mar 22 11:00:25 Winston kernel: md: disk1 read error, sector=12915028064 Mar 22 11:00:25 Winston kernel: md: disk1 read error, sector=12915028072 Mar 22 11:00:25 Winston emhttpd: read SMART /dev/sde Mar 22 11:00:35 Winston kernel: sd 3:0:0:0: [sdd] tag#16 access beyond end of device Mar 22 11:00:35 Winston kernel: I/O error, dev sdd, sector 12915028112 op 0x1:(WRITE) flags 0x0 phys_seg 4 prio class 0 Mar 22 11:00:35 Winston kernel: md: disk1 write error, sector=12915028048 Mar 22 11:00:35 Winston kernel: md: disk1 write error, sector=12915028056 Mar 22 11:00:35 Winston kernel: md: disk1 write error, sector=12915028064 Mar 22 11:00:35 Winston kernel: md: disk1 write error, sector=12915028072 Mar 22 11:01:38 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 11:01:38 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO Mar 22 11:01:38 Winston kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO I'm happy for any hints since this is so hard to debug
  5. ok something weird happened. now disk1 is in error state. That is another disk. which makes me suspicious that these errors are NOT the disks. i've downloaded diagnostics before a reboot this time. any idea what could happen here @JorgeB since now this error is across disks apparently ? winston-diagnostics-20230304-2232.zip
  6. Ah damn, should have grabbed the diag before checking the disk, well. Thanks for your insight. Will start to rebuild on top of it and see if anything is looking good. thanks!
  7. Hello, over night my disk2 came into error state Disk 2 in error state (disk dsbl) - WDC_WD80EZAZ-11TDBA0_JEH74D8M (sde) array has errors - Array has 1 disk with read errors I've checked the drive and it was powered down. these are shucked WD drives where i had to cover this 3rd pin to make it run. i suspect this became loose, but no way to confirm. i used an adapter to fix this issue permantently and the drive works again. I've ran a full SMART extended check which resulted in no errors. So i suspect the drive might not be faulty but just lost power. I'm unsure how to proceed. Should i run a preclear on the drive and if this is ok add it back to the array? How would that work? Stop Array Remove Drive from the Array Start Array Do a preclear on the unassigned drive Stop Array Add Drive again Start Array Any help appriciated. I added the SMART report of said drive2 and my diagnostics file. winston-smart-20230221-0952.zip winston-diagnostics-20230221-0952.zip
  8. ok. thank you so much for your time and help
  9. ok. so i moved files, formatted newly to btrfs encrypted, moved files again. Seems to be normal currently. i ran scrub and balance after moving files back and no errors so far. anything i can do besides that to verify its ok now? might happen over time again right?
  10. oh damn, okay. i might switch back to zfs then. really wanted to avoid doing "manual" local backup with rsync. would you mind elaborating what indicates that its metadata?
  11. Ok, it finished, these are the syslog outputs Nov 4 17:12:12 Winston ool www[15451]: /usr/local/emhttp/plugins/dynamix/scripts/btrfs_scrub 'start' '/mnt/cache' '' Nov 4 17:12:12 Winston kernel: BTRFS info (device dm-3): scrub: started on devid 1 Nov 4 17:12:12 Winston kernel: BTRFS info (device dm-3): scrub: started on devid 2 Nov 4 17:12:52 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 5, gen 0 Nov 4 17:12:52 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 81550143488 on dev /dev/mapper/sdf1 Nov 4 17:13:34 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 6, gen 0 Nov 4 17:13:34 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 122435252224 on dev /dev/mapper/sdf1 Nov 4 17:13:35 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 7, gen 0 Nov 4 17:13:35 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 123201454080 on dev /dev/mapper/sdf1 Nov 4 17:13:56 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 8, gen 0 Nov 4 17:13:56 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 179388588032 on dev /dev/mapper/sdf1 Nov 4 17:13:56 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 9, gen 0 Nov 4 17:13:56 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 179478081536 on dev /dev/mapper/sdf1 Nov 4 17:13:57 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 10, gen 0 Nov 4 17:13:57 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 180374007808 on dev /dev/mapper/sdf1 Nov 4 17:13:57 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 11, gen 0 Nov 4 17:13:57 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 185834225664 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 12, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 186957123584 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 13, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 186955014144 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 14, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 187113508864 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 15, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 187287994368 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 16, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 187306131456 on dev /dev/mapper/sdf1 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 17, gen 0 Nov 4 17:13:58 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 187558371328 on dev /dev/mapper/sdf1 Nov 4 17:14:01 Winston kernel: scrub_handle_errored_block: 52 callbacks suppressed Nov 4 17:14:01 Winston kernel: btrfs_dev_stat_print_on_error: 52 callbacks suppressed Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 70, gen 0 Nov 4 17:14:01 Winston kernel: scrub_handle_errored_block: 52 callbacks suppressed Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 191955169280 on dev /dev/mapper/sdf1 Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 71, gen 0 Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 191962398720 on dev /dev/mapper/sdf1 Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 72, gen 0 Nov 4 17:14:01 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 192054423552 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 73, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194658598912 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 74, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194773180416 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 75, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194779500544 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 76, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194793492480 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 77, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194779983872 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 78, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194928234496 on dev /dev/mapper/sdf1 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 79, gen 0 Nov 4 17:14:02 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 194943197184 on dev /dev/mapper/sdf1 Nov 4 17:16:46 Winston kernel: scrub_handle_errored_block: 1 callbacks suppressed Nov 4 17:16:46 Winston kernel: btrfs_dev_stat_print_on_error: 1 callbacks suppressed Nov 4 17:16:46 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 81, gen 0 Nov 4 17:16:46 Winston kernel: BTRFS error (device dm-3): fixed up error at logical 801075355648 on dev /dev/mapper/sdf1 Nov 4 17:17:54 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdg1 errs: wr 0, rd 0, flush 0, corrupt 1, gen 0 Nov 4 17:17:54 Winston kernel: scrub_handle_errored_block: 1 callbacks suppressed Nov 4 17:17:54 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 953985130496 on dev /dev/mapper/sdg1 Nov 4 17:18:01 Winston kernel: BTRFS info (device dm-3): scrub: finished on devid 2 with status: 0 Nov 4 17:18:12 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 82, gen 0 Nov 4 17:18:12 Winston kernel: BTRFS error (device dm-3): unable to fixup (regular) error at logical 953985130496 on dev /dev/mapper/sdf1 Nov 4 17:18:30 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 83, gen 0 Nov 4 17:18:30 Winston kernel: BTRFS error (device dm-3): fixed up error at logical 846948855808 on dev /dev/mapper/sdf1 Nov 4 17:19:06 Winston kernel: BTRFS error (device dm-3): bdev /dev/mapper/sdf1 errs: wr 0, rd 0, flush 0, corrupt 84, gen 0 Nov 4 17:19:06 Winston kernel: BTRFS error (device dm-3): fixed up error at logical 861991755776 on dev /dev/mapper/sdf1 Nov 4 17:20:33 Winston emhttpd: spinning down /dev/sdc Nov 4 17:20:42 Winston kernel: BTRFS info (device dm-3): scrub: finished on devid 1 with status: 0 Nov 4 17:21:52 Winston emhttpd: spinning down /dev/sde Nov 4 17:24:41 Winston emhttpd: spinning down /dev/sdd not sure what that means? not any files listet here i tried resolving "logical" with inspect, but this doesnt result in any output, f.e.: btrfs inspect-internal logical-resolve 81550143488 /mnt/cache
  12. ok, one more question if you might ask. doing it in the ui, should i Repair corrupted blocks or not? i guess so? Is not checking like a dry run?
  13. ok. with scrub you mean i should run btrfs scrub start /mnt/cache ? will do if this is correct.
  14. sure winston-diagnostics-20221104-1347.zip
  15. Hello, i'm getting this message whenever i start my array: pool BTRFS too many profiles (You can ignore this warning when a pool balance operation is in progress) There is no balancing operation taking place. i also did a full balance without any changes, reboot, nothing helped. btrfs fi df /mnt/cache Data, single: total=124.01GiB, used=38.28GiB Data, RAID1: total=289.00GiB, used=142.08GiB System, RAID1: total=32.00MiB, used=80.00KiB System, DUP: total=8.00MiB, used=0.00B Metadata, RAID1: total=1.00GiB, used=302.75MiB Metadata, DUP: total=1.00GiB, used=155.50MiB GlobalReserve, single: total=270.20MiB, used=0.00B WARNING: Multiple block group profiles detected, see 'man btrfs(5)'. WARNING: Data: single, raid1 WARNING: Metadata: raid1, dup WARNING: System: raid1, dup I'm a bit lost on what to do? Anyone might help me out?
  16. yep that worked, thanks. would be cool if this would show up in the info box before install. i think there is even a specific field for this in the templates for "things to do before install"
  17. Also tried installing Baikal freshly but failed with permission errors: its the default template. folders seem to be mapped correctly and get created in appdata but it still fails and folders are empty. doesnt matter if nobody or root owns the folder.
  18. i created a new share, same problem Jan 29 21:01:07 Winston emhttpd: shcmd (197): /usr/local/sbin/mover |& logger & Jan 29 21:01:07 Winston root: mover: started Jan 29 21:01:07 Winston move: file: /mnt/disk2/t3/Monsieur Claude 2.mkv Jan 29 21:01:07 Winston kernel: shfs[17432]: segfault at 0 ip 00000000004043cc sp 0000151c89700780 error 4 in shfs[402000+c000] Jan 29 21:01:07 Winston kernel: Code: 48 8b 45 f0 c9 c3 55 48 89 e5 48 83 ec 20 48 89 7d e8 48 89 75 e0 c7 45 fc 00 00 00 00 8b 45 fc 48 63 d0 48 8b 45 e0 48 01 d0 <0f> b6 00 3c 2f 74 43 8b 05 67 df 00 00 85 c0 78 2f e8 fe df ff ff Jan 29 21:01:07 Winston move: create_parent: /mnt/disk2 error: Software caused connection abort Jan 29 21:01:07 Winston move: move_object: /mnt/disk2/t3: Transport endpoint is not connected Jan 29 21:01:07 Winston root: mover: finished Jan 29 21:01:07 Winston root: Fix Common Problems: Other Warning: Mover logging is enabled winston-diagnostics-20220129-2102.zip
  19. uhrg ok. any idea what i can try next? i rebooted and mover seems to move the file of the newly created test2-share now.
  20. i don't mind rebooting. but i think this problem persists through reboots. /bin/ls: cannot access '/mnt/user': Transport endpoint is not connected total 16 1930 drwxr-xr-x 9 root root 180 Jan 29 19:52 ./ 1 drwxr-xr-x 20 root root 420 Jan 29 17:52 ../ 256 drwxrwxrwx 1 nobody users 38 Jan 29 20:02 cache/ 128 drwxrwxrwx 11 nobody users 187 Jan 29 20:02 disk1/ 128 drwxrwxrwx 16 nobody users 282 Jan 29 20:02 disk2/ 1 drwxrwxrwt 2 nobody users 40 Jan 29 17:46 disks/ 1 drwxrwxrwt 2 nobody users 40 Jan 29 17:46 remotes/ ? d????????? ? ? ? ? ? user/ 18302628885633695872 drwxrwxrwx 1 nobody users 187 Jan 29 20:02 user0/
  21. well, its weird. created another share where i can't get it working currently no matter if i start/stop the array. the error message tho has changed a bit this time Jan 29 19:50:56 Winston move: create_parent: /mnt/disk2 error: Software caused connection abort Jan 29 19:50:56 Winston move: move_object: /mnt/disk2/test2: Transport endpoint is not connected Jan 29 19:50:56 Winston root: mover: finished if i cant get this to work i might have to switch back to xfs. that would be a bummer not to be able to have a cache drive thats guaranteed to be backed up.
  22. ok so with a new share the same happened. but somehow after i stopped the array and restarted it (without reboot) mover was working fine once. now after adding another array it failed again. stopped -> started, everything seems ok for the moment.