Jump to content

JorgeB

Moderators
  • Posts

    67,530
  • Joined

  • Last visited

  • Days Won

    707

Everything posted by JorgeB

  1. It's very possible, like mentioned most likely connection or power problem, firmware might or not have an impact, but good to upgrade if possible.
  2. Your missed the trailing slash, look for a disk4 folder on disk3.
  3. Adding a new cache device by default results in a raid1 pool, it won't increase available space, also not a good idea to add a device with a full filesystem, besides the device you added dropped offline (check cables), but first you need to free up some space on cache, move or delete some data.
  4. Backup the key from the flash drive, re-create it using the USB tool, restore the key.
  5. Problems with multiple devices: Oct 2 09:35:46 Tower kernel: sd 9:0:5:0: Power-on or device reset occurred Oct 2 09:35:46 Tower kernel: sd 9:0:7:0: Power-on or device reset occurred Oct 2 09:35:46 Tower kernel: sd 9:0:1:0: Power-on or device reset occurred Oct 2 09:35:46 Tower kernel: sd 9:0:2:0: Power-on or device reset occurred Oct 2 09:35:46 Tower kernel: sd 9:0:4:0: Power-on or device reset occurred This usually suggests a connection/power problem. Also make sure to update the LSI firmware: mpt2sas_cm0: LSISAS2308: FWVersion(20.00.06.00) All p20 releases except 20.00.07.00 have known issues
  6. Rebuilding the disk on top would re-partition it, IIRC he already did that before doing the new config, but didn't finish the rebuild.
  7. Try booting in safe mode with all services disable, if no errors after a couple of days start turning them on one by one.
  8. I honestely don't know, never did much testing on that, time permitting I'll try doing some over the weekend, with different workloads and a couple of controllers, to see if there's any clear difference one way or the other.
  9. Possibly, maybe better to leave it disabled for now, I will since I don't see any performance advantage.
  10. Sorry missed this part earlier, no, that wasn't the problem, since the checksum error would always be logged in the syslog, and if I tried to copy the file locally I would always get an i/o error, with SMB (and aio read enable) it would still copy successfully despite the checksum error. BTW, just did a test on beta30, with aio enable, and it's behaving correctly, i.e., a file with just a single sector corrupt will fail to copy, tried 20 times, and always got this: At the same time a checksum error is logged (before only this part happened): So that issue appears to be fixed.
  11. Yep, two SASLP, you should difinetely get rid of those, it's not clear the disk is OK, you need to reboot an post ne diags, but still they can cause more trouble when there are errors because the driver crashes. Also make sure scheduled parity checks are set to non correct.
  12. IIRC I used dd to write some zeros on existing data, the larger the write (corruption), the more likely it would cause an i/o error during an SMB transfer, a single corrupt sector would always (or almost always) go through.
  13. In one of my main servers I have both aio read and write set to 0, on another just read, never noticed any issue with writes, but didn't do much testing, my original issue was btrfs ignoring cheksum errors on reads, performance was similar when I tested at the time.
  14. It's logged as a disk problem, you should
  15. I remember this also caused an issue with btrfs failing to give an i/o error if data corruption was found, I still have those entries in my Samba extra settings to disable aio.
  16. Yep, get one of the recommended LSI HBAs.
  17. Difficult to guess without the diags, next time it happens grab them before rebooting.
  18. Preclear script has nothing to do with the array, like mentioned Unraid requires all assigned devices to have a unique identification, maybe that string can be changed in the controller, you can also try toggling Setings -> Display Settings -> Display world-wide-name in device ID
  19. Problem appears to be the raid controller, it's adding the same string to all the Seagate disks (2001b4d2000000000), Unraid requires all disks to have a unique identification.
  20. The other controller is Asmedia: 01:00.0 SATA controller [0106]: ASMedia Technology Inc. Device [1b21:0625] (rev ff) Kernel modules: ahci But it's failing to initialize: Oct 8 04:03:31 Tower kernel: ahci 0000:01:00.0: controller reset failed (0xffffffff) Oct 8 04:03:31 Tower kernel: ahci: probe of 0000:01:00.0 failed with error -5 According to this booting with pci=noacpi solves the issue, it's probably an unknown controller with Linux.
  21. Yes, and the 9300-8i is IT mode only, note that you'd need new cables though. Performance in this case would be the same as a SAS2 PCIe 3.0 controller, like the 9207 for example, you can check some benchmarks here:
  22. Any LSI with a SAS2008/2308/3008/3408 chipset in IT mode, e.g., 9201-8i, 9211-8i, 9207-8i, 9300-8i, 9400-8i, etc and clones, like the Dell H200/H310 and IBM M1015, these latter ones need to be crossflashed. You can get one for example in "the art of server" shop on ebay, it's a reputable seller of IT mode HBAs.
×
×
  • Create New...