Jump to content
  • 6.10.0-rc2 dropping drives in external enclosure


    Shonky
    • Urgent

    System is a Microserver Gen 8 with "07:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9230 PCIe 2.0 x2 4-port SATA 6 Gb/s RAID Controller (rev 11)" connected to an external drive enclosure via an eSATA port through a port multiplier. Works fine in 6.9.2 and it has been part of my system for at least 4 years.

     

    Drives sdb, sdc, sdd, sde are in the main chassis and sdi, sdg are in the enclosure. Also sdh and sdj are installed but not part of the array. Drives are getting quite full, I was going to add a new drive soon.

     

    Upgraded from 6.9.2 to 6.10.0-rc2. Initially everything started OK. However shortly after booting as I was just checking things over, it started dropping all the drives in the enclosure - both in the array and not.

     

    Log ending from is 6.10.0-rc2 boot from /boot/logs. I didn't request it - is it created automatically?

     

    Nov  8 12:22:00 Mars root: Fix Common Problems Version 2021.08.05
    Nov  8 12:26:54 Mars kernel: ata8: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:00 Mars kernel: ata8: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:05 Mars kernel: ata11: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:10 Mars kernel: ata12: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:15 Mars kernel: ata12: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:21 Mars kernel: ata8: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:21 Mars kernel: ata8.00: disabled
    Nov  8 12:27:21 Mars kernel: ata11: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:21 Mars kernel: sd 8:0:0:0: rejecting I/O to offline device
    Nov  8 12:27:21 Mars kernel: blk_update_request: I/O error, dev sdg, sector 3910845296 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
    Nov  8 12:27:21 Mars kernel: md: disk5 read error, sector=3910845232
    Nov  8 12:27:26 Mars kernel: ata13: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:31 Mars kernel: ata8.00: detaching (SCSI 8:0:0:0)
    Nov  8 12:27:31 Mars kernel: sd 8:0:0:0: [sdg] Synchronizing SCSI cache
    Nov  8 12:27:31 Mars kernel: sd 8:0:0:0: [sdg] Synchronize Cache(10) failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:31 Mars kernel: sd 8:0:0:0: [sdg] Stopping disk
    Nov  8 12:27:31 Mars kernel: sd 8:0:0:0: [sdg] Start/Stop Unit failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:31 Mars kernel: ata11: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:31 Mars kernel: ata11.00: disabled
    Nov  8 12:27:31 Mars kernel: ata12: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:31 Mars kernel: ata12.00: disabled
    Nov  8 12:27:31 Mars kernel: ata13: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:31 Mars kernel: sd 12:0:0:0: rejecting I/O to offline device
    Nov  8 12:27:31 Mars kernel: blk_update_request: I/O error, dev sdi, sector 3910845296 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
    Nov  8 12:27:31 Mars kernel: md: disk4 read error, sector=3910845232
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: XFS (md5): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0xe91ac330 len 8 error 5
    Nov  8 12:27:31 Mars kernel: md: disk5 read error, sector=6143051080
    Nov  8 12:27:31 Mars kernel: blk_update_request: I/O error, dev sdi, sector 6143051144 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
    Nov  8 12:27:31 Mars kernel: md: disk4 read error, sector=6143051080
    Nov  8 12:27:32 Mars kernel: md: disk5 read error, sector=6143089464
    Nov  8 12:27:32 Mars kernel: blk_update_request: I/O error, dev sdi, sector 6143089528 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
    Nov  8 12:27:32 Mars kernel: md: disk4 read error, sector=6143089464
    Nov  8 12:27:37 Mars kernel: ata12.00: detaching (SCSI 12:0:0:0)
    Nov  8 12:27:37 Mars kernel: ata11.00: detaching (SCSI 11:0:0:0)
    Nov  8 12:27:37 Mars kernel: sd 11:0:0:0: [sdh] Synchronizing SCSI cache
    Nov  8 12:27:37 Mars kernel: sd 11:0:0:0: [sdh] Synchronize Cache(10) failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars kernel: sd 11:0:0:0: [sdh] Stopping disk
    Nov  8 12:27:37 Mars kernel: sd 11:0:0:0: [sdh] Start/Stop Unit failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars kernel: sd 12:0:0:0: [sdi] Synchronizing SCSI cache
    Nov  8 12:27:37 Mars kernel: sd 12:0:0:0: [sdi] Synchronize Cache(10) failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars kernel: sd 12:0:0:0: [sdi] Stopping disk
    Nov  8 12:27:37 Mars kernel: sd 12:0:0:0: [sdi] Start/Stop Unit failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars kernel: ata13: SATA link down (SStatus 0 SControl 300)
    Nov  8 12:27:37 Mars kernel: ata13.00: disabled
    Nov  8 12:27:37 Mars emhttpd: error: get_device_size, 1549: No such device or address (6): open: /dev/sdj
    Nov  8 12:27:37 Mars emhttpd: error: device_inventory, 1704: No such file or directory (2): readlink: /sys/dev/block/8:112
    Nov  8 12:27:37 Mars emhttpd: error: device_inventory, 1704: No such file or directory (2): readlink: /sys/dev/block/8:129
    Nov  8 12:27:37 Mars emhttpd: error: device_inventory, 1704: No such file or directory (2): readlink: /sys/dev/block/8:113
    Nov  8 12:27:37 Mars emhttpd: error: device_inventory, 1704: No such file or directory (2): readlink: /sys/dev/block/8:128
    Nov  8 12:27:37 Mars kernel: ata13.00: detaching (SCSI 13:0:0:0)
    Nov  8 12:27:37 Mars kernel: sd 13:0:0:0: [sdj] Synchronizing SCSI cache
    Nov  8 12:27:37 Mars kernel: sd 13:0:0:0: [sdj] Synchronize Cache(10) failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars kernel: sd 13:0:0:0: [sdj] Stopping disk
    Nov  8 12:27:37 Mars kernel: sd 13:0:0:0: [sdj] Start/Stop Unit failed: Result: hostbyte=0x04 driverbyte=DRIVER_OK
    Nov  8 12:27:37 Mars unassigned.devices: Warning: Can't get rotational setting of 'sdj'.
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=282765536
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=282765536
    Nov  8 12:27:44 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x10daa8e0 len 8 error 5
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5988858984
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5988858992
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5988859000
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5988859008
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5988858984
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5988858992
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5988859000
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5988859008
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=6008286176
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=6008286176
    Nov  8 12:27:44 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x1661f2be0 len 8 error 5
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=4035502192
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=4035502200
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=4035502208
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=4035502216
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=4035502192
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=4035502200
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=4035502208
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=4035502216
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5990007848
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5990007856
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5990007864
    Nov  8 12:27:44 Mars kernel: md: disk5 read error, sector=5990007872
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5990007848
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5990007856
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5990007864
    Nov  8 12:27:44 Mars kernel: md: disk4 read error, sector=5990007872
    Nov  8 12:27:50 Mars kernel: md: disk4 read error, sector=66720
    Nov  8 12:27:50 Mars kernel: md: disk5 read error, sector=66720
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: XFS (md4): metadata I/O error in "xfs_da_read_buf+0xa3/0x103 [xfs]" at daddr 0x104a0 len 8 error 5
    Nov  8 12:27:50 Mars kernel: md: disk4 read error, sector=2303691736
    ....

     

    I didn't really want to hang around long so fairly quickly reverted back to 6.9.2 and everything is working again. Started a parity check and that's going ok after 200GB of checking.

     

    I wouldn't call this urgent, but it's not minor or annoyance. I would consider it a major problem though certainly for a next release. The priority options aren't the best IMO.

     

     

    mars-diagnostics-20211108-1230.zip




    User Feedback

    Recommended Comments

    There are no comments to display.



    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...