Jump to content

Repeat drive failures - but only during midnight housekeeping


Recommended Posts

I'm having the same drive flake out in the middle of the night, once every couple months. Yesterday I reseated the SATA cabling on drive and motherboard, and it died again in the night. Usually I just reboot and rebuild from parity since it's a newer drive. Can someone take a peek at the attached logs and give me any indication on the cause/solution? TIA

 

Some snippets:

Jun 21 00:00:51 Sophos kernel: ata9.00: exception Emask 0x0 SAct 0xffffffff SErr 0x0 action 0x6 frozen
Jun 21 00:00:51 Sophos kernel: ata9.00: failed command: WRITE FPDMA QUEUED
Jun 21 00:00:51 Sophos kernel: ata9.00: cmd 61/20:00:20:3c:a4/00:00:00:00:00/40 tag 0 ncq dma 16384 out
Jun 21 00:00:51 Sophos kernel:         res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Jun 21 00:00:51 Sophos kernel: ata9.00: status: { DRDY }
Jun 21 00:00:51 Sophos kernel: ata9.00: failed command: WRITE FPDMA QUEUED
Jun 21 00:00:51 Sophos kernel: ata9.00: cmd 61/80:08:20:82:a4/00:00:00:00:00/40 tag 1 ncq dma 65536 out
Jun 21 00:00:51 Sophos kernel:         res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)


Jun 21 00:01:30 Sophos kernel: ata9: hard resetting link
Jun 21 00:01:30 Sophos kernel: ata9: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Jun 21 00:01:30 Sophos kernel: ata9.00: supports DRM functions and may not be fully accessible
Jun 21 00:01:45 Sophos kernel: ata9.00: qc timeout (cmd 0xef)
Jun 21 00:01:45 Sophos kernel: ata9.00: failed to set xfermode (err_mask=0x4)
Jun 21 00:01:45 Sophos kernel: ata9: hard resetting link
Jun 21 00:01:55 Sophos kernel: ata9: softreset failed (1st FIS failed)
Jun 21 00:01:55 Sophos kernel: ata9: hard resetting link
Jun 21 00:02:05 Sophos kernel: ata9: softreset failed (1st FIS failed)
Jun 21 00:02:05 Sophos kernel: ata9: hard resetting link
Jun 21 00:02:40 Sophos kernel: ata9: softreset failed (1st FIS failed)
Jun 21 00:02:40 Sophos kernel: ata9: limiting SATA link speed to 3.0 Gbps
Jun 21 00:02:40 Sophos kernel: ata9: hard resetting link
Jun 21 00:02:45 Sophos kernel: ata9: softreset failed (1st FIS failed)
Jun 21 00:02:45 Sophos kernel: ata9: reset failed, giving up
Jun 21 00:02:45 Sophos kernel: ata9.00: disabled
Jun 21 00:02:45 Sophos kernel: ata9: EH complete
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#12 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=149s
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#18 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=0s
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#18 CDB: opcode=0x2a 2a 00 0b e0 15 10 00 00 30 00
Jun 21 00:02:45 Sophos kernel: blk_update_request: I/O error, dev sdj, sector 199234832 op 0x1:(WRITE) flags 0x8800 phys_seg 6 prio class 0
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#12 CDB: opcode=0x93 93 08 00 00 00 00 00 00 30 00 00 00 00 20 00 00
Jun 21 00:02:45 Sophos kernel: blk_update_request: I/O error, dev sdj, sector 12288 op 0x3:(DISCARD) flags 0x800 phys_seg 1 prio class 0
Jun 21 00:02:45 Sophos kernel: BTRFS error (device sdj1): bdev /dev/sdj1 errs: wr 1, rd 0, flush 0, corrupt 0, gen 0
Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xbe01528 len 0 err no 10
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] Read Capacity(16) failed: Result: hostbyte=0x04 driverbyte=0x00
Jun 21 00:02:45 Sophos kernel: BTRFS error (device sdj1): bdev /dev/sdj1 errs: wr 2, rd 0, flush 0, corrupt 0, gen 0
Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xbe01540 len 0 err no 10
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] Sense not available.
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#19 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=0s
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#19 CDB: opcode=0x2a 2a 00 0b e0 14 88 00 00 08 00
Jun 21 00:02:45 Sophos kernel: blk_update_request: I/O error, dev sdj, sector 199234696 op 0x1:(WRITE) flags 0x8800 phys_seg 1 prio class 0
Jun 21 00:02:45 Sophos kernel: BTRFS error (device sdj1): bdev /dev/sdj1 errs: wr 3, rd 0, flush 0, corrupt 0, gen 0
Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xbe01490 len 0 err no 10
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#20 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=106s
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#20 CDB: opcode=0x2a 2a 00 0b e0 14 70 00 00 18 00

 

Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xdfbdfc0 len 0 err no 10
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#18 access beyond end of device
Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xdfbdfe0 len 0 err no 10
Jun 21 00:02:45 Sophos kernel: sd 9:0:0:0: [sdj] tag#19 access beyond end of device
Jun 21 00:02:45 Sophos kernel: BTRFS warning (device sdj1): direct IO failed ino 265 rw 1,34817 sector 0xdfbe000 len 0 err no 10

sophos-diagnostics-20210621-0733 (1).zip

Link to comment
57 minutes ago, stultus said:

Jun 21 00:02:45 Sophos kernel: ata9: softreset failed (1st FIS failed)
Jun 21 00:02:45 Sophos kernel: ata9: reset failed, giving up
Jun 21 00:02:45 Sophos kernel: ata9.00: disabled

This means the drive is dropping offline, replace or swap cables, both SATA and power, or connect the SSD to a different controller.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...