June 24, 20215 yr Lately I have been noticing random reboots and lots of parity checks. Things were going well for a while though so I did not think much of it until today when I had a reboot while stopping the array. I clicked on the disk log information and two of the disks were spitting a bunch of warnings and errors. After a Google search I determined it could be a bad SATA cable so I swapped that out. Still having the issue. Below is some of the Disk log as well as a full attached log for reference. Any help would be appreciated. Thanks Jun 23 22:30:40 Sauron kernel: ata11: hard resetting link Jun 23 22:30:50 Sauron kernel: ata11: softreset failed (1st FIS failed) Jun 23 22:30:50 Sauron kernel: ata11: hard resetting link Jun 23 22:30:59 Sauron kernel: ata11: SATA link up 1.5 Gbps (SStatus 113 SControl 310) Jun 23 22:30:59 Sauron kernel: ata11.00: configured for UDMA/33 Jun 23 22:30:59 Sauron kernel: ata11: EH complete Jun 23 22:31:00 Sauron kernel: ata11.00: exception Emask 0x10 SAct 0xff00 SErr 0x90200 action 0xe frozen Jun 23 22:31:00 Sauron kernel: ata11.00: irq_stat 0x00400000, PHY RDY changed Jun 23 22:31:00 Sauron kernel: ata11: SError: { Persist PHYRdyChg 10B8B } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:40:98:e4:60/05:00:00:00:00/40 tag 8 ncq dma 688128 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:48:d8:e9:60/00:00:00:00:00/40 tag 9 ncq dma 32768 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/10:50:18:ea:60/05:00:00:00:00/40 tag 10 ncq dma 663552 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:58:28:ef:60/05:00:00:00:00/40 tag 11 ncq dma 688128 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/70:60:68:f4:60/00:00:00:00:00/40 tag 12 ncq dma 57344 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:68:d8:f4:60/05:00:00:00:00/40 tag 13 ncq dma 688128 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:70:18:fa:60/05:00:00:00:00/40 tag 14 ncq dma 688128 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11.00: failed command: READ FPDMA QUEUED Jun 23 22:31:00 Sauron kernel: ata11.00: cmd 60/40:78:58:ff:60/05:00:00:00:00/40 tag 15 ncq dma 688128 in Jun 23 22:31:00 Sauron kernel: ata11.00: status: { DRDY } Jun 23 22:31:00 Sauron kernel: ata11: hard resetting link sauron-syslog-20210624-0226.zip
June 24, 20215 yr Disk errors look more like a connection/power problem, for the reboots see here, also next time please post the full diagnostics instead.
June 24, 20215 yr Author Seems like it rebooted again an hour ago. I uploaded a diagnostic this time. When I have time later I can look at the page about reboots and try some other things. Thanks sauron-diagnostics-20210624-0921.zip
June 25, 20215 yr Author So I think I just confirmed that the issue is either with the sata cable or the sata port. I hot swapped the two troublesome drives with two other ones on my hot swappable case and now a different drive is spitting out errors. Weirdly only one drive is having problems post swap as opposed to two. I wonder if I can try seating the bad one better in the drive bay to see if that solves it. If anyone else has any solutions though I would appreciate that as well. Thanks sauron-diagnostics-20210624-2118.zip
Archived
This topic is now archived and is closed to further replies.