Reddog1400 Posted August 10, 2022 Share Posted August 10, 2022 A few weeks ago, I had a disk that got disabled after some read errors. I thought maybe the disk was going bad, so I pulled it out of the array and precleared it. Disk passed preclear. I also bought another drive just in case. I have put the new drive into the array and the rebuild keeps throwing errors and shutting down the array. There have been a lot of wonky things going on the past couple of weeks. I am unsure what to do. I have attached diagnostics. defiant-syslog-20220810-1329.zip defiant-diagnostics-20220810-0927.zip Quote Link to comment
JorgeB Posted August 10, 2022 Share Posted August 10, 2022 Problem with the onboard SATA controller, quite common with some Ryzen servers especially under load, best bet is to use an add-on controller. Quote Link to comment
trurl Posted August 10, 2022 Share Posted August 10, 2022 Disk7 SMART report looks fine. Probably nothing wrong with the original disk either. Connection problems are much more common that bad disks. And it looks like you have connection/controller problems on multiple disks, causing read errors. You should be able to see these in the Errors column on Main - Array Devices. This might have even broken user shares since diagnostics isn't showing any. You have a Marvell controller, these are NOT recommended. However Spoiler Aug 10 09:25:28 Defiant kernel: ata5.00: exception Emask 0x0 SAct 0x8fc SErr 0x0 action 0x6 frozen Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/40:10:58:6d:4a/05:00:04:00:00/40 tag 2 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/40:18:98:72:4a/05:00:04:00:00/40 tag 3 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/38:20:d8:77:4a/04:00:04:00:00/40 tag 4 ncq dma 552960 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/40:28:10:7c:4a/05:00:04:00:00/40 tag 5 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:00:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/40:30:50:81:4a/05:00:04:00:00/40 tag 6 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/78:38:90:86:4a/02:00:04:00:00/40 tag 7 ncq dma 323584 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata5.00: cmd 60/50:58:08:89:4a/04:00:04:00:00/40 tag 11 ncq dma 565248 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata5.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata5: hard resetting link Aug 10 09:25:28 Defiant kernel: ata6.00: exception Emask 0x0 SAct 0x3f8000 SErr 0x0 action 0x6 frozen Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/40:78:58:6d:4a/05:00:04:00:00/40 tag 15 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/40:80:98:72:4a/05:00:04:00:00/40 tag 16 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/38:88:d8:77:4a/04:00:04:00:00/40 tag 17 ncq dma 552960 in Aug 10 09:25:28 Defiant kernel: res 40/00:ff:00:00:00/00:00:00:00:00/40 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/40:90:10:7c:4a/05:00:04:00:00/40 tag 18 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/40:98:50:81:4a/05:00:04:00:00/40 tag 19 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/78:a0:90:86:4a/02:00:04:00:00/40 tag 20 ncq dma 323584 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata6.00: cmd 60/50:a8:08:89:4a/04:00:04:00:00/40 tag 21 ncq dma 565248 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata6.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata6: hard resetting link Aug 10 09:25:28 Defiant kernel: ata2.00: exception Emask 0x0 SAct 0x7f00 SErr 0x0 action 0x6 frozen Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/40:40:58:6d:4a/05:00:04:00:00/40 tag 8 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/40:48:98:72:4a/05:00:04:00:00/40 tag 9 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/38:50:d8:77:4a/04:00:04:00:00/40 tag 10 ncq dma 552960 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/40:58:10:7c:4a/05:00:04:00:00/40 tag 11 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/40:60:50:81:4a/05:00:04:00:00/40 tag 12 ncq dma 688128 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/78:68:90:86:4a/02:00:04:00:00/40 tag 13 ncq dma 323584 in Aug 10 09:25:28 Defiant kernel: res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2.00: failed command: READ FPDMA QUEUED Aug 10 09:25:28 Defiant kernel: ata2.00: cmd 60/50:70:08:89:4a/04:00:04:00:00/40 tag 14 ncq dma 565248 in Aug 10 09:25:28 Defiant kernel: res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:28 Defiant kernel: ata2.00: status: { DRDY } Aug 10 09:25:28 Defiant kernel: ata2: hard resetting link Aug 10 09:25:37 Defiant kernel: ata1.00: exception Emask 0x0 SAct 0x8fe SErr 0x0 action 0x6 frozen Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/40:08:58:6d:4a/05:00:04:00:00/40 tag 1 ncq dma 688128 in Aug 10 09:25:37 Defiant kernel: res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/40:10:98:72:4a/05:00:04:00:00/40 tag 2 ncq dma 688128 in Aug 10 09:25:37 Defiant kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/38:18:d8:77:4a/04:00:04:00:00/40 tag 3 ncq dma 552960 in Aug 10 09:25:37 Defiant kernel: res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/40:20:10:7c:4a/05:00:04:00:00/40 tag 4 ncq dma 688128 in Aug 10 09:25:37 Defiant kernel: res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/40:28:50:81:4a/05:00:04:00:00/40 tag 5 ncq dma 688128 in Aug 10 09:25:37 Defiant kernel: res 40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/78:30:90:86:4a/02:00:04:00:00/40 tag 6 ncq dma 323584 in Aug 10 09:25:37 Defiant kernel: res 40/00:01:00:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/50:38:08:89:4a/04:00:04:00:00/40 tag 7 ncq dma 565248 in Aug 10 09:25:37 Defiant kernel: res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1.00: failed command: READ FPDMA QUEUED Aug 10 09:25:37 Defiant kernel: ata1.00: cmd 60/08:58:50:00:00/00:00:00:00:00/40 tag 11 ncq dma 4096 in Aug 10 09:25:37 Defiant kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Aug 10 09:25:37 Defiant kernel: ata1.00: status: { DRDY } Aug 10 09:25:37 Defiant kernel: ata1: hard resetting link Aug 10 09:25:38 Defiant kernel: ata5: softreset failed (1st FIS failed) Aug 10 09:25:38 Defiant kernel: ata5: hard resetting link Aug 10 09:25:38 Defiant kernel: ata2: softreset failed (1st FIS failed) Aug 10 09:25:38 Defiant kernel: ata2: hard resetting link Aug 10 09:25:38 Defiant kernel: ata6: softreset failed (1st FIS failed) Aug 10 09:25:38 Defiant kernel: ata6: hard resetting link Aug 10 09:25:47 Defiant kernel: ata1: softreset failed (1st FIS failed) Aug 10 09:25:47 Defiant kernel: ata1: hard resetting link Aug 10 09:25:48 Defiant kernel: ata6: softreset failed (1st FIS failed) Aug 10 09:25:48 Defiant kernel: ata6: hard resetting link Aug 10 09:25:48 Defiant kernel: ata2: softreset failed (1st FIS failed) Aug 10 09:25:48 Defiant kernel: ata2: hard resetting link Aug 10 09:25:48 Defiant kernel: ata5: softreset failed (1st FIS failed) Aug 10 09:25:48 Defiant kernel: ata5: hard resetting link Aug 10 09:25:57 Defiant kernel: ata1: softreset failed (1st FIS failed) Aug 10 09:25:57 Defiant kernel: ata1: hard resetting link Aug 10 09:26:23 Defiant kernel: ata6: softreset failed (1st FIS failed) Aug 10 09:26:23 Defiant kernel: ata6: limiting SATA link speed to 3.0 Gbps Aug 10 09:26:23 Defiant kernel: ata6: hard resetting link Aug 10 09:26:23 Defiant kernel: ata2: softreset failed (1st FIS failed) Aug 10 09:26:23 Defiant kernel: ata2: limiting SATA link speed to 3.0 Gbps Aug 10 09:26:23 Defiant kernel: ata2: hard resetting link Aug 10 09:26:23 Defiant kernel: ata5: softreset failed (1st FIS failed) Aug 10 09:26:23 Defiant kernel: ata5: limiting SATA link speed to 3.0 Gbps Aug 10 09:26:23 Defiant kernel: ata5: hard resetting link Aug 10 09:26:28 Defiant kernel: ata6: softreset failed (1st FIS failed) Aug 10 09:26:28 Defiant kernel: ata6: reset failed, giving up Aug 10 09:26:28 Defiant kernel: ata6.00: disabled Aug 10 09:26:28 Defiant kernel: ata6: EH complete Aug 10 09:26:28 Defiant kernel: sd 6:0:0:0: [sde] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=DRIVER_OK cmd_age=90s Aug 10 09:26:28 Defiant kernel: sd 6:0:0:0: [sde] tag#6 CDB: opcode=0x88 88 00 00 00 00 00 04 4a 89 08 00 00 04 50 00 00 Spoiler [1:0:0:0] disk ATA WDC WD101EMAZ-11 0A81 /dev/sdb /dev/sg1 state=running queue_depth=32 scsi_level=6 type=0 device_blocked=0 timeout=30 dir: /sys/bus/scsi/devices/1:0:0:0 [/sys/devices/pci0000:00/0000:00:01.3/0000:03:00.1/ata1/host1/target1:0:0/1:0:0:0] [2:0:0:0] disk ATA WDC WD140EDGZ-11 0A85 /dev/sdc /dev/sg2 state=running queue_depth=32 scsi_level=6 type=0 device_blocked=0 timeout=30 dir: /sys/bus/scsi/devices/2:0:0:0 [/sys/devices/pci0000:00/0000:00:01.3/0000:03:00.1/ata2/host2/target2:0:0/2:0:0:0] [5:0:0:0] disk ATA WDC WD40EFRX-68N 0A82 /dev/sdd /dev/sg3 state=running queue_depth=32 scsi_level=6 type=0 device_blocked=0 timeout=30 dir: /sys/bus/scsi/devices/5:0:0:0 [/sys/devices/pci0000:00/0000:00:01.3/0000:03:00.1/ata5/host5/target5:0:0/5:0:0:0] [6:0:0:0] disk ATA WDC WD40EFRX-68N 0A82 /dev/sde /dev/sg4 state=running queue_depth=32 scsi_level=6 type=0 device_blocked=0 timeout=30 dir: /sys/bus/scsi/devices/6:0:0:0 [/sys/devices/pci0000:00/0000:00:01.3/0000:03:00.1/ata6/host6/target6:0:0/6:0:0:0] it looks like the problems are all coming from this controller instead: 03:00.1 SATA controller [0106]: Advanced Micro Devices, Inc. [AMD] 400 Series Chipset SATA Controller [1022:43c8] (rev 01) Subsystem: ASMedia Technology Inc. 400 Series Chipset SATA Controller [1b21:1062] Not sure if that is on the motherboard. If not reseat it. And you must always double check all connections when mucking about inside. Shutdown, check all connections, all disks, SATA and power, both ends, including splitters. Reboot, restart rebuild, post new diagnostics. Quote Link to comment
trurl Posted August 10, 2022 Share Posted August 10, 2022 Just now, trurl said: Not sure if that is on the motherboard. 2 minutes ago, JorgeB said: Problem with the onboard SATA controller Quote Link to comment
Reddog1400 Posted August 10, 2022 Author Share Posted August 10, 2022 I will go through and reseat and recheck all the cables. What is a recommended SATA board? Quote Link to comment
trurl Posted August 10, 2022 Share Posted August 10, 2022 1 minute ago, Reddog1400 said: recommended SATA board Quote Link to comment
Reddog1400 Posted August 10, 2022 Author Share Posted August 10, 2022 So, something like this? https://www.newegg.com/lsi00302-sata-sas/p/N82E16816118183?Item=9SIB7VEH151745 With these cables? https://www.newegg.com/p/238-00A4-00003?Item=9SIACJF7BC9148 I am very new to SAS. I just want to make sure I am getting the right hardware. Quote Link to comment
JorgeB Posted August 10, 2022 Share Posted August 10, 2022 Yes, assuming it's genuine. Quote Link to comment
JonathanM Posted August 10, 2022 Share Posted August 10, 2022 I don't think the 92xx series has been manufactured by LSI for quite some time, so the only genuine ones are used server pulls. If it states new, it's probably counterfeit. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.