desertfox_0815 Posted August 14, 2021 Share Posted August 14, 2021 (edited) Hey guys, i do have a severe problem with my unraid server. my parity disk1 kept restarting on parity checks (Seagate Ironwolf 8Tb CMR). Smartchecks are clear and free of errors. Even though i've replaced the disk with a new disk same type. I've changed the sata cable and powercable and my new drive keeps doing this while rebuilding parity information. on every disk-restart parity checks slows down due to the spinup of the disk. with the new disk rebuild runs longer until paritydisk1 restarts. do you have any idea? Diagnostics log is attached. Thanks alot sideinfo: i've enabled encryption on my array so i have to rebuild parity. tower-diagnostics-20210814-0829.zip Edited August 14, 2021 by desertfox_0815 texterror Quote Link to comment
desertfox_0815 Posted August 14, 2021 Author Share Posted August 14, 2021 I've enabled the smart value "command time out" on both parity disks and parity disk2 gets errors. is this something to worry about!? should i change to single parity until the source error is found?! Quote Link to comment
JorgeB Posted August 14, 2021 Share Posted August 14, 2021 Constant ATA errors on both parity disks, replace/swap cables, including power and try again. Quote Link to comment
desertfox_0815 Posted August 14, 2021 Author Share Posted August 14, 2021 Ok thanks, cables are already ordered. Will arrive monday. could you tell me where you found these errors? Quote Link to comment
JorgeB Posted August 14, 2021 Share Posted August 14, 2021 You can see them in the syslog, see if they go way after the cables swap: Aug 14 07:53:02 Tower kernel: ata2.00: status: { DRDY } Aug 14 07:53:02 Tower kernel: ata2.00: failed command: WRITE FPDMA QUEUED Aug 14 07:53:02 Tower kernel: ata2.00: cmd 61/40:b0:e0:aa:bb/05:00:2c:00:00/40 tag 22 ncq dma 688128 out Aug 14 07:53:02 Tower kernel: res 40/00:a0:60:a0:bb/00:00:2c:00:00/40 Emask 0x50 (ATA bus error) Aug 14 07:53:02 Tower kernel: ata2.00: status: { DRDY } Aug 14 07:53:02 Tower kernel: ata2: hard resetting link Aug 14 07:53:12 Tower kernel: ata2: softreset failed (1st FIS failed) Aug 14 07:53:12 Tower kernel: ata2: hard resetting link Aug 14 07:53:14 Tower kernel: ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 310) Aug 14 07:53:14 Tower kernel: ata2.00: configured for UDMA/133 Aug 14 07:53:14 Tower kernel: ata2: EH complete Aug 14 07:54:07 Tower kernel: ata16.00: exception Emask 0x10 SAct 0xf8000 SErr 0x190002 action 0xe frozen Aug 14 07:54:07 Tower kernel: ata16.00: irq_stat 0x80400000, PHY RDY changed Aug 14 07:54:07 Tower kernel: ata16: SError: { RecovComm PHYRdyChg 10B8B Dispar } Aug 14 07:54:07 Tower kernel: ata16.00: failed command: WRITE FPDMA QUEUED Aug 14 07:54:07 Tower kernel: ata16.00: cmd 61/40:78:50:88:91/05:00:2d:00:00/40 tag 15 ncq dma 688128 out Aug 14 07:54:07 Tower kernel: res 40/00:78:50:88:91/00:00:2d:00:00/40 Emask 0x10 (ATA bus error) Aug 14 07:54:07 Tower kernel: ata16.00: status: { DRDY } Aug 14 07:54:07 Tower kernel: ata16.00: failed command: WRITE FPDMA QUEUED Aug 14 07:54:07 Tower kernel: ata16.00: cmd 61/48:80:90:8d:91/01:00:2d:00:00/40 tag 16 ncq dma 167936 out Aug 14 07:54:07 Tower kernel: res 40/00:78:50:88:91/00:00:2d:00:00/40 Emask 0x10 (ATA bus error) Aug 14 07:54:07 Tower kernel: ata16.00: status: { DRDY } Aug 14 07:54:07 Tower kernel: ata16.00: failed command: WRITE FPDMA QUEUED Aug 14 07:54:07 Tower kernel: ata16.00: cmd 61/88:88:d8:8e:91/04:00:2d:00:00/40 tag 17 ncq dma 593920 out Aug 14 07:54:07 Tower kernel: res 40/00:78:50:88:91/00:00:2d:00:00/40 Emask 0x10 (ATA bus error) Aug 14 07:54:07 Tower kernel: ata16.00: status: { DRDY } Aug 14 07:54:07 Tower kernel: ata16.00: failed command: WRITE FPDMA QUEUED Aug 14 07:54:07 Tower kernel: ata16.00: cmd 61/40:90:60:93:91/05:00:2d:00:00/40 tag 18 ncq dma 688128 out Aug 14 07:54:07 Tower kernel: res 40/00:78:50:88:91/00:00:2d:00:00/40 Emask 0x10 (ATA bus error) Aug 14 07:54:07 Tower kernel: ata16.00: status: { DRDY } Aug 14 07:54:07 Tower kernel: ata16.00: failed command: WRITE FPDMA QUEUED Aug 14 07:54:07 Tower kernel: ata16.00: cmd 61/a0:98:a0:98:91/02:00:2d:00:00/40 tag 19 ncq dma 344064 out Aug 14 07:54:07 Tower kernel: res 40/00:78:50:88:91/00:00:2d:00:00/40 Emask 0x10 (ATA bus error) Currently ATA2 is parity, ATA16 is parity2. Quote Link to comment
desertfox_0815 Posted August 18, 2021 Author Share Posted August 18, 2021 Changed Powercables...Problem is gone. I was able to rebuild parity without errors. the "safety" parity check was also clean. Thanks alot 1 Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.