xthursdayx Posted April 12, 2016 Share Posted April 12, 2016 I've found that I'm getting this series of repeating errors in my server's log. Any idea what is causing it? I have 4 pending sectors and 85 reallocated events (though 0 reallocated sectors) on my cache drive it seems (see my next post for the errors from my cache drive). Is this the cause of the errors in my SysLog? Do I need to swap out my cache? The number of currently pending sectors doesn't seem to be increasing. Apr 12 02:01:20 Tower kernel: ata9.00: exception Emask 0x0 SAct 0x4 SErr 0x0 action 0x6 frozen Apr 12 02:01:20 Tower kernel: ata9.00: failed command: WRITE FPDMA QUEUED Apr 12 02:01:20 Tower kernel: ata9.00: cmd 61/08:10:00:a9:8c/00:00:0d:00:00/40 tag 2 ncq 4096 out Apr 12 02:01:20 Tower kernel: res 40/00:ff:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout) Apr 12 02:01:20 Tower kernel: ata9.00: status: { DRDY } Apr 12 02:01:20 Tower kernel: ata9: hard resetting link Apr 12 02:01:20 Tower kernel: ata9: SATA link up 3.0 Gbps (SStatus 123 SControl 300) Apr 12 02:01:20 Tower kernel: ata9.00: configured for UDMA/133 Apr 12 02:01:20 Tower kernel: ata9.00: device reported invalid CHS sector 0 Apr 12 02:01:20 Tower kernel: ata9: EH complete Link to comment
xthursdayx Posted April 12, 2016 Author Share Posted April 12, 2016 Also I'm assuming this information from my cache drive SMART info might be related: 196 Reallocated event count 0x0032 100 100 000 Old age Always Never 85 197 Current pending sector 0x0022 100 100 000 Old age Always Never 4 And this is coming up in my cache's SMART error log: ATA Error Count: 119 (device log contains only the most recent five errors) CR = Command Register [HEX] FR = Features Register [HEX] SC = Sector Count Register [HEX] SN = Sector Number Register [HEX] CL = Cylinder Low Register [HEX] CH = Cylinder High Register [HEX] DH = Device/Head Register [HEX] DC = Device Command Register [HEX] ER = Error register [HEX] ST = Status register [HEX] Powered_Up_Time is measured from power on, and printed as DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes, SS=sec, and sss=millisec. It "wraps" after 49.710 days. Error 119 occurred at disk power-on lifetime: 1459 hours (60 days + 19 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 36 9a 02 4a ef Error: UNC 54 sectors at LBA = 0x0f4a029a = 256508570 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 00 d0 01 4a e0 08 22d+20:04:30.300 READ DMA EXT ea 00 00 00 00 00 a0 08 22d+20:04:30.300 FLUSH CACHE EXT c8 00 80 50 01 4a ef 08 22d+20:04:30.200 READ DMA ca 00 08 c0 00 00 e0 08 22d+20:04:30.200 WRITE DMA c8 00 20 30 01 4a ef 08 22d+20:04:30.200 READ DMA Error 118 occurred at disk power-on lifetime: 1458 hours (60 days + 18 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 2e 9a 02 4a ef Error: UNC 46 sectors at LBA = 0x0f4a029a = 256508570 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 00 c8 00 4a e0 08 22d+18:54:13.200 READ DMA EXT c8 00 80 48 00 4a ef 08 22d+18:54:13.200 READ DMA c8 00 20 28 00 4a ef 08 22d+18:54:13.200 READ DMA 35 00 30 d0 a9 3c e0 08 22d+18:54:13.200 WRITE DMA EXT c8 00 08 20 00 4a ef 08 22d+18:54:13.200 READ DMA Error 117 occurred at disk power-on lifetime: 1445 hours (60 days + 5 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 05 9b 99 a7 e1 Error: UNC 5 sectors at LBA = 0x01a7999b = 27761051 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 98 99 a7 e0 08 22d+05:34:20.100 READ DMA EXT ef 10 02 00 00 00 a0 08 22d+05:34:20.100 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 e0 08 22d+05:34:20.100 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 a0 08 22d+05:34:20.100 IDENTIFY DEVICE ef 03 42 00 00 00 a0 08 22d+05:34:20.100 SET FEATURES [set transfer mode] Error 116 occurred at disk power-on lifetime: 1445 hours (60 days + 5 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 05 9b 99 a7 e1 Error: UNC 5 sectors at LBA = 0x01a7999b = 27761051 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 98 99 a7 e0 08 22d+05:34:16.300 READ DMA EXT ef 10 02 00 00 00 a0 08 22d+05:34:16.300 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 e0 08 22d+05:34:16.300 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 a0 08 22d+05:34:16.300 IDENTIFY DEVICE ef 03 42 00 00 00 a0 08 22d+05:34:16.300 SET FEATURES [set transfer mode] Error 115 occurred at disk power-on lifetime: 1445 hours (60 days + 5 hours) When the command that caused the error occurred, the device was active or idle. After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 05 9b 99 a7 e1 Error: UNC 5 sectors at LBA = 0x01a7999b = 27761051 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- 25 00 08 98 99 a7 e0 08 22d+05:34:12.500 READ DMA EXT ef 10 02 00 00 00 a0 08 22d+05:34:12.500 SET FEATURES [Enable SATA feature] 27 00 00 00 00 00 e0 08 22d+05:34:12.500 READ NATIVE MAX ADDRESS EXT [OBS-ACS-3] ec 00 00 00 00 00 a0 08 22d+05:34:12.500 IDENTIFY DEVICE ef 03 42 00 00 00 a0 08 22d+05:34:12.500 SET FEATURES [set transfer mode] Link to comment
JorgeB Posted April 12, 2016 Share Posted April 12, 2016 Probably a bad disk, you can do an extended SMART test to confirm. Link to comment
Recommended Posts
Archived
This topic is now archived and is closed to further replies.