Can you force sectors to remap on a parity disk?


Recommended Posts

This weekend was not a good weekend for my unRAID.  My Hitachi parity drive started throwing read errors, 215 to be exactly.  SMART still lists the health status as good, but looking at the SMART data, the Reallocated Sectors Count is 7 and the Current Pending Sector Count is 7.  Uncorrectable Sector Count is 0.

 

I did a parity verify and aside from a bunch of read errors, it passed.  All errors point to the same LBA, every time I check it.  SMART reports 30 errors in the history, but the past five are all identical, all at the same LBA (before and after a parity verify).

 

Is there a way to get rid of these read errors?  The pending sectors will not remap until written to, and who knows how long that might be.  But I'll continue getting read errors until it does.

 

I ran the Hitachi DFT tool and according to it, the drive is good and I can't RMA it until the DFT tool reports an error code, so I'm forced to use it for something.  7 reallocated sectors isn't too bad I guess, but I still don't like it.  Fortunately I have two pre-cleared Hitachi drives in backup, so I just popped one in and rebuilt parity and I'm all green now.

 

But what to do with this drive?  Can I still trust it enough to put it back in my pool of spare parity drives?  Is it worth it to use it as my next data drive?  I guess pre-clearing it should force it to remap the pending sectors (which will take reallocated sectors up to 14)?  Since I can't RMA it yet, I feel like I should be using it until it is failed enough to get the RMA.  The drive is only 5 months old, so it's a waste if I can't use it or get it RMA'd.

 

What's everyone's thoughts?

Link to comment

As you said, running the preclear script on it should re-allocate the sectors.  If it is just the few sectors you'll know soon enough.

 

Make sure you use the most recent preclear script.  (It is version 1.4 as if this post)

check the version with

preclear_disk.sh -v

 

Joe L.

Link to comment

They should remap themselves during a parity check. Will be interested to see if preclearing remap sectors that the parity check does not.

No, a parity check is a read operation.  There is no why for unRAID to know what to save in the sector until it is written.

 

You could set a new disk configuration and force a write of parity with what should be an identical copy of itself.

 

Joe L.

Link to comment

They should remap themselves during a parity check. Will be interested to see if preclearing remap sectors that the parity check does not.

No, a parity check is a read operation.  There is no why for unRAID to know what to save in the sector until it is written.

 

You could set a new disk configuration and force a write of parity with what should be an identical copy of itself.

 

Joe L.

 

There are 2 situations - either there is a read error on the parity disk, or there is not a read error on the parity disk.

 

1.  If there is a read error, unRAID will reconstruct the parity sector and rewrite it to the disk.  This will force a remap.

 

2.  If there is NOT a read error, yet the smart system detects a failing sector, it will invisibly remap the sector.

 

This is my understanding of how the smart system is supposed to operate.

 

Either way, the parity check should eventually remap a bad or marginal sector.

Link to comment

There are 2 situations - either there is a read error on the parity disk, or there is not a read error on the parity disk.

 

1.  If there is a read error, unRAID will reconstruct the parity sector and rewrite it to the disk.  This will force a remap.

 

2.  If there is NOT a read error, yet the smart system detects a failing sector, it will invisibly remap the sector.

 

This is my understanding of how the smart system is supposed to operate.

 

Either way, the parity check should eventually remap a bad or marginal sector.

I've seen "smartctl" reports of sectors pending re-allocation on both the parity and data drives. 

 

My suspicion is that those fall into your "#2" category of being detected by the SMART firmware on the disk.  I also suspect they may have been detected during the normal "read-ahead" of sectors on a disk, so no error is returned to the OS, since the sector with the error was not being requested.

 

Joe L.

Link to comment

Well, running the Advanced Test on the Hitachi DFT tool did make the sectors remap.  I guess it must read and write as part of whatever test it does.  So the drive now stands at 14 reallocated sectors, 0 pending and the DFT still still says the drive is green and not eligible for an RMA.

 

So my dilemma is whether to shelve it and use it as a spare parity drive, rebuild parity on it and remove my current spare with 0 bad sectors (hate to rebuild since I just did it two days ago), or insert it as a data disk back into my array, use it and see if it continues to develop bad sectors.

 

My warranty is good until 2013, so I guess I have some time to get some use out of it.

 

I'm currently pre-clearing it, so we will see if that results in anymore reallocated sectors.  The first time I precleared it (when it was new), it took 30 hours.  I started last night at 10 pm and this morning at 9 am, it was 60% through Part 2, showing a 120 MB/s write speed.  So it seems to be on track.

 

I wish I knew how many reallocated sectors it would need to see for Hitachi to consider it failed.

Link to comment

I wish I knew how many reallocated sectors it would need to see for Hitachi to consider it failed.

Most large disks have a reserve pool of several thousand sectors.  I don't know of any manufacturer that tells their threshold.

 

Just keep an eye on it.

 

Joe L.

 

Okay, there may be a problem with the drive pre-clearing. 

 

The drive went through Step 1 to Step 10 and each step says done.  However, doing a post read, it appears to have hung at 12%.  It hasn't moved from 12% in hours and my syslog just keeps repeating the following:

 

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:        72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:        00 00 00 27

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] ASC=0x0 ASCQ=0x0 (Drive related)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 ab 1f 2b 28 00 00 08 00 (Drive related)

Feb  8 13:08:49 Tower kernel: end_request: I/O error, dev sdb, sector 2870946600 (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08 (System)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor] (Drive related)

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:        72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:        00 00 00 67

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] ASC=0x0 ASCQ=0x0 (Drive related)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 ab 1f 32 68 00 00 08 00 (Drive related)

Feb  8 13:08:49 Tower kernel: end_request: I/O error, dev sdb, sector 2870948456 (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08 (System)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor] (Drive related)

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:        72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:        00 00 00 af

 

 

Actually, here are the log entries preceeding the first error:

 

Feb  8 10:40:14 Tower dhcpcd[1942]: sending DHCP_REQUEST for 192.168.2.183 to 192.168.2.1

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpIPaddrLeaseTime=86400 in DHCP server response.

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpT1value is missing in DHCP server response. Assuming 43200 sec

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpT2value is missing in DHCP server response. Assuming 75600 sec

Feb  8 10:40:14 Tower dhcpcd[1942]: DHCP_ACK received from  (192.168.2.1)

Feb  8 11:43:34 Tower kernel:  sdb: sdb1

Feb  8 11:43:44 Tower kernel: udev: starting version 141

Feb  8 12:55:45 Tower kernel: sas: command 0xc4242840, task 0xf0bc1400, timed out: BLK_EH_NOT_HANDLED

Feb  8 12:55:45 Tower kernel: sas: Enter sas_scsi_recover_host

Feb  8 12:55:45 Tower kernel: sas: trying to find task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: aborting task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: /usr/src/sas/trunk/mvsas_tgt/mv_sas.c 1701:mvs_abort_task:rc= 5

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: querying task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: /usr/src/sas/trunk/mvsas_tgt/mv_sas.c 1645:mvs_query_task:rc= 5

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: task 0xf0bc1400 failed to abort

Feb  8 12:55:45 Tower kernel: sas: task 0xf0bc1400 is not at LU: I_T recover

Feb  8 12:55:45 Tower kernel: sas: I_T nexus reset for dev 0100000000000000

Feb  8 12:55:45 Tower kernel: sas: I_T 0100000000000000 recovered

Feb  8 12:55:45 Tower kernel: sas: --- Exit sas_scsi_recover_host

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08

Feb  8 12:55:45 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor]

 

I would post the entire log, but it is over 2GB.

Link to comment

I wish I knew how many reallocated sectors it would need to see for Hitachi to consider it failed.

Most large disks have a reserve pool of several thousand sectors.  I don't know of any manufacturer that tells their threshold.

 

Just keep an eye on it.

 

Joe L.

 

Okay, there may be a problem with the drive pre-clearing. 

 

The drive went through Step 1 to Step 10 and each step says done.  However, doing a post read, it appears to have hung at 12%.  It hasn't moved from 12% in hours and my syslog just keeps repeating the following:

 

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:         72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:         00 00 00 27

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] ASC=0x0 ASCQ=0x0 (Drive related)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 ab 1f 2b 28 00 00 08 00 (Drive related)

Feb  8 13:08:49 Tower kernel: end_request: I/O error, dev sdb, sector 2870946600 (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08 (System)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor] (Drive related)

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:         72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:         00 00 00 67

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] ASC=0x0 ASCQ=0x0 (Drive related)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] CDB: cdb[0]=0x28: 28 00 ab 1f 32 68 00 00 08 00 (Drive related)

Feb  8 13:08:49 Tower kernel: end_request: I/O error, dev sdb, sector 2870948456 (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00 (Drive related)

Feb  8 13:08:49 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error } (Errors)

Feb  8 13:08:49 Tower kernel: ata1: error=0x04 { DriveStatusError } (Errors)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08 (System)

Feb  8 13:08:49 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor] (Drive related)

Feb  8 13:08:49 Tower kernel: Descriptor sense data with sense descriptors (in hex):

Feb  8 13:08:49 Tower kernel:         72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00

Feb  8 13:08:49 Tower kernel:         00 00 00 af

 

 

Actually, here are the log entries preceeding the first error:

 

Feb  8 10:40:14 Tower dhcpcd[1942]: sending DHCP_REQUEST for 192.168.2.183 to 192.168.2.1

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpIPaddrLeaseTime=86400 in DHCP server response.

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpT1value is missing in DHCP server response. Assuming 43200 sec

Feb  8 10:40:14 Tower dhcpcd[1942]: dhcpT2value is missing in DHCP server response. Assuming 75600 sec

Feb  8 10:40:14 Tower dhcpcd[1942]: DHCP_ACK received from  (192.168.2.1)

Feb  8 11:43:34 Tower kernel:  sdb: sdb1

Feb  8 11:43:44 Tower kernel: udev: starting version 141

Feb  8 12:55:45 Tower kernel: sas: command 0xc4242840, task 0xf0bc1400, timed out: BLK_EH_NOT_HANDLED

Feb  8 12:55:45 Tower kernel: sas: Enter sas_scsi_recover_host

Feb  8 12:55:45 Tower kernel: sas: trying to find task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: aborting task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: /usr/src/sas/trunk/mvsas_tgt/mv_sas.c 1701:mvs_abort_task:rc= 5

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: querying task 0xf0bc1400

Feb  8 12:55:45 Tower kernel: /usr/src/sas/trunk/mvsas_tgt/mv_sas.c 1645:mvs_query_task:rc= 5

Feb  8 12:55:45 Tower kernel: sas: sas_scsi_find_task: task 0xf0bc1400 failed to abort

Feb  8 12:55:45 Tower kernel: sas: task 0xf0bc1400 is not at LU: I_T recover

Feb  8 12:55:45 Tower kernel: sas: I_T nexus reset for dev 0100000000000000

Feb  8 12:55:45 Tower kernel: sas: I_T 0100000000000000 recovered

Feb  8 12:55:45 Tower kernel: sas: --- Exit sas_scsi_recover_host

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: ata1: translated ATA stat/err 0x51/04 to SCSI SK/ASC/ASCQ 0xb/00/00

Feb  8 12:55:45 Tower kernel: ata1: status=0x51 { DriveReady SeekComplete Error }

Feb  8 12:55:45 Tower kernel: ata1: error=0x04 { DriveStatusError }

Feb  8 12:55:45 Tower kernel: sd 0:0:0:0: [sdb] Result: hostbyte=0x00 driverbyte=0x08

Feb  8 12:55:45 Tower kernel: sd 0:0:0:0: [sdb] Sense Key : 0xb [current] [descriptor]

 

I would post the entire log, but it is over 2GB.

I think it is safe to say the drive is having problems.

 

You'll want to terminate the pre-clear before you run out of memory for the log file.

(and no, no need to post the entire 2Gig of the syslog)

Joe L.

Link to comment

The stupid drive still passes the Hitachi DFT.  It looks like during the pre-clear it went from 14-16 reallocated sectors and it currently has 1 pending.  So those added bad sectors must be what hung the pre-clear.

 

Looks like this drive is going to be a slow burn failure, which sucks for the warranty.  I guess I need to e-mail them to see if it can be returned anyway.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.