Cache Drive Failure


aim60

Recommended Posts

I'm running 6.8.1 with a 3-drive raid1 cache pool.  I've been getting Errors on Cache Pool messages, and many of the following in the logs:

    BTRFS error (device sdf1): error writing primary super block to device 1

    kernel: sd 7:0:0:0: [sdf] tag#15 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00

    kernel: sd 7:0:0:0: [sdf] tag#15 CDB: opcode=0x2a 2a 00 1d 9b af c0 00 00 58 00

 

The device still shows a green ball on the Main tab, but smartctl returns

    Short INQUIRY response, skip product id
    A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

 

I'm assuming that the drive needs replacement.

 

My only use of cache is VM's and Dockers, and I have disabled both services, as well as temporarily disabling the mover job.

 

I would like to remove the cache pool from the configuration until I can obtain a replacement.

 

I assume that the procedure is to stop the array and unassign all of the cache drives, and start the array.   Then when the new drive arrives, reassign the 2 good drives and the replacement, and start the array.  Please confirm my procedure.

 

I only have remote access to the server for the next few days.

 

Thanks

 


 

tower7-diagnostics-20200208-1324B.zip

Link to comment

Cache1 dropped offline:

Feb  7 10:24:38 Tower7 kernel: ata6: hard resetting link
Feb  7 10:24:43 Tower7 kernel: ata6: link is slow to respond, please be patient (ready=0)
Feb  7 10:24:48 Tower7 kernel: ata6: COMRESET failed (errno=-16)
Feb  7 10:24:48 Tower7 kernel: ata6: hard resetting link
Feb  7 10:24:53 Tower7 kernel: ata6: link is slow to respond, please be patient (ready=0)
Feb  7 10:24:58 Tower7 kernel: ata6: COMRESET failed (errno=-16)
Feb  7 10:24:58 Tower7 kernel: ata6: hard resetting link
Feb  7 10:25:03 Tower7 kernel: ata6: link is slow to respond, please be patient (ready=0)
Feb  7 10:25:33 Tower7 kernel: ata6: COMRESET failed (errno=-16)
Feb  7 10:25:33 Tower7 kernel: ata6: limiting SATA link speed to 3.0 Gbps
Feb  7 10:25:33 Tower7 kernel: ata6: hard resetting link
Feb  7 10:25:38 Tower7 kernel: ata6: COMRESET failed (errno=-16)
Feb  7 10:25:38 Tower7 kernel: ata6: reset failed, giving up
Feb  7 10:25:38 Tower7 kernel: ata6.00: disabled

 

Likely a cable/connection issue, see here for more info:

https://forums.unraid.net/topic/46802-faq-for-unraid-v6/?do=findComment&comment=700582

 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.