Remove failed drive from array - preserve data on it


Recommended Posts

Hi! 

One of my drives failed today. I have attached the diagnostics. The SMART error I think is the culprit is "UDMA CRC Error Count". It seems to not always be so serous though, do you have any thoughts on it?

 

I have more space than I need on the array and would like to remove the drive from the array, but keep the data that was on it. The drive is 4TB and I have more than 4TB free on the array. The guide in the Unraid wiki is a bit confusing, I would be very happy if someone could walk me through how to achieve this? 

 

Thank you!

Erik

monsterservern-diagnostics-20210419-1028.zip

Link to comment

Thank you for the suggestion, 

Unfortunately I tried to replace the SATA cable (it is connected to an HBA card, so I switched it to another SATA connector from that card). The drive is still disabled by unraid. All other drives that are connected to that HBA card seem to be working normally. 

 

The UDMA CRC Error count has increased over the last couple of months from about 5 to 42 now. 

 

/Erik

Link to comment
37 minutes ago, JorgeB said:

Diags are after rebooting so we can't see what happened, but CRC errors are a connection problem, usually a bad SATA cable.

Since replacing the SATA cable did not help is there anything else in the SMART report that indicates why the drive failed? The drive is a Seagate IronWolf 4TB.

 

#	ATTRIBUTE NAME	FLAG	VALUE	WORST	THRESHOLD	TYPE	UPDATED	FAILED	RAW VALUE
1	Raw read error rate	0x000f	077	064	044	Pre-fail	Always	Never	55465484
3	Spin up time	0x0003	095	093	000	Pre-fail	Always	Never	0
4	Start stop count	0x0032	100	100	020	Old age	Always	Never	132
5	Reallocated sector count	0x0033	100	100	010	Pre-fail	Always	Never	0
7	Seek error rate	0x000f	089	060	045	Pre-fail	Always	Never	803115697
9	Power on hours	0x0032	088	088	000	Old age	Always	Never	11183 (164 86 0)
10	Spin retry count	0x0013	100	100	097	Pre-fail	Always	Never	0
12	Power cycle count	0x0032	100	100	020	Old age	Always	Never	132
184	End-to-end error	0x0032	100	100	099	Old age	Always	Never	0
187	Reported uncorrect	0x0032	100	100	000	Old age	Always	Never	0
188	Command timeout	0x0032	100	099	000	Old age	Always	Never	1
189	High fly writes	0x003a	100	100	000	Old age	Always	Never	0
190	Airflow temperature cel	0x0022	071	062	040	Old age	Always	Never	29 (min/max 29/30)
191	G-sense error rate	0x0032	100	100	000	Old age	Always	Never	0
192	Power-off retract count	0x0032	100	100	000	Old age	Always	Never	3
193	Load cycle count	0x0032	071	071	000	Old age	Always	Never	59695
194	Temperature celsius	0x0022	029	040	000	Old age	Always	Never	29 (0 17 0 0 0)
197	Current pending sector	0x0012	100	100	000	Old age	Always	Never	0
198	Offline uncorrectable	0x0010	100	100	000	Old age	Offline	Never	0
199	UDMA CRC error count	0x003e	200	199	000	Old age	Always	Never	42
240	Head flying hours	0x0000	100	253	000	Old age	Offline	Never	3737 (41 241 0)
241	Total lbas written	0x0000	100	253	000	Old age	Offline	Never	30794778888
242	Total lbas read	0x0000	100	253	000	Old age	Offline	Never	237940858011

 

/Erik

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.