Jump to content
Sign in to follow this  
Falcon

Help ! Problem with drives ! 2 drives

5 posts in this topic Last Reply

Recommended Posts

I have problem with 2 drives at almost the same time.

 

First 1 got several read errors from disk121 a 2 TB disk

 

Disk 12:

Device  Identification                                                                                          Temp  Size    Free        Read        Write    Errors

disk12  WDC_WD20EARS-00MVWB0_WD-WCAZA3191168 (sdm) 1953514552 26°C 2 TB 1.82 TB 238120 3109 11000

 

As you see this disk have a lot off hardware errors

 

It was not so bad because unraid fixed all the errors since it have parity check.

Vendor Specific SMART Attributes with Thresholds:

ID# ATTRIBUTE_NAME          FLAG    VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE

  1 Raw_Read_Error_Rate    0x002f  095  095  051    Pre-fail  Always      -      112283

  3 Spin_Up_Time            0x0027  191  164  021    Pre-fail  Always      -      5441

  4 Start_Stop_Count        0x0032  100  100  000    Old_age  Always      -      927

  5 Reallocated_Sector_Ct  0x0033  160  160  140    Pre-fail  Always      -      773

  7 Seek_Error_Rate        0x002e  200  200  000    Old_age  Always      -      0

  9 Power_On_Hours          0x0032  096  096  000    Old_age  Always      -      3173

10 Spin_Retry_Count        0x0032  100  100  000    Old_age  Always      -      0

11 Calibration_Retry_Count 0x0032  100  253  000    Old_age  Always      -      0

12 Power_Cycle_Count      0x0032  100  100  000    Old_age  Always      -      76

192 Power-Off_Retract_Count 0x0032  200  200  000    Old_age  Always      -      44

193 Load_Cycle_Count        0x0032  196  196  000    Old_age  Always      -      14811

194 Temperature_Celsius    0x0022  124  114  000    Old_age  Always      -      26

196 Reallocated_Event_Count 0x0032  001  001  000    Old_age  Always      -      513

197 Current_Pending_Sector  0x0032  200  198  000    Old_age  Always      -      315

198 Offline_Uncorrectable  0x0030  200  199  000    Old_age  Offline      -      48

199 UDMA_CRC_Error_Count    0x0032  200  200  000    Old_age  Always      -      0

200 Multi_Zone_Error_Rate  0x0008  123  102  000    Old_age  Offline      -      20695

 

Oct 24 20:35:07 Fileserver kernel: md: disk12 read error

Oct 24 20:35:07 Fileserver kernel: handle_stripe read error: 321338184/12, count: 1

 

While I was making a copy to a local drive I got another Error with disk9

 

One 300 GB drive have been disable since unraid had problem to write to the drive:

 

DISK_DSBL_NP /dev/md9 /mnt/disk9 300.08G 132.30G 45% 167.78G

 

Now I have copied everything from disk9 and all I could recover from disk12 to other disks in unraid + to other external drives.

 

My question is: What should I do now to disable both disk9 and disk12 ?

I have taken disk9 out of the raid group and I am running Pre-Clear on it right now.

 

What I think I should do is:

 

1. Insert disk9 into the raid group again to rebuild it. It will maybe not be correct since I have read errors from disk12....

2. When disk9 is rebuild take the disk12 out of the raid group.

3. Recalc the parity disk.

 

What will unraid do when it reach the read errors on disk12 when it rebuilding the disk9 ?

 

Is it possible to remove both drives at once and just rebuild the parity disk ?

 

Running Unraid 5 Beta 9.

 

Thanks to unraid I have NOT lost any data yet !

 

Looking forward to get som answers...

ErrorsDrives.png.747b45f79d79943dc2533b2413e7965d.png

Share this post


Link to post

What makes you sure that you have not lost any data? With 2 drives down the data on each of those drives may be lost. There are recovery procedures that may revive one of the damages disks. Resurrecting one of them will then allow the other to be rebuilt. If there is no data on either of those disks then you can remove them and click "New Config" under Utils.

Share this post


Link to post
What makes you sure that you have not lost any data? With 2 drives down the data on each of those drives may be lost. There are recovery procedures that may revive one of the damages disks. Resurrecting one of them will then allow the other to be rebuilt.

 

Only disk9 is totaly down, disk12 working but is full of read errors. I think I have lost 3 files, but I manage out mostly all files out from the drive with read errors. Maybe some of the errors is on filesystem level.

 

I use Crashplan to all my important data on the unraid server.

 

If there is no data on either of those disks then you can remove them and click "New Config" under Utils

 

I will check that, Thank you.

Share this post


Link to post

Thank you again, now the Parity is rebuilding.

 

Nice feature to have in an situation like this !  ;D

 

 

 

Share this post


Link to post

If disk 12 is giving read errors then the data on the rebuilt drive will be in error at locations corresponding to those read errors.

Share this post


Link to post

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

Sign in to follow this