[SOLVED] Parity Sync - slowing down drastically


Neo_x

Recommended Posts

Hi  team

any ideas as to why a parity sync will all of a sudden drop 50% in speed?

i am trying to  complete a parity rebuild to a new 8TB drive (tested with preclear before the time), and for the first +-13 hours it was performing admirably (roughly 35MB/s). since this afternoon however, it dropped to 10MB/s

 

I dont have anything writing to or reading from the array, so it is really strange to see this :

image.thumb.png.14f7ef8bcaf71d53cee062ff243e2735.png

I have disabled anything extra (cahcdirs , and now for troubleshooting even the docker and VM services). No details in syslog as to a caused. can anybody maybe give an idea?

 

not willing to wait 7 plus days for a parity sync to finish at current speeds :(, but also dont want to give up on 25% / 24 hours alread completed...

 

TIA

neo_X

 

 

 

 

storage-diagnostics-20180826-2322.zip

storage-diagnostics-20180826-2322.zip

Edited by Neo_x
SOLVED
Link to comment
38 minutes ago, Neo_x said:

performing admirably (roughly 35MB/s).

Personally, I'd consider that to be the low end of unacceptable :)  But, with the age of your system, it might be as best as you can get...

 

Other than that, your parity drive being reconstructed IS currently overheating.  No clue though is that impacts write speeds or not

ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
  1 Raw_Read_Error_Rate     POSR--   079   064   044    -    75318304
  3 Spin_Up_Time            PO----   089   089   000    -    0
  4 Start_Stop_Count        -O--CK   100   100   020    -    6
  5 Reallocated_Sector_Ct   PO--CK   100   100   010    -    0
  7 Seek_Error_Rate         POSR--   074   060   045    -    24152096
  9 Power_On_Hours          -O--CK   100   100   000    -    97
 10 Spin_Retry_Count        PO--C-   100   100   097    -    0
 12 Power_Cycle_Count       -O--CK   100   100   020    -    6
184 End-to-End_Error        -O--CK   100   100   099    -    0
187 Reported_Uncorrect      -O--CK   100   100   000    -    0
188 Command_Timeout         -O--CK   100   100   000    -    0
189 High_Fly_Writes         -O-RCK   100   100   000    -    0
190 Airflow_Temperature_Cel -O---K   037   035   040    NOW  63 (Min/Max 56/65 #8613)
191 G-Sense_Error_Rate      -O--CK   100   100   000    -    849
192 Power-Off_Retract_Count -O--CK   100   100   000    -    2
193 Load_Cycle_Count        -O--CK   100   100   000    -    32
194 Temperature_Celsius     -O---K   063   065   000    -    63 (0 27 0 0 0)
195 Hardware_ECC_Recovered  -O-RC-   079   005   000    -    75318304
197 Current_Pending_Sector  -O--C-   100   100   000    -    0
198 Offline_Uncorrectable   ----C-   100   100   000    -    0
199 UDMA_CRC_Error_Count    -OSRCK   200   200   000    -    0
240 Head_Flying_Hours       ------   100   253   000    -    91 (8 9 0)
241 Total_LBAs_Written      ------   100   253   000    -    20626008160
242 Total_LBAs_Read         ------   100   253   000    -    46744423067
                            ||||||_ K auto-keep

 

Edited by Squid
  • Upvote 1
Link to comment

65°C would be a warranty-breaking temperature for most HDD.

 

You need to immediately fix this issue - hard disks doesn't really produce much heat but 10W of heat without moving air will result in very significant temperature increases. Your fans must move air over the drives. And preferably cool air sucked into the case and not air that has already been heated by the CPU, PSU, graphics card etc.

Link to comment

thx guys. i will put a temporary measure on to see if i can drop the temperature.
currently air is being sucked over the drives and pushed through cpu and then out, so shouldn;t be an issue. might be that the fans is a bit low CFM, but will try to manage/upgrade witha controller.

Link to comment
2 hours ago, Neo_x said:

thx guys. i will put a temporary measure on to see if i can drop the temperature.
currently air is being sucked over the drives and pushed through cpu and then out, so shouldn;t be an issue. might be that the fans is a bit low CFM, but will try to manage/upgrade witha controller.

 

Even slow fans would normally manage better - is it all drives that gets that hot, or do you have a some drives that have the air circulation blocked or isn't in the direct path of the moving air?

Link to comment
On 8/27/2018 at 10:43 PM, pwm said:

 

Even slow fans would normally manage better - is it all drives that gets that hot, or do you have a some drives that have the air circulation blocked or isn't in the direct path of the moving air?

must admit - stubborn me did not want to believe it, but seems like temperature was indeed the cause. just the stubborn, errr i mean loyal old server, wanted me to remove it from the server cabinet and put it right next to my bed before it started to complete a parity sync (combined with a massive 75 cm fan for the warmer times of day.


 

i believe there is other possible issues afoot - possibly some of the controllers, as i had 4 drives drop out of the array yesterday during fan troubleshooting, which required me to run xfs_repair before i could mount them again. will monitor it closely

 

i am getting a fan controller in the next week to see if i can manage the temperature a bit better as well.

 

 

Thx for the help team!

 

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.