Jump to content

Issues with read/parity check speed.


Recommended Posts

My parity check and read check speed is abysmal. Max 2MB/s but I am able to read from and write to the array at GB speed over the network.

 

I've read through the logs and found a couple of errors but no idea how to interpret them or how to proceed.

 

At one time the parity check speed was fine so somewhere along the line I've added or changed something that broke it.

 

Help from the experts would be appreciated.

 

Specs of my setup:

 

- Norco ds 24 bay disk shelf with 18 disk's in the array. Connected via 6 sff8088 cables to two LSI9201-16e sas cards on an Asus PRIME B450M Ryzen 3 3200g 16GB DDR4

 

tower-diagnostics-20240122-1549.zip

Edited by Vincent77
Specs
Link to comment
Jan 22 21:26:44 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:26:45 Tower kernel: sd 10:0:0:0: Power-on or device reset occurred
Jan 22 21:26:50 Tower kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:26:51 Tower flash_backup: adding task: /usr/local/emhttp/plugins/dynamix.my.servers/scripts/UpdateFlashBackup update
Jan 22 21:26:55 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred
Jan 22 21:26:55 Tower kernel: sd 2:0:3:0: Power-on or device reset occurred
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:26:56 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred
Jan 22 21:26:56 Tower kernel: sd 10:0:0:0: Power-on or device reset occurred
Jan 22 21:27:00 Tower kernel: mpt2sas_cm0: log_info(0x31110d01): originator(PL), code(0x11), sub_code(0x0d01)
Jan 22 21:27:00 Tower kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
Jan 22 21:27:04 Tower kernel: sd 2:0:4:0: Power-on or device reset occurred
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:27:05 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:27:10 Tower kernel: mpt2sas_cm1: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
### [PREVIOUS LINE REPEATED 1 TIMES] ###
Jan 22 21:27:10 Tower kernel: mpt2sas_cm0: log_info(0x31110d00): originator(PL), code(0x11), sub_code(0x0d00)
### [PREVIOUS LINE REPEATED 4 TIMES] ###
Jan 22 21:27:15 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred
Jan 22 21:27:16 Tower kernel: sd 10:0:0:0: Power-on or device reset occurred
Jan 22 21:27:16 Tower kernel: sd 2:0:6:0: Power-on or device reset occurred

 

This usually means a power/connection issue, check/replace cables for those devices.

Link to comment
21 minutes ago, Vincent77 said:

I did notice reads and writes are much higher on that disk, and climbing slowly. 

Your appdata, domains, and system shares are on the array, including disk16. Better if those shares have all their files on cache or other fast pool so Docker/VM will perform better and so array disks can spindown since these files are always open.

 

But you don't appear to have cache or any other pools. What is the purpose of your nvme?

Link to comment
31 minutes ago, JorgeB said:

This usually means a power/connection issue, check/replace cables for those devices.

 

I'll grab a new PSU and required connectors tomorrow. I think the current PSU in the disk shelf is a Corsair 550w.

 

19 minutes ago, trurl said:

But you don't appear to have cache or any other pools. What is the purpose of your nvme?

 

The nvme was the cache drive originally but I disabled it at some point a while back fiddling, trying anything to fix this issue. I'll set it back now.

Link to comment
2 minutes ago, Vincent77 said:

The nvme was the cache drive originally but I disabled it at some point a while back fiddling, trying anything to fix this issue. I'll set it back now.

There will be more work to do to get those shares back on cache. We can work on that later. For now, Disable Docker and VM Manager in Settings.

Link to comment

Today I did some testing.  I removed all the disks from the array except for one and did a read check. No problems it ran at 120MB/s.

 

I then added each disk one at a time, doing a read test after adding each one. What I found was I have three disks that when added to the array individually, will cause the read check to crawl.

 

Disks 4, 8 and 24 are causing the problem.

 

I am able to read and write to these disks at full speed, yet they slow the read check down to under 2MB/s.

 

I don't really want to remove these 3 disks from the array as it's 32TB or storage that otherwise performs as they should.

 

Tomorrow I will clear and format one to see if that does anything.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...