UhClem

Members
  • Posts

    270
  • Joined

  • Last visited

Everything posted by UhClem

  1. The (unique/specific) C246 chipset on your own [3Server] motherboard is flaky. (You should get a direct replacement.) The best documentation (readily available) for the issue is the output of: grep -e "ATA-10" -e "AER: Corr" -e "FPDMA QUE" -e "USB disc" syslog.txt Use the syslog.txt from the 20210930-0723 .zip file. It has all 4 HDDs throwing errors. The "ATA-10" pattern just documents which HDD is ata[1357].00 . I'm pretty sure there are also relevant NIC errors in there, but I'm networking-ignorant. Note that all of these errors emanate from devices on the C246. Please examine a syslog.txt from your test-run on your Gen10 MS+; that box also uses the C246, but its syslog.txt will have none of these errors. Attached is the output from the above command (filtered thru uniq -c, for brevity). c246.txt
  2. Understood. Given the ~20 (non-empty) reviews [mostly Russia/E. Eur], the product is as-advertised and functions properly; only negative appears to be seller's (lack of padded) packaging. I'm sure the community will welcome your report. Mazel tov.
  3. I have some 4TB's that do 200 MB/s (typical 4TBs max ~150-160); typical 8TBs ~200; 12TBs ~240; 16TBs 260+ . [ Kafka wrote: "Better to have and not need, than to need and not have." ] 170+ orders & 60+ reviews (@4.8/5) [for what it's worth ??]
  4. Weird indeed! BUT it is not a cable issue, nor a disk issue. It is a flaky motherboard, specifically the Intel C246 chipset. (your syslog.txt files are gory with details) [ not an Unraid user ; but enjoy weird problems ]
  5. No !!!! NOT that card. Unless you really want/need to restrict yourself to a x1 physical slot; hence limiting your total throughput to ~850 MB/s. If you have a x4 physical slot (which is at least x2 electrical) this one looks like an excellent value: https://www.aliexpress.com/item/4001269633905.html getting full PCIe3 x2 throughput of ~1700 MB/s (at < 30 $USD)
  6. As for the 7GB/s "estimate", note that it assumes that the 9300-8i itself actually has the muscle. It probably does, but it is likely that it will be PCIe limited to slightly less than 7.0 GB/s (ref: your x4 test which got 3477; [i.e. < 7000/2]) Interesting. The under-performance of my script suggests another deficiency of the PMC implementation (vs LSI) : My script uses hdparm -t which does read()'s of 2MB (which the kernel deconstructs to multiple 512KB max to the device/controller). Recall that LSI graphic you included which quantified the Databolt R/W throughput for different Request sizes (the x-axis). There was a slight decrease of Read throughput at larger request sizes (64KB-512KB). I suspect that an analogous graph for the PMC edge-buffering expander would show a more pronounced tail off.
  7. Try the B option. It might help, or not ... Devices (and buses) can act strange when you push their limits. The nvmx script uses a home-brew prog instead of hdparm. Though I haven't used it myself, you can check out fio for doing all kinds of testing of storage. I completely agree with you. I do not completely agree with this. I'll send you a PM.
  8. Certainly ... but as an old-school hardcore hacker, I wonder if it could have been (at least a few %) better. I have to wonder if any very large, and very competent, potential customer (e.g., GOOG, AMZN, MSFT), did a head-to-head comparison between LSI & PMC before placing their 1000+ unit chip order. That lays the whole story out--with good quantitative details. I commend LSI. And extra credit for "underplaying" their hand. Note how they used "jumps from 4100 MB/s to 5200 MB/s" when their own graph plot clearly shows ~5600. (and that is =~ your own 5520) I suspect that the reduction in read speed, but not write speed, is due to the fact that writing can take advantage of "write-behind" (similar to HDD's and OS's), but reading can not do "read-ahead" (whereas HDD's and OS's can). Thanks for the verification.
  9. You're getting there ... 😀 Maybe try a different testing procedure:. See the attached script. I use variations of it for SAS/SATA testing. Usage: ~/bin [ 1298 ] # ndk a b c d e /dev/sda: 225.06 MB/s /dev/sdb: 219.35 MB/s /dev/sdc: 219.68 MB/s /dev/sdd: 194.17 MB/s /dev/sde: 402.01 MB/s Total = 1260.27 MB/s ndk_sh.txt Speaking of testing (different script though) ... ~/bin [ 1269 ] # nvmx 0 1 2 3 4 /dev/nvme0n1: 2909.2 MB/sec /dev/nvme1n1: 2907.0 MB/sec /dev/nvme2n1: 2751.0 MB/sec /dev/nvme3n1: 2738.8 MB/sec /dev/nvme4n1: 2898.5 MB/sec Total = 14204.5 MB/sec ~/bin [ 1270 ] # for i in {1..10}; do nvmx 0 1 2 3 4 | grep Total; done Total = 14205.8 MB/sec Total = 14205.0 MB/sec Total = 14207.5 MB/sec Total = 14205.8 MB/sec Total = 14203.3 MB/sec Total = 14210.6 MB/sec Total = 14207.0 MB/sec Total = 14208.0 MB/sec Total = 14203.4 MB/sec Total = 14201.9 MB/sec ~/bin [ 1271 ] # PCIe3 x16 slot [on HP ML30 Gen10, E-2234 CPU] nothing exotic
  10. Excellent evidence! But, to me, very disappointing that the implementations (both LSI & PMC, apparently)] of this feature are this sub-optimal.. Probably a result of cost/benefit analysis with regard to SATA users (the peasant class--"Let them eat cake."). Also surprising that this hadn't come to light previously. Speaking of the LSI/PMC thing ... Intel's SAS3 expanders (such as the OP's) are documented, by Intel, to use PMC expander chips. How did you verify that your SM backplane actually uses a LSI expander chip (I could not find anything from Supermicro themself; and I'm not confident relying on a "distributor" website)? Do any of the sg_ utils expose that detail? The reason for my "concern" is that the coincidence of both OP's & your results, with same 9300-8i and same test (Unraid parity check) [your 12*460 =~ OP's 28*200] but different??? expander chip is curious.
  11. Please keep things in context. OP wrote: Since the OP seemed to think that an x16 card was necessary, I replied: And then you conflated the limitations of particular/"typical" PCIe3 SAS/SATA HBAs with the limits of the PCIe3 bus itself. In order to design/configure an optimal storage subsystem, one needs to understand, and differentiate, the limitations of the PCIe bus, from the designs, and shortcomings, of the various HBA (& expander) options. If I had a single PCIe3 x8 slot and 32 (fast enough) SATA HDDs, I could get 210-220 MB/sec on each drive concurrently. For only 28 drives, 240-250..(Of course, you are completely free to doubt me on this ...) And, two months ago, before prices of all things storage got crazy, HBA + expansion would have cost < $100. ===== Specific problems warrant specific solutiions. Eschew mediocrity.
  12. In my direct, first-hand, experience, it is 7100+ MB/sec. (I also measured 14,200+ MB/sec on PCIe3 x16). I used a PCIe3 x16 card supporting multiple (NVMe) devices. [In a x8 slot for the first measurement.] [Consider: a decent PCIe3 x4 NVMe SSD can attain 3400-3500 MB/sec.] That table's "Typical" #s are factoring in an excessive amount of transport layer overhead. I'm pretty certain that the spec for SAS3 expanders eliminates the (SAS2) "binding" of link speed to device speed. I.e., Databolt is just Marketing. Well, that's two tests of the 9300, with different dual-link SAS3 expanders and different device mix, that are both capped at ~5600 ... prognosis: muscle deficiency [in the 9300].
  13. It looks to me like you are not limited by PCIe bandwidth. PCIe gen3 @ x8 is good for (real-world) ~7000 MB/sec. If you are getting a little over 200 MB/sec each for 28 drives, that's ~6000 MB/sec. (You are obviously using a Dual-link connection HBA<==>Expander which is good for >> 7000 [9000].) Either your 9300 does not have the muscle to exceed 6000, or you have one (or more) drives that are stragglers, handicapping the (parallel) parity operation. (I'm assuming you are not CPU-limited--I don't use unraid.)
  14. OK. I'd still suggest 24 hrs of MPrime (aka Prime95) Torture-Blend on the one in play here.
  15. JB, do you have ECC memory? (I know it's not a guarantee, but it gets you 90-95% of the way there.)
  16. The whole time, or just post-read verify ?? (I don't use Unraid, but I vaguely recollect the details of Joe's preclear.) No, it will not affect the CPU usage. It does (effectively) eliminate the I/O bottleneck of the on-board (chipset) Sata sub-system. That CPU usage you saw during pre-clear (x2) should not guide any (re-configure) decision you make.
  17. I've looked into "staggered spinup" for a DIY DAS. The key search term you want to research is "disk PUIS" ... Power Up In Standby. It looks a little tricky, but quite doable. (That isn't a solution for me because my drives are connected via a SAS expander.) [I don't use Unraid.]
  18. Before recommending a seller, I'd like to make sure we are seeking the best solution. Based on your system specs (mobo/cpu), you actually have 5 PCIe g3 x8 slots -- OR only 3 slots if one (or two) need to supply x16 lanes (each). If it is the latter case (you need to "free up" an x16 slot, then I'd suggest considering a SAS(/SATA) expander, which can use one of the soon-to-be x0 slots (expanders only use a PCIe slot for power (no signals/lanes) and a place to live). If it's the former case (you want to repurpose an x8 slot now used by an H200, then, yes, probably a 16 (or more) port HBA is the answer. Instead of a 9201-16i though, I'd go for an Adaptec ASR-71605 (or -72405). Less $$, it's faster PCIe Gen3 (vs Gen2 for 9201) and 4500+ MB/s (vs ~3000 MB/s), and it's low-profile (repurposing flexibility in future). Only negative is that its mini-SAS ports are 8643 (vs 8087 on H200/9201), so new breakout cables are needed. I recently bought a SAS expander to play with, and despite going for low cost, was pleasantly surprised. I bought a Lenovo 03X3834 on eBay for $15 shipped from CN/HK. Ordered on 23Nov, and it arrived (NH,USA) on 06Dec. Very well-packaged (anti-static + bubble-wrap + box) Works fine! The link for the seller's eBay store is JiaWen2108 sells lots of HBA, expanders, cables.
  19. Thanks! Good points. I think that spec'd endurance (e.g., 600 TBW for 860 EVO 1TB) won't be an issue, for all but extreme use cases. (For a data point, I used a 860 EVO 500GB in a DVR (DirecTV HR24) for the last year. It had ~8000 hours and ~30 TBW when I secure-erased it. Sadly, I didn't think to do any write-performance tests before the erase.) An "extreme use case" might be an array for multi HD security cameras [e.g. 4 feeds @ 10GB/hr each (24/7) =~ 350 TBW/year]. Note, though, that you need a near-server-level NVME to exceed 1 PBW rating (for 1 TB device). As you said, the NVME-for-parity does offer significant performance "head-room", such that it's write speed can degrade (as expected w/o trim) with no effect on array write speed. It also allows one to forego turbo mode, eliminating read-contention with the other (N-1) data SSDs [during array writes] ... and a few watts of juice.
  20. A question, please ... [I might be missing something, since I don't use Unraid.] : For an all-SSD array, wouldn't turbo-mode alleviate the parity-dominating aspect (with no detrimental side-effects)? ["good" SSDs (sata), and in decent "trim", will get write speeds very close to their read speeds, no?]
  21. 520-byte format Although this is more common on SAS drives, it is still a possibility on SATA drives. Beyond explicit disclosure by the re-seller, it could be indicated in a Model #. In many cases, but not all, there are tools available to re-format such drives. (It is not a common issue ... so ... just a heads-up)
  22. Yes. And, it is a drag to have to "re-buy" (different) cables. But, if it helps to rationalize/justify going with the 71605, note that you might get some added flexibility and/or future-proofing. The 71605 is a low-profile card (be sure to get the bracket you want/need); and it is PCIe Gen3, so it would likely suffice if you only gave it 4 (g3) lanes (=~ 12x275).
  23. Maybe the answer is "staring you in the face". See (currently) 3 threads below this one, same sub-forum. (LSI is not the only game in town ...)
  24. [ Assuming that: also means using a separate 8088 input connector [else there's a tiny chance that the (single?) 8088-IN is flaky] ] Then, I suspect that you might have a glitchy H810 controller. In either case, if you have a 8088=>4xSata breakout cable, you could "test" the H810 independent of the MD1200.