Jump to content

gumby327

Members
  • Posts

    67
  • Joined

  • Last visited

Posts posted by gumby327

  1. MB is:  Micro-Star International Co., Ltd. MAG B550M MORTAR WIFI (MS-7C94), Version 1.0
    American Megatrends International, LLC., Version 1.94
    BIOS dated: Thu 23 Sep 2021 12:00:00 AM CDT

     

    Processor is AMD Ryzen 7 5700G with Radeon Graphics @ 3800 MHz

     

    32 gig filling all 4 slots.

     

    I have a HBA LSI Broadcom SAS 9300-8i 8-port 12Gb/s SATA+SAS PCI-Express 3.0 Low Profile Host.

     

    Issue #1:  I think I got further than most.  My VM does work to a point.  It faces sudden death due to the LED (Micro Star International MYSTIC LIGHT) sub system.  For some reason it works about 15 minutes on my NVIDIA RTX 2060 then poof it is gone.  That is collateral damage for not reading blogs before purchase, but it is a NAS anyway so who cares.  I did try to use it for my older iTunes ripping, but it is not stable enough for that.  I would show the error, but I cannot even keep it running long enough anymore to get the error.  Things went south one day when I decided to pass that hardware into the VM.

     

    Issue #2:  My primary problem is stability in the HDD's in my HBA SAS SATA.  It suffers the equivalent of rolling brownouts with CRC errors.  I thought it was old drives.  I replaced all 8 of them over several months.  Now it is failing on brand new ones.  To fix it, I take the drive out, place it in a Windows box, reformat, then put it back into the Unraid server.  It does Parity and all is good again.  So, I ordered better cables and also a new M.2 NVMe HBA SAS card.  I will try skipping the SATA board connections completely and see if it improves it.  This is my last mod, if this fails then the entire project is a fail.  I have another server running on Intel and it has never had any problems.  I think that is what I have learned here is AMD and unRaid is a no go.

    gumby-diagnostics-20220226-1140.zip

  2. Hi, yesterday I got an alert off of my unRAID servers during an rsync to my backup server.  It was the last of my really old hard drives.  Back story on that was I wanted to store videos for Plex on a windows box and bought a few refurbished 4tb drives off of Amazon.  Then a few more, then a few more.  One day I decided to try unRAID out and that is when my eyes were opened to servers and their silent killers.  Over the next few months those old drives overheated and suffered a LOT of sudden death.  Now, I am on 100% new drives and life is a little easier.  Anyway my initial backup share was the last of those old drives and when it went out I thought no biggy, pop in a new one and it will restore from parity.  I have been trying out Dynamix File Integrity plugin this past week and it had completed no errors.  I wanted to inspect my Acronis backup files to see that they were still viable.  Sure enough I had massive bit-rot.  None of the PC's requiring restoration (were still working), I was happy to have learned it now rather than later.  I had lost a quarter of my backup files.  When?  was it Windows, or unRAID?  who knows?  All I can say is bit rot is real and never fall into the happy ignorance that you have backups so you are good.  Test your data quality as part of a regular routine.  Over the 4 decades I have been at this I have lost digital content, lots of it.  The stuff most precious to you and your family is normally the first to go.  In my professional life we often call it digital Alzheimer's when the data is right in front of you, but you have no idea some is missing.

  3. [solved]  The MSI B550M has many built in settings to handle about everything one could come across.  If you try to navigate to them you will get lost.  You can however use the handy search utility for key phrases like "V" and "IOMMU".  They had every problem I faced covered.  It is a highly capable VM motherboard.

  4. You are correct.  It has been locked out so it is not being diagnosed.  I wend into the syslog, reviewed that, and found a way to get at the smart on it.  It has 5 years and 9 months of life on it and when I look at the lock out event it had a large stack of IO faults.  I did check the cabling revealing no issues.  It is running 20 degrees F hotter than all the other drives.  With all the facts I did order some brand new drives that get here on Thursday.  It is running meantime one drive down with no problem.  I ignored all the feedback and attempted to shuck some MyBook's from a local store.  I ended up putting them back together and hooking them to my router.  The SMR design started throwing CRC's right out of the gate.  One of the local buy's was a WD Red drive and it was not SMR.  I always try to buy drives in blocks of three.  One goes in, during the rebuild another one goes out, that goes in and I have a spare for next.  I am almost done cycling through all the old hardware idea and am on mostly fresh brand new drives now.  Thanks for looking at that though.  I really appreciate you guys.

  5. I have a Windows VM and EVGA NVIDIA 2060 on an AMD motherboard MSI B550M.  Everything works.  All I have to do is show bios that there is nothing plugged into the PGPU making it look at the iGPU for it's primary monitor.  Yes, I have it set to default to iGPU in bios.  But, the only way I have a free and clear NVIDIA 2060 for the Windows VM is don't let BIOS see it in use or available.

     

    I guess no replies tells me no one is in love with this motherboard and CPU combo.

     

  6. I think ... (I am a systems developer)  there is more value in the dockers themselves like Plex Server to build out a file read cache.  On demand when a movie is called for it starts writing it out to a NVMe or even a memory ram disk.  Then when (like last night) your wife is watching Deadpool, and you are watching Real Player One, both are on Disk4 and both are high IO.  One of you will buffer and the buffer is related to the count of arms in your hard drive (one).

     

    It is my belief that reading contiguously from disk to any sort of a ram disk or NVMe would be faster than a typical LAN bandwidth.  So, I would like to see the Plex developers create a setting for pre-cache locations.

    • Upvote 2
  7. I had nextcloudpi on my unraid server for a day or so.  Even though it is not a PI server it loaded up on my Ryzen 5 5600 with 16 gig of ram and no reverse proxy, just a local connection to the host.  The thing loaded all my data up in very short order.  It got me to thinking I wanted to step back and install a database and a full nextcloud.  That too was not half bad.  The slowness became apparent after I started using the proxy.  So, I went back and turned it off, but the performance was not great.  I am holding out to get my 350 gig of data loaded into it.  I see it is slow, ram is not being touched, network is idle, disk IO is in the k's not the megs like it was doing.  I never got rid of my SMB and my robocopy scripts.  So, I may be dumping my DNS all together.

  8. I am running a multi terabit SSD for my cache.  According to the guides it says when the cache fills the data writes direct onto the array.  Well, the array was in the middle of a move and I had data coming in from two sources and very large files.  My diagnostic logs look like a blood bath.  I have a SATA onboard controller that supports 6 drives.  On that I had the parity plus cache and also a spare SSD that is in the array.  There are also 8 drives on a SAS to SATA PCIe 8x controller.  It has 12 hours to go on a read check and I would assume decisions would need to come after that as to replacing what hardware.  I had two Toshiba 4tb 128 meg of cache drives what are in the same serial number range that were side by side on ports on the motherboard that and those were the failing drives this time.  Any drive coming out of unraid with that failure has never recovered when placed on other controllers.

     

    Earlier in the day I attempted to mount a multi-terabit SSD and that all failed.  You will see that goofing around in the logs as well.

    gumby-diagnostics-20211225-2120.zip

  9. solved it... the work I did to make the network robust actually paid off big.  What I failed to do was go back to RoboCopy and set my max threads back up.  I have to apologize with submitting so many self solved problems... I am a senior technologist with a fortune 500 manufacturing firm.  I have been working in technologies since the mid 80's.  So, I am new to this forum, but been around a bit.

  10. There was a time earlier that I was bragging that my speed was hitting the ceiling of my network capability and I had never experienced that with any other NAS software.  Then suddenly I started hitting a ceiling 20~40 MBs.  I have been looking at my LAN and have troubleshot from switch to switch and cross switches.  When I set a Windows PC beside this NAS with Aida32 Network Benchmark I am getting a much higher max bandwidth than this PC (NAS) has.  So, I added a second PCI NIC to my media creator PC (next to my NAS) and also a second PCI NIC to this NAS.  In the unRAID NAS I primary bonded to eth1 and ran backup to eth0.  This thinking was to analyze the possibility of hardware fail.  Nothing has changed, no matter what I run through at it it is very very slow.

     

    I even went so far as to make some new CAT6 wires and tested them.

    gumby-diagnostics-20211224-0927.zip

  11. You are probably thinking this is a negative comment, but it really isn't.  I have a Plus license and I have learned through a couple of weeks of tinkering how this goes correctly.  Now, I had decided to go back and do a full reset with all of your (moderator) feedback as well as google'd lessons learned.  I have built a new empty flash drive and placed my Key on it.  I have chosen to erase all the drives and lined up my new media across the 40TB of drives.  Once this is done, nothing will be finer, nothing to improve, it will be perfect.

     

    Because the world has different PC's and no two use cases are similar, I fully get why we do a POC phase before we "go physical"

  12. Yes 2.0    It was the plague of multiples.  The USB is toast it shows as raw.  I have a old backup I am placing on another USB.  I know it is a major setback.  The second of the problems was that I made my lan wire loose all the yanking on the heavy server.  So it was not finding network.

     

×
×
  • Create New...