• Data loss on disks , disks falling out of array randomly


    MacModMachine
    • Closed

    I think i found a possibly major defect. my diagnostics are running the beta , however this happens also in 6.8.3. 

     

    Lets start off with the issue. 

     

    after 7-8 days roughly. the disks will fall out of the array. ( i know you will say its the cables , the cards ,hdd ect. however its not). this usually can be fixed by removing and readding the disk to the array(letting it rebuid). however when i do that. it shows the disk as empty now....all data is gone.

     

    i have tried this on several different combinations of motherboard/cpu , controllers...all amd systems.

     

    pulling out the motherboard and cpu and swapping with a 7th gen intel cpu works fine, i left it running for a few weeks with no issues.

     

    i have made another unraid flash drive , started from scratch. issue still shows up.

     

    tried this hardware in many combinations :

     

    B450F motherboard

    A320I-K motherboard

    AM4 3400G

    AM4 3600

    Hitatchi 3tb disks

    brand new 8tb wd disks

    brand new 8th seagate compute disks

    timetec memory , gskill memory in varios sizes and even new packages

    LSI 2008 based controller

    PERC H310 flashed to IT mode

    H200 HP flashed to IT mode

    motherboard sata connectors

    Brand new 800w corsair power supply.

     

     

    the logs show like the disk is failing , however if i swap the disk out. with a brand new one....another will show failed until this happens to all the disks. then the brand new ones will start doing the same.

     

    im completely lost at this point....i have had to revert to my intel machine running my backup array. all the parts listed above except the 3tb and controllers are brand new. those parts when swapped into the intel system have no issues.

     

    all of the HBA's are cooled , nothing in the system builds have anything running hot. all within a 40-50c max operating temp. hdd's are running at a 35C average. i have tried different pcie slots, controller combinations , reflashing controllers. they work in the intel system. 

     

    i have been running unraid for 10+ years , this is the first time i have had an issue.

    fileserver-diagnostics-20200623-1206.zip




    User Feedback

    Recommended Comments

    This is a general support issue, first thing to do is to update the LSI firmware since it's on a very old release:

    Jun 23 06:41:02 fileserver kernel: mpt2sas_cm0: LSISAS2008: FWVersion(07.15.08.00), ChipRevision(0x03), BiosVersion(07.39.00.00)

    Current one is 20.00.07.00, if after that you still need help please start a thread on the general support forum.

    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.