Jump to content
  • System crash seems to happen when doing Parity check


    Which6
    • Urgent

    After several crashes of my server I upgraded my Bios on my Asus Steel Legend mother board to the latest bios, created a new USB incase of a failed dongle; and yet it still crashes.  All Temps seem good but since the parity check is happening overnight I can't be sure. I have attached the diagnostics report for possible review. Thanks for any suggestions or input. 

    Unconfirmed 640343.crdownload Unconfirmed 551103.crdownload




    User Feedback

    Recommended Comments

    Both diags are just after rebooting, so not much to see, enable the syslog server and post that after a crash.

     

    P.S. next time please make sure you download the files properly first, to avoid us having to rename them to zip.

    Link to comment

    Thanks, I'm new to the diagnostics. The server has never had issues until now.  I'll resubmit after next crash. Thanks for the fast response.

    Link to comment

    Are your sure there are no power related issues?   A parity check would be a time of maximum load.

    Link to comment

    Not that I can verify.  I misspoke about the motherboard It is an ASRock Z690 Steel Legend
    American Megatrends International, LLC., Version 18.04
    BIOS dated: Thursday, 06-06-2024. The power supply In the machine is way overkill in anticipation of growth. it's a Seasonic Prime GX-1300. I'm not on an UPS though.

    Link to comment

    The syslog in the diagnostics is the RAM version that starts afresh every time the system is booted.  You should enable the syslog server (probably with the option to Mirror to Flash set) to get a syslog that survives a reboot so we can see what leads up to a crash.  The mirror to flash option is the easiest to set up (and if used the file is then automatically included in any diagnostics), but if you are worried about excessive wear on the flash drive you can put your server's address into the remote server field.  

    Link to comment

    Thanks, I've set up the Mirror to flash option now. I'll send a new diagnostic file once it crashes again.  Thanks for the information and fast reply assistance

    Link to comment

    Crashed again last night. I have attached the downloaded Diagnostics file.

    Please let me know if I need a different file or other information to solve this. I hope this pulled the correct information. I had the Sys log  server set to Mirror syslog to flash.

    Thank you for the assistance

    tower-diagnostics-20240625_0527.zip

    Link to comment

    Multiple call traces logged, start by running memtest, if nothing is found, and because memtest is only definitive if it finds errors, try running the server with just one stick of RAM, if the same try with a different one, that will basically rule out bad RAM, if issues persist, another thing you can try is to boot the server in safe mode with all docker containers/VMs disabled, let it run as a basic NAS for a few days, if it still crashes it's likely a hardware problem, if it doesn't start turning on the other services one by one.
     

    Link to comment

    I'll let you know the out come. I'll also leave the Sys log in mirror setting to send you a file later if it crashes again while testing.  Thank you once again for the quick action on this.

    Link to comment

    Yep, It appears you were correct on your diagnosis. Attached a shot of the Memtest. I had another 64GB DDR kit and have switched out for the new RAM. It's a different Mgf. using Micron chips which may or may not make a difference but all seems to be running smooth so far.  If anything changes I'll let y'all know. 

    Thank you  JorgeB for the excellent assistance.

    MemTest errors 2.pdf

    • Like 1
    Link to comment


    Join the conversation

    You can post now and register later. If you have an account, sign in now to post with your account.
    Note: Your post will require moderator approval before it will be visible.

    Guest
    Add a comment...

    ×   Pasted as rich text.   Restore formatting

      Only 75 emoji are allowed.

    ×   Your link has been automatically embedded.   Display as a link instead

    ×   Your previous content has been restored.   Clear editor

    ×   You cannot paste images directly. Upload or insert images from URL.


  • Status Definitions

     

    Open = Under consideration.

     

    Solved = The issue has been resolved.

     

    Solved version = The issue has been resolved in the indicated release version.

     

    Closed = Feedback or opinion better posted on our forum for discussion. Also for reports we cannot reproduce or need more information. In this case just add a comment and we will review it again.

     

    Retest = Please retest in latest release.


    Priority Definitions

     

    Minor = Something not working correctly.

     

    Urgent = Server crash, data loss, or other showstopper.

     

    Annoyance = Doesn't affect functionality but should be fixed.

     

    Other = Announcement or other non-issue.

×
×
  • Create New...