Jump to content

Mytherium

Members
  • Posts

    6
  • Joined

  • Last visited

Posts posted by Mytherium

  1. Hi there,

    I've had issues with Unraid crashing unexpectedly for a number of weeks now, and I'm finding it nearly impossible to track down what exactly the issue is because multiple things happen when it finally does crash.

     

    Some of this behaviour I've posted about in a different thread, but to recap: System Event Logs from my motherboard are very vague. I've disabled C1E Support in the bios as a troubleshooting step that did nothing. Changing memory speeds has no remedy either.

     

    I've configured remote logging of my unraid server so I can watch it's behaviour as it crashes. I've now had the longest uptime in a long time (about 11 days) and just this morning it's beginning to crash again.

     

    It seems like Unraid is having page fault errors, however I've tested all 8 of my 16GB ECC sticks to no avail.

    Attached are my logs from the past 48 hours, but these are probably the highlights for what's kicked this whole process off: 

    Quote

    2019-10-12 10:59:01    Kernel.Alert    LEANDER    Oct 12 10:59:00 Leander kernel: housekeeper: Corrupted page table at address 1505b890d010

    Quote

    2019-10-13 00:01:07    Kernel.Alert    LEANDER    Oct 13 00:01:06 Leander kernel: BUG: Bad page cache in process pihole-FTL  pfn:6ec9a8

    Quote

    2019-10-13 00:47:10    Kernel.Alert    LEANDER    Oct 13 00:47:09 Leander kernel: BUG: Bad page map in process python  pte:ffff888b0887c958 pmd:f674ae067

     

    In addition to this, I'm witnessing nginx falling apart as the webGUI for Unraid slowly ceases to work, as pictured below, until nginx finally gives a 504 error

     

    ANY help is greatly appreciated, I'm out of ideas

    leanderwut.PNG

    syslog from Oct 12-13 2019.txt

  2. 2 hours ago, bonienl said:

    I believe there is another version of the memtest available which runs under UEFI, but I don't know/have the details.

    Perhaps somebody else reading this, may know this and have some pointers.

    Might need to make a bootable usb drive for the latest version of memtest86 https://www.memtest86.com/download.htm . I've had issues trying to boot to the built-in version that unraid has even on a board with legacy bios

    • Upvote 1
  3. Hey there,

    I've been running into a similar circumstance where UnRaid crashes unexpectedly and the only way to recover is by power-cycling my machine.

    I've tried using ECC and non-ECC DIMMs, but it just keeps crashing. The System Event Log of my X9DR3-LN4F+ keeps reporting the same OS Stop Shutdown event on... FAN6 and with very strange timestamps. Only FAN2-4 are populated in the system and temperatures are under control because the system idles while I try to troubleshoot these crashes (which tend to be every 3-8 days or so).

     

    SEL records lead nowhere. I've disabled C1E Support in the bios as a troubleshooting step that did nothing. Changing memory speeds has no remedy either. I also have reason to believe that maybe MDS vulnerability is a factor here because the readout of my /sys/devices/system/cpu/vulnerabilities/mds reads Mitigation: Clear CPU buffers; SMT vulnerable.

     

    I currently have the syslog being saved to my usb for now, but I'll try to setup an external logging server up & running when I can. Any help is greatly appreciated.

     

    Attached are screenshots of the terminal before rebooting the machine on two recent crashes, logs of those most recent crashes and a screenshot from the IPMI SEL log.

     

    My system is using Unraid v6.7.2

    2x E5-2670 v1 at 2.60GHz

    128GB of Samsung ECC memory at 1600MHz M393B2G70BH0-YK0

    Supermicro X9DR3-LN4F+ rev1.10 using Bios v3.3, IPMI firmware 3.48

    thumbnail (2).jpg

    thumbnail (1).jpg

    thumbnail.jpg

    syslog_01_01_2019.txt syslog_28_09_2019.txt

  4. Hey, so I've been using OpenVPN on my server for a while now and I stumbled upon spaceinvader's June 2019 update video for it and I wanted to update my configuration. Going through the guide, it occurs to me that the admin user cannot have its password changed, nor can I delete the user through the GUI. Is this a known issue and is there yet another work-around for this oversight that happens to be a recurring theme among OpenVPN versions.

     

    Here's a clip of using the default 'password', changing it to '123', and the admin user still accepting the old password. Using openvpn-as version 2.7.5

    https://giant.gfycat.com/DarkDisgustingBrownbear.webm

×
×
  • Create New...