Soxekaj Posted April 27, 2020 Share Posted April 27, 2020 (edited) I have been experiencing a weird issue for the past couple of months. Shortly after a parity check finishes or i cancel the check, the server reboots. This of course triggers a new parity check and so the circle is complete. I have syslog setup to mirror to flash, and when checking with cat, it runs through a bunch of log, but once it finishes, I can only access the logs from after the crash. Anyone have any input on why the crashes might be happening or maybe a way to see the logs from before the crash, so that might bring answers to light. Edited May 20, 2020 by Soxekaj Quote Link to comment
testdasi Posted April 27, 2020 Share Posted April 27, 2020 If you set up syslog to mirror to flash then the file on the flash drive should contain both pre and post crash. Just attach the file. Crashes like that are hard to diagnosed though. We'll see if there's anything useful in the log. While you are at it, attach the diagnostic zip (Tools -> Diagnostics -> attach the whole zip file). My first hunch is PSU. Second hunch is RAM. Quote Link to comment
Frank1940 Posted April 27, 2020 Share Posted April 27, 2020 (edited) You have setup the Syslog Server per these instructions??? https://forums.unraid.net/topic/46802-faq-for-unraid-v6/page/2/?tab=comments#comment-781601 Two things: First, attach the syslog that you get. (I see that @testdasi has already requested that.) Give us a date and approximate time to be looking at if it is large syslog. Second, hook up a monitor and trigger the crash by stopping the parity check. Perhaps, you could use a camera to get a photo of the problem. (Prepare to be quick as it may last only a few seconds. If nothing else, describe what you see. Does it look like everything was fine and then a restart or does the system vomit a whole bunch of stuff (like a core dump) that does not look like normal syslog entries? Edited April 27, 2020 by Frank1940 Quote Link to comment
Frank1940 Posted April 27, 2020 Share Posted April 27, 2020 One more thing. Does it run normally for a while after the parity check or it virtually instantaneous with the finish. Is there an possibility that a child or pet (cats sometimes are a problem here) may be pushing the Reset button. IF you even suspect this could be a problem, disconnect the switch leads at the MB end. Quote Link to comment
Soxekaj Posted April 27, 2020 Author Share Posted April 27, 2020 The crash doesn't happen instantly when the parity check stops, but usually withing 30min or so (it varies) I managed to get syslog of what is happening just before the crash and after. These are for the last 2 times it happened (early this morning and this afternoon) Also attached is the diagnostics as requested. tower-diagnostics-20200427-1608.zip syslog1.txt syslog2.txt Quote Link to comment
Frank1940 Posted April 27, 2020 Share Posted April 27, 2020 From what I can tell, the Syslog Server is not setup properly. Did you set it up using the instructions in the post above? The file created by the Syslog Server in the logs folder/directory of the flash drive and it is named syslog without any extension. Quote Link to comment
Soxekaj Posted April 28, 2020 Author Share Posted April 28, 2020 It is set up correctly, these are just grabs from putty because i had problems transfering the file off the machine. I can try again if you want the entire log. Quote Link to comment
Frank1940 Posted April 28, 2020 Share Posted April 28, 2020 If necessary, shut the server down (after stopping the parity check), pull the Flash drive and copy it from the flash drive on your PC. While you have it in the PC make a complete backup of the contents of the flash drive. Also run a chkdsk on the flash drive. Quote Link to comment
Soxekaj Posted April 29, 2020 Author Share Posted April 29, 2020 Here we go, complete syslog file. I made a copy of the content while i had it in my pc as you sugested. Ran chkdsk, nothing found: syslog Quote Link to comment
testdasi Posted April 29, 2020 Share Posted April 29, 2020 What's your RAM speed? Are you using XMP? Also can you let us know the last time it crashed / unexpectedly reboot? Was it this morning? Your 23MB syslog spanned almost the whole April so it contains too much information (e.g. your intentional reboot is mixed up with the crash) so we need to narrow it down. Quote Link to comment
Soxekaj Posted April 29, 2020 Author Share Posted April 29, 2020 RAM is runnuing at base clock speed (2666), don't think i have XMP active, but not sure, been a while since i was in bios and i don't have a graphics card in the machine (model F i3, so no on chip graphics). I have not done intentional reboots for a while, so if you look at April that is probably only crashes for that period. Quote Link to comment
Frank1940 Posted April 29, 2020 Share Posted April 29, 2020 (edited) Doing a search of your syslog using Linux version 4 as the search term, I noticed that the reboot seems to occur with fifteen minutes of when the parity finishes or is stopped. There is no clue as to anything happening beyond what would be expected. It appears that the sever is in a idle state. I would check the BIOS and make sure that it s not set to some power saving mode. If it is change to high performance mode as a test. You could also run memtst (a boot option) for 24 hours and see if that detects anything. You might also consider changing the PS. They have been found to be the culprit in other similar cases. You might have one in your junk box, borrow one from a friend or a loan of one from a vendor with a liberal return policy... EDIT: Do one thing at a time, so if something helps, you know what it is! Edited April 29, 2020 by Frank1940 Quote Link to comment
Soxekaj Posted April 30, 2020 Author Share Posted April 30, 2020 22 hours ago, testdasi said: What's your RAM speed? Are you using XMP? Are you recommending running with XMP active? Will get a hold of a gpu, so that i can access BIOS and check settings for power savings. Quote Link to comment
itimpi Posted April 30, 2020 Share Posted April 30, 2020 12 minutes ago, Soxekaj said: Are you recommending running with XMP active? Will get a hold of a gpu, so that i can access BIOS and check settings for power savings. No. Many people do not realize that XMP is an overclocked setting and thus the question. Quote Link to comment
Soxekaj Posted April 30, 2020 Author Share Posted April 30, 2020 6 minutes ago, itimpi said: No. Many people do not realize that XMP is an overclocked setting and thus the question. That is what i was assuming, but given my predicament, I just wanna get everything right. I do have a hard time seeing why XMP or any other hardware component would cause this, since it happens pretty tight around 15 min after parity check finishes. If it was hardware related, wouldn't it also happen during the check, or is the hardware being hit differently after the check? Quote Link to comment
testdasi Posted April 30, 2020 Share Posted April 30, 2020 10 minutes ago, Soxekaj said: That is what i was assuming, but given my predicament, I just wanna get everything right. I do have a hard time seeing why XMP or any other hardware component would cause this, since it happens pretty tight around 15 min after parity check finishes. If it was hardware related, wouldn't it also happen during the check, or is the hardware being hit differently after the check? Hardware-related instability can be hard to predict / understand. For example, with Precision Boost on (i.e. AMD-certified automatic overclock), my system is rock solid stable. Browsing, gaming, transcoding etc, all fine. The only exception is a very specific Lightroom job that does not even load up on the CPU (not even ONE core to 100%!) and it reliably crashes my whole server every single time despite being run in a VM. The point here is you can't really predict when an innate instability will rear its ugly head. Quote Link to comment
Soxekaj Posted May 20, 2020 Author Share Posted May 20, 2020 Disabling XMP did the trick. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.