Jump to content

System freezes - diagnostic steps


boomam

Recommended Posts

Hi,
Twice in the last week my unraid box has done a hard crash in the early hours of the morning.

As in, blank screen, non-responsive, needs a reset.


The difference this morning is that i had set the syslog set to write to the disk so i have the log....but i'm not seeing anything useful in it.
In fact, there's an entire gap of time in the log, strangely enough.
 
3:10am you can see the CA Backup/Restore running and completing.
 ...Nothing in-between...
 7:02am the drives spin down down.
 ...Nothing in-between...
 9:13am you can see the system start to come back up

(or maybe from before when i plugged in a keyboard, its hard to tell as the log doesn't specifically show what i would expect to see during a boot at the beginning).
 
Looking over the logs, SMART statuses & both an hour before and then after the reboot shows no errors that i can see either.
 
The only 'theory' i have is that according to the time schedules i have for mover, backups, etc. the appdata backup runs about an hour before the mover, but i dont see a mover entry in the log...
 
How do i go about diagnosing this further?

As its previously ran for close to a month without a single issue (all new hardware/new build).

 

Key information

Version: v6.8.2

Plugins: 

  • Advanced Copy and Move
  • CA Auto Update Applications
  • CA Backup / Restore Appdata
  • CA Cleanup Appdata
  • CA Dynamix Unlimited Width
  • CA Mover Tuning
  • Community Applications
  • Dynamix Active Streams
  • Dynamix Cache Directories
  • Dynamix SSD TRIM
  • Dynamix System Information
  • Dynamix System Statistics
  • Dynamix System Temperature
  • Enhansed Log Viewer
  • Fix Common Problems
  • Nerd Tools (for Perl installation)
  • Network Statistics
  • Parity Check Tuning
  • Preclear Disks
  • ProFTPd
  • Speedtest Command Line Tool
  • Unassigned Devices
  • Unassigned Devices Plus
  • unBalance
  • User Scripts
  • VM Backup
  • Wake On Lan support

 

Hardware:

  • AMD Ryzen 3200G
  • 16GB (2x8Gb) PC3200
  • Asrock X570M Pro4
  • 3x Seagate IronWolf 4Tb (Pool)
  • 2x Crucial MX500 1Tb (Cache Drive + Unassigned)
  • APC BackupUPS 650VA
  • Average CPU Temp (Idle/Load) - 35*c/50*c
  • Average HDD Temp (Idle/Load) - 35*c/50*c
  • Average SSD Temp (Idle/Load) - 40*c/45*c

 

Log (attached)

  • Anonymised server name and some other parts.
  • Client 192.168.0.25 errors in the log was intentional testing with the LetsEncrypt Reverse Proxy.
  • Time of freeze based on the logs is sometime between 3:10am and 9am today (February 8th).


Thanks!

syslog-192.168.0.250.log

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...