Hi, Not sure where to post this so please move if there is a better spot.
I've been trying to track down random lockups of my Unraid server for the last few months, system becomes unresponsive, no WebUI, Telnet, FTP or file access, for around several minutes, then everything goes back to normal, always has the same error in the logs:
Feb 24 03:45:57 Server kernel: timekeeping watchdog: Marking clocksource 'tsc' as unstable, because the skew is too large:
Feb 24 03:45:57 Server kernel: 'hpet' wd_now: 9c965a15 wd_last: 9c5bb7c6 mask: ffffffff
Feb 24 03:45:57 Server kernel: 'tsc' cs_now: 30da64fb569e cs_last: 2ff9f2884ef6 mask: ffffffffffffffff
Feb 24 03:45:57 Server kernel: Hangcheck: hangcheck value past margin!
Feb 24 03:45:57 Server kernel: Switched to clocksource hpet
I managed to track the problem down to having the Emby docker running, if I disable the docker, the server runs flawlessly. I setup Plex, and the system ran perfectly for a week, re-enabled the Emby docker, system crashed again within a few hours. I have nuked the Emby docker, and started again, have also tried two different repositories.
Running Unraid 6.1.8 Pro, no extra plugins, other then the Emby docker.
System is:
AMD Phenom II X4 955, ASUS M5A97 EVO, 4gb Ram, 120gb SSD cache (docker and appdata, on SSD), 11 HDD 22TB array, all disks formated in XFS.
Diagnostic info is attached.
server-diagnostics-20160224-0740.zip
Emby_Log.txt