knutarn Posted January 4, 2020 Share Posted January 4, 2020 (edited) Hey! My Unraid server start to crash efter 3year of drift. Get this: https://imgur.com/DBS25t7 and 3-8 haure after i get this: https://imgur.com/QTxCWlv and server need hard reboot to work again. What happen here? Info: Model: NS1 M/B: Gigabyte Technology Co., Ltd. - Z68X-UD3H-B3 CPU: Intel® Core™ i5-2500K CPU @ 3700 HVM: Enabled IOMMU: Disabled Cache: 852672 kB Memory: 16 GB (max. installable capacity 32 GB) Network: eth0: 1000 Mb/s, full duplex, mtu 1500 Kernel: Linux 4.14.16-unRAID x86_64 OpenSSL: 1.0.2n LOG: Jan 4 19:49:15 Elsa kernel: virbr0: port 1(virbr0-nic) entered disabled state Jan 4 19:49:15 Elsa kernel: br0: port 2(vnet0) entered blocking state Jan 4 19:49:15 Elsa kernel: br0: port 2(vnet0) entered disabled state Jan 4 19:49:15 Elsa kernel: device vnet0 entered promiscuous mode Jan 4 19:49:15 Elsa kernel: br0: port 2(vnet0) entered blocking state Jan 4 19:49:15 Elsa kernel: br0: port 2(vnet0) entered forwarding state Jan 4 19:49:15 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:15 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:16 Elsa avahi-daemon[3213]: Joining mDNS multicast group on interface vnet0.IPv6 with address fe80::fc54:fXe6b:554e. Jan 4 19:49:16 Elsa avahi-daemon[3213]: New relevant interface vnet0.IPv6 for mDNS. Jan 4 19:49:16 Elsa avahi-daemon[3213]: Registering new address record for fe80::fX:ff:fe6b:554e on vnet0.*. Jan 4 19:49:18 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:18 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:20 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:21 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:21 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:21 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:23 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:23 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:24 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:25 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:25 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:49:25 Elsa kernel: usb 1-1.5: reset full-speed USB device number 3 using ehci-pci Jan 4 19:50:01 Elsa sSMTP[5049]: Creating SSL connection to host Jan 4 19:50:01 Elsa sSMTP[5049]: SSL connection using XXXX Jan 4 19:50:03 Elsa sSMTP[5049]: Sent mail for [email protected] (221 XXXXX.XXXXX.no closing connection) uid=0 username=XXXX outbytes=667 Jan 4 19:55:43 Elsa kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 1) Jan 4 19:55:43 Elsa kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 4 19:55:43 Elsa kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 4 19:55:43 Elsa kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 4 19:55:43 Elsa kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 4 19:55:43 Elsa kernel: CPU1: Core temperature/speed normal Jan 4 19:55:43 Elsa kernel: CPU0: Package temperature/speed normal Jan 4 19:55:43 Elsa kernel: CPU3: Package temperature/speed normal Jan 4 19:55:43 Elsa kernel: CPU2: Package temperature/speed normal Jan 4 19:55:43 Elsa kernel: CPU1: Package temperature/speed normal Edited January 4, 2020 by knutarn Quote Link to comment
FreeMan Posted January 4, 2020 Share Posted January 4, 2020 3 hours ago, knutarn said: My Unraid server start to crash efter 3year of drift. Not sure exactly what this means. Has it been working just fine for the last 3 years and these crashes are new/sudden with no configuration changes? Also, go to Tools | Diagnostics and download and post the complete diagnostics .ZIP file in your next post. You can also go to Settings | Network Services | Syslog Server and set "Mirror syslog to flash" to capture the syslog prior to any future crash. Note that this will write a lot of info to your boot flash drive, so you don't want to leave this on all the time, only for trouble shooting purposes. Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 9 hours ago, FreeMan said: Not sure exactly what this means. Has it been working just fine for the last 3 years and these crashes are new/sudden with no configuration changes? Also, go to Tools | Diagnostics and download and post the complete diagnostics .ZIP file in your next post. You can also go to Settings | Network Services | Syslog Server and set "Mirror syslog to flash" to capture the syslog prior to any future crash. Note that this will write a lot of info to your boot flash drive, so you don't want to leave this on all the time, only for trouble shooting purposes. Here is the file. Server has been online for 13h now. And not crash yet. I cant find Syslog server in Network services. knutarn elsa-diagnostics-20200105-0858.zip Quote Link to comment
itimpi Posted January 5, 2020 Share Posted January 5, 2020 9 minutes ago, knutarn said: Here is the file. Server has been online for 13h now. And not crash yet. I cant find Syslog server in Network services. knutarn elsa-diagnostics-20200105-0858.zip 98.77 kB · 0 downloads The syslog server was a feature that first became available on the 6.7 series of Unraid releases (current stable release is 6.8) and your diagnostics show you are on a much older release (6.4.1). Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 (edited) 52 minutes ago, itimpi said: The syslog server was a feature that first became available on the 6.7 series of Unraid releases (current stable release is 6.8) and your diagnostics show you are on a much older release (6.4.1). The server crash now. Then o going to make a update on it. What is the easy way to make a update? EDIT: Make update now:) Edited January 5, 2020 by knutarn Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 Done! new diag file elsa-diagnostics-20200105-1009.zip Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 now it start to make this in the log: Jan 5 10:53:49 Elsa kernel: CPU1: Core temperature above threshold, cpu clock throttled (total events = 1) Jan 5 10:53:49 Elsa kernel: CPU2: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 5 10:53:49 Elsa kernel: CPU0: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 5 10:53:49 Elsa kernel: CPU3: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 5 10:53:49 Elsa kernel: CPU1: Package temperature above threshold, cpu clock throttled (total events = 1) Jan 5 10:53:49 Elsa kernel: CPU1: Core temperature/speed normal Jan 5 10:53:49 Elsa kernel: CPU2: Package temperature/speed normal Jan 5 10:53:49 Elsa kernel: CPU3: Package temperature/speed normal Jan 5 10:53:49 Elsa kernel: CPU0: Package temperature/speed normal Jan 5 10:53:49 Elsa kernel: CPU1: Package temperature/speed normal But the server is working just normal Add a new diagnostics now. elsa-diagnostics-20200105-1055.zip Quote Link to comment
JorgeB Posted January 5, 2020 Share Posted January 5, 2020 8 minutes ago, knutarn said: But the server is working just normal It might be, but CPU is throttling down due to overheating, this usually means cooler needs cleaning. Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 10 minutes ago, johnnie.black said: It might be, but CPU is throttling down due to overheating, this usually means cooler needs cleaning. Are you sure? Then i going to clean it Quote Link to comment
itimpi Posted January 5, 2020 Share Posted January 5, 2020 11 minutes ago, knutarn said: Are you sure? Then i going to clean it Especially since if the temperature is getting above a critical level the CPU may forcibly close down the server explaining yous 'crashes'. Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 5 minutes ago, itimpi said: Especially since if the temperature is getting above a critical level the CPU may forcibly close down the server explaining yous 'crashes'. Yes. The server is running but the screen is crashes like the picture and SD card light stopp (USB). But server is running, so I need to take the power cord from it to reboot it. If CPU going to hot, shuld the server just shutdown? Quote Link to comment
mrbilky Posted January 5, 2020 Share Posted January 5, 2020 (edited) Your system seems to be overheating your cpu is doing what it should when this occurs check fans and cooler to ensure all is working and clean also my not hurt to replace the thermal paste on the IHS Edited January 5, 2020 by mrbilky Quote Link to comment
knutarn Posted January 5, 2020 Author Share Posted January 5, 2020 5 hours ago, mrbilky said: Your system seems to be overheating your cpu is doing what it should when this occurs check fans and cooler to ensure all is working and clean also my not hurt to replace the thermal paste on the IHS I get the crash now. How can i read syslog? Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.