Jump to content

Random Shutdown


KDP

Recommended Posts

As of last night I am having a random shutdown and I am just looking to see if I am troubleshooting in an efficient manner. I originally built my server over a decade ago. 2 years ago I changed the case, power supply and fans. I am currently running UNRAID 6.11.5.

 

Last night I was downloading a large series of files and adding them to my array when my server first shutdown. When I turned it back on and logged in I received the notice that my unassigned device that I use for all of my extracting had returned to normal temperature. I assumed that this was the reason the server shut down and continued on with my evening. My downloading and extracting completed without any further issues and stopped worrying.

 

About an hour later my server shutdown again. I started it back up and did not see any notices that would have indicated what may have happened so I turned on the syslog server to monitor further issues. A couple hours later the server was still running and I went to bed. I woke up this morning and the server was powered down again. I looked in the syslog server and there are nothing but time entries since my last action before bed (I changed a share from public to private).

 

I then looked in to my IPMI while the server was down and the only entries I have are months old (probably my last reboot prior to the current problem) and there is mention of FAN1

 

30009	2023/05/01 20:23:03	FAN 1	Fan	Lower Non-Critical - Going Low - Asserted
30010	2023/05/01 20:23:03	FAN 1	Fan	Lower Critical - Going Low - Asserted
30011	2023/05/01 20:23:03	FAN 1	Fan	Lower Non-Recoverable - Going Low - Asserted
30012	2023/05/01 20:24:36	FAN 1	Fan	Lower Non-Recoverable - Going Low - Deasserted
30013	2023/05/01 20:24:36	FAN 1	Fan	Lower Critical - Going Low - Deasserted
30014	2023/05/01 20:24:36	FAN 1	Fan	Lower Non-Critical - Going Low - Deasserted

 

This error does not appear for the two power ups after the shutdowns. All other reports for temps and voltage are all green (good). So I decided to open up the server and blow dust out and make sure everything is seated and everything seemed seated well. I started up the server again and watched the fans. One CPU fan at the low rpm on boot looks abnormal. It would spin smooth then hitch for a fraction of a second then spin smooth. Once the server booted up and RPMs increased it runs fine. For peace of mind I will be ordering a replacement.

 

I checked the IPMI again and see new entries which I assume are referencing the fan in question

 

30015	1970/01/14 19:47:42	FAN 1	Fan	Lower Non-Critical - Going Low - Asserted
30016	1970/01/14 19:47:42	FAN 1	Fan	Lower Critical - Going Low - Asserted
30017	1970/01/14 19:47:43	FAN 1	Fan	Lower Non-Recoverable - Going Low - Asserted
30018	1970/01/14 19:50:04	FAN 1	Fan	Lower Non-Recoverable - Going Low - Deasserted
30019	1970/01/14 19:50:55	FAN 1	Fan	Lower Critical - Going Low - Deasserted
30020	1970/01/14 19:50:55	FAN 1	Fan	Lower Non-Critical - Going Low - Deasserted
30021	1970/01/14 19:52:36	FAN 1	Fan	Lower Non-Critical - Going Low - Asserted
30022	1970/01/14 19:53:30	FAN 1	Fan	Lower Non-Critical - Going Low - Deasserted

 

I am not really sure if the time thing is just because it had not synced with a NTP server or not. However, for peace of mind again, I will be replacing the CMOS battery. However, looking in the IPMI on the time page, the time is now correct.

 

I have included my diagnostic file, although it is post boot and will likely give no useful information. The server is again running a parity check, as it does when an unclean shutdown has occurred, and everything is in the green both in the UNRAID dashboard as well as the IPMI.

 

Should I be looking at anything else or performing any other diagnostics?

elvis-diagnostics-20230805-1120.zip

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...