Jump to content

Crash tonight: end kernel panic - not syncing: Fatal exception in interrupt


Recommended Posts

I noticed my server was not responding. I went in via IPMI and saw this so I rebooted. The GUI is up and I'm logged in. This is freaking me out. I rebooted and when I started the array I got this message, But the NVME Cache drive seemed to mount ok.
 

1915476891_nvmemissing.JPG.1972cd3b2a2ba406b448784e8c6f8280.JPG

 

Any Idea what caused this? Really appreciate some help.

Should my next step be a MemTest?


1338616555_KernelPanic2.png.1b3ae4ef7823fdcd1e00dde8dfaa1f94.png

 

tower-syslog-20210107-0522.zip

 

 

Edited by adminmat
Link to comment
6 minutes ago, JorgeB said:

Pool seems fine, at least for now, if it keeps crashing try this to see if it catches anything.

Thanks Jorge. When it crashes does it risk the data on the disks? Or does it gracefully shut them down? Is there something I can do to make sure the disks are stopped properly when I reboot? I get the option to "Power off - Orderly Shutdown" with my motherboard BMS 

 

Power-On-Server-via-IPMI-2.0.png.3ddbd0620cbc656d7c4aafc6db4383fb.png

Link to comment
2 hours ago, JorgeB said:

If it crashes there's always some risk of filesystem corruption, but not much you can do about that.

Right. So I set up the rsyslog server on another server (Ubunt in ESXi) and it's currently mirroring the log. And a thank you to @Frank1940 for the help too. If anyone has additional thoughts as to why it would crash please let me know.

Link to comment
12 hours ago, JorgeB said:

I ran the Scrub on the Cache drive. These instructions are a little outdated when using 6.9.0-rc2. I assume now you just click "Scrub" and that's it. I don't see this option on the Cache 2 drive so I assume when scrubbing Cahce it does both? The "Check" button is greyed out and says "Check is only available when array is Started in Maintenance mode" which is in conflict with the Wiki instructions for BTRFS.

Also, do I need to run the Balance? I've not done this before.

 

Link to comment
10 hours ago, adminmat said:

so I assume when scrubbing Cahce it does both?

Correct.

 

10 hours ago, adminmat said:

The "Check" button is greyed out and says "Check is only available when array is Started in Maintenance mode"

This is correct, only scrub can be done with the fs online, check in btrfs should only be done without the repair option.

 

11 hours ago, adminmat said:

Also, do I need to run the Balance?

Usually not needed as a regular option.

Link to comment
  • 9 months later...

Generally you should start your own support thread (or use the support thread for help with a specific plugin or docker). But since this is an old thread and nobody else is using it we can try to help you here.

 

Syslog server is needed since otherwise syslog will not survive reboot. But you should also attach your diagnostics to your NEXT post in this thread. That will give us more information about your hardware and configuration than we can get from just syslog.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...