6.6.6 system crash


Gog

Recommended Posts

Good morning

 

My unraid server is unstable.  I get system crashes every 7-14 days.  Last time I gathered logs with the troubleshooting tools and I got this:

 

Feb  3 02:46:51 Tower kernel: mdcmd (1097): spindown 9
Feb  3 02:46:55 Tower kernel: BUG: unable to handle kernel paging request at fffff8ffdfd3f008
Feb  3 02:46:55 Tower kernel: PGD 0 P4D 0 
Feb  3 02:46:55 Tower kernel: Oops: 0000 [#2] SMP PTI
Feb  3 02:46:55 Tower kernel: CPU: 3 PID: 9875 Comm: php-fpm Tainted: G    B D W         4.18.20-unRAID #1
Feb  3 02:46:55 Tower kernel: Hardware name: Supermicro X10SL7-F/X10SL7-F, BIOS 3.0 04/24/2015
Feb  3 02:46:55 Tower kernel: RIP: 0010:unmap_page_range+0x69b/0x88a
Feb  3 02:46:55 Tower kernel: Code: 0c ff 8c 24 88 00 00 00 e9 89 00 00 00 48 b9 ff ff ff ff ff ff ff 01 b8 f5 ff 7f 00 48 21 f9 48 c1 e0 29 48 c1 e1 06 48 01 c1 <4c> 8b 41 08 48 89 c8 41 f6 c0 01 74 04 49 8d 40 ff 4c 8b 40 08 41 
Feb  3 02:46:55 Tower kernel: RSP: 0018:ffffc90003cfbce8 EFLAGS: 00010286
Feb  3 02:46:55 Tower kernel: RAX: ffffea0000000000 RBX: 000015271688c000 RCX: fffff8ffdfd3f000
Feb  3 02:46:55 Tower kernel: RDX: ffff880101607e98 RSI: 0000000000000000 RDI: 3e00003bff7f4fc0
Feb  3 02:46:55 Tower kernel: RBP: 00001527169cf000 R08: 0000000000000000 R09: ffffea00103316c0
Feb  3 02:46:55 Tower kernel: R10: ffffea00103316c0 R11: 0000000000000001 R12: ffff8802b8d414e0
Feb  3 02:46:55 Tower kernel: R13: 0000000000000000 R14: ffffc90003cfbde8 R15: ffff880283d2f268
Feb  3 02:46:55 Tower kernel: FS:  0000000000000000(0000) GS:ffff88041fcc0000(0000) knlGS:0000000000000000
Feb  3 02:46:55 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb  3 02:46:55 Tower kernel: CR2: fffff8ffdfd3f008 CR3: 0000000001e0a005 CR4: 00000000001606e0
Feb  3 02:46:55 Tower kernel: Call Trace:
Feb  3 02:46:55 Tower kernel: unmap_vmas+0x4b/0x7f
Feb  3 02:46:55 Tower kernel: exit_mmap+0xc8/0x16a
Feb  3 02:46:55 Tower kernel: ? __switch_to_asm+0x35/0x70
Feb  3 02:46:55 Tower kernel: mmput+0x4d/0xe5
Feb  3 02:46:55 Tower kernel: do_exit+0x3b0/0x8b0
Feb  3 02:46:55 Tower kernel: ? handle_mm_fault+0x159/0x1a8
Feb  3 02:46:55 Tower kernel: do_group_exit+0x9a/0x9a
Feb  3 02:46:55 Tower kernel: __x64_sys_exit_group+0xf/0xf
Feb  3 02:46:55 Tower kernel: do_syscall_64+0x57/0xe6
Feb  3 02:46:55 Tower kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Feb  3 02:46:55 Tower kernel: RIP: 0033:0x152714ce7a66
Feb  3 02:46:55 Tower kernel: Code: Bad RIP value.
Feb  3 02:46:55 Tower kernel: RSP: 002b:00007ffe8d00e668 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
Feb  3 02:46:55 Tower kernel: RAX: ffffffffffffffda RBX: 0000152714de5760 RCX: 0000152714ce7a66
Feb  3 02:46:55 Tower kernel: RDX: 0000000000000000 RSI: 000000000000003c RDI: 0000000000000000
Feb  3 02:46:55 Tower kernel: RBP: 0000000000000000 R08: 00000000000000e7 R09: ffffffffffffff78
Feb  3 02:46:55 Tower kernel: R10: 0000000000000006 R11: 0000000000000246 R12: 0000152714de5760
Feb  3 02:46:55 Tower kernel: R13: 0000000000000007 R14: 0000152714dee428 R15: 0000000000000000
Feb  3 02:46:55 Tower kernel: Modules linked in: ipt_REJECT iptable_mangle tun macvlan veth xt_nat ipt_MASQUERADE iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat reiserfs xfs md_mod ipmi_devintf igb i2c_algo_bit x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc ipmi_ssif aesni_intel aes_x86_64 crypto_simd cryptd glue_helper intel_cstate intel_uncore intel_rapl_perf mpt3sas ahci sata_mv libahci i2c_i801 button i2c_core video thermal fan backlight pcc_cpufreq raid_class scsi_transport_sas ipmi_si ie31200_edac [last unloaded: i2c_algo_bit]
Feb  3 02:46:55 Tower kernel: CR2: fffff8ffdfd3f008
Feb  3 02:46:55 Tower kernel: ---[ end trace 70b9f100a2be40b9 ]---
Feb  3 02:46:55 Tower kernel: RIP: 0033:0x150b13e38207
Feb  3 02:46:55 Tower kernel: Code: Bad RIP value.
Feb  3 02:46:55 Tower kernel: RSP: 002b:00007ffe46f9e000 EFLAGS: 00010206
Feb  3 02:46:55 Tower kernel: RAX: 0000150b1547a097 RBX: 0000150b1547a2c4 RCX: 0000150b1547a097
Feb  3 02:46:55 Tower kernel: RDX: 0000000000000008 RSI: 0000150b1547a129 RDI: 0000000000000000
Feb  3 02:46:55 Tower kernel: RBP: 00007ffe46f9e0d0 R08: 0000000040000000 R09: 000055ac2a6af2e0
Feb  3 02:46:55 Tower kernel: R10: 0000000000000000 R11: 0000150b1540fe30 R12: 000055ac2a77b420
Feb  3 02:46:55 Tower kernel: R13: 00000000000f4241 R14: 0000150b13e85000 R15: 0000150b13e8cff0
Feb  3 02:46:55 Tower kernel: FS:  0000000000000000(0000) GS:ffff88041fcc0000(0000) knlGS:0000000000000000
Feb  3 02:46:55 Tower kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb  3 02:46:55 Tower kernel: CR2: 0000150b13e381dd CR3: 0000000001e0a005 CR4: 00000000001606e0
Feb  3 02:46:55 Tower kernel: Fixing recursive fault but reboot is needed!

 

Google searches did not help much.

 

After this, the server won't boot up, it's falling in EFI shell.

 

I had this before but with the original USB stick (512MB, it's been a while) troubleshooting tools didn't have enough room to save logs.  Now the kingston DTSE9 16GB won't boot.  

 

Troubleshooting logs are attached.  Any help is appreciated.

tower-diagnostics-20190203-0229.zip

FCPsyslog_tail.zip

Link to comment

Had somethin simular yesterday, suddenly my cpu options where disabled even tough te serverran for 16 day straight.

Then i tried reboot trough interface and it would not boot trough any more.

 

Case was that some how my falshdrive was taken hostage by ransomware  GandCrab version 5.0.4 so my whole server was toast, the ransomware even got so far that the whole usb drive is no gone even formating and reinitialising it and even dod wipe did not get it to work with the usb creator.

 

try checking your old usb on a windows pc and check for file extesion with 5 or more random letters in most cases capitals.

 

problem with this and newer version is that it is not decryptable due to the private key that you only get when you pay 1200 dollar worth of bitcoin or dashcoin.

 

Why are there such idiotic pepole on this planet just iritating hard working pepole and trying to take money we need to pay bills with. 

 

Link to comment
On 2/4/2019 at 8:41 AM, sojab0on said:

Had somethin simular yesterday, suddenly my cpu options where disabled even tough te serverran for 16 day straight.

Then i tried reboot trough interface and it would not boot trough any more.

 

Case was that some how my falshdrive was taken hostage by ransomware  GandCrab version 5.0.4 so my whole server was toast, the ransomware even got so far that the whole usb drive is no gone even formating and reinitialising it and even dod wipe did not get it to work with the usb creator.

 

try checking your old usb on a windows pc and check for file extesion with 5 or more random letters in most cases capitals.

 

problem with this and newer version is that it is not decryptable due to the private key that you only get when you pay 1200 dollar worth of bitcoin or dashcoin.

 

Why are there such idiotic pepole on this planet just iritating hard working pepole and trying to take money we need to pay bills with. 

 

I made a backup of the flash before rebuilding it and the files are clean, I dont think it was ransomware

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.