Jump to content

FramedLight


Recommended Posts

New Unraid setup Frequent unclean shutdowns

 

Good evening everyone.

 

I have been using uraid for a few years quite happily on a old desktop which finally decided to giveup. So I decided to to give a i dedicated server machine a try.

 

I have been able to set up my array with no issue, But when moving data from my desktop to the new server or installing dockers I am encountering frequent unclean shutdowns.

Some information on my server

 

HP ProLiant DL165 G7 Server

BIOS: HP Version O37. Dated: 10/17/2012
CPU: AMD Opteron™ 6276 @ 2300 MHz
HVM: Enabled
IOMMU: Disabled
Cache: 768 KiB, 16384 KiB, 12288 KiB, 768 KiB, 16384 KiB, 12288 KiB
Memory: 32 GiB DDR3 Single-bit ECC (max. installable capacity 512 GiB)
Network: bond0: fault-tolerance (active-backup), mtu 1500
 eth0: interface down
 eth1: 100 Mbps, full duplex, mtu 1500
 eth2: interface down
 eth3: interface down
Kernel: Linux 4.19.107-Unraid x86_64
OpenSSL: 1.1.1d

 

From the SYSTEM LOG after the last unclean shutdowns, I can see the following notable entries

 

Jun 24 20:17:19 Tower kernel: pci 0000:04:00.0: BAR 10: failed to assign [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:04:00.1: BAR 7: no space for [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:04:00.1: BAR 7: failed to assign [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:04:00.1: BAR 10: no space for [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:04:00.1: BAR 10: failed to assign [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:00:04.0: PCI bridge to [bus 04]
Jun 24 20:17:19 Tower kernel: pci 0000:00:04.0:   bridge window [io  0xe000-0xefff]
Jun 24 20:17:19 Tower kernel: pci 0000:00:04.0:   bridge window [mem 0xfeb00000-0xfebfffff]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.0: BAR 7: assigned [mem 0xfea00000-0xfea1ffff 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.0: BAR 10: no space for [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.0: BAR 10: failed to assign [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.1: BAR 7: no space for [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.1: BAR 7: failed to assign [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.1: BAR 10: no space for [mem size 0x00020000 64bit]
Jun 24 20:17:19 Tower kernel: pci 0000:03:00.1: BAR 10: failed to assign [mem size 0x00020000 64bit]


Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: Machine check events logged
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: CPU 8: Machine Check: 0 Bank 4: f200002000020c0f
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: TSC 0 
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: PROCESSOR 2:600f12 TIME 1593026205 SOCKET 0 APIC 8 microcode 600063e
Jun 24 20:17:19 Tower kernel:  #9 #10 #11 #12 #13 #14 #15
Jun 24 20:17:19 Tower kernel: .... node  #2, CPUs:   #16 #17 #18 #19 #20 #21 #22 #23
Jun 24 20:17:19 Tower kernel: .... node  #3, CPUs:   #24
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: Machine check events logged
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: CPU 24: Machine Check: 0 Bank 4: f200001000020c0f
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: TSC 0 
Jun 24 20:17:19 Tower kernel: mce: [Hardware Error]: PROCESSOR 2:600f12 TIME 1593026205 SOCKET 1 APIC 28 microcode 600063e
Jun 24 20:17:19 Tower kernel: #25 #26 #27 #28 #29 #30 #31

 

I've collected the last system logs and diagnostics files

 

Any help with this is greatly appreciated

 

David

tower-diagnostics-20200624-2037.zip tower-syslog-20200624-1856.zip

Edited by Framedlight
log uploads
Link to comment
1 hour ago, Framedlight said:

Hi trurl.

 

Thank you for the quick reply.

 

Not as yet.

 

Would this be the memtest from the boot menu, or is there a memtest plugin that I can use?

 

Regards

 

David  

Memtest from the boot menu if you boot in legacy mode.

if you boot in UEFI mode then you need to download the memtest version from the internet that is UEFI compatible and put that on a USB stick to run it.

 

you cannot use a plugin as memtest takes over the whole machine while it is running.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...