HARD power downs


Recommended Posts

Hello all,

 

I am very new to unRAID but not necessarily new to computers and servers. I am in the process of converting my buddy's Ubuntu Server over to unRAID. The server boots, created the array, tried installing/using dockers and plugins, everything runs great. Love unRAID. I have yet to purchase a licence (19 days to go) because I am having a hell of a time with this thing and hard power downs. Literally powers off like it was unplugged, no errors, nothing on the terminal (I recorded a video), Fix Common Problems Troubleshooting mode doesn't record errors, even tried a syslog tail. Nothing. Thus far, I've found this ONLY happens when copying large files to the server, in this case, large media files. The server can run full parity checks/corrections with no problem and will run for days if left alone. All things point to hardware but I wanted some opinions or other ideas before digging in to what's left (ie: MoBo and CPU). Here is the server and my process thus far:

 

Components:
 - Motherboard: Supermicro C7SIM-Q
 - CPU: Intel Core i5 CPU 750 @ 2.67GHz
 - RAM: 4x G.Skill RipjawsX F3-12800CL9-4GBXL (16GB total)
 - SATA HBA: * Supermicro AOC-SAS2LP-MV8 * (I understand the implications of this but hear me out)
 - Parity Drive: 1x WD Green 3TB
 - Array HDDs: 1x WD Red 3TB, 3x WD Green 2TB
 - SSDs: 1x ADATA SP550 120GB (cache)
 - Flash: Sandisk Cruzer Fit 16GB (boot)
 - Video card: NVIDIA GeForce 7900 GT/GTO
 - PSU: Seasonic X Series fanless

 

Secondary Components tried:
 - RAM 2: 2x Crucial 8GB DDR3L-1600 UDIMM (CT102464BD160B)
 - PSU 2: Antec EA-750 750w

 

Combinations of hardware tested (no specific order and multiple combinations):
 - No video card
 - 8GB RAM
 - No SATA HBA card (SDD/HDDs direct to MoBo)
 - No cache drive
 - Updated BIOS to newest
 - Updated SATA HBA firmware to newest
 - Different PSU (see PSU 2)
 - Disconnect cache drive entirely
 - Flash drive in new port
 - Different RAM (see RAM 2)

 - New array with RED drive as parity

 

Tests and different settings:
 - Memtest86 for 6+ passes (24hrs), no errors on RAM 1

 - Virtualization enabled/disabled

 - C states enabled/disabled

 - BIOS to defaults

 

This has been an ongoing process for over a week. Diags attached. Appreciate ANY ideas/thoughts/comments/snide remarks.

lyserver-diagnostics-20190109-1716.zip

Link to comment
9 hours ago, UncleBacon said:

So I cleaned the CPU and heatsink, new thermal compound. Runs fine but as soon as I start a copy to the server it power's down. It's also back at my buddy's place so different power entirely. I'm really at a loss... 

Difficult to see how this can be anything other than hardware related?  The only things that spring to mind would be PSU problems or heating problems (causing thermal shutdown).  However you seem to have checked for those as far as I can see.  Have you checked all fans are spinning - some systems shutdown if a fan appears jammed.

Edited by itimpi
Link to comment

I have checked and cleaned all the fans. There was one somewhat noisy and slow fan so I removed it completely. I am leaning toward hardware too, just seems strange the previous OS (Ubuntu Server) ran with no issues, exact same hardware. The only difference would be the flash drive but I've tried multiple drives with unRAID.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.