molesza Posted December 22, 2011 Posted December 22, 2011 Hi I have just installed an APC UPS on my server. I have set unraid to shutdown after the battery reaches 95% as we have frequent power cuts and want to save battery. Problem is unraid wants to shutdown but never does (powerdown script installed). So basically the UPS will run out of battery and unraid machine will lose power. I have attached my syslog which seems to have quite a few errors. Right at the end of the syslog is where I tried to get unraid to shutdown by unplugging power from the UPS. I then plugged the power back in before the UPS ran out of battery. I would like to add that the only thing I could access after unraid tried to powerdown was unmenu. standard gui was not accessible and sabnzbd,sickbeard,couchpotato were all down. So might it be unmenu that is causing the array to stay online? syslog-2011-12-22.txt
molesza Posted December 22, 2011 Author Posted December 22, 2011 just want to add that I established a telnet session and issued the command "powerdown -r" to reboot the server and nothing is happening. Looks like I am going to have to force shutdown and run a parity sync. Or is there another way I cleanly shutdown?
Johnm Posted December 22, 2011 Posted December 22, 2011 I would have the unraid shut down rather quickly, sometimes it takes a while to spin up drives and sync then power down. I would look to see what open files you might have during your test. I know if I have been using my mac or my HTPC on my unraid, they tend to keep files opened and locked on me preventing a clean shutdown.. in a real power outage, my mac/htpc would go poof since i don't have them on the ups to cause this issue. If i did have have the MAC or HTPC on a UPS, I would leave that switch/hub off the UPS or shut them down first. Edit: I didnt really look at your syslog, but /dev/sdg SN# 5LSEQC2R has a massive amount of errors. Dec 22 16:09:15 Tower smartctl[5268]: 1 Raw_Read_Error_Rate 0x000f 088 077 006 Pre-fail Always - 209476411 Dec 22 16:09:15 Tower smartctl[5268]: 3 Spin_Up_Time 0x0003 093 092 000 Pre-fail Always - 0 Dec 22 16:09:15 Tower smartctl[5268]: 4 Start_Stop_Count 0x0032 096 096 020 Old_age Always - 5118 Dec 22 16:09:15 Tower smartctl[5268]: 5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0 Dec 22 16:09:15 Tower smartctl[5268]: 7 Seek_Error_Rate 0x000f 084 060 030 Pre-fail Always - 274875534 Dec 22 16:09:15 Tower smartctl[5268]: 9 Power_On_Hours 0x0032 077 077 000 Old_age Always - 20242 Dec 22 16:09:15 Tower smartctl[5268]: 10 Spin_Retry_Count 0x0013 100 099 097 Pre-fail Always - 0 Dec 22 16:09:15 Tower smartctl[5268]: 12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 152 Dec 22 16:09:15 Tower smartctl[5268]: 187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0 Dec 22 16:09:15 Tower smartctl[5268]: 189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0 Dec 22 16:09:15 Tower smartctl[5268]: 190 Airflow_Temperature_Cel 0x0022 058 045 045 Old_age Always In_the_past 42 (Min/Max 35/42) Dec 22 16:09:15 Tower smartctl[5268]: 194 Temperature_Celsius 0x0022 042 055 000 Old_age Always - 42 (0 15 0 0) Dec 22 16:09:15 Tower smartctl[5268]: 195 Hardware_ECC_Recovered 0x001a 062 043 000 Old_age Always - 209476411 Dec 22 16:09:15 Tower smartctl[5268]: 197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
Joe L. Posted December 22, 2011 Posted December 22, 2011 The issue had nothing to do with unMENU, but with this kernel OOps: Dec 22 16:09:54 Tower kernel: BUG: unable to handle kernel NULL pointer dereference at (null) Dec 22 16:09:54 Tower kernel: IP: [<c103892c>] queue_delayed_work_on+0x33/0xbf Dec 22 16:09:54 Tower kernel: *pdpt = 0000000034ee6001 *pde = 0000000000000000 Dec 22 16:09:54 Tower kernel: Oops: 0000 [#1] SMP Dec 22 16:09:54 Tower kernel: Modules linked in: md_mod xor ide_gd_mod pata_jmicron r8169 jmicron ahci libahci i2c_i801 i2c_core ata_piix Dec 22 16:09:54 Tower kernel: Dec 22 16:09:54 Tower kernel: Pid: 154, comm: sync_supers Not tainted 3.1.0-unRAID #2 OEM OEM/IP35 PRO(P35+ICH9R) Dec 22 16:09:54 Tower kernel: EIP: 0060:[<c103892c>] EFLAGS: 00010246 CPU: 0 Dec 22 16:09:54 Tower kernel: EIP is at queue_delayed_work_on+0x33/0xbf Dec 22 16:09:54 Tower kernel: EAX: f87c8138 EBX: ffffffff ECX: f87c8134 EDX: 00000000 Dec 22 16:09:54 Tower kernel: ESI: 00000000 EDI: f87c8134 EBP: f0a5fed0 ESP: f0a5fec4 Dec 22 16:09:54 Tower kernel: DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Dec 22 16:09:54 Tower kernel: Process sync_supers (pid: 154, ti=f0a5e000 task=f0a461c0 task.ti=f0a5e000) Dec 22 16:09:54 Tower kernel: Stack: Dec 22 16:09:54 Tower kernel: f87b8000 ded8bcd4 ded8bc80 f0a5fedc c1038da9 0000000a f0a5ff40 c10d8d23 Dec 22 16:09:54 Tower kernel: f87b8000 00000012 00000000 e2ea13f0 0014b794 03dc3ab0 00000000 003c021f Dec 22 16:09:54 Tower kernel: ded8bc98 00000012 00000004 f87b8000 000017a5 00000000 de9e6000 f16a17c0 Dec 22 16:09:54 Tower kernel: Call Trace: Dec 22 16:09:54 Tower kernel: [<c1038da9>] queue_delayed_work+0x1b/0x1e Dec 22 16:09:54 Tower kernel: [<c10d8d23>] do_journal_end+0x740/0x923 Dec 22 16:09:54 Tower kernel: [<c10d8f61>] journal_end_sync+0x5b/0x63 Dec 22 16:09:54 Tower kernel: [<c10ccf9f>] reiserfs_sync_fs+0x32/0x51 Dec 22 16:09:54 Tower kernel: [<c10ccfcb>] reiserfs_write_super+0xd/0xf Dec 22 16:09:54 Tower kernel: [<c108196b>] sync_supers+0x5e/0x9c Dec 22 16:09:54 Tower kernel: [<c1068b46>] ? bdi_sched_wait+0xc/0xc Dec 22 16:09:54 Tower kernel: [<c1068b7b>] bdi_sync_supers+0x35/0x44 Dec 22 16:09:54 Tower kernel: [<c103bc31>] kthread+0x62/0x67 Dec 22 16:09:54 Tower kernel: [<c103bbcf>] ? kthread_worker_fn+0x10a/0x10a Dec 22 16:09:54 Tower kernel: [<c130beb6>] kernel_thread_helper+0x6/0xd Dec 22 16:09:54 Tower kernel: Code: d6 53 89 c3 f0 0f ba 29 00 19 d2 31 c0 85 d2 0f 85 9d 00 00 00 83 79 10 00 74 04 0f 0b eb fe 8d 41 04 39 41 04 74 04 0f 0b eb fe <f6> 06 02 b8 08 00 00 00 75 19 89 c8 e8 7b e4 ff ff 85 c0 74 08 Dec 22 16:09:54 Tower kernel: EIP: [<c103892c>] queue_delayed_work_on+0x33/0xbf SS:ESP 0068:f0a5fec4 Dec 22 16:09:54 Tower kernel: CR2: 0000000000000000 Dec 22 16:09:54 Tower kernel: ---[ end trace 1672edd1c18249b7 ]--
Recommended Posts
Archived
This topic is now archived and is closed to further replies.