unRaid 6.4.1 Freezes after 2 hours


Recommended Posts

hello, im new with unraid.

installed a fresh unraid on my san disk cruzer 16gb ( new ordered via amazon) - added all disk to unraid and created the array.

when i now try to build the parity it freezes. parity disk is a 4TB WD RED.

okay, some forum posts say the ram sometimes is the problem so over night memorytest 5.5 and 3 passes 0 errors.

this morning i copied my datas to the server. after 100gb it freezed again.

the smart results are not the best i know. had some problems with sata-cables few months ago- replaced them all.

psu is a picopsu with 150watts but the 12v goes directly to the drives, so 150watts for the system is enough. ( also testing with 500watts bequiet - same error )

 

now i booted the server with 1 piece of memory (8gb), normally it has 16gb, worked with proxmox for 1 year now.

 

when it freezes i cant login via ssh, the gui is down, and via monitor it shows me something ( next time ill take  picture) but when i connect a keyboard there is no input.

 

maybe someone have some idea.

 

plugin installed :

community.applications.png

Community Applications

A Plugin to keep your docker application lists up to date and easily sort them by category and add them to your running containers. unRaid v6.1+ only.

Andrew Zawadzki 2018.01.28a  up-to-date  

dynamix.ssd.trim.png

Dynamix SSD TRIM

Dynamix SSD trim creates a cronjob to do regular SSD TRIM operations on all mount points which support the operation. The command 'fstrim -a -v' is executed at the given interval.

Bergware 2017.04.23a  up-to-date  

dynamix.system.info.png

Dynamix System Information

Dynamix System Information shows various details of your system hardware and BIOS. This includes processor, memory and sub-system components.

Bergware 2017.11.18b  up-to-date  

dynamix.system.stats.png

Dynamix System Statistics

Dynamix System Stats shows in real-time the disk utilizations and critical system recources, such CPU usage, memory usage, interface bandwidth and disk I/O bandwidth.

Bergware 2018.02.04  up-to-date  

dynamix.system.temp.png

Dynamix System Temperature

Dynamix System Temperature shows in real-time the temperature of the system CPU and motherboard. Temperatures can be displayed in Celsius or Fahrenheit. Your hardware must support the necessary probes, and additional software drivers may be required too.

Bergware 2017.12.06  up-to-date  

fix.common.problems.png

Fix Common Problems

A Plugin to diagnose and suggest fixes for common problems, configuration mistakes, etc.

Andrew Zawadzki 2018.01.21  up-to-date  

NerdPack.png

Nerd Tools

Go to NerdPack in Settings to install extra CLI packages for advanced users. Use at your own risk. Not officially supported by LimeTech. Contains: iftop, iotop, screen, lshw, kbd, unrar, bwm-ng, strace, git, lftp, subversion, python, sshfs, iperf, p7zip...60+ packages Support.

dmacias72 2017.10.03a  up-to-date  

unassigned.devices.png

Unassigned Devices

This plugin uses UDEV to automount and share disks that are not part of your unRAID array. Available devices are listed under the "Main/Unassigned Devices" tab.

dlandon 2018.01.09b  up-to-date

cube-diagnostics-20180212-1649.zip

 

Plex installed but not started. with plex started its the same error

Edited by jacko1337
Link to comment

https://www.amazon.de/PicoPSU-150-XT-DC-DC-Netzteil-power-supply/dp/B0045IXKTQ/ref=sr_1_sc_3?ie=UTF8&qid=1518453947&sr=8-3-spell&keywords=picpsu

this is the picopsu with 12V INTAKE.

it makes 5 and 3,3v out of the 12v via the intake from example something like this https://www.amazon.de/Netzteil-Laufwerke-Lichtschläuche-LED-Strips-geeignet/dp/B006Z9TQE6/ref=pd_bxgy_147_img_2?_encoding=UTF8&pd_rd_i=B006Z9TQE6&pd_rd_r=H0TS8BG862X8WYWTAZ21&pd_rd_w=utNNi&pd_rd_wg=oxaKh&psc=1&refRID=H0TS8BG862X8WYWTAZ21

 

so ive got the 150watt picopsu and 400watt powersupply for the psu. the drives use 12v most, and the picopsu is strong enough for board and some 5v for drives. what i want to say, the server takes about 60-70watts from the wall, so the psu is strong enough. you understand me? im working as an electrican.. its hard to tell but with some 500watts bequiet it freezes the same way.

Link to comment

yeah sure, when the server is in idle with 8 drives its consuming about 40 watts. some normal psu is very unefficent at this power,

the drives are powered by 400watts, i think this should be enough.. i mean sure i can plug some 600 watts enermax, but im very sure it wont help.

the server shows 12.02 volt on the 12v rail.

when the drives spinup its even 12,2 volt, so there is enough power.

otherwise why should proxmox run 1 year and now i install unraid and need a stronger powersupply? the server is in idle and freezes and should consume more power then under 100% load proxmox? i dont understand that

 

and there are only 6 3,5" drives, 2x2,5" and 1 ssd, thats much power but the psu got 33Amps

Edited by jacko1337
Link to comment
6 minutes ago, jacko1337 said:

yeah sure, when the server is in idle with 8 drives its consuming about 40 watts. some normal psu is very unefficent at this power,

the drives are powered by 400watts, i think this should be enough.. i mean sure i can plug some 600 watts enermax, but im very sure it wont help.

the server shows 12.02 volt on the 12v rail.

when the drives spinup its even 12,2 volt, so there is enough power.

otherwise why should proxmox run 1 year and now i install unraid and need a stronger powersupply? the server is in idle and freezes and should consume more power then under 100% load proxmox? i dont understand that

 

and there are only 6 3,5" drives, 2x2,5" and 1 ssd, thats much power but the psu got 33Amps

When i was typing this the server freezes again and i was able to screenshot the screen. 

 

IMG_20180212_181120.jpg

Link to comment

k, tested with broadcom.. its not the error. but - now the server is much more stable, i reflashed the usb device.

 

here is some log, when i copy files from my win10 to UnRaid, after 3 minutes he stops copying..

 

Feb 14 19:38:07 Cube kernel: general protection fault: 0000 [#1] PREEMPT SMP PTI
Feb 14 19:38:07 Cube kernel: Modules linked in: ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat xfs md_mod nct6775 hwmon_vid bonding r8169 mii bnx2 x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate wmi_bmof intel_uncore intel_rapl_perf i2c_i801 i2c_core video ahci libahci backlight ie31200_edac wmi thermal button fan [last unloaded: mii]
Feb 14 19:38:07 Cube kernel: CPU: 1 PID: 4234 Comm: unraidd Not tainted 4.14.16-unRAID #1
Feb 14 19:38:07 Cube kernel: Hardware name: System manufacturer System Product Name/P8Z77-V LX2, BIOS 0305 08/10/2012
Feb 14 19:38:07 Cube kernel: task: ffff880225d6b800 task.stack: ffffc900013f8000
Feb 14 19:38:07 Cube kernel: RIP: 0010:handle_stripe+0x523/0x12a4 [md_mod]
Feb 14 19:38:07 Cube kernel: RSP: 0018:ffffc900013fbdc0 EFLAGS: 00010202
Feb 14 19:38:07 Cube kernel: RAX: ffff880222af1a68 RBX: ffff8801fd927500 RCX: ffff8801fd9c8000
Feb 14 19:38:07 Cube kernel: RDX: 00000000000000c8 RSI: 000000000000000a RDI: ffff8801fd92753c
Feb 14 19:38:07 Cube kernel: RBP: 0000000000000000 R08: 00000000fffffffc R09: ffffea0007f66680
Feb 14 19:38:07 Cube kernel: R10: ffffc900013fbdc0 R11: ffff8801fd927420 R12: ffff8801fd927630
Feb 14 19:38:07 Cube kernel: R13: 0000000000000008 R14: 0900000000000000 R15: 0000000000000001
Feb 14 19:38:07 Cube kernel: FS:  0000000000000000(0000) GS:ffff88022fa80000(0000) knlGS:0000000000000000
Feb 14 19:38:07 Cube kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 14 19:38:07 Cube kernel: CR2: 0000152fcc338000 CR3: 0000000004c0a006 CR4: 00000000001606e0
Feb 14 19:38:07 Cube kernel: Call Trace:
Feb 14 19:38:07 Cube kernel: unraidd+0xaf/0xff [md_mod]
Feb 14 19:38:07 Cube kernel: ? md_open+0x2c/0x2c [md_mod]
Feb 14 19:38:07 Cube kernel: ? md_thread+0xb7/0xc7 [md_mod]
Feb 14 19:38:07 Cube kernel: ? handle_stripe+0x12a4/0x12a4 [md_mod]
Feb 14 19:38:07 Cube kernel: md_thread+0xb7/0xc7 [md_mod]
Feb 14 19:38:07 Cube kernel: ? wait_woken+0x68/0x68
Feb 14 19:38:07 Cube kernel: kthread+0x111/0x119
Feb 14 19:38:07 Cube kernel: ? kthread_create_on_node+0x3a/0x3a
Feb 14 19:38:07 Cube kernel: ? do_group_exit+0x95/0x95
Feb 14 19:38:07 Cube kernel: ret_from_fork+0x35/0x40
Feb 14 19:38:07 Cube kernel: Code: 7c 24 40 0f 8d a0 00 00 00 49 69 d7 c8 00 00 00 4c 8b b4 13 38 01 00 00 4d 85 f6 0f 84 80 00 00 00 f6 84 13 31 01 00 00 01 74 76 <49> 8b 0e 48 85 c9 48 89 8c 13 38 01 00 00 74 02 0f 0b 48 8b 4b 
Feb 14 19:38:07 Cube kernel: RIP: handle_stripe+0x523/0x12a4 [md_mod] RSP: ffffc900013fbdc0
Feb 14 19:38:07 Cube kernel: ---[ end trace 4b600eeefd60757e ]---
Feb 14 19:38:07 Cube kernel: ------------[ cut here ]------------
Feb 14 19:38:07 Cube kernel: WARNING: CPU: 1 PID: 4234 at kernel/exit.c:771 do_exit+0x48/0x896
Feb 14 19:38:07 Cube kernel: Modules linked in: ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat xfs md_mod nct6775 hwmon_vid bonding r8169 mii bnx2 x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate wmi_bmof intel_uncore intel_rapl_perf i2c_i801 i2c_core video ahci libahci backlight ie31200_edac wmi thermal button fan [last unloaded: mii]
Feb 14 19:38:07 Cube kernel: CPU: 1 PID: 4234 Comm: unraidd Tainted: G      D         4.14.16-unRAID #1
Feb 14 19:38:07 Cube kernel: Hardware name: System manufacturer System Product Name/P8Z77-V LX2, BIOS 0305 08/10/2012
Feb 14 19:38:07 Cube kernel: task: ffff880225d6b800 task.stack: ffffc900013f8000
Feb 14 19:38:07 Cube kernel: RIP: 0010:do_exit+0x48/0x896
Feb 14 19:38:07 Cube kernel: RSP: 0018:ffffc900013fbef0 EFLAGS: 00010206
Feb 14 19:38:07 Cube kernel: RAX: ffffc900013fbe50 RBX: ffff880225d6b800 RCX: 0000000000000000
Feb 14 19:38:07 Cube kernel: RDX: ffff8801fc888400 RSI: ffff88022fa96478 RDI: 000000000000000b
Feb 14 19:38:07 Cube kernel: RBP: 000000000000000b R08: 000000000000000f R09: 00000000ffffff00
Feb 14 19:38:07 Cube kernel: R10: ffffffff81d66080 R11: ffffffff81a4e220 R12: ffff8801fd927630
Feb 14 19:38:07 Cube kernel: R13: 0000000000000008 R14: 0900000000000000 R15: 0000000000000001
Feb 14 19:38:07 Cube kernel: FS:  0000000000000000(0000) GS:ffff88022fa80000(0000) knlGS:0000000000000000
Feb 14 19:38:07 Cube kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 14 19:38:07 Cube kernel: CR2: 0000152fcc338000 CR3: 0000000004c0a006 CR4: 00000000001606e0
Feb 14 19:38:07 Cube kernel: Call Trace:
Feb 14 19:38:07 Cube kernel: ? kthread+0x111/0x119
Feb 14 19:38:07 Cube kernel: rewind_stack_do_exit+0x17/0x20
Feb 14 19:38:07 Cube kernel: Code: 30 07 00 00 48 85 c0 74 24 48 8b 10 48 39 d0 75 1a 48 8b 48 10 48 8d 50 10 48 39 d1 75 0d 48 8b 50 20 48 83 c0 20 48 39 c2 74 02 <0f> ff 65 8b 05 a7 b2 fc 7e 25 00 ff 1f 00 48 c7 c7 99 cc b2 81 
Feb 14 19:38:07 Cube kernel: ---[ end trace 4b600eeefd60757f ]---
Feb 14 19:38:07 Cube kernel: note: unraidd[4234] exited with preempt_count 1
Feb 14 19:39:20 Cube login[3912]: ROOT LOGIN  on '/dev/tty1'

Link to comment

I also suspect the power supply.  The BeQuiet option is suspect possibly because of a split 12 rail.  The initial approach of powering the drives using the 12 volts into the Pico PSU and the 5 volts from its output is not a good idea because of the possibility of some ground voltage differences between the external 12 volts and its and the locally regulated 5 volts and its ground (from the Pico PSU).  These two grounds may be very similar, but I would not be suprised to see a few hundred millivolts of high frequency noise where there should be none.   In addition you have the possibility of noisy ground currents from the drives going back to the mother board rather than directly to the supply.   This would then upset the regulation of the other supplies to the motherboard.  You won't see these problems with a regular volt meter - some high frequency analysis tools would be needed. 

 

I would start with a good standard PC power supply with a single 12 volt rail.  Then you can debug the rest of the system as needed.  If the problem is eventually found not to be the power supply you could then revert to the original arrangement is you need to.

Link to comment

i already lend me some single rail 800 watts psu from some friend.

its not the fault. the system is stable till i copy from windows to unraid. there is sambe crashing i think, cause webinterface and console works fine.

its the error i posted above

 

edit1:

seems that my picopsu works. server is running fine after setting ram to 1333mhz from auto settings.. (1333mhz ram set to 1600. damn asus, but okay, now its running)

 

thank for all your help, thread can be closed

Edited by jacko1337
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.