unRAID Server Release 6.2.0-beta21 Available


Recommended Posts

Having issues with contacting the beta machine, can no longer access web gui, pings unresponsive and putty wont connect either. However dockers still running and working. Plex still works and OpenVPN-AS but looks like going to have to do hard reset as I cant even downgrade to stable version.

Link to comment
  • Replies 545
  • Created
  • Last Reply

Top Posters In This Topic

Top Posters In This Topic

Posted Images

Having issues with contacting the beta machine, can no longer access web gui, pings unresponsive and putty wont connect either. However dockers still running and working. Plex still works and OpenVPN-AS but looks like going to have to do hard reset as I cant even downgrade to stable version.

 

Try one thing first.  Give a quick push of the power button and see if your server shuts down gracefully.  I know that is one of the features of the Powerdown plugin but it may also exist in LT's powerdown routine.  The Powerdown plugin will also write out either the syslog or diagnostic file to the Flash Drive if it is successful. 

Link to comment

On beta 21, I'm also experiencing problem with copying large files to the server.  I do have a VM running, but the file transfers are usually from a secondary unraid server to my main unraid server, so nothing is copied to/from the VM.

 

The copies start fine (70-80 Mbps), but within a few seconds they slow down to a crawl, and eventually stop altogether.  At the same time unraid web gui becomes unresponsive.  In the last few times this happened, I was able to stop the array, but not to restart the server (had to manually hit the reset button). 

 

Changing the num_stripes setting didn't seem to make any difference, the first large file copy hung the server again, however I was able to stop the array and restart the server as well from the gui, so maybe an improvement?  I've never been able to see anything in the log suggesting a problem.  After restarting, the server seems fine for a few days.

 

Try disabling network offload, see the Tips and Tweaks page.  The plugin makes the change easy to perform.

 

I've installed stats plugin and run a test copy: ~16GB file from my windows 10 PC to the main unraid server.  The copy starts fast as usual, and I can see almost all RAM getting used up as "cache" (see pic), BTW, is that normal?  When the ram is used up, speed slows down gradually.

 

At around 17:22 I turn off the one VM I have running (it's now set to only use 2GB of ram), and I can see the copy speed up again (bump at 17:23), until the cache is full again.  After 17:36 speed recovers around 20 MBps, and is stable until the end of the copy. 

 

Your description sounds normal, image looks about normal to me, but perhaps I may be missing your point?  Writes are super fast while there's cache room to fill.  I'm not sure if your description is saying it's good, or it's bad, or it's back to normal after the first post?  (Sorry for my confusion, I may be missing what you are trying to point out.)

Link to comment

Syslog is filled up with /dev/sda errors

 

May 30 13:47:11 Raptor kernel: usb 4-1: USB disconnect, device number 2
May 30 13:47:15 Raptor kernel: FAT-fs (sda1): Directory bread(block 32768) failed
May 30 14:21:01 Raptor kernel: fat__get_entry: 6 callbacks suppressed
May 30 14:21:01 Raptor kernel: FAT-fs (sda1): Directory bread(block 32768) failed
May 30 14:21:01 Raptor kernel: ------------[ cut here ]------------
May 30 14:21:01 Raptor kernel: WARNING: CPU: 0 PID: 28498 at fs/fs-writeback.c:2100 __mark_inode_dirty+0x1d2/0x1e3()
May 30 14:21:01 Raptor kernel: bdi-block not registered
May 30 14:21:01 Raptor kernel: Modules linked in: ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_nat_ipv4 iptable_filter ip_tables nf_nat md_mod fbcon bitblit fbcon_rotate fbcon_ccw fbcon_ud fbcon_cw softcursor font ast drm_kms_helper cfbfillrect cfbimgblt cfbcopyarea ttm drm agpgart syscopyarea sysfillrect sysimgblt fb_sys_fops fb ahci i2c_i801 libahci fbdev x86_pkg_temp_thermal coretemp kvm_intel igb kvm ptp pps_core i2c_algo_bit ipmi_si acpi_cpufreq
May 30 14:21:01 Raptor kernel: CPU: 0 PID: 28498 Comm: php Not tainted 4.4.6-unRAID #1
May 30 14:21:01 Raptor kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./E3C226D2I, BIOS P3.30 06/04/2015
May 30 14:21:01 Raptor kernel: Call Trace:
May 30 14:21:01 Raptor kernel: [<ffffffff811c36de>] mark_fsinfo_dirty+0x28/0x2a
May 30 14:21:01 Raptor kernel: [<ffffffff811c4212>] fat_alloc_clusters+0x327/0x40e
May 30 14:21:01 Raptor kernel: [<ffffffff81110e7f>] ? terminate_walk+0x21/0x82
May 30 14:21:01 Raptor kernel: [<ffffffff81086f16>] ? rcu_is_watching+0x10/0x22
May 30 14:21:01 Raptor kernel: [<ffffffff81119798>] ? __d_instantiate+0x84/0xda
May 30 14:21:01 Raptor kernel: [<ffffffff811c2d6c>] fat_alloc_new_dir+0x30/0x1ca
May 30 14:21:01 Raptor kernel: [<ffffffff811c83ca>] ? vfat_lookup+0xdb/0x145
May 30 14:21:01 Raptor kernel: [<ffffffff811c946b>] vfat_mkdir+0x4d/0x140
May 30 14:21:01 Raptor kernel: ---[ end trace fd44b47a9ddf242e ]---

sda is generally your Flash Drive.  I would suspect some sort of corruption on that device.  Suggest you run chkdsk (or the Apple equivalent) on it.

I've just run chkdsk on my Windows PC - No errors / issues reported :(

It's happened twice in the month of May... I'm using a Sandisk Ultra Fit - Which is a pretty decent USB drive ;) And a server grade Asrock - I can try a USB 2 port, but I wish I didn't need to haha

 

No other cause you can think of? :) There's not a 'chkdsk' tool included with unRAID by any chance? :)

 

There is, since early in the 6.1 series I think, and it's automatically run at start, but you can only see its messages if you are watching the console (usually have to page back).  If you happened to reboot after you got all the boot drive corruption messages above, then it was fixed at that time, and would account for a clean drive when tested on Windows.  That's the first time I've seen FAT corruption bad enough to cause a Call Trace, odd.

 

As Squid said, there have been a number of reports of unreliability with USB 3.0 ports and drives.  I suspect a future kernel will have all the workarounds necessary for all the quirks currently in them.

Link to comment

Im doing a data re-build on 2 disks. I upgraded 2 of my 2TB disks to 4TB disks. So far so good doing a dual re-build. (I have back ups and the original drives still so I figured to give it a go)

 

Looking in the shares folder, I have some inconsistency on whats protected (Green Triangle). The share comment where it's cache only is just that cache only.

 

Appdata and Downloads (Cache only) are correct. But Upload, domains and system show the orange triangle. I dont think that's correct, no data / files / folders are on the array for those shares. They should be Green protect by cache pool?

 

unless this has something to do with the Enable copy on write setting? (Mine are all Auto) or another setting? but I cant seem to find it when I compare settings of Downloads to Upload.

 

Or is the cache drives part of the rebuild, not sure if what I see is correct or a bug.

 

im35bV2.png

 

Parity finished, all back to Green.  ???

Link to comment

Having issues with contacting the beta machine, can no longer access web gui, pings unresponsive and putty wont connect either. However dockers still running and working. Plex still works and OpenVPN-AS but looks like going to have to do hard reset as I cant even downgrade to stable version.

 

Try one thing first.  Give a quick push of the power button and see if your server shuts down gracefully.  I know that is one of the features of the Powerdown plugin but it may also exist in LT's powerdown routine.  The Powerdown plugin will also write out either the syslog or diagnostic file to the Flash Drive if it is successful.

 

Sorry so everything is actually working but when I RDP to my windows machine it doesnt allow me to connect to //tower/ but I can connect to //server/ which is 1.6.9

 

So not sure if this might be a vpn issue or firewall rule.

Link to comment

On beta 21, I'm also experiencing problem with copying large files to the server.  I do have a VM running, but the file transfers are usually from a secondary unraid server to my main unraid server, so nothing is copied to/from the VM.

 

The copies start fine (70-80 Mbps), but within a few seconds they slow down to a crawl, and eventually stop altogether.  At the same time unraid web gui becomes unresponsive.  In the last few times this happened, I was able to stop the array, but not to restart the server (had to manually hit the reset button). 

 

Changing the num_stripes setting didn't seem to make any difference, the first large file copy hung the server again, however I was able to stop the array and restart the server as well from the gui, so maybe an improvement?  I've never been able to see anything in the log suggesting a problem.  After restarting, the server seems fine for a few days.

 

Try disabling network offload, see the Tips and Tweaks page.  The plugin makes the change easy to perform.

 

I've installed stats plugin and run a test copy: ~16GB file from my windows 10 PC to the main unraid server.  The copy starts fast as usual, and I can see almost all RAM getting used up as "cache" (see pic), BTW, is that normal?  When the ram is used up, speed slows down gradually.

 

At around 17:22 I turn off the one VM I have running (it's now set to only use 2GB of ram), and I can see the copy speed up again (bump at 17:23), until the cache is full again.  After 17:36 speed recovers around 20 MBps, and is stable until the end of the copy. 

 

Your description sounds normal, image looks about normal to me, but perhaps I may be missing your point?  Writes are super fast while there's cache room to fill.  I'm not sure if your description is saying it's good, or it's bad, or it's back to normal after the first post?  (Sorry for my confusion, I may be missing what you are trying to point out.)

 

RobJ, I'm afraid did a very poor job at explaining.  :)  I'm definitely experiencing consistently slower than normal copies (down to a few KBps or even zero speed for stretches of time), when large files are involved.  I did not seem to matter if:

- the VM is running or not (except for a small speed bump if the VM was on, and is turned off during the copy)

- the num_stripes fix is set to 8192 or left a default

- network offload is on or off

 

I actually went back to 6.19 last night, and to my surprise the problem persisted, so I'm now focusing at ruling out a problem at my end. I'll report back if I find anything.

Link to comment

Not a big deal but shouldn't these notifications be the same?

 

Dual parity sync after a new config:

 

Event: unRAID Parity disk error
Subject: Warning [TOWER5] - Parity disk, parity-sync in progress

Event: unRAID Parity2 error
Subject: Warning [TOWER5] - Parity2, drive not ready, content being reconstructed

Link to comment

When trying to configure SMTP notifications, using Zoho mail, dynamix got completely broken and there was this error message:

 

Warning:syntax error, unexpected $end, expecting TC_DOLLAR_CURLY or TC_QUOTED_STRING or '"' in /boot/config/plugins/dynamix/dynamix.cfg on line 15 in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: array_replace_recursive(): Argument #2 is not an array in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: extract() expects parameter 1 to be array, null given in /usr/local/emhttp/plugins/dynamix/template.php on line 30

 

 

Thanks,

anthonws.

Link to comment

When trying to configure SMTP notifications, using Zoho mail, dynamix got completely broken and there was this error message:

 

Warning:syntax error, unexpected $end, expecting TC_DOLLAR_CURLY or TC_QUOTED_STRING or '"' in /boot/config/plugins/dynamix/dynamix.cfg on line 15 in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: array_replace_recursive(): Argument #2 is not an array in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: extract() expects parameter 1 to be array, null given in /usr/local/emhttp/plugins/dynamix/template.php on line 30

 

 

Thanks,

anthonws.

Try getting rid of any single or double quotes in the password

 

Link to comment

RobJ, I'm afraid did a very poor job at explaining.  :)  I'm definitely experiencing consistently slower than normal copies (down to a few KBps or even zero speed for stretches of time), when large files are involved.  I did not seem to matter if:

- the VM is running or not (except for a small speed bump if the VM was on, and is turned off during the copy)

- the num_stripes fix is set to 8192 or left a default

- network offload is on or off

 

I actually went back to 6.19 last night, and to my surprise the problem persisted, so I'm now focusing at ruling out a problem at my end. I'll report back if I find anything.

 

Are you writing to a ReiserFS drive that is quite full?

Link to comment

RobJ, I'm afraid did a very poor job at explaining.  :)  I'm definitely experiencing consistently slower than normal copies (down to a few KBps or even zero speed for stretches of time), when large files are involved.  I did not seem to matter if:

- the VM is running or not (except for a small speed bump if the VM was on, and is turned off during the copy)

- the num_stripes fix is set to 8192 or left a default

- network offload is on or off

 

I actually went back to 6.19 last night, and to my surprise the problem persisted, so I'm now focusing at ruling out a problem at my end. I'll report back if I find anything.

 

Are you writing to a ReiserFS drive that is quite full?

 

I write to a (samba) share.  All the drives are XFS.  The array is ~87% full, but there is still 4.4TB of total space available. The drive with the least free space still has 170GB free, and most of the other drives have > 300GB of free space.

Link to comment

In case it hasn't been posted here, I was able to fix my hardware passthrough issue (in ESXi) using the following fix.  So I'm not able to run 6.2 virtualized.

 

Is it safe to assume that you meant to say that after the *fix* that you are s/not/now able to run 6.2 virtualized?

Link to comment

In case it hasn't been posted here, I was able to fix my hardware passthrough issue (in ESXi) using the following fix.  So I'm not able to run 6.2 virtualized.

 

Is it safe to assume that you meant to say that after the *fix* that you are s/not/now able to run 6.2 virtualized?

 

Yes, meant to say now.  Thanks for the correction.

Link to comment

When trying to configure SMTP notifications, using Zoho mail, dynamix got completely broken and there was this error message:

 

Warning:syntax error, unexpected $end, expecting TC_DOLLAR_CURLY or TC_QUOTED_STRING or '"' in /boot/config/plugins/dynamix/dynamix.cfg on line 15 in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: array_replace_recursive(): Argument #2 is not an array in /usr/local/emhttp/plugins/dynamix/include/Wrappers.php on line 19 Warning: extract() expects parameter 1 to be array, null given in /usr/local/emhttp/plugins/dynamix/template.php on line 30

 

 

Thanks,

anthonws.

Try getting rid of any single or double quotes in the password

[/quote

 

Thanks! Will do!

Link to comment

New issue here:  Last night I noticed that videos wouldn't play on any of my devices from my plex server docker.  This morning checking the unRAID box showed me all the drives powered down and Plex still wouldn't play.  Thinking I needed to reboot the box I tried to shut the dockers down -- all shut down except for the Crashplan docker.  I tried to take the array offline through the gui and it accepted the command, but then became unresponsive.  Got the diagnostics via the command line and attempted to restart it using the shutdown -r command, but nothing happened.  Had to power off and power back on.  Diags attached.

 

Edit:  to clarify, shutdown -r said something like "RESTARTING NOW!", but then nothing happened.

 

Same thing happened again and I think I can easily reproduce it.  The server becomes unresponsive (SMB shares), a look at the GUI shows all drives spun down, and after a couple of clicks the GUI becomes unresponsive. Have to cycle power to get it working again.

 

The trigger seems to be having my Win10 VM running.  I'd left it turned off for a few days and everything worked fine.  I turned it on yesterday and within a few hours this happened.  What else can I do to troubleshoot?

Link to comment

Just a brief update, we are pretty confident we have discovered the bug causing deadlocks and system hangs and are in the process of testing patched code now before rolling out a new release.  Thank you all for your patience with us as we worked to get to the bottom of this very nasty bug.

Link to comment

Just a brief update, we are pretty confident we have discovered the bug causing deadlocks and system hangs and are in the process of testing patched code now before rolling out a new release.  Thank you all for your patience with us as we worked to get to the bottom of this very nasty bug.

 

That's great.  Thanks for letting us know.

Link to comment

Can I suggest a feature / option now that you have NVME support?

 

Would it be possible to set a 2nd set of temperature warnings / thresholds for NVME drives?

 

They get really hot and have a much higher temperature range vs SATA SSD / mechanical drives. If the mover is running a large transfer these drives can easily spike to 60C+ (They thermal throttle at 70C)

 

If I leave the default thresholds I would get heat warnings constantly. Would be nice to set the array to one set of temperature thresholds and the cache pool or NVME to another.

Link to comment

Can I suggest a feature / option now that you have NVME support?

 

Would it be possible to set a 2nd set of temperature warnings / thresholds for NVME drives?

 

They get really hot and have a much higher temperature range vs SATA SSD / mechanical drives. If the mover is running a large transfer these drives can easily spike to 60C+ (They thermal throttle at 70C)

 

If I leave the default thresholds I would get heat warnings constantly. Would be nice to set the array to one set of temperature thresholds and the cache pool or NVME to another.

You can change the temp threshold for each drive separately. Click on the drive name under the main tab

Link to comment
Guest
This topic is now closed to further replies.