[6.8.3] shfs error results in lost /mnt/user

VBilbo · October 27, 2021

I have the same issue with disappearing /mnt/user every couple of days after installing Tdarr.

AnnabellaRenee87 · November 8, 2021

On 10/27/2021 at 6:00 PM, VBilbo said:

I have the same issue with disappearing /mnt/user every couple of days after installing Tdarr.

I had this too, just disable NFS shares if you can get away with it.

I went a step further and disabled NFS entirely.

Andiroo2 · November 14, 2021

This explains my hangs over the last few weeks. Tdarr running must be causing this.

rs5050 · November 14, 2021

On 10/18/2021 at 11:44 AM, JorgeB said:

Or at least disable hard links : Settings > Global Shares Settings -> Tunable (support Hard Links): no

Keep in mind, if you disable Hard Links and find that your Sonarr/Radarr no longer works properly, it's because you disabled Hard Links and you're using Hard Links, like from Trash's guide here. https://trash-guides.info/Hardlinks/Hardlinks-and-Instant-Moves/

VlarpNL · January 4, 2022

Not sure if this is already known, but today I suddenly had this issue happening and started looking for solutions. I stumbled on this post:

I changed the path for my media share from /mnt/user/media to /mnt/user0/media in the Tdarr docker containers and just to be sure I also made sure the /temp is now on a share that has no mover running on it.

I have no tdarr related errors anymore in the logs, so fingers crossed!
I have NFS disabled and still have Hard Links enabled under Global Share Settings.

Edited January 4, 2022 by VlarpNL

bandiboo · January 5, 2022

I have just seen this for the first time and i found my way here by accident, as I ran "Fix common problems" and there was a comment about Tdarr, which I also have installed.

Server is on 6.9.2 and is always very stable as i generally dont touch it. Today I added a new physical nvme drive and moved some files/folders with bash. On completion of this task i noticed that /mnt/user was missing. I should also note that some of the folders moved where shares that Tdarr interacts

A reboot resolved the issue.

Edited January 5, 2022 by bandiboo

Aspect · February 3, 2022

Hello everyone,

Just wanted to chime in here and potentially give some people a temp solution. I am also having the Tdarr warning pop up in fix common problems

"tdarr (haveagitgat/tdarr:latest) has the following comments: Anecdotal evidence is implicating tdarr with Unraid issues."

which refers me to this thread. in doing some research and reading through here, it looks like im not truly having the issues everyone is experiencing.

i believe it was due to dumb luck. the way i have Tdarr configured my transcode folder is on an NVME drive that is my second cache pool and i have the share it uses set to use cache only and specifically that drive. i guess because the mover is never invoked on this share i don't have a true issue.

i haven't had the mnt/user folder disappear on me or any other issues i have read about with this warning.

maybe this can help some other people experiencing this.

although due to my OCD i would like to figure out how to get fix common problems to stop sending me this warning and ultimately fix the over all issue. (i know i can ignore warning but that's not the way i like to deal with these things)

Twinkie0101 · February 25, 2022

I can confirm I am having the same experience you are having. I have my tdarr_temp folder set to prefer cache (NVME) and have not experienced the mnt/user share issue. I went with this approach because I didn't want unnecessary writes to my spinning drives. I've been running tdarr for months now.

paloooz · March 4, 2022

I just want to report my experience with Tdarr. I've been running Tdarr in multiple configurations with multiple nodes across two Unraid servers. Currently, I'm running two tdarr servers with four nodes operating on two independent data sets totaling around 90k files on 20 disks with parity. Temporary files are written directly to disk, skipping cache. I have never had any issues like this before, and I've saved almost 40TB over the last 5 months.

Edited March 4, 2022 by paloooz

AgentXXL · April 16, 2022

FYI - I have been seeing this issue (the disappearing user shares) since upgrading to 6.10 RC4. I am using NFS for my shares as the majority of my systems are Linux or Mac. Mac's in particular have speed issues with SMB shares, but NFS works great.

The gotcha is that I don't use tdarr... in fact I don't use any of the *arr apps. I've grabbed diagnostics just now as it just happened again. I will send them via PM if anyone else wants to look at them, but I prefer not posting them here. Although I use anonymize, going through the diagnostics reveal they still contain information that I consider private.

I'll be taking my own stab at the diagnostics shortly, but I've disabled hard links as suggested and will see if that helps.

Jeremyb · April 17, 2022

Much like AgentXXL I also don't use tdarr and am having this issue. The real kicker for me is that I can't do a clean shutdown after this happens either, and since I'm running 2 parity drives, I have to rebuild it every time (total of about 30 hours).

That being said, the last time this happened, the parity drives went offline before the reboot, and it seems to be related, but in either event, shares randomly going away, which causes docker to crash and then put the machine in a bad state that requires a physical reboot to bring it back up is a problem.

Edit: and /mnt/user is offline again, in the middle of a parity rebuild.

Edited April 18, 2022 by Jeremyb

miicar · April 19, 2022

i too have had the shares disappear overnight, with this shfs error, on a production server. So i will be watching this forum with great interest!

changes i have recently made...
- consolidated x2 cache drives (production & server) into 1 larger SSD
- added new raid6 cache pool

- turned on a docker called prism and let it run all night. it writes its meta-data to the new cache pool (not a public shared folder).
- NFS was on, but i have turned it off, since the shares disappeared, to see if that was the issue (rarely the CFO brings in her macbook, and i make fun of her for its overpriced limitations).

A reboot did bring everything back online for now. i have stopped the prism app, in order to monitor the server for 24 hours, without any other changes. This unraid server is used heavily during the day, in a production environment, with an average of 5 clients grabbing/saving files all day. Normally this server does not have any issues (besides the ones W10 bestows upon it with its random forced updates), and has been left on issue free, without a reboot for months.

I have Logs of the crash, but they contain sensitive information i do not feel safe posting publicly.

Jeremyb · April 19, 2022

2 hours ago, miicar said:

i too have had the shares disappear overnight, with this shfs error, on a production server. So i will be watching this forum with great interest!

changes i have recently made...
- consolidated x2 cache drives (production & server) into 1 larger SSD
- added new raid6 cache pool

- turned on a docker called prism and let it run all night. it writes its meta-data to the new cache pool (not a public shared folder).
- NFS was on, but i have turned it off, since the shares disappeared, to see if that was the issue (rarely the CFO brings in her macbook, and i make fun of her for its overpriced limitations).

A reboot did bring everything back online for now. i have stopped the prism app, in order to monitor the server for 24 hours, without any other changes. This unraid server is used heavily during the day, in a production environment, with an average of 5 clients grabbing/saving files all day. Normally this server does not have any issues (besides the ones W10 bestows upon it with its random forced updates), and has been left on issue free, without a reboot for months.

I have Logs of the crash, but they contain sensitive information i do not feel safe posting publicly.

I am having the same problem, but don't have any of those same "Recent changes" I got the issue when NFS was off, I don't have or use Prism. I have 2 drives in my cache pool in a RAID1 BtrFS, and a 2nd one that is just a NVME. I don't use RAID 6 for anything.

As with most other people this issue is relatively new, as my machine was humming along pretty good (it's pretty new) before it started crashing. I've replaced most of the suspect hardware by now.

The other challenging part is that it requires a hard reboot, which means if I am remote to the machine for a while, I won't be able to access or reboot it.

Squid · April 19, 2022

2 hours ago, Jeremyb said:

The other challenging part is that it requires a hard reboot, which means if I am remote to the machine for a while, I won't be able to access or reboot it.

Not the answer to the problem, but the MyServers plugin makes that painless to accomplish.

Jeremyb · April 19, 2022

1 hour ago, Squid said:

Not the answer to the problem, but the MyServers plugin makes that painless to accomplish.

Ok, I have the plugin, but I'm not sure how it helps. I can VPN into my network and clicking the shutdown/restart button on the unraid dashboard does nothing. SSHing in and typing "shutdown -r now" does nothing. Typing the same command from a connected keyboard and monitor also does nothing. The only way to access it again is a hard reboot.

sivart · April 19, 2022

I believe it is hung doing a soft reboot because it cannot unmount /mnt/user. Even trying to access /mnt/user from the terminal hangs.

OngoingDetonator · May 4, 2022

I'm also experiencing this issue now at an alarming rate.

I don't have Tdarr installed

miicar · May 5, 2022

On 4/19/2022 at 5:35 PM, Jeremyb said:

Ok, I have the plugin, but I'm not sure how it helps. I can VPN into my network and clicking the shutdown/restart button on the unraid dashboard does nothing. SSHing in and typing "shutdown -r now" does nothing. Typing the same command from a connected keyboard and monitor also does nothing. The only way to access it again is a hard reboot.

This has happened to me as well. I am lucky i have a managed motherboard i can access outside of unraid and force a reset...i don't know if i will ever build another server that doesn't have some sort of backdoor access.

chli01 · August 18, 2022

I also had this issue (transport endpoint not connected) every few days.

In the end it turned out for me to be caused by bad memory. I was running Memtest for several days but got only rare errors (one issue every few hours) - so I wondered if that could cause the frequent shfs errors.

However, replacing the memory module totally solved this issue for me (no errors since 2 months now). So maybe you should also check your RAM.

Marshalleq · September 24, 2022

I'm another referred here by the fix common problems plugin. I use tdarr extensively having been in discussion with the developer since the beginning of it's creation. I have never and still do not have any of these issues. However, I do not use the unraid array except as a dummy USB device to start the docker services (I use ZFS) and do not use NFS (I use SMB). I strongly suspect this issue is more about tdarr triggering an unraid bug of some kind than tdarr itself being the issue.

SgtBatten · November 3, 2022

I've experienced this twice in two days. Never seen it before now.

I streamed from Plex earlier and now my user shares are gone.

I do not used tdarr but do use many of the arrs.

CS01-HS · November 10, 2022

Happened to me recently with 6.11.2.

In my case it's always triggered by SMB file operations - macOS's poor implementation acting on a stale version of the directory tree, causing invalid operations and the fuse exception. I have to remember to navigate down or up then back to force a refresh.

TRaSH · December 10, 2022

On 9/9/2021 at 1:33 PM, niavasha said:
Well - the only thing I can do is commiserate - this is obviously painful for all involved. However, I setup a User Script, as per an above post, that runs every minute with a Custom schedule (* * * * *) and looks for the error in Syslog and reboots the machine safely if it finds it.

My machine has rebooted over 200 times since May last year - but - now I don't really notice it, rather than finding my machine inaccessible.

Yes it's a pain in the proverbial, but, at least it's now automatic. Here's the script. Note I at one stage was dumping core on mount.shfs (which involved removing the "ulimit -c none" set on boot) with a hope that @limetech might value a core file for debugging purposes, but, at this stage despite proffering this several times, I've given up:
#!/bin/bash
# Looks for the dreaded unlink_node error and logs occurrences to /boot/config/unlink_reboots.logs 
# Also reboots the machine using the prescribed powerdown -r
# Also backups a core file from mount.shfs if it finds it in root (although this requires adjust the ulimit for core on which is not covered here)


grep unlink_node /var/log/syslog > /dev/null 2>&1
if [ $? -eq 0 ]; then
    echo "----------------------------------------------------------------------------"  >> /boot/config/unlink_reboots.log
    grep unlink_node /var/log/syslog >> /boot/config/unlink_reboots.log
    date >> /boot/config/unlink_reboots.log
    uname -a >> /boot/config/unlink_reboots.log 
    echo "----------------------------------------------------------------------------"  >> /boot/config/unlink_reboots.log
    echo "" >> /boot/config/unlink_reboots.log
    if [ -f "/core" ]; then
        echo "Found core file, moving it" >> /boot/config/unlink_reboots.log
        mv /core /boot/logs/core.$(date +%Y%m%d-%H%M)
    fi
    powerdown -r
fi

Does this reboot take enough time to bring down all the containers ?

niavasha · February 13, 2023

@limetech Happy to say this looks completely fixed as of whatever release came out after September 14th as that was the last time I had to auto-reboot to handle the issue.

Currently I am running 6.11.5 which has 5.19.17-Unraid kernel - and I've been up and running for 82 days... yay.

Thanks!

Indi · February 16, 2023

I am running 6.11.5 and have had this happen twice in the last week.

[6.8.3] shfs error results in lost /mnt/user

User Feedback

Recommended Comments

VBilbo 0

Link to comment

AnnabellaRenee87 45

Link to comment

Andiroo2 21

Link to comment

rs5050 3

Link to comment

VlarpNL 3

Link to comment

bandiboo 1

Link to comment

Aspect 4

Link to comment

Twinkie0101 13

Link to comment

paloooz 2

Link to comment

AgentXXL 47

Link to comment

Jeremyb 0

Link to comment

miicar 6

Link to comment

Jeremyb 0

Link to comment

Squid 4983

Link to comment

Jeremyb 0

Link to comment

sivart 3

Link to comment

OngoingDetonator 0

Link to comment

miicar 6

Link to comment

chli01 0

Link to comment

Marshalleq 139

Link to comment

SgtBatten 0

Link to comment

CS01-HS 77

Link to comment

TRaSH 18

Link to comment

niavasha 2

Link to comment

Indi 6

Link to comment

Join the conversation