[solved] ssdcache Unmountable: No file system


Go to solution Solved by JorgeB,

Recommended Posts

Unraid Version: 6.9.2

Hello folcs,

 

I run Unraid for a couple of months now, the server is often down and powred on via wol when needed. Yesterday I recognized that my VMs couldn't start. The reason: they're on ssdcache only and both SSDs are in state "Unmountable: No file system" 😲

 

Diagnostics: kunraid-diagnostics-20220706-2215.zip

Array and Pool: 1867123327_2022-07-0622_41_16-kunraid_Main.thumb.png.3cfffd7f340e549ca2c2d692d19dfd59.png

 

Shares (part):

909352288_2022-07-0622_47_49-kunraid_Shares.thumb.png.b24f7c6060d851ddca675a13338dc882.png

 

Any hint where to start to get this fixed?

 

 

2022-07-06 22_41_16-kunraid_Main.png

Edited by KaBo
Link to comment
  • Solution
Jul  6 21:38:14 kunraid kernel: BTRFS: device fsid 0ecc5969-4afc-4b9a-a299-3fab30cf63d9 devid 1 transid 153079 /dev/sdb1 scanned by udevd (1244)
Jul  6 21:38:14 kunraid kernel: BTRFS: device fsid 0ecc5969-4afc-4b9a-a299-3fab30cf63d9 devid 2 transid 147224 /dev/sdc1 scanned by udevd (1234)

 

devid 2 is on an older generation than devid 1, this can sometimes fix it:

 

btrfs-select-super -s 1 /dev/sdc1

 

If you rebooted since the diags check that ssdcache2 is still sdc, then reboot and if it still doesn't mount post new diags.

Link to comment
  • KaBo changed the title to [solved] ssdcache Unmountable: No file system

Thanks a lot @JorgeB! This exacly was it. But three questions remaining:

  • How did you find out? Is the last number in brackets from syslog lines the generation (the higher, the newer?) - and devid 2 has to be the newest?
  • Do I have to do anything else before the next reboot? sdb1 shows a lot of "bad tree block start" and "read error corrected" after reboot.
  • How did this happen and what can I do to avoid it? Any ideas?

Kai

Link to comment

Run a scrub on the pool, though like mentioned here nocow shares can't be fixed, since there are no checksums, your domains share is nocow, so assuming the vdisks are there you might need to restore them from backups if available, always recommend using cow shares for anything btrfs.

 

As for the cause of the problem, most likely a device/firmware/controller issue, some writes were lost, all should be recovered after a scrub (except nocow shares like mentioned).

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.