This morning I couldn't reach the Internet, so I started to check my PiHole DNS docker.
When logging in to the Unraid server, the VM and Docker were not running (Service couldn't start)
The Cache drive was not mounted, so I stopped the array and started it again. Still not able to mount.
Then reading through a lot of logs etc just to see, that it might have been due to a unclean reboot.
Hours later, reading up on various blog posts around the Internet, I pretty much gave up, since nothing really helped, and didn't want to make it worse (which may be too late)
Anyways, it's my SSD Cache drive marked as /dev/sdf that is giving me issues.
It contains the system app data, but also the docker image and libvert image, along with some other backup stuff.
Ideally I would like to be able to recover some files, and have ordered an USB-> SATA converter, just in case I need to hook it up to my main pc and try and recover files that way.
But I wanted to reach out here also, just in case someone could help me on this.
root@Tower:~# fsck /dev/sdf
fsck from util-linux 2.36
e2fsck 1.45.6 (20-Mar-2020)
/sbin/e2fsck: Input/output error while trying to open /dev/sdf
The superblock could not be read or does not describe a valid ext2/ext3/ext4
filesystem. If the device is valid and it really contains an ext2/ext3/ext4
filesystem (and not swap or ufs or something else), then the superblock
is corrupt, and you might try running e2fsck with an alternate superblock:
e2fsck -b 8193 <device>
e2fsck -b 32768 <device>
root@Tower:~# smartctl -a /dev/sdf
smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.10.1-Unraid] (local build)
Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Model Family: Phison Driven SSDs
Device Model: KINGSTON SA400S37960G
Serial Number: 50026B778326A54D
LU WWN Device Id: 5 0026b7 78326a54d
Firmware Version: SBFK61K1
User Capacity: 960,197,124,096 bytes [960 GB]
Sector Size: 512 bytes logical/physical
Rotation Rate: Solid State Device
Device is: In smartctl database [for details use: -P show]
ATA Version is: ACS-3 T13/2161-D revision 4
SATA Version is: SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Tue Mar 2 13:51:56 2021 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x00) Offline data collection activity
was never started.
Auto Offline Data Collection: Disabled.
Self-test execution status: ( 112) The previous self-test completed having
the read element of the test failed.
Total time to complete Offline
data collection: (65535) seconds.
Offline data collection
capabilities: (0x11) SMART execute Offline immediate.
No Auto Offline data collection support.
Suspend Offline collection upon new
No Offline surface scan supported.
No Conveyance Self-test supported.
No Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 30) minutes.
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x0032 100 100 000 Old_age Always - 1460
9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 13332
12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 15
148 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 21
149 Unknown_Attribute 0x0000 100 100 000 Old_age Offline - 3363
167 Write_Protect_Mode 0x0000 100 100 000 Old_age Offline - 0
168 SATA_Phy_Error_Count 0x0012 100 100 000 Old_age Always - 0
169 Bad_Block_Rate 0x0000 100 100 000 Old_age Offline - 14
170 Bad_Blk_Ct_Erl/Lat 0x0000 100 100 010 Old_age Offline - 0/34
172 Erase_Fail_Count 0x0032 100 100 000 Old_age Always - 0
173 MaxAvgErase_Ct 0x0000 100 100 000 Old_age Offline - 86 (Average 23)
181 Program_Fail_Count 0x0032 100 100 000 Old_age Always - 0
182 Erase_Fail_Count 0x0000 100 100 000 Old_age Offline - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 2
192 Unsafe_Shutdown_Count 0x0012 100 100 000 Old_age Always - 13
194 Temperature_Celsius 0x0022 032 041 000 Old_age Always - 32 (Min/Max 23/41)
196 Reallocated_Event_Count 0x0032 100 100 000 Old_age Always - 2
199 SATA_CRC_Error_Count 0x0032 100 100 000 Old_age Always - 0
218 CRC_Error_Count 0x0032 100 100 000 Old_age Always - 0
231 SSD_Life_Left 0x0000 097 097 000 Old_age Offline - 97
233 Flash_Writes_GiB 0x0032 100 100 000 Old_age Always - 22000
241 Lifetime_Writes_GiB 0x0032 100 100 000 Old_age Always - 28075
242 Lifetime_Reads_GiB 0x0032 100 100 000 Old_age Always - 6193
244 Average_Erase_Count 0x0000 100 100 000 Old_age Offline - 23
245 Max_Erase_Count 0x0000 100 100 000 Old_age Offline - 86
246 Total_Erase_Count 0x0000 100 100 000 Old_age Offline - 2040736
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed: read failure 00% 13330 0
Selective Self-tests/Logging not supported
Guides I've been looking at;