HAMANY Posted February 9, 2023 Share Posted February 9, 2023 (edited) Hi all, Few days ago, one of my drives had read errors and got disabled (Disk 6). I stopped the array, remove the disk from the array, started the array again, saw the disk in unassigned but I wasn't able to select it in the array (No disk). So I shutdown the server, changed the disabled disk cables, started the server again, then the surprise. The cache (NVME) was shown under the unassigned devices, when I add it to the pool, it says it's a new device. Also there were many disks with UDMA CRC errors. I've attached the diags before the first restart and after the restart. Appreciate your help. Thank you. tower-diagnostics-20230209-0626 (After Restart).zip tower-diagnostics-20230206-2321 (Before Restart).zip Edited February 23, 2023 by HAMANY SOLVED Quote Link to comment
trurl Posted February 9, 2023 Share Posted February 9, 2023 4 minutes ago, HAMANY said: one of my drives had read errors and got disabled Which drive was this? 1 Quote Link to comment
trurl Posted February 9, 2023 Share Posted February 9, 2023 Start the array and post new diagnostics 1 Quote Link to comment
HAMANY Posted February 9, 2023 Author Share Posted February 9, 2023 23 minutes ago, trurl said: Which drive was this? Disk 6 the drive that got disabled Cache (nvme0n1) the cache ssd that is shown as unassigned 15 minutes ago, trurl said: Start the array and post new diagnostics Is there any risk of losing the pool data (cache)? It has all the dockers, I've a backup for it. Quote Link to comment
trurl Posted February 9, 2023 Share Posted February 9, 2023 4 minutes ago, HAMANY said: Is there any risk of losing the pool data (cache)? Should be OK, don't format it. 1 Quote Link to comment
HAMANY Posted February 9, 2023 Author Share Posted February 9, 2023 31 minutes ago, trurl said: Should be OK, don't format it. Attached. Disk 6 is rebuilding now. tower-diagnostics-20230209-0746.zip Quote Link to comment
HAMANY Posted February 17, 2023 Author Share Posted February 17, 2023 On 2/9/2023 at 7:13 AM, trurl said: Should be OK, don't format it. Thank you! The array is back online again, however, there are some issues. - Some dockers and plugins got corrupted. The Community App tab disappeared. - Loading the docker icons in the dashboard page is very slow. The server is overall slower than before. - Many disks have UDMA CRC error. Although recently, I've changed the power and Sata cables - Every time I restart the server, the Cache drive goes to unassigned. Appreciate your suggestion tower-diagnostics-20230217-1030.zip Quote Link to comment
trurl Posted February 17, 2023 Share Posted February 17, 2023 Corruption on flash drive. Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-binhex-deluge.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-binhex-krusader.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-FileZilla.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-JDownloader2.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-PuTTY.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-transmission.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-tsMuxeR.xml corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/disk.cfg corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/domain.cfg corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/ident.cfg corrupted Do you have a current backup of flash? 1 Quote Link to comment
HAMANY Posted February 17, 2023 Author Share Posted February 17, 2023 3 minutes ago, trurl said: Corruption on flash drive. Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-binhex-deluge.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-binhex-krusader.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-FileZilla.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-JDownloader2.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-PuTTY.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-transmission.xml corrupted Feb 16 04:40:13 Tower root: Fix Common Problems: Error: /boot/config/plugins/dockerMan/templates-user/my-tsMuxeR.xml corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/disk.cfg corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/domain.cfg corrupted Feb 16 04:40:14 Tower root: Fix Common Problems: Error: /boot/config/ident.cfg corrupted Do you have a current backup of flash? Yes I do. Should I do full data restore? Or just for the corrupted files? Thanks Quote Link to comment
Solution trurl Posted February 17, 2023 Solution Share Posted February 17, 2023 There is no complete list of corrupted files. Recreate flash drive as a new install of whatever version, copy the config folder from your backup. 1 Quote Link to comment
HAMANY Posted February 23, 2023 Author Share Posted February 23, 2023 On 2/17/2023 at 5:01 PM, trurl said: There is no complete list of corrupted files. Recreate flash drive as a new install of whatever version, copy the config folder from your backup. I've restored the backup in a new USB stick. Every thing looks stable now. Thank you @trurl Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.