Jump to content

slize

Members
  • Posts

    31
  • Joined

  • Last visited

Posts posted by slize

  1. 47 minutes ago, JorgeB said:

    Problem with the onboard SATA controller:

     

    
    
    
    Jun  1 14:00:59 SRVUNR1 kernel: ahci 0000:03:00.1: AMD-Vi: Event logged [IO_PAGE_FAULT domain=0x000f address=0xe7d60000 flags=0x0000]

     

    Unfortunately quite common with some Ryzen boards, BIOS update might help, or using a newer Unraid release when available due to the newer kernel, failing that best bet is to use an add-on controller (or a different model board).

    I am using an ASRock Rack X470 with the latest bios. Well thats sad when you pay 300€ for a board just to get such errors. I will get a cheap 50€ SATA HBA - that should be enough for this system. Thank you very much! 

  2. The issue:
    One of the two disks in the array are randomly going to "error state" which results in a crash of all VMs on the server. It happens ~once per month. I checked the disks and they are totally fine. They are about 10 months old and i replaced one of them last month (just to make sure that the issue is not caused by a broken disk). I also swapped the SATA cables and the location in the server backplane. The last time the error occurred was at the 01.06.2021, 14 hours and 2 minutes after a parity check started. And the "failed" disk was the new/replaced disk.

    Quote

    Event: Unraid Parity disk error
    Subject: Alert [SRVUNR1] - Parity disk in error state (disk dsbl)
    Description: WDC_WD80EDAZ-11TA3A0_VG033ZLG (sdg)
    Importance: alert


    I added some screenshots from the GUI after the crash and the diagnostics. To get the system back up running i have to:
    #1 Reboot the system
    #2 Remove the "error state" disk
    #3 Start the array

    #4 Stop the array
    #5 Add the disk
    #6 Start a parity rebuild/resync depending on the disk that got corrupted (parity disk/data disk)

    How can i stop this from happening?

    Unbenannt.png

    Unbenannt1.png

    Unbenannt2.png

    srvunr1-diagnostics-20210601-1839.zip srvunr1-smart-20210601-1839.zip srvunr1-smart-20210607-1814 (1).zip srvunr1-smart-20210607-1814.zip

  3. On 5/4/2021 at 2:48 AM, jbat66 said:

    Yes mine is working Great.  Make sure your underlying permissions are correct on the files as well as the dockers. My dockers are running as UMASK:000, PUID:99, PGID:100. I hope that helps.

     

    I just found the issue. Very strange but it seems like that a WireGuard VPN Connection "Remote Access To Server" is not enough. You need "Server to Server Access" to get Syncthing/rsync deamon to work.

  4. First of all thank you again for your time and answers!

    Now the ironic part: Yes, you can copy the header - kind of. The important values are marked here. I am comparing two valid drives. So unless unraid stores a backup of the highlighted values or the rebuild succeeds i'm definitely out of options. By getting myself locked out of the disk. I bet the data on the disk is still fine.

     

    image.thumb.png.808680b755fc3faed29c2efb12dc100c.png

     

    The part i don't understand is why the header got destroyed.

    Edit:

    @JorgeB do you know how the drive gets rebuilded from parity? Is it sequential from offset 0 up or random? In case its sequential i can already stop the rebuid because the header does not contain any valid luks section.

     

    root@nas:~# sudo hexdump -C /dev/sdo -n 100000
    00000000  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
    *
    000001c0  02 00 ee ff ff ff 01 00  00 00 ff ff ff ff 00 00  |................|
    000001d0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
    *
    000001f0  00 00 00 00 00 00 00 00  00 00 00 00 00 00 55 aa  |..............U.|
    00000200  45 46 49 20 50 41 52 54  00 00 01 00 5c 00 00 00  |EFI PART....\...|
    00000210  1a 05 4d cf 00 00 00 00  01 00 00 00 00 00 00 00  |..M.............|
    00000220  ff ff 3f 8c 04 00 00 00  22 00 00 00 00 00 00 00  |..?.....".......|
    00000230  de ff 3f 8c 04 00 00 00  3f 80 c7 a7 bb 4f da 4a  |..?.....?....O.J|
    00000240  95 e0 23 75 8c ef 29 e1  02 00 00 00 00 00 00 00  |..#u..).........|
    00000250  80 00 00 00 80 00 00 00  1d af a0 8f 00 00 00 00  |................|
    00000260  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
    *
    00000400  af 3d c6 0f 83 84 72 47  8e 79 3d 69 d8 47 7d e4  |.=....rG.y=i.G}.|
    00000410  66 22 62 c1 7d 95 3b 40  a5 4b 11 28 01 2e 8a 8f  |f"b.}.;@.K.(....|
    00000420  40 00 00 00 00 00 00 00  de ff 3f 8c 04 00 00 00  |@.........?.....|
    00000430  00 00 00 00 00 00 00 00  00 00 00 00 00 00 00 00  |................|
    *
    

     

    Thats the current header of the disk that is getting rebuilded at 20%.

     

  5. Yes, i am trying to find the luks header. But as far as i can tell: Unraid is using LUKS2, the header is between from offset 0x0 to offset  0x00010000. Does unraid store backups of the luks header? Because it seems like the header is gone and without the header i am unable to access the data on the disk. I got my infos from here: https://security.stackexchange.com/questions/227359/how-to-determine-start-and-end-bytes-of-luks-header

  6. 10 minutes ago, JorgeB said:

    Yes.

    I will do it as soon as the rebuild is done. I tried to mount it via an external USB dock, but the button was just greyed out.

    As for my understanding:
    Unraid uses BTRFS as filesystem and than uses LUKS1 or LUKS2 to encrypt the whole disk?

  7. 33 minutes ago, JorgeB said:

    If the disk was empty might as well reformat, but you can try to mount it with UD plugin and if it doesn't post the diags (key needs to be the same as other encrypted disks in the server).

     

     

     

    The broken dies was not empty. It contained all the data from the other disks.

    UD cant mount it, the key is the same.

    Okay, so i should try to mount it and than post the diags? Can i mount it on any other linux system because the rebuild of the array is running and i dont have a free bay to put the old disk in.
     

    I dumped the first 10MB of the raw disk. I dont know if the header is correct - it has a lot of 0's until block 0x00010000.

    I compared the headers of the "broken" disk and one of the running disks in the array - they are totally different... 

     

    BrokenDisk.bin WorkingDisk.bin

  8. 14 minutes ago, JorgeB said:

    No valid filesystem is being detected, so there's serious damage to the superblock which may make recovery difficult, all your other disks are empty, was that the only disk with data?

    Yes, they were empty. Yesterday i moved all the data to the missing disk (which was the newest one and running BTRFS encrypted) because i wanted to switch the other disks from XFS to BTRFS encrypted. That worked just fine. 

    Today i the data disk reported that its unmountable - after a stop/start of the array.


    Edit:
    I just attached the "failed" disk to an other system and i think it's physically still fine. (I can read the SMART values and there are no errors) But i cant mount it because the filesystem seems to be invalid. I know that it was running BTRFS and the key that was used for the encryption. Is there a way i can recover the filesystem?

  9. 56 minutes ago, trurl said:

    According to the screenshot you added after those replies, the emulated disk is unmountable, which means filesystem corruption that will need repairing, possibly with some data loss.

     

    You must always have another copy of anything important and irreplaceable. Parity is NOT a backup.

     

    Usually we suggest repairing the emulated disk before rebuild.

     

    You probably would have gotten better advice if you had attached your diagnostics in your first post.

     

    Go to Tools - Diagnostics and attach the complete Diagnostics ZIP file to your NEXT post in this thread.

     

     

     

    The rebuild is now running. How can i repair an encrypted BTRFS filesystem?

    Yes, i am aware about the fact that parity is not a backup. I know, shame on me.

    I added the report.
    image.png.cdb07ece9f0cd31daf2969cb0b10ea51.png
    image.thumb.png.f20b618918226ea546b779104b736fbe.png

    nas-diagnostics.zip

  10. 10 minutes ago, JorgeB said:

    No, but you could move all the data from the emulated disk to other disk(s) then remove it from the array.

    Thanks for the fast reply. Okay the data is very important for me - im just going to get a new 10TB drive and i hope that it does lower the risk of losing anything by simplifying the process. (My backup is on year old because the rsync to an offsite server seems to have failed since a long time - my bad, my issue but lets hope i can recover the drive)

    So i just put the new drive in and start the array?

  11. Hallo,

    i got one failed drive in my array and need to rebuild the data on it.

    Can i rebuild it without a new drive? I got enough free space, the failed drive was a 10TB. 

     

    Also what is the propper way to recover the drive? 

     

    image.thumb.png.c80ee0b3de4d43318aaaa2eae2770107.png

     

     

    Shouldn't the disk be emulated and the data immediately accessible?
    image.thumb.png.e8452f8ff2508db74621f7ae67c4272a.png
     

    Edit: Does unraid backup the LUKS headers of the encrypted disks?

  12. I literally cant get syncthing to work with my shares.

    To my setup:
    I got two unraid server connected via the WireGuard VPN plugin. Both servers are running the latest version of this container and are connected in the syncthing gui. When im sharing the default folder (which is just running inside the container) everything is syncing fine and super fast.

    When i try to sync one of my unraid shares (by mounting it as RW into the docker of the main server) + sharing the folder with the other server it starts the sync process but gets stuck at 0%. The target server does not even state that a sync is running.

    So there has to be an issue with the mounted share/the files in the share. 

    As long as i mount an empty share and i am creating the files in the share from within the container the sync is running fine.

    "Ignore Permissions" is checked on both ends and "UMASK_SET" has the value "000".

    Can someone tell me whats wrong with my files in the share that causes the sync process to get stuck?

    Screenshot 2021-04-14 202131.png

    Screenshot 2021-04-14 202332.png

  13. 5 hours ago, bonienl said:

    Each individual pool can have encryption or not, just like the array.

     

    Stop the array and go to the pool (cache) settings and change the file system type.

    Note: when a pool consists of 2 or more devices, it will always use btrfs.

     

    image.thumb.png.75112ae99d2548eecf37891cffabfc85.png

    Awesome stuff! Thank you - i searched for that option in "Disk Settings". 
    image.thumb.png.430729fdc607d4cde4b7ad52fec733a1.png
    That looks way better now :-).

    Also thank you too @John_M! 

  14. @Squid, @bonienl, @dlandon sorry to ping you guys, but i really think this feature would add so much value for UNRAID especially in those two points:
    #1 Humans do make mistakes and a snapshotted version of an office document would help a lot when overwritten accidentally.
    #2 Ransomware has become an serious issue around the world from large companies to very small networks. As long as a user cannot delete a snapshot and the server is not compromised it would not be as big of an issue if you get hit at one day. (Sure you can create backups every day but snapshots would be much more convenient.)

     

    It would be awesome to get some prominent attention on this topic and maybe see an plugin coming in 2021.

     

    I think this topic would also fit very well on @SpaceInvaderOne's YouTube channel. :)

  15. Is there currently a way to do a unraid server - unraid server connection via the VPN Manager plugin? I would like to sync my server via a vpn to another location.

    I manged to get a vpn connection via a Windows VM and the unraid server but i cant figure out how to connect to an vpn via the VPN Manager plugin.

  16. I'm searching for a way to store my passphrase on the unraid usb/flash device to autostart the disk array.
    For what i'm using encryption: To be able to send a failed drive to the manufacturer in a warrenty case.
    For what i'm not using encryption: To secure my data in case of a stolen server. So yes, i know that it would be useless to store the key on the same machine as the encrypted disks.

    It would be great if someone got a solution for me :-).

×
×
  • Create New...