Jump to content

Salzgablah

Members
  • Posts

    25
  • Joined

  • Last visited

Posts posted by Salzgablah

  1. Both my array and cache drives are encrypted with the same key. So when pulling the keyfile, it unlocks and mounts all drives (array and cache). If you are using different encryption keys, I'm not sure how that would impact the process. I would recommend using the same key for all, unless you have a specific reason to use different key's...

  2. Very interesting. I'll connect it to the MOBO through the SATA port later today instead of the HBA card. I think I have one port left. wild that it impacts a specific HD model like that. How do i initiate a rebuild using the same drive? The "Replacing a Data Drive" seems to be using a new disk that hasn't been in the array before.
     

    Quote

     

    The procedure

    If you are running a very old version of unRAID, such as v4.7 or older, skip down to the next section.

    Stop the array

    Unassign the old drive if still assigned (to unassign, set it to No Device)

    Power down

    [ Optional ] Pull the old drive (you may want to leave it installed for Preclearing or testing)

    Install the new drive

    Power on

    Assign the new drive in the slot of the old drive

    Go to the Main -> Array Operation section

    Put a check in the Yes, I'm sure checkbox (next to the information indicating the drive will be rebuilt), and click the Start button

    The rebuild will begin, with hefty disk activity on all drives, lots of writes on the new drive and lots of reads on all other drives

     

     

    I actually picked up a new 16TB drive to increase parity, which would allow me to replace this 8TB with my 14TB parity. But it sounds like i'll need to rebuild the Disk3 before i can replace and rebuild parity.

  3. Today I received a warning that disk3 of my array had an error and is now disabled. The below is from the syslog in the diagnostics.

     

    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9163 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=DRIVER_OK cmd_age=19s
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9163 Sense Key : 0x2 [current] 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9163 ASC=0x4 ASCQ=0x0 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9163 CDB: opcode=0x88 88 00 00 00 00 03 93 cc 06 60 00 00 00 40 00 00
    Mar  1 14:41:09 Tower kernel: I/O error, dev sde, sector 15364523616 op 0x0:(READ) flags 0x0 phys_seg 8 prio class 0
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523552
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523560
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523568
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523576
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523584
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523592
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523600
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523608
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9164 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=DRIVER_OK cmd_age=19s
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9164 Sense Key : 0x2 [current] 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9164 ASC=0x4 ASCQ=0x0 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9164 CDB: opcode=0x88 88 00 00 00 00 03 93 cc 06 c0 00 00 00 60 00 00
    Mar  1 14:41:09 Tower kernel: I/O error, dev sde, sector 15364523712 op 0x0:(READ) flags 0x0 phys_seg 12 prio class 0
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523648
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523656
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523664
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523672
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523680
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523688
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523696
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523704
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523712
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523720
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523728
    Mar  1 14:41:09 Tower kernel: md: disk3 read error, sector=15364523736
    Mar  1 14:41:09 Tower  emhttpd: read SMART /dev/sde
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9173 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=DRIVER_OK cmd_age=0s
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9173 Sense Key : 0x2 [current] 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9173 ASC=0x4 ASCQ=0x0 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9173 CDB: opcode=0x8a 8a 00 00 00 00 03 93 cc 06 60 00 00 00 40 00 00
    Mar  1 14:41:09 Tower kernel: I/O error, dev sde, sector 15364523616 op 0x1:(WRITE) flags 0x0 phys_seg 8 prio class 0
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523552
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523560
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523568
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523576
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523584
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523592
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523600
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523608
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9179 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=DRIVER_OK cmd_age=0s
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9179 Sense Key : 0x2 [current] 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9179 ASC=0x4 ASCQ=0x0 
    Mar  1 14:41:09 Tower kernel: sd 6:0:0:0: [sde] tag#9179 CDB: opcode=0x8a 8a 00 00 00 00 03 93 cc 06 c0 00 00 00 60 00 00
    Mar  1 14:41:09 Tower kernel: I/O error, dev sde, sector 15364523712 op 0x1:(WRITE) flags 0x0 phys_seg 12 prio class 0
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523648
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523656
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523664
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523672
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523680
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523688
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523696
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523704
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523712
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523720
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523728
    Mar  1 14:41:09 Tower kernel: md: disk3 write error, sector=15364523736

     

    I've also got the SMART results from the diag package, see below and attached.

     

    SMART Attributes Data Structure revision number: 10
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAGS    VALUE WORST THRESH FAIL RAW_VALUE
      1 Raw_Read_Error_Rate     POSR--   074   064   044    -    26762032
      3 Spin_Up_Time            PO----   082   080   000    -    0
      4 Start_Stop_Count        -O--CK   099   099   020    -    1587
      5 Reallocated_Sector_Ct   PO--CK   100   100   010    -    0
      7 Seek_Error_Rate         POSR--   082   060   045    -    175723966
      9 Power_On_Hours          -O--CK   080   080   000    -    17990
     10 Spin_Retry_Count        PO--C-   100   100   097    -    0
     12 Power_Cycle_Count       -O--CK   100   100   020    -    97
     18 Head_Health             PO-R--   100   100   050    -    0
    187 Reported_Uncorrect      -O--CK   100   100   000    -    0
    188 Command_Timeout         -O--CK   099   099   000    -    4295032833
    190 Airflow_Temperature_Cel -O---K   066   055   040    -    34 (Min/Max 27/40)
    192 Power-Off_Retract_Count -O--CK   100   100   000    -    27
    193 Load_Cycle_Count        -O--CK   090   090   000    -    20911
    194 Temperature_Celsius     -O---K   034   045   000    -    34 (0 23 0 0 0)
    195 Hardware_ECC_Recovered  -O-RC-   074   064   000    -    26762032
    197 Current_Pending_Sector  -O--C-   100   100   000    -    0
    198 Offline_Uncorrectable   ----C-   100   100   000    -    0
    199 UDMA_CRC_Error_Count    -OSRCK   200   200   000    -    0
    240 Head_Flying_Hours       ------   100   253   000    -    6332h+12m+55.571s
    241 Total_LBAs_Written      ------   100   253   000    -    29052003840
    242 Total_LBAs_Read         ------   100   253   000    -    555730377236
                                ||||||_ K auto-keep
                                |||||__ C event count
                                ||||___ R error rate
                                |||____ S speed/performance
                                ||_____ O updated online
                                |______ P prefailure warning

     

    This drive is through an LSI 9207 HBA SAS card with 3 other drives. I've reset the SAS to SATA cables and all power cords. I'm now running an extended SMART test on the drive. If that comes back clean, and with the above info, do you think the drive is OK or should I start an RMA with Seagate? If you think it's still got some life left, what is the process to re-enable the drive? Couldn't find much in the wiki and links in past posts went 404.

    ST8000VN004-2M2101-20230301-1548 disk3 (sde) - DISK_DSBL.txt

  4. 5 minutes ago, Nacho Acaso said:

    Today the plugin has stopped working, it does not show the folders or the files

    Captura_de_pantalla_2023-02-28_a_las_19.32.16.png

    I had this issue a couple days ago, scroll up a bit. I tried uninstalling then installing and it would still show up blank....but a restart of the server actually fixed it.

  5. No dice. Tried with the array stopped and with the cache drive out of the motherboard.

     

    Cache 2 was removed and only cache was left installed.

     

    image.png.50a465939032684437e28f7ac7769693.png

     

    Firmware listed on Samsung website:

     

    image.thumb.png.e69ffc6316f6cb90ec9d46479ae69a90.png

     

    image.thumb.png.c63284edf1efbf66e6ba0b1144c06882.png

     

    None of my other PC's have an nvme slot, otherwise i'd try to remove the old FW drive and update in another Mobo....I'm out of ideas.

  6. Once this parity check is complete, I'll first try stopping the array and running the tool again from an external drive.

    If that doesn't work I'll do as you suggest and remove the 'cache 2' drive to see if it'll pick up the old FW on the 'cache' drive.

    I'll report back with results. Much appreciated for your help with the instructions and troubleshooting.

  7. I was able to follow all the steps but the firmware updater does not detect all of my Samsung SSD's.

     

    image.png.4969d6647e537db432fbe6272a2df0aa.png

     

     

    I have two 970 EVO Plus drives in a cache pool together for mirroring. The drive named "Cache" has an older firmware starting with '2B' and Cache 2 already has the updated firmware starting '4B'. I'm looking to update the "Cache" NVME.

     

    When running the updater, it only detects "Cache 2" and says that the firmware is already up to date (which is true), but it does not list out the first drive "Cache". Any ideas?

  8. With the new version 2023.02.19, I'm unable to see any files when opening up shares. The screen just shows zero files. Everything worked great until this updated version. Is there a way to downgrade versions for plugins?

     

    image.thumb.png.cb930e9abba822b4bef13d41d38b5748.png

     

    Edit: a server reboot appears to have resolved the issue.

  9. That worked. It removed the windows formatting. I also updated the go file to use the new install commands and that worked as well. What's the main reason for using the three install commands instead of the copy and changing permissions? Just reducing the amount of lines in go?

     

    Thanks for the pointers and help.

  10. I realize this is an old thread, but I just went through the process and hope i can help someone else in the future.

     

    I tried using the fetch_key/delete_key from another thread on the forum, but apparently that broke a few versions of UnRaid ago. The below was pulled and compiled from other posts on the forum, spaceinvader videos linked above, plus some trial and error. I should note that I'm currently on 6.11.0.

     

    I have a backup server using a passphrase to decrypt and wanted to switch it to a keyfile with the ultimate goal of having a keyfile on a second machine and pulling a copy at startup. This would only allow the server to auto start when the main server is online. To convert my passphrase into a keyfile, the only method I was able to use successfully was the following command in terminal

     

    echo -n passphrase >/boot/keyfile

     

    This put the new keyfile in boot but that isn't enough as the keyfile needs to be in /root/ upon startup to decrypt and autostart the array. However, root is deleted upon system shutdown. So I added a 'cp' line in the /boot/config/go file to copy this keyfile to root as the system starts.

     

    #!/bin/bash
    
    cp /boot/keyfile /root/keyfile
    
    # Start the Management Utility
    /usr/local/sbin/emhttp &

     

    Once confirmed that the key works, I was able to move the keyfile off /boot as it is a BAD idea to have your key on the flashdrive that's plugged into the machine.  So I copied the keyfile on a second server in the 'keys' share and modified the above 'go' file mount the share, copy the keyfile to root, then unmount the drive.

     

    #!/bin/bash
    
    # cp /boot/keyfile /root/keyfile #commented out the previous method used to test the keyfile. Can remove this line if you have a working keyfile.
    
    mkdir -p /keys
    mount -t cifs -o user=username,password=userpassword,iocharset=utf8 //IPADDRESS/keys /keys
    cp -f /keys/keyfile /root/keyfile
    umount /keys
    rm -r /keys
    
    # Start the Management Utility
    /usr/local/sbin/emhttp &

     

    You'll need to modify the username, userpassword and IPADDRESS for what your settings.

     

    You can also use the above code from the go file in a user script to run 'At Stopping of Array' so it'll be loaded when you start the array back up. I also created the below script to remove the /root/keyfile after the array has started, that way the keyfile isn't sitting in root while the machine is one (probably not necessary as it is removed automatically at shutdown/restart).

     

    #!/bin/bash
    sleep 120
    rm /root/keyfile

     

    Hope this helps.

    • Thanks 1
  11. I'm able to run each step of the below 'fetch_key' script in terminal without issue,

    #!/bin/bash
    
    if [[ ! -e /root/keyfile ]]; then
      mkdir -p /keys
      mount -t cifs -o user='username',password='secret_key',iocharset=utf8 //'IP ADDRESS'/keys
     /keys
      cp -f /keys/AMD_array/keyfile /root/keyfile
      umount /keys
      rm -r /keys
    fi

     

    but when trying to run the script itself, i'm getting two errors.

    root@Tower:~# bash /usr/local/emhttp/webGui/event/starting/fetch_key
    /usr/local/emhttp/webGui/event/starting/fetch_key: line 2: $'\r': command not found
    /usr/local/emhttp/webGui/event/starting/fetch_key: line 11: syntax error: unexpected end of file

     

    My go file for reference.

    #!/bin/bash
    
    # auto unlock array by making use of events to fetch keyfile and delete it after decryption
    mkdir -p /usr/local/emhttp/webGui/event/starting
    mkdir -p /usr/local/emhttp/webGui/event/started
    mkdir -p /usr/local/emhttp/webGui/event/stopped
    cp -f /boot/custom/bin/fetch_key /usr/local/emhttp/webGui/event/starting
    cp -f /boot/custom/bin/delete_key /usr/local/emhttp/webGui/event/started
    cp -f /boot/custom/bin/fetch_key /usr/local/emhttp/webGui/event/stopped
    chmod a+x /usr/local/emhttp/webGui/event/starting/fetch_key
    chmod a+x /usr/local/emhttp/webGui/event/started/delete_key
    chmod a+x /usr/local/emhttp/webGui/event/stopped/fetch_key

     

    Did something major change?

  12. Redundant pool.  I went the path of moving all cache files off the pool onto the array, then started fresh with only one drive in the cache pool. Took a bit longer to move everything off then back on (appdata and vm's), but didn't want to risk it.

  13. On 7/18/2016 at 4:46 AM, JorgeB said:

    How do I remove a cache pool disk?

     

    A few notes:

    -unRAID v6.4.1 or above required, upgrade first if still on an older release.

    -Always a good idea to backup anything important on the current cache in case something unexpected happens

    -You can only remove devices from redundant pools (raid1, raid5/6, raid10, etc) but make sure to only remove one device at a time, i.e., you can't remove 2 devices at the same time from any kind of pool, you can remove them one at a time after waiting for each balance to finish (as long as there's enough free space on the remaining devices).

    -You can't remove devices past the minimum number required for the profile in use, e.g., 3 devices for raid1c3/raid5, 4 devices for raid6/raid10, etc, exception is removing a device from a two device raid1 pool, in this case Unraid converts the pool to single profile.

    -Some Unraid releases like current stable v6.8.3 permit removing devices from non redundant pools, but this is not officially supported, the pool itself can't be encrypted and the device being removed must remain connected while the pool is balanced to the new profile, there might be other gotchas so proceed with care.

     

     

    Procedure:

     

    • stop the array
    • unassign pool disk to remove
    • it's OK to reorder remaining pool members if you want (but see point below), i.e., you had a three device pool and removed cache2, you can assign cache3 to slot2 before starting the array (you can also do it later)
    • while it's OK to reorder members it's currently not OK to change the number of cache slots at the same time you remove a device, you can do that after the pool is balance down, i.e., just don't do both things at the same time.
    • start the array (after checking the "I'm sure" box next to the start array button)
    • a balance and/or a device delete will begin depending on the profile used and number of pool members remaining, wait for cache activity to stop, the stop array button will be inhibited during the operation, this can take some time depending on how much data is on the pool and how fast your devices are.
    • when the cache activity stops or the stop array button is available the replacement is done.

     

     

     

    Quote

    -Some Unraid releases like current stable v6.8.3 permit removing devices from non redundant pools, but this is not officially supported, the pool itself can't be encrypted and the device being removed must remain connected while the pool is balanced to the new profile, there might be other gotchas so proceed with care.

     

    So I'm on Version: 6.10.3 and have two 1TB SSDs as part of the cache pool. I'd like to split the pool up into two, which means i need to remove one of the drives, but they are encrypted. Per the above, I can't use this method.

     

    How do I remove one of the two cache disks in a pool where the drives are encrypted?

  14. On 4/27/2022 at 1:11 PM, Salzgablah said:

     

    I switched from SWAG to nginx proxy manager to try out other reverse proxy options to see if it fixed my issue.

     

    Still have a "Socket Failed to Connect" error after logging in through the reverse proxy. Making me think it's an ABS container issue instead of reverse proxy issue, but could be wrong.

    @jimmy898 and @Froberg, you both were right. Wasn't container based.

     

    It was addons with Firefox causing conflicts. It worked on chrome so started digging into the browser. Now it's working as intended. All good on my end.

    • Like 1
  15. On 3/22/2022 at 12:45 PM, Froberg said:

    Noticed a lot of people asking about nginx proxy manager, here's my working setup:

     

    image.png.e6af4c57f302b706fcc4a1a75661bcb2.png

    image.png.ae22ac7d150c26e2222918c3644c7d8d.png

    Hope it helps someone. 

     

    I switched from SWAG to nginx proxy manager to try out other reverse proxy options to see if it fixed my issue.

     

    Still have a "Socket Failed to Connect" error after logging in through the reverse proxy. Making me think it's an ABS container issue instead of reverse proxy issue, but could be wrong.

  16. On 11/1/2021 at 4:29 PM, Greygoose said:

    If anyone is trying to get this working with SWAG i managed to get it working with the following config.  I adapted the one on the socket.io website to listen on 443 as SWAG wouldnt allow the HTTP set with the socket.io template.

     

    Make sure to put your unraid IP address where i have put in caps below.  Also make sure you name your file as below and put in your SWAG proxy-confs folder

    audiobookshelf.subdomain.conf


     

      server {
        listen 443 ssl;
        listen [::]:443 ssl;
        
        server_name audiobookshelf.*;
        
        include /config/nginx/ssl.conf;
    
        location / {
          proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
          proxy_set_header Host $host;
    
          proxy_pass http://YOURUNRAIDIPADDRESS:13378;
    
          proxy_http_version 1.1;
          proxy_set_header Upgrade $http_upgrade;
          proxy_set_header Connection "upgrade";
        }
      }
    
    

     

     

    When connecting locally via the IP address, everything is working fine. Once i go through my swag reverse proxy, with the above conf, I can connect and login, but continually see a "Socket Failed to Connect" error. This happened on v2.0.1 and the new v2.0.2 version that was just released.

  17. So many fails with the overnight mem test, I'm talking 2,500+ errors. Took one stick out and tried again and passed without any. I'll see if the server is more stable now and come back with follow up after a few days to a week.

     

    Know the suggest was super simple, but helps solve one problem and hope I'm all set. Thanks.

  18. Over the last few weeks, i've had the server lockup on me. I've scoured through the various lockup threads and can't find a solution that works. Even added Spaceinvaders rc_nocbs=0-7 to the Syslinux. Granted my mobo is on the older side, but was hoping to use the hardware as a nice NAS. Didn't realize AMD had such issues with UNRAID. Is Intel really the way to go? Now the lockouts are happening every 10-20 min...

     

    Log is the most recent lock plus the following boot.

     

    Asus M5A99X EVO ATX AM3+ Motherboard

    AMD FX-8320 3.5 GHz 8-Core Processor

    G.Skill Ripjaws X Series 32 GB (4 x 8 GB) DDR3-1600 CL9 Memory

    Seagate IronWolf NAS 8 TB 3.5" 7200RPM Internal Hard Drive (Parity + 1 active drive + Trying to add second active drive)

    Samsung 860 Evo 1 TB 2.5" Solid State Drive (Cache)

    One other SSD but it is not mounted (UNUSED)

    EVGA GeForce GTX 1060 6GB 6 GB SC GAMING Video Card

    Antec Three Hundred Illusion ATX Mid Tower Case

    Corsair GS 600 W 80+ Bronze Certified ATX Power Supply

     

    I'm tempted to replace the mobo, chip and memory for a newer intel build if nothing else works.

    tower-diagnostics-20210209-1632.zip 2021_02_09 Lockup Syslog.txt

×
×
  • Create New...