RallyGallery

Members
  • Posts

    38
  • Joined

  • Last visited

Recent Profile Visitors

The recent visitors block is disabled and is not being shown to other users.

RallyGallery's Achievements

Newbie

Newbie (1/14)

3

Reputation

  1. Sorted! I added the new drive again and now it works. As you said a bug. So for reference, once you add the larger drive, stop the array, unassign it from the pool, let the balance run and you have one drive in the pool. Stop the array, add new drive back in again and hey presto, it works. Thanks ever so much for your help.
  2. Many thanks for your help. I will give it a go, The new disk is in the array as an unassigned device, so will see it what happens. Will report back the results and hopefully this thread will be closed. Only having 480Gb is not a problem. My VM's and downloads and transcode stuff are on two separate NVME drives in their own separate pools. Appreciate your time.
  3. Thanks for your help. As you suggested I have removed the new SSD and it's now as an unassigned device. Cache pool has rebalanced and dockers are working fine. Just a single cache drive currently. New diagnostics attached. To replaced the old cache drive I : Stopped the docker se4rvice. Stopped the array Removed the 'defective' SSD from the cache pool Shut down server Removed old drive and put in the new 500gb drive. Restarted the server. Assigned the new SSD to the cache pool Start server Cache pool rebalanced and we end up with my initial problem as per the post. Thanks for any help you can give. pcserver-diagnostics-20210911-1159.zip
  4. I have had a cache pool of two SSD in raid 1 running with no issues. In the last two weeks I have noticed many errors in the main log with one of the SSD's. I decided to buy a new SSD and replace it. The cache pool drives have been 500GB. Drive replaced with no problems, but..... The new drive was purchased as 500GB as per the original drive, the original 'good' drive still in the cache' reports 480GB (sdf). I have tried to put this in raid 1 but nothing happens. Is this because that unRAID thinks the two drives are not of the same capacity? I suspect it is, which is a nuisance as I purchased 500GB drives to match. Have seen log file with : Sep 11 10:53:02 PCServer ool www[7241]: /usr/local/emhttp/plugins/dynamix/scripts/btrfs_balance 'start' '/mnt/cache' '-dconvert=raid1,soft -mconvert=raid1,soft' Sep 11 10:53:02 PCServer kernel: BTRFS error (device sdf1): balance: invalid convert data profile raid1 Full diagnostics file attached. Is there a work around for this? pcserver-diagnostics-20210911-1053.zip
  5. Have done an extended smart test and no errors found
  6. I checked all the connections and made sure they were all ok. Rebooted server and can do a SMART test now. Diagnostics attached. Also screenshot of the disk status and I think this disk is failing. pcserver-diagnostics-20210907-1823.zip
  7. Full Diagnostics file. pcserver-diagnostics-20210906-2139.zip
  8. I have a cache pool made up of two SSD 480gb SSD drives in a raid 1. Over the last few days I have noticed that my trim cron job has errored. Just looked at the server and the log file is nearly full. It is full of entries such as : Sep 6 21:01:19 PCServer kernel: sd 10:0:0:0: [sdi] tag#28 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=0s Sep 6 21:01:19 PCServer kernel: sd 10:0:0:0: [sdi] tag#28 CDB: opcode=0x2a 2a 00 11 52 0f 00 00 00 80 00 Sep 6 21:01:19 PCServer kernel: blk_update_request: I/O error, dev sdi, sector 290590464 op 0x1:(WRITE) flags 0x1800 phys_seg 16 prio class 0 Sep 6 21:01:19 PCServer kernel: BTRFS warning (device sdf1): lost page write due to IO error on /dev/sdi1 (-5) Sep 6 21:01:19 PCServer kernel: BTRFS warning (device sdf1): lost page write due to IO error on /dev/sdi1 (-5) Sep 6 21:01:19 PCServer kernel: BTRFS warning (device sdf1): lost page write due to IO error on /dev/sdi1 (-5) Sep 6 21:01:19 PCServer kernel: BTRFS error (device sdf1): error writing primary super block to device 2 Sep 6 21:01:24 PCServer kernel: scsi_io_completion_action: 53 callbacks suppressed Sep 6 21:01:24 PCServer kernel: sd 10:0:0:0: [sdi] tag#4 UNKNOWN(0x2003) Result: hostbyte=0x04 driverbyte=0x00 cmd_age=0s Sep 6 21:01:24 PCServer kernel: sd 10:0:0:0: [sdi] tag#4 CDB: opcode=0x2a 2a 00 00 12 9c 88 00 00 18 00 Sep 6 21:01:24 PCServer kernel: print_req_error: 54 callbacks suppressed Sep 6 21:01:24 PCServer kernel: blk_update_request: I/O error, dev sdi, sector 1219720 op 0x1:(WRITE) flags 0x0 phys_seg 3 prio class 0 I have a feeling that the second cache drive is failing? I can not also undertake a smart report. I would say the drive is failing or has failed? Unraid is not showing any errors in the 'Main tab'. The drive in question has a lot more reads and writes. As part of a cache pool, I should be able to stop the array. Unassign the drive, power down the server, install new SSD, assign it and it should rebuild? Any help or advice is greatly appreciated.
  9. Thanks for the post. The NVME was formatted with version 6.9 so maybe it will keep the data when I assign it to the drive pool? It will be the only one in the pool, so that should also not complicate anything. The new NVME will be in a separate pool and it will be brand new so no issues with losing data there.
  10. I have just watched the latest @SpaceInvaderOne video on drive pools and managing them. My server is two years old and I have two ssd's in raid 1 for my docker containers which I will leave well alone. I also have an NVME drive mounted as an unassigned device which has my Windows VM running and it works really well. Now with unRaid 6.9 I am thinking of creating some drive pools. I have got another NVME drive coming which I will install and create a drive pool on that one device and use it for files downloaded etc. That's easy to setup. However, I was thinking of creating another drive pool with the original nvme for VM's. If I move the current unassigned devices nvme drive into this new pool, it will wipe the drive if I am correct? What is the best way to retain the VM data held on this current installed nvme drive and move it all to the new drive pool? I just want to ensure I use the best (and easiest) procedure to minimise any problems along the way and thought the community would have the best ideas. Thanks for any help.
  11. Just a quick shout out for for some guidance. I used the @SpaceInvaderOne video to setup a Minecraft server and also used Cloudflare. Worked a treat. Ran for a week or so. Then had an update for Binhex's container (great work on all your containers, use them for everything!) and it updated with no errors but then the minecraft server was not available on the local network or via the internet. When you connect to the WebUI for the container (user name and password work fine) you get the following message in the window: "There is no screen to be attached matching minecraft." and that's it. Previously the WebUI showed all the data for the minecraft server, who connected, what the world was doing, etc. There was another container update and I had the glibc error which I corrected with the post in this thread and that's all fixed so no problems there. However, I still get the console window error and the minecraft server not being able to connect. The container starts and runs, had a look through logs and couldn't see anything obvious. Log posted below. Any help is greatly appreciated (username and password deleted from logs). Created by... ___. .__ .__ \_ |__ |__| ____ | |__ ____ ___ ___ | __ \| |/ \| | \_/ __ \\ \/ / | \_\ \ | | \ Y \ ___/ > < |___ /__|___| /___| /\___ >__/\_ \ \/ \/ \/ \/ \/ https://hub.docker.com/u/binhex/ 2021-04-05 03:28:04.575012 [info] Host is running unRAID 2021-04-05 03:28:04.620424 [info] System information Linux 85aa981c4a94 5.10.21-Unraid #1 SMP Sun Mar 7 13:39:02 PST 2021 x86_64 GNU/Linux 2021-04-05 03:28:04.675189 [info] OS_ARCH defined as 'x86-64' 2021-04-05 03:28:04.730666 [info] PUID defined as '99' 2021-04-05 03:28:04.793458 [info] PGID defined as '100' 2021-04-05 03:28:04.905287 [info] UMASK defined as '000' 2021-04-05 03:28:04.956405 [info] Permissions already set for volume mappings 2021-04-05 03:28:05.109976 [info] Deleting files in /tmp (non recursive)... 2021-04-05 03:28:05.163616 [info] CREATE_BACKUP_HOURS defined as '12' 2021-04-05 03:28:05.217617 [info] PURGE_BACKUP_DAYS defined as '14' 2021-04-05 03:28:05.276984 [info] ENABLE_WEBUI_CONSOLE defined as 'yes' 2021-04-05 03:28:05.329073 [info] ENABLE_WEBUI_AUTH defined as 'yes' 2021-04-05 03:28:05.382607 [info] WEBUI_USER defined as '*********' 2021-04-05 03:28:05.435596 [info] WEBUI_PASS defined as '********' 2021-04-05 03:28:05.489763 [info] WEBUI_CONSOLE_TITLE defined as 'Minecraft Java' 2021-04-05 03:28:05.540730 [info] CUSTOM_JAR_PATH defined as '/config/minecraft/minecraft_server.jar' 2021-04-05 03:28:05.593004 [info] JAVA_VERSION defined as '8' 2021-04-05 03:28:05.671902 [info] JAVA_INITIAL_HEAP_SIZE defined as '2048' 2021-04-05 03:28:05.724184 [info] JAVA_MAX_HEAP_SIZE defined as '4096M' 2021-04-05 03:28:05.780979 [info] JAVA_MAX_THREADS defined as '8' 2021-04-05 03:28:05.834088 [info] Starting Supervisor... 2021-04-05 03:28:06,622 INFO Included extra file "/etc/supervisor/conf.d/minecraft-server.conf" during parsing 2021-04-05 03:28:06,622 INFO Set uid to user 0 succeeded 2021-04-05 03:28:06,628 INFO supervisord started with pid 6 2021-04-05 03:28:07,631 INFO spawned: 'backup-script' with pid 124 2021-04-05 03:28:07,635 INFO spawned: 'purge-script' with pid 125 2021-04-05 03:28:07,638 INFO spawned: 'shutdown-script' with pid 126 2021-04-05 03:28:07,641 INFO spawned: 'start-script' with pid 127 2021-04-05 03:28:07,642 INFO reaped unknown pid 7 (exit status 0) 2021-04-05 03:28:07,650 DEBG 'backup-script' stdout output: [info] Waiting 12 hours before running worlds backup... 2021-04-05 03:28:07,650 INFO success: backup-script entered RUNNING state, process has stayed up for > than 0 seconds (startsecs) 2021-04-05 03:28:07,651 INFO success: purge-script entered RUNNING state, process has stayed up for > than 0 seconds (startsecs) 2021-04-05 03:28:07,651 INFO success: shutdown-script entered RUNNING state, process has stayed up for > than 0 seconds (startsecs) 2021-04-05 03:28:07,651 INFO success: start-script entered RUNNING state, process has stayed up for > than 0 seconds (startsecs) 2021-04-05 03:28:07,652 DEBG 'purge-script' stdout output: [info] Removing any Minecraft worlds backups with a creation date older than 14 days... 2021-04-05 03:28:07,659 DEBG 'start-script' stdout output: [info] Minecraft folder '/config/minecraft' already exists, rsyncing newer files... 2021-04-05 03:28:07,726 DEBG 'start-script' stdout output: [info] Checking EULA is set to 'true'... 2021-04-05 03:28:07,728 DEBG 'purge-script' stdout output: [info] Checking for old backups in 12 hours... 2021-04-05 03:28:07,731 DEBG 'start-script' stdout output: [info] EULA set to 'true' 2021-04-05 03:28:07,733 DEBG 'start-script' stdout output: [info] Starting Minecraft Java process... 2021-04-05 03:28:07,740 DEBG 'start-script' stdout output: [info] Minecraft Java process is running 2021-04-05 03:28:07,740 DEBG 'start-script' stdout output: [info] Starting Minecraft console Web UI... 2021-04-05 03:28:07,797 DEBG 'start-script' stderr output: 2021/04/05 03:28:07 Permitting clients to write input to the PTY. 2021/04/05 03:28:07 Using Basic Authentication 2021/04/05 03:28:07 Server is starting with command: screen -x minecraft 2021-04-05 03:28:07,798 DEBG 'start-script' stderr output: 2021/04/05 03:28:07 URL: http://127.0.0.1:8222/ 2021/04/05 03:28:07 URL: http://172.17.0.2:8222/ 2021-04-05 11:12:06,558 DEBG 'start-script' stderr output: 2021/04/05 11:12:06 192.168.1.180:50349 401 GET / 2021-04-05 11:12:07,979 DEBG 'start-script' stderr output: 2021/04/05 11:12:07 Basic Authentication Succeeded: 192.168.1.180:50349 2021-04-05 11:12:07,980 DEBG 'start-script' stderr output: 2021/04/05 11:12:07 192.168.1.180:50349 200 GET / 2021-04-05 11:12:08,008 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Basic Authentication Succeeded: 192.168.1.180:50349 2021-04-05 11:12:08,020 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Basic Authentication Succeeded: 192.168.1.180:50352 2021-04-05 11:12:08,020 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Basic Authentication Succeeded: 192.168.1.180:50350 2021/04/05 11:12:08 192.168.1.180:50350 200 GET /auth_token.js 2021/04/05 11:12:08 192.168.1.180:50352 200 GET /js/gotty.js 2021-04-05 11:12:08,021 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 192.168.1.180:50349 200 GET /js/hterm.js 2021-04-05 11:12:08,052 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 New client connected: 192.168.1.180:50353 2021-04-05 11:12:08,366 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Command is running for client 192.168.1.180:50353 with PID 151 (args="-x minecraft"), connections: 1 2021/04/05 11:12:08 192.168.1.180:50353 101 GET /ws 2021-04-05 11:12:08,400 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Command exited for: 192.168.1.180:50353 2021-04-05 11:12:08,401 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 read tcp 172.17.0.2:8222->192.168.1.180:50353: use of closed network connection 2021/04/05 11:12:08 Connection closed: 192.168.1.180:50353, connections: 0 2021-04-05 11:12:08,422 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 Basic Authentication Succeeded: 192.168.1.180:50349 2021-04-05 11:12:08,422 DEBG 'start-script' stderr output: 2021/04/05 11:12:08 192.168.1.180:50349 200 GET /favicon.png
  12. Success! I have installed the VM. I changed the mount point as you pointed out and it works. Thank you to everyone who has helped, it is really appreciated. unRaid is not just a great system, the community is superb!
  13. Update! The server is working well, no more failures. I decided to create a new vm on the nvme disk due to the main ine I use causing issues for some reason (as discussed above and it will be deleted). I also have another VM running on the cache disk which just runs Blue Iris. The new VM is causing a strange problem. VM creates and starts with no problem. Windows 10 setup loads and I can select the virtio store driver and select where to create the primary partition and click 'Next' and it freezes. In unraid it says the VM is paused and has an orange dot next to the icon in the unraid GUI. I have also ensured that the Primary disk location is 'manual' and select 'remote' for the unassigned nvme so it's not creating it on the cache pool. I have fiddled around with unassigned devices setting for the nvme drive and at times when you select the windows partition you see the actual physical nvme disk and also the 30g unraid assigned space on the nvme in the select partition to create Windows 10 on. The nvme is mounted, not shared (only want to use it to store VM's on) and no pass through. I will also create a test vm on the cache drive to see if I can create this same behaviour. Any thoughts?
  14. The good news for the screenshot is that it’s for the ‘old’ VM which I am going to delete and not worry about. I just included it for info but it actually caused more confusion! Apologies for that! I have had no more issues as such. The external hard drive says missing in the historical section of unassigned devices but it’s passed through to the vm so that all works perfectly. In terms of the two options I want to place all the vm’s on the nvme and run them off there. Can you assist on how to move them over? I can normally work this stuff out but you are such a big help and it really saves time and stress! So to recap the unraid server boots up perfectly. The nvme drive is installed and working and mounted. I have a Windows VM that works fine with no errors and sees the external hard drive for Blue Iris use. Just want to move all VMs onto the nvme disk and run them all from there. Then have my cache pool just for docker containers. As ever, thank you!