trevisthomas

Members
  • Posts

    108
  • Joined

  • Last visited

Everything posted by trevisthomas

  1. I got the dreaded error warning today after a hard reboot (system was unresponsive) it's running now, parity check is going. Anyone have any clue if this is telling me something? Attached the full log but.... these don't sound like good news. Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: Machine check events logged Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 3: fe00000000800400 Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: TSC 0 ADDR ffffffff8108843d MISC ffffffff8108843d Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: PROCESSOR 0:a0655 TIME 1624618181 SOCKET 0 APIC 0 microcode e0 Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: Machine check events logged Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: CPU 0: Machine Check: 0 Bank 4: fe00000000800400 Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: TSC 0 ADDR fffff8044651bb59 MISC fffff8044651bb59 Jun 25 05:50:04 TheBronze kernel: mce: [Hardware Error]: PROCESSOR 0:a0655 TIME 1624618181 SOCKET 0 APIC 0 microcode e0 syslog
  2. Thanks. I found it. I have mcelog installed now. I've done a dump... but, i'm not noticing anything obviously different. ( like i don't see an mcelog file in it). I'll upload the diagnostic. thebronze-diagnostics-20201113-1335.zip
  3. Ah, thanks. I see it. I have Nerd Tools but not this piece. So, i guess at this point there are no forensics because i needed to have had mcelog installed when the failure was detected? Or is there a way to run something now?
  4. My system was DOA on monday. I had to press the reset button to get back in. I'm running 6.9 beta 30. The system has been running without issues since the event. But i just noticed that i have a "Machine Check Evens" error logged in Fix Common Problems. It suggested that i install mcelog, but cant find that. But i was able to go to diagnostics and get a dump. Any... suggestions? thebronze-diagnostics-20201112-1751.zip
  5. I just had my 6.9 beta 30 server hard lock as well. I couldn't SSH, all VM's were gone. Couldn't get to the web console etc. Resetting the box got me back in. VM and Dockers look to be back. I've never had unraid hard freeze... that i can even remember. Maybe in the 5x days. This particular system was setup with 6.9 beta 25, then upgraded to 29, and then 30 about a week ago. Now that i'm back in, is there any forensic data that i can pull from the system?
  6. Is there a currently supported solution for this problem? I have a website running in docker and i need the DB to be up and running before the web server container starts.
  7. Wow. that is interesting. Thanks. I rebooted and i'm watching the log. ( after closing a bunch of old webgui tabs and... so far so good.
  8. Not sure if this was the correct place to put this but i'm having some issue with a log being filled. My first hint of trouble was an email saying: error: error setting owner of /var/log/nginx/error.log to uid 0 and gid 0: Operation not permitted I ssh'd in and ran a couple of commands and see this: root@TheBronze:~# df -h /var/log Filesystem Size Used Avail Use% Mounted on tmpfs 128M 128M 0 100% /var/log root@TheBronze:~# du -sm /var/log/* 1 /var/log/apcupsd.events 0 /var/log/btmp 0 /var/log/cron 0 /var/log/debug 1 /var/log/dmesg 0 /var/log/docker.log 0 /var/log/faillog 1 /var/log/lastlog 1 /var/log/libvirt 1 /var/log/maillog 0 /var/log/messages 0 /var/log/nfsd 53 /var/log/nginx 0 /var/log/packages 0 /var/log/pkgtools 0 /var/log/plugins 0 /var/log/preclear.disk.log 0 /var/log/removed_packages 0 /var/log/removed_scripts 1 /var/log/samba 0 /var/log/scripts 0 /var/log/secure 0 /var/log/setup 0 /var/log/spooler 0 /var/log/swtpm 1 /var/log/syslog 74 /var/log/syslog.1 2 /var/log/syslog.2 1 /var/log/vfio-pci 1 /var/log/wtmp The nginx log has megs of this: 2020/10/04 11:01:18 [alert] 9375#9375: worker process 9232 exited on signal 6 ker process: ./nchan-1.2.7/src/store/spool.c:479: spool_fetch_msg: Assertion `spool->msg_status == MSG_INVALID' failed. 2020/10/04 11:01:20 [alert] 9375#9375: worker process 9243 exited on signal 6 ker process: ./nchan-1.2.7/src/store/spool.c:479: spool_fetch_msg: Assertion `spool->msg_status == MSG_INVALID' failed. 2020/10/04 11:01:22 [alert] 9375#9375: worker process 9257 exited on signal 6 ker process: ./nchan-1.2.7/src/store/spool.c:479: spool_fetch_msg: Assertion `spool->msg_status == MSG_INVALID' failed. 2020/10/04 11:01:24 [alert] 9375#9375: worker process 9265 exited on signal 6 ker process: ./nchan-1.2.7/src/store/spool.c:479: spool_fetch_msg: Assertion `spool->msg_status == MSG_INVALID' failed. 2020/10/04 11:01:26 [alert] 9375#9375: worker process 9279 exited on signal 6 ker process: ./nchan-1.2.7/src/store/spool.c:479: spool_fetch_msg: Assertion `spool->msg_status == MSG_INVALID' failed. 2020/10/04 11:01:28 [alert] 9375#9375: worker process 9323 exited on signal 6 Sys log looks like this: Oct 5 07:12:57 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:12:57 TheBronze nginx: 2020/10/05 07:12:57 [alert] 9375#9375: worker process 18834 exited on signal 6 Oct 5 07:12:58 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:12:58 TheBronze nginx: 2020/10/05 07:12:58 [alert] 9375#9375: worker process 8219 exited on signal 6 Oct 5 07:12:58 TheBronze nginx: 2020/10/05 07:12:58 [alert] 9375#9375: worker process 8232 exited on signal 6 Oct 5 07:12:59 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:00 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:00 TheBronze nginx: 2020/10/05 07:13:00 [alert] 9375#9375: worker process 8233 exited on signal 6 Oct 5 07:13:01 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:02 TheBronze nginx: 2020/10/05 07:13:02 [alert] 9375#9375: worker process 8241 exited on signal 6 Oct 5 07:13:02 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:03 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:04 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:04 TheBronze nginx: 2020/10/05 07:13:04 [alert] 9375#9375: worker process 8315 exited on signal 6 Oct 5 07:13:05 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:06 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:06 TheBronze nginx: 2020/10/05 07:13:06 [alert] 9375#9375: worker process 8353 exited on signal 6 Oct 5 07:13:07 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token Oct 5 07:13:08 TheBronze root: error: /plugins/user.scripts/exec.php: wrong csrf_token
  9. Updated from Beta 25 to Beta 29. 0 issues so far. System continues to be stable. Four VM's (2 VNC, 2 dedicated GPU). Maybe a dozen docker containers ranging from Tomcat, SQL Server, Letsencrypt, and a number of others. The only issue so far impacting me is the lack of daily email. I saw that there was a hot fix for that, but I haven't applied it yet. Wanted to make sure that things were running well first.
  10. Thanks to everyone for sharing their knowledge in this thread. I've been curious about getting a SQL Server Docker image working for some time but never dabbled until today. I was losing faith when I saw that the official container was not supported anymore but this thread got me up and running. I'm not very strong with Linux, or Docker.... but this thread made it very easy. What i did: I started with Merrrp's config on GitHub. I went to the Dockerman folder (on my the unraid flash drive) and created an xml file with the contents of Meerp's config. Restarted the UnRaid docker engine. Went to the docker tab and choose ' add container '. I was able to choose Meerp's config from there. I set the password, port etc and hit apply. In a few minutes SQL Server was up and running... sort of. Seems that there is a new variable needed. 'MSSQL_COLLATION' is required. ( I figured this out by looking at the error log which was located in the app-data folder ). I added the parameter, and set it to 'SQL_Latin1_General_CP1_CI_AS' (which, google told me was the default). After adding the parameter SQL Server started up. I went to a windows system, installed the latest Management Studio and was able to connect. I'm running UnRaid 6.9 Beta 25 and Microsoft SQL Server 2019 (RTM-CU7) (KB4570012) - 15.0.4063.15 (X64) Best of luck to anyone else who finds this thread.
  11. Ok, yeah that worked like a charm. Glad i asked, I was half way down the road to reading up on the dd command for creating these from the command line in linux. Didn't look that hard, but it was much nicer to be able to create it from the GUI.
  12. Interesting. So i was over thinking this. I'll give that a shot, thanks.
  13. I've got an UnRAID host with a VM running ( two actually ), and i'm running the 6.9 Beta 25 build with it's epic multi cache disk setup. What i'd like to do is have the OS vdisk on one drive, and the photoshop scratch disk on a different drive. Problem is Photoshop won't let me make a mapped drive the scratch disk. So... i was thinking that maybe i could make a second vdisk, (which lives on a different ssd) and map that in the xml. Question is, can i create a blank vdisk... from the command line? Is that the best way to do this in UnRAID? ( Very open to alternative solutions too 🙂 ) Oh and PS... i'd really rather not grow the primary vdisk. I like keeping them small so the they are easy to backup. Also, i'd like to get the OS and Scratch on two separate SSD's.
  14. Hm, I didn’t try that. Maybe I’m over thinking this issue. It’s probably quite the edge case that I need to delete something that she’s created, and vice versa. And when that happens I can ssh and chown. It just took me by surprise that since I was logged in as me when I migrated her data over that she couldn’t delete files. I didn’t think of it at the time.
  15. Sorry, i didn't have notifications turned on for this post. ( wish it did that by default, rookie mistake on my part ) Your test results don't match mine. When I mount the a Secure drive via a specific user from windows, if you ssh into Unraid you can see that the file is owned by that user. And from Windows I can not delete files that the other person created. I mounted the shares via 'net use' My goal for user accounts was so that each person could read/write anything ( no matter the owner ) on shares where they have read/write permissions. But not be able to delete files on drives that they only have read permissions. As it stands, i have us both using the same credentials which limits my ability to decide which shares each of us can read write to. If windows allowed me to have different users for different shares, that would solve it too. but... that's not allowed. --- oh, and I mostly use a Sony A7ii these days.
  16. My wife and I are part time photographers and can generate lots of data. I use unraid to house the terabytes of images that we have. In order to mitigate risk, i would like for each of us to have our own user account but be able to read/write to some of the same shares. Problem is even if we both have read/write access, if she creates files as her self, i cant delete them. And vice versa. Is there not a way to have a readwrite share create the files as 'nobody' so that we can both manage them? Seems the only solution is to share an account. Windows is no help since we cant have different folders mounted with different accounts. It really limits how i can do account management. What are others doing?
  17. In the foreground of the photo is a tiny USB 3 card connected with the adaptor in the amazon link. In the background, the other end of that USB cable is connected to the part that fits into the PCIe slot. After this shot i was able to reinstall the 5500 GPI which fits perfectly with it's fans and heat syncs extended over the PCIe slot.
  18. Building a new UrRAID host to replace desktop PC's in my house and i needed two GPU's, and 3 USB controllers. Problem was, none of the mother boards that satisfied my other criteria had enough accessible PCIe slots if GPU's were in the long 16x and 4x slots. The solution: https://www.amazon.com/gp/product/B07N38Y799/ref=ppx_yo_dt_b_asin_title_o01_s00?ie=UTF8&psc=1
  19. Solved. It's pretty easy. The steps under 'what if i cant backup my device' lead me to the answer. https://wiki.unraid.net/UnRAID_6/Changing_The_Flash_Device#:~:text=Simply install Unraid to a,button the Tools%2FRegistration page. The scenario wasn't quite the same but this worked. Since i had the build that i wanted on a new USB drive that i also wanted, all i had to do was replace the config/Trial.key with my Pro.key from my old drive ( registered in 2011! ). Once in unraid was booted, it saw the Pro license and gave me the option to generate a new key file for the new drive. Up and running now.
  20. Interesting. Thanks. I guess that should be fairly obvious if I boot it and it’s complaining about the config. Or... doesn’t show the disk drop downs populated.
  21. Ok, so my build is in a pretty happy place but i need to move the new configuration onto a thumbdrive with my Pro license. I have a pro license, but built the new system using a trial key on a fresh thumb drive. Now that i'm happy with the build, I want to move the config from the trial thumb drive, to my licensed one. Anyone got any tips on how to do that? I'm assuming that i can take the backup zip of the trial thumbdrive and... unzip it onto the licensed thumbdrive.... delete /config/Trial.key and replace it with the old /config/Pro.key? Is that all there is to it?
  22. Thanks for the reply. Right now I have the AMD GPU installed and things are working well. I'm still debating if it's worth trying to diagnose the Nvidia issue, or just return the card and stick with the 5500. The restart script to work around the AMD card getting stuck works pretty well, and with normal use i it doesn't seem like it will happen often.
  23. Same problem, 6.9 beta 25. Shutting down Docker made it stop. I only have one container in there and it wasn't even started.
  24. New UnRAID build (6.9 beta 25!). The goal is two VM's with GPU/USB pass through. I would like to use two Nvidia GPU's in the box but for the life of me, it will not let the card in the second PCIe slot start. I've not seen talk about this which makes me think that it is something that i'm doing wrong but i cant figure it out. The two cards in question are a 970 and a 1650. I have exported bios for each card that absolutely works no problem in the primary slot. I can put either card in the first slot and VM's boot no problem. No matter which card i install in the second slot, it wont start if an Nvidia card is in the first one. This is on an asrock velocita z490 motherboard. Both Nvidia cards are Asus (970 and 1650). Any thoughts? Is this a known limitation of Nvidia cards? A bug in the beta? What can i do to diagnose it? When I say that the second card fails to start, i mean when you tap the button to start the VM you get a very simple qeum failed to initialize message.
  25. Got it. Transfer process looks fully self service. No need to contact support even. Cool. Sounds like the old GUID is black listed in this process, so no going back. Thanks.