suchamoneypit

Members
  • Posts

    42
  • Joined

  • Last visited

Everything posted by suchamoneypit

  1. Been running unraid for several years, been using my 5950x CPU for several months now. Fix common issues plugin just notified me of a Machine Check Events detected, and to post on the forums with logs. I think I see the several errors in the logs; is there enough info here for something to explain what the issue was? Notably I think these all occurred during a parity check which is currently 87% complete. Apr 7 01:23:49 S-Cartographer kernel: mce: [Hardware Error]: Machine check events logged Apr 7 01:23:49 S-Cartographer kernel: [Hardware Error]: Corrected error, no action required. Apr 7 01:23:49 S-Cartographer kernel: [Hardware Error]: CPU:1 (19:21:2) MC21_STATUS[-|CE|-|-|PCC|-|-|Poison|Scrub]: 0x839d8de800015080 Apr 7 01:23:49 S-Cartographer kernel: [Hardware Error]: IPID: 0x0000000000000000 Apr 7 01:23:49 S-Cartographer kernel: [Hardware Error]: Bank 21 is reserved. Apr 7 01:23:49 S-Cartographer kernel: [Hardware Error]: cache level: RESV, tx: INSN Apr 8 16:26:44 S-Cartographer kernel: mce: [Hardware Error]: Machine check events logged Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: Corrected error, no action required. Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: CPU:1 (19:21:2) MC23_STATUS[-|CE|-|AddrV|-|-|UECC|-|-|-]: 0x854824048b480084 Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: Error Addr: 0x0000000000000000 Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: IPID: 0x0000000000000000 Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: Bank 23 is reserved. Apr 8 16:26:44 S-Cartographer kernel: [Hardware Error]: cache level: RESV, tx: DATA Apr 14 04:30:07 S-Cartographer root: Fix Common Problems: Error: Machine Check Events detected on your server Apr 14 04:30:07 S-Cartographer root: mcelog: ERROR: AMD Processor family 25: mcelog does not support this processor. Please use the edac_mce_amd module instead. s-cartographer-diagnostics-20240415-1227.zip
  2. Ah at first I was confused as I have. But then I realized, log out of Connect on the server itself, then log back in. That did it; Thank you Jorge!
  3. I changed my server name months ago; I've done several flash backups since. I have tried looking and searching and I cannot seem to find a way to get this darn name updated...Is this not a thing yet? I've checked the online interface, on the unraid server itself, etc.
  4. I did an additional parity check and its also zero errors again. Not sure exactly why I suddenly saw repeated large parity errors and it then went away. I will say that I never rebooted the server in between those so the only thing that's changed is that I fully shut down and restarted the server and then the parity errors stopped showing up. Anyways, it seems to be in a good spot now. Thanks for the advice. And yes Itimpi, I am running that plugin now. Thanks for making me aware of that!
  5. So interestingly enough, after the 2nd non correcting parity check, no errors. Which is weird because I was consistently getting them from the checks before. Attached the new diag, though. Also I see the history tells you whether it was error correcting or not. I thought I unchecked one previously but apparently I didn't, because this shows the only non error correcting one being the one I just did. So I guess for now I wait and see if the errors resurface. s-cartographer-diagnostics-20240324-1849.zip
  6. I just wanted to say that today I rebooted my unraid server to clear my logs to troubleshoot another issue, and it did not come back up because I had no display connected. The key takeaway here was I didn't take my entire network down while I figured that out because I learned my lesson and have since migrated my OPNsense to dedicated hardware so nothing went down 😎 (and it had no display connected because it was connected to the OPNsense box from when I set that up) And notably after updating and rebooting after getting the Nvidia plugin installed again, my WebGUI does work locally again. Like I said before though the WebGUI was working without the plugin before. I only got the plugin for hardware transcoding for jellyfin/plex. It seems that once I setup the plugin, suddenly taking it away somehow made the local WebGUI stop loading when using my GPU. Maybe there was something else going on, but it is all working now.
  7. ok, thank you. I will start those now. They take a while to complete so it might be a day or two before I can come back with the new diag.
  8. Recently on my last few parity checks I've been getting thousands of parity errors when before I consistently got zero, or 1 or 2. Doing some light research I keep seeing people asking for diagnostics files, specifically after running a parity check without error correction. I've done so, but having trouble seeing someone also explain where to look for the info on what's at error. Could someone help me figure out what is causing parity errors, and really I would like to know if you could point me to where specifically I should be looking to where the errors are occurring? My uneducated guess would be a hardware issue. There are no reported errors on any of my drives though. Attached is diagnostics I ran recently with error correction off. Below is also a history of my parity checks. 3/5 and 3/11 were error correcting, and once I saw it repeat with similar error count I ran 3-18 without error correcting. s-cartographer-diagnostics-20240320-1752.zip
  9. Although I am incredibly thankful to everyone who provided advice and took the time to read my issue and try to help, ultimately your post telling me to setup a second router so I get internet and I can get my Unraid server an IP is what got me on the road to victory. I was convinced the whole server was down, but it was only the local GUI. Getting the second router setup and editing my network.cfg allowed me to get access back to the WebGUI and begin to setup my server after restoring my flashdrive on a new one. I did say $20 to anyone who leads me to the solution or tells me the solution, I think because your comment is what got me to the WebGUI, this is you. I am a man of my word so please private message me something like paypal or venmo and I will happily buy you a couple cups of coffee.
  10. Yes, thank you very much for the help on the plugins. At first they wouldn't work because the internet source was the VM which came up after the plugins load. I was able to use my secondary router I used earlier to get to the web GUI and do initial config and get the server working again to edit my network.cfg to get it on the right IP address to match the secondary router's network, boot up, get my plugins loaded, then I shut down the machine, edited the network.cfg back to my OPNsense IP scheme, and switched my internet back to my OPNsense VM. Everything is 100% working now, with new CPU and RAM installed. One thing that was weird was my Plex and Jellfin containers removed themselves but the data was still there. I had appdata backups though and it was pretty easy to restore those. I still have not yet verified that the local GUI loads now that I have the Nvidia driver driver plugin loaded. The important lesson learned here is that the unraid machine being the router too is obviously the major thing that made troubleshooting such a pain. What started the issue was my USB dying in middle of the hardware upgrade, and the fact my network went down because of that just made it worse. I am definitely working on setting up a second machine and migrating OPNsense to that so this never happens again. I appreciate everyones help and advice on this. I was SO relieved to get the server back up.
  11. Do you know of a good way to restore my plugins? The folders are there, but unraid seems to not like it and lists errors for them all. And notably before I ever installed the nvidia driver plugin, the GUI worked just fine through my 1060. I only ever got the plugin once I wanted to do GPU transcoding with Plex/Emby.
  12. No motherboard change, only CPU and RAM. Im going to post a better explanation later, im tried and have to work again now that I have internet. Long story short: my server is back and running. The only exception currently is that although my USB plugins folder has all my plugins, in unraid it shows zero installed plugins. The local GUI is still not working, but with zero plugins the nvidia driver one is not installed. I'm going to have to dig into that later unless in the meantime someone has a solution or answer to that. The server is running with the new CPU but old RAM. now that the flashdrive is working later today im also going to put back in the new RAM.
  13. More to come regarding this, this seemed to be the key (I think you're gonna get the $20 bounty) Initially no, I had nothing else to hook up to the ISP modem so I had no internet or wifi. I had to do everything and troubleshooting from my phones data or hotspot data which was terribly slow. I do now have unraid connected to a router thats getting internet. I actually have two ISPs. one I had essentially in a dormant setup as I was going to disconnect that service as unraid and OPNsense run my new provider which is fiber. I hooked back up my old ISP's router so I have wifi and internet.
  14. I will say when I checked my USB, the .plg file did not exist. I'm guessing it never downloaded because the server never had Internet access since the new flash drive
  15. I am still holding my breath but I got an older router working and with my old ISP I have yet to disconnect services with but still had an account with (switched ISPs a few months ago), but after doing this, and then manually editing my network.cfg, I have been able to get to the webGUI on that network. Currently slowly restoring things back as they way and verifying everything is still working at each step. This may have have been the key. As I suspected when I got in nothing was running because even before I could start the array it made me transfer my license to the new USB. I will provide an update soon but I'm feeling quite hopeful which has not been the case for nearly 10 hours of troubleshooting prior.
  16. I'll see if I can get that going and if I can get to webGUI I will say that even without the Nvidia plugin, it worked just fine with my 1060 before for basic display output
  17. Yes, it won't boot into GUI mode. And I assume because it's a new USB, if I boot into CLI, nothing works, no VMs or Dockers start. Because the array needs to be set back to its initial config and started. unRAID runs an OPNsense VM, so even if I manually assign an IP, there is no router for another computer on my network to connect to unRAID. GUI refuses to load when testing locally. I have a kbm and monitor hooked up right to the machine. Previously when troubleshooting network down situations when I was getting the VM working with OPNsense, loading the GUI locally was of no issue. If the GUI won't load locally, I can't imagine I'll get it up on the network.
  18. Server is my router. OPNsense runs as a VM which is down. It won't get an IP currently. No services will start and I can't get to webGUI to check anything. I'm am troubleshooting with a monitor hooked up to the server.
  19. Diagnostics when ran using my backup USB files. Also the results of running the "df" command which should should if there was a flash mounting issue. I am now heading to bed, unsuccessful, at at 3am, 9 hours after I started what was supposed to be a standard hardware upgrade. Praying to the computer gods someone out there takes pity on me and knows enough to help me get my stuff back. I mean hell, if anyone directly leads me to or tells me the fix, $20 is yours. s-cartographer-diagnostics-20240227-2135.zip
  20. I got this diagnostics file from the CLI, when trying booting to nonGUI with a fresh install tower-diagnostics-20240227-2117.zip
  21. My server is completely screwed, I am desperate for help or guidance. I have spent 8 hours straight trying to get this working. I started at 5:45pm and im writing this now as of 2am. It was also my home router, opnsense was running as a VM. I work from home and this it's literally taking me offline. I was planning to segregate the two but now Im paying the ultimate price for not getting to it sooner. Once this is (hopefully) resolved thats getting done ASAP. I had a ryzen 2700x and 48GB RAM. I upgraded to a 5950x and 128GB. After rebooting after doing this, unraid failed to start at all. After hours, i realized the USB for unraid would not load even in windows. A dead USB after a reboot? super odd. I realized I had a super recent backup from unraid connect just 8hrs prior. Thought this was my saving grace and I would be back and running shortly. NOPE. Setup a new USB, tried to boot, and now I'm completely stuck. No matter what I do, dockers or VMs don't run and it can never get to GUI. It just boots to a black screen with a typing cursor in the top left. After booting up, the initial few hundred lines of text from unraid show, but then for gui boots it goes to a blank screen with a cursor. For my BIOS, I have csm enabled, SVM enabled, legacy USB support enabled, fast boost and secure boot off. The system boots in ubuntu with the live installer/preview without any issues. I tried switching back to my old RAM. No matter what I do, the GUI will not come up. I get display output throughout the initial boot process. If I try to go to GUI from there, it goes to the blank screen with a cursor once I enter "slim". Same thing in GUI safemode as well. No dockers will launch or VMs. I tried launching a VM from CLI with "vrish start OPNsense" and got "failed to connect to hypervisor". "operation not supported. Cannot use direct socket mode if no URI is set". I've tried this with a fresh unraid install using the official tool both with a new download, and by using a ZIP. Exact same behavior. Im using a Samsung 128GB USB 3.0 flashdrive formatted with Fat32. You would have thought a flashdrive backup from unraid connect would have been a simple ordeal, it has honestly become a nightmare from me. I followed the exact steps in their "Manual method" for restoring the backup. Im literally going to take the day off of work tomorrow because my whole network is down and I cant get unraid working. BIOS Settings: SVM enabled (virtualization) CSM enabled legacy USB enabled fast boot off (I also tried with the BIOS defaults, with SVM off) Main hardware: ryzen 5950x CPU Gigabyte Aorus Master B550, Bios F15 (latest is F16 with minor update, F10 supported 5950x) Tried many RAM configs including one that was working prior I'm booting with UEFI, my folder on the USB is "EFI". It won't boot if I try non UEFI. I was doing this before as well. And again, this system has posted a hundred times now at this point and live linux (ubuntu) loads and runs just fine. Unraid refuses to. If anyone knows whats going on I would be IMMENSELY thankful. EDIT: SOLUTION: Something that really made troubleshooting a problem was my unraid server was also my router, with an OPNsense VM. The main problem that kicked this all off was my USB died during my hardware upgrade. Without unraid array started, I also had no router or network, meaning no WebGUI access. For some reason, the local GUI stopped working for me. I believe it was because I also lost my Nvidia Driver plugin, but I will note it did work before without that. In retrospect, the webGUI was likely up the entire time once I setup my backup USB, but I could not access it with no network as OPNsense VM was down. Setting up an old secondary router allowed me to get my network online. I connected the unraid server to this router and got it on a network (manually setting correct IP scheme by editing network.cfg on the USB), and from there I was able to access the WebGUI, setup the new USB license, then get my array started again. From there is was standard setup getting everything back to working order.
  22. thank you, that did take it from 1.07GB to 6.04kB. Will keep and eye on it and see if it climbs in size again.
  23. Hey, for some reason Project Zomboid specifically is taking up a significantly larger size of my docker img file than my other services, I don't think this is proper?
  24. I have 3, 3TB SSDs in a pool for my cache. The Size correctly reports 3TB, but the "Used" and "Unused" incorrectly total to around 2.15TB. Notably, when doing a large amounts of downloads, the other day the Used space got up to 2.8TB+ of Used space. This seems to confirm the reported free space is being incorrectly calculated or displayed. The True capacity of 3TB seems correct, but the Free space utility never lists it. It should be showing 2.7-2.8TB of free space right now but only lists 2.04. Diagnostics ZIP is attached. My Unraid has done a couple upgrades and this issue has been around since I installed these 3 drives to the pool. diagnostics.zip