leeknight1981

Members
  • Posts

    139
  • Joined

  • Last visited

Posts posted by leeknight1981

  1. On 8/4/2021 at 2:31 PM, ich777 said:

    This indicates the plugin to don't create a cron job and from what I've see in your crontab the cron job for the update check is not created so it is impossible that your server will notify you when a new version is released...

     

    On which plugin version are you because of the screenshot?

    The Notification is that green one time and date stamped. My server was running All Ok on the previous version. I realised there was an issue when I tried to clear that Notification as I had to click on the tab to reload and that’s when it locked up! So I’m not sure as you say it’s Not Possible for my server to Notify Me yet there it is Notifying me it’s done it and I need to reboot but before I even rebooted the logs was full of the error and GUI locked up. I’m not saying it’s your plug-in but it’s also Not my server / hardware as it’s been working faultlessly until I had that green Notification of Updated Version which os exactly what broke it last time in Exactly the same way. 

    40F16525-91C4-497A-862A-C1D7F0C77964.jpeg

  2. 3 minutes ago, ich777 said:

    That would be super nice.

    Also check if the plugin looks like in my screenshot above, the Update Notification should be in between System Info and the Download button.

     

    Do you boot in GUI mode or do you leave a page from unRAID open on your local PC?

    Are you sure that the server was actually locked up and not only the GUI on this instance from your browser?

    no it loads into unraid normal not GUI mode. i do have a tab open on my imac during the day thats when i noticed the GUI locked up i clicked close on notification and it wouldnt Go so clicked main and it just sat there with three wavy lines.

  3. 45 minutes ago, ich777 said:

    What do you mean with nothing changed?

    The automatic update is a cron job between 8am and 10am and downloads the driver in the background, if a newer one is found and sends the message, doesn't install anything when the message is sent, that is all done on reboot.

     

    But then there is something else wrong, when 470.42.01 was fine until you rebooted and now you have to run 460.80...

     

    Can you be a little more precise please? What was forced? What does the plugin page say about the automatic update, have you clicked Update after you set automatic update to False?

     

    Please open up a unRAID terminal and send me the output from 'crontab -e' (without quotes) and also from 'cat /boot/config/plugins/nvidia-driver/settings.cfg' (without quotes).

     

    What version from the plugin itself are you running, "2021.07.30" is the latest one.

     If you don't want the output of a cron job mailed to you, you have to direct
    # any output to /dev/null.  We'll do this here since these jobs should run
    # properly on a newly installed system.  If a script fails, run-parts will
    # mail a notice to root.
    #
    # Run the hourly, daily, weekly, and monthly cron jobs.
    # Jobs that need different timing may be entered into the crontab as before,
    # but most really don't need greater granularity than this.  If the exact
    # times of the hourly, daily, weekly, and monthly cron jobs do not suit your
    # needs, feel free to adjust them.
    #
    # Run hourly cron jobs at 47 minutes after the hour:
    47 * * * * /usr/bin/run-parts /etc/cron.hourly 1> /dev/null
    #
    # Run daily cron jobs at 4:40 every day:
    40 4 * * * /usr/bin/run-parts /etc/cron.daily 1> /dev/null
    #
    # Run weekly cron jobs at 4:30 on the first day of the week:
    30 4 * * 0 /usr/bin/run-parts /etc/cron.weekly 1> /dev/null
    #
    # Run monthly cron jobs at 4:20 on the first day of the month:
    20 4 1 * * /usr/bin/run-parts /etc/cron.monthly 1> /dev/null
    0 2 28 * * /usr/local/emhttp/plugins/ca.backup2/scripts/backup.php &>/dev/null 2>&1
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    ~                                                                                                                            
    Read /var/spool/cron/crontab.C8zSyX, 23 lines, 1125 chars                                                    1,1   Command

     

     

     

     

     

     

    root@R720XD:~# cat /boot/config/plugins/nvidia-driver/settings.cfg
    first_installation=false
    driver_version=460.80
    local_version=460.80
    disable_xconfig=false
    update_check=false
    root@R720XD:~#

     

     

     

     

    Can you be a little more precise please? What was forced?

    Yes my server locked up and shows that error in the logs and the update notification before i restarted and it showed the newest as installled as if it was installed without reboot

  4. 34 minutes ago, ich777 said:

    What do you mean with nothing changed?

    The automatic update is a cron job between 8am and 10am and downloads the driver in the background, if a newer one is found and sends the message, doesn't install anything when the message is sent, that is all done on reboot.

     

    But then there is something else wrong, when 470.42.01 was fine until you rebooted and now you have to run 460.80...

     

    Can you be a little more precise please? What was forced? What does the plugin page say about the automatic update, have you clicked Update after you set automatic update to False?

     

    Please open up a unRAID terminal and send me the output from 'crontab -e' (without quotes) and also from 'cat /boot/config/plugins/nvidia-driver/settings.cfg' (without quotes).

     

    What version from the plugin itself are you running, "2021.07.30" is the latest one.

    Will have to do all that after work! Basically servers been running fine all week! This morning the GUI had locked up and that notification was on the screen. Auto Plug-in update is off yet that says Found and downloaded which caused that normal error in the logs. Just to clarify the servers been working the GPU has been working and transcoding all week NO issue not one then This morning my sever locked up and that update was installed that broke it. 
     

    regards 

     

    lee 

    9AAE22D7-CD23-4FFA-A57D-C781CD4A7506.jpeg

  5. 54 minutes ago, ich777 said:

    Can you please double check that Auto Update is off?

     

    What crashed your GUI? The download from the update?

     

    The first thing I have to say, v470.57.02 was uploaded to the repository on 2021.07.21 and the update notification should be triggered on 2021.07.22.

    What I can imagine is that something was wrong when the plugin tried to fetch the versions.

     

    For troubleshooting reasons I would try one last time the latest driver since your card should be supported.

     

    Also please include the Diagnostics, otherwise troubleshooting is really hard.

    Nope nothing has changed Nothing at all, my sever GUI locked up and I had a notification of the update. I’m running the 1st one and it’s working fine so it’s Not my server or GPU. 
    servers running ok not locking up no fault in logs snd it’s transcoding fine so I don’t know but v470.42.01 worked absolutely fine until today when it updated and then locked up GUI and the fault in logs. Now running 460.80 and again running fine no issues so if it was my server / gpu why was it ok on 470.42.01 and 460.80. I don’t want it auto updating I turned auto update off yet it’s being forced like this morning 

    4F521F13-C68B-4244-990E-6BD20B1BA212.jpeg

    46448BF1-3D8D-4048-9592-6E1D09C8868B.jpeg

    64659496-D148-4BC2-80A1-41A11D711220.jpeg

  6. On 8/2/2021 at 8:23 PM, ich777 said:

    Really interesting how a update that has nothing changed in terms of the drivers can magically fix the problem. 😂

    OK so my server locked up at 09:56 today 04.08.21 when i went on it said New NVidia UpDate v470.57.02 found and downloaded please reboot your server even though Auto Update is off. So it crashed my GUI and i had that same error in loggs. I Rebooted using the one that was working v470.42.01 and its working fine! So the latest one breaks my UnRaid so i am back on v470.42.01 and all is Working...

  7. On 7/31/2021 at 4:26 PM, ich777 said:

    The update only added a update helper if you upgarde to a newer unRAID version, nothing else was changed in terms of the driver. ;)

     

    Pretty certain that this is not the case because only a few lines where added that strictly speaking have nothing to do with the drivers itself, look at the changelog. :)

     

    Are you sure that you don't changed anything?

    No mate nothing i simpley waited to see that the plugin had been updated, i installed it and all works ok :) 

    • Haha 1
  8. Well I was told it was My server, Hardware or GPU yet the Plug-in was updated 30.07.21 and now everything’s working, So it appears it was the plug-in and Not my Hardware. I am great full for the work the Guy’s do but I did say the card and everything was tested and working. 

  9. 1 hour ago, ich777 said:

    Sorry I really can't help with that since @HellraiserOSU hasen't reported back yet and since you don't tried HW transcoding with Unraid on the Gaming machine.

    As said the drivers are still the same and nothing wasn't changed there at all, so if you roll back to that driver version that worked before I see no reasen why it shouldn't work now.

    No worries i put it to the one prior to the update, rebooted all was ok then as soon as i Enable docker No - Enable docker Yes i get the error so its not my card or server  So i guess its wait and see 

    Screen Shot 2021-07-02 at 11.20.12.png

  10. On 6/28/2021 at 3:06 PM, HellraiserOSU said:

    I'm having an issue where my card appears for a little bit and then disappears. It's an EVGA GeForce RTX 3060 and on the plugins it'll show the driver version and the Installed GPU fine after I reboot. After I check it and refresh the page, it's gone and says No devices found.

    I don't have a VM

    I do see in the logs

    NVRM: GPU 0000:01:00.0: RmInitAdapter failed! (0x23:0xffff:1199)

    NVRM: GPU 0000:01:00.0: rm_init_adapter failed, device minor number 0

    image.thumb.png.b26884559e6ea173765d0e7ad20f6c10.png

    Im having an identical Problem and am being advised its my card that works in other machine's and shows up in My UnRaid then disappears. If you find a fix please let me know :)) 

  11. 3 hours ago, ich777 said:

    If it showed up and not freezed on the gaming machine then it seems to me that this is a hardware combination issue on your machine.

     

    Have you already tried the steps that I've linked above?

    You also said something about a mining rig, do you have a chance to put another Nvidia card in the system?

     

    As said at the time there was only a driver update of the plugin and nothing else, a rollback should have it enabled again, and the changes to the plugin itself are only cosmetic and or add new features like the notification if a new driver is available and do nothing to the drivers itself.

     

    If you search this thread for this issue this was reported a few times but after a BIOS reset/update/change everything seems to work again, some user even told me that it workes after he put the card in another system and put in back again in his server.

     

    This issue can also happen with too long and too less shielded risers or defective risers.

    The mining machine is mining and I don’t think the card in that will fit in my R720XD just seems odd when I install the plugin it’s all ok card is there as soon as I disable and enable docker it instantly says No Devices Found in the plug-in and I get the NVRM: GPU 0000:42:00.0 RmInitAdapter failed so I’m convinced it’s not my hardware as it all works and did work fine until after that update I give up tbh I’ll wait till it’s fixed I guess and say wasn’t my server / GPU as I’m Not the only person with the issue :-/ 

    61FC4359-7564-45B5-ACE2-519CA878C17B.jpeg

    BD09F250-D8E8-44AD-89AC-F46DBD399801.jpeg

  12. Yes we installed UnRaid on the Gaming Machine, Yes the Card showed up and yes it showed in the Plugin... but nothing on that machine to test as was just set up to see if the card was visible etc as its not my machine :(

    Not saying there is a problem was showing what was in the logs after i removed it :) Id just like to get to the bottom of this as someone else is having the same issue i think. All was working Fine then an Update then its stopped working. Iv tested the card it outputs and games Fine... It shows up in a Fresh UnRaid - Plugin. I put the card back in mine its fine it shows install the plugin disable/reenable docker then i get the fault and whats in the logs the same as the chap above :) 

  13. 9 minutes ago, ich777 said:

    Have you tried it already in another system?

    Also I don't hear back from @HellraiserOSU not sure if it's solved now or not.

    removed Plugin 

    Jul 1 22:30:44 R720XD emhttpd: cmd: /usr/local/emhttp/plugins/dynamix.plugin.manager/scripts/plugin remove nvidia-driver.plg
    Jul 1 22:30:44 R720XD root: plugin: running: anonymous
    Jul 1 22:31:01 R720XD kernel: docker0: port 1(veth049e849) entered disabled state
    Jul 1 22:31:01 R720XD kernel: veth6194793: renamed from eth0
    Jul 1 22:31:01 R720XD avahi-daemon[12225]: Interface veth049e849.IPv6 no longer relevant for mDNS.
    Jul 1 22:31:01 R720XD avahi-daemon[12225]: Leaving mDNS multicast group on interface veth049e849.IPv6 with address fe80::5468:5dff:fe57:d2c7.
    Jul 1 22:31:01 R720XD kernel: docker0: port 1(veth049e849) entered disabled state
    Jul 1 22:31:01 R720XD kernel: device veth049e849 left promiscuous mode
    Jul 1 22:31:01 R720XD kernel: docker0: port 1(veth049e849) entered disabled state
    Jul 1 22:31:01 R720XD avahi-daemon[12225]: Withdrawing address record for fe80::5468:5dff:fe57:d2c7 on veth049e849.

  14. Guess Who's back... 

    OK GPU Works in a gaming machine flat out, It also works in UnRaid IE Shows UP and in the plugin. 

    So i put it back in my server, work's fine! Then i install the Plugin Card Shows up, Ver No, GUID Etc. Then i Re Enabled docker And Boom that same fault in logs Web - GUI locks up and freezes :( So Deffo Not the GPU or Server as it only happens when i install Plugin and or Re Enable Docker. More than Happy to assist with this via teams or what ever :) Logs below in order of time. Before shut down, Re boot, Install plugin, re enable docker etc :) 

    r720xd-diagnostics-20210701-2212.zip r720xd-diagnostics-20210701-2208.zip r720xd-diagnostics-20210701-2201.zip r720xd-diagnostics-20210701-2157.zip

  15. On 6/28/2021 at 4:48 PM, ich777 said:

    Please post your Diagnostics (Tools -> Diagnostics -> Download -> drop the downloaded zip file here in the text box.

     

    For how long is the card recognized, can you use it in Docker containers or does it drop instantly?

    Do you boot with Legacy or UEFI?

     

    Please also see here: Click

    Changing the PCIe slot or the PCIe generation can help, also check if you got a setting in the BIOS named "Above 4G decoding” or “large/64bit BARs" and make sure to enable it.

    This is what mines doing Also! ill let you know in another one :) 

  16. 2 hours ago, ich777 said:

    But what should it be else, on my server the plugin just works fine, tried now 3 different driver versions and all work just fine.

     

    Don't get me wrong but if there is a problem with the plugin or the drivers itself I think more people would have reported that here... ;)

    I know mate i'm at a los we will try a fresh UnRaid and slap the GPU in, And see what happens ill come back to you with the findings :)

    • Thanks 1
  17. 10 minutes ago, ich777 said:

    Can you try to do the steps from above with a spare USB Key and Unraid and see if it works on this machine?

    Not really No and I don’t have a spare USB now I’m not at home and this is my mates gaming machine, So I’ll have to leave the card here and he will set UnRaid up when he has 5 mins but I’m 99.6% certain it’s not my hardware / server at fault 😜

     

    I’ll come back to you 

     

    maybe a Day or Two 

  18. 54 minutes ago, ich777 said:

    There seems to be something wrong when the plugin is installed and you get this output.

     

    Downgrade to the driver that was installed previously. Eventually this will solve your issues.

     

    This was after the driver upgrade and the reboot I think.

     

    You get the error that the GUI locks up and that Unraid is laggy?

     

    Have you double checked that you boot in legacy mode?

    You can also try a BIOS reset, but only if you know what you are doing.

     

     

    Do you have a second machine where you can test the card with Unraid, I would do it as follows:

    1. Create a new USB Boot stick on a spare USB Key
    2. Put the card in the other machine
    3. Boot the other machine with your card installed from the USB Key
    4. Register for Trail on the WebGUI
    5. Install the CA App
    6. Install the Nvidia Driver plugin from the CA App
    7. See if the WebGUI becomes laggy on this machine too, if not open up a terminal and issue the command: 'nvidia-smi'

    OK ill have to set up UnRaid on the tower as the R710 and supermicro servers wont take a GPU! i'm working till 1800 UK Time so ill have a play tonight... Its just strange that its stopped working 1 hour after that update ;(

    WhatsApp Image 2021-06-23 at 15.26.52 (1).jpeg

    WhatsApp Image 2021-06-23 at 15.26.52.jpeg

  19. 1 minute ago, ich777 said:

    When you are saying GUI you mean the WebGUI from Unraid and not the plugin page?

     

    Are you on the Dashboard when the GUI locks up or does it lock up in general? Can you close the window and reopen the GUI again?

    Have you installed any custom build of Unraid or is this a stock 6.9.2 build?

     

    Please open up a Terminal from Unraid and type in: 'nvidia-smi' (without quotes) and post the output here.

     

    I really can't help this is usually a sign the the Card can't initialize because of too less power or some other hardware related issue (sometimes it can happen also when you are booting with UEFI).

     

    Did you mine with it or did you tested it with a 3D load and connected it to a Display?

     

    And rebooted in between?

     

     

    The problem with this error is that it is not easy to identify. I think you are booting with Legacy (CSM)?

    Please double check if you are booting with Legacy mode.

     

    i put it in the machine and installed windows 10 on it and installed the drivers and the card showed up all ok and worked on external display, GUI if i click say Plugin's or Stats tabs i get the 3 wavy orange lines and it does nothing, totally standard unraid i don't play about with it, nvidia-smi: command not found, Honestly it has been working perfectly fine till that nvidia driver update Emby has worked and the card has transcoded everything correctly. On the 22nd about 10am UK time i went onto unraid and the GUI was weird and laggy and like locked up so i rebooted and was the same. So i removed the nvidia plugin and rebooted and its ok. Re install it i get that error. I cant get another card atm as non in stock and well overpriced as they all being brought to mine. This has been happily working Fine for well over 12 months till that update on 22nd

  20. 16 minutes ago, ich777 said:

    When does it update or what do you mean exactly?

     

    Can you please be a little bit more specific? Also you don't answered this question:

     

    Did the driver updated or did the plugin update?

     

    Have you already tried to pick a driver from the stable branch?

    Sorry the Nvidia Driver updated 22nd June according to telegram, Yes the GUI does lockup so i had to MC in and remove the Nvidia plugin and now it does not lock up and no Error in the logs. When i re install the plugin i get this - R720XD kernel: NVRM: GPU 0000:42:00.0: RmInitAdapter failed! and the GUI locks up. Card Works iv tested it in my mining machine.  yes i tried latest and the option below that. 

    Telegram.jpg

  21. 4 minutes ago, ich777 said:

    Yes but first a de-installation of such "scripts" is necessary to properly troubleshoot the card.

     

    Does your GUI also lock up and freeze with the Nvidia Driver installed?

     

    Have you upgraded recently (Unraid, BIOS, Driver version,...)?

    This error is normally a sign that the card doesn't initialize properly, can you test the Card in a desktop computer, install the driver (driver installation is necessary because the basic display output mostly works properly) and put a 3D load on it?

     

    Please also try to swap the PCIe slot if possible and/or reseat the card.

    This seems like a power issue or a failure of the card, but that's only a guess.

     

    Btw: sadly enough your Intel iGPU is a little too old for transcoding HEVC but h.264 should work fine on it if you need a temporary solution.

    Its all been working 100% OK No Issues till the NVIDIA Plugin UpDated, transcoding was fine also multiple streams. Its in a Dell R720XD Server with 2 1100w PSU's, Card works fine I don't believe this to be a hardware fault at all. 

  22. 14 hours ago, ich777 said:

    The driver is already auto compiled but the driver isn't listed on their download site so you actually can't install it because I grab the driver versions from there otherwise this will be a completely mess, or you switch to latest, then it should be listed if I'm not mistaken... :D

    Driver

    MD5

     

    I would first try to remove your "script" eventually that's the problem.

    100% its the nvidia plugin as i removed it my GUI doesn't lock up and NO Errors, Re install the plugin and it fails with the same error in the log's 

    r720xd-diagnostics-20210623-1209.zip

  23. Anyone have a Quick look into this for me please? My Emby GPU has stopped working and i cant understand why - kernel: NVRM: GPU 0000:42:00.0: RmInitAdapter failed!

    I have attached logs, I have removed the plugin and restarted the server and re added the plugin and re started and disabled the docker servoce and re started etc but i cant understand Nothings been touched or changed and the servers never moved :(

     

    Cheers

     

    Lee

    r720xd-diagnostics-20210622-1052.zip