Jump to content

alexdac99

Members
  • Posts

    3
  • Joined

  • Last visited

Posts posted by alexdac99

  1. 23 hours ago, ich777 said:

    Then roll back to an even older version, the drivers won't change and the package is always the same since they are precompiled.

    I've just tried v525.78.01 as well and issue still appears after around 5 hours at idle.

     

    1 hour ago, ich777 said:

    If it was working before something in terms of hardware or BIOS must have been changed since otherwise the older driver will work as expected, I think you get the point what I'm trying to say.

    Maybe the power supply is dying or something similar, if you haven't changed anything and you've rolled back the driver and you have the exact same issue with the previous driver there must be something different now...

    The only thing changed was that I switched out the RAM about a month ago to 4x32GB sticks running on their XMP profile at 3200MHz. I have noticed though since this issue appeared that when I shutdown the system, if I turn it on the EZDebug CPU LED is red (MSI B550 Gaming Gen3 Motherboard) which apparently means CPU not detected or initial checks failed at start, but if I press the Reset button there, it boots up fine... I know the CPU is alright, and temps have never gotten high, so I'm starting to think possible PSU or Motherboard failure...

     

    1 hour ago, ich777 said:

    Can you please describe what you are doing? Do you update the BIOS on Unraid itself?

    I updated the GPU's VBIOS driver right now through a bootable usb to enable Resizable BAR Support using Asus' tool (GPU is Asus 3060ti Dual OC). Was trying to do it through Unraid before but ended up doing it using their provided exe since safer.

     

    I've also updated the Motherboard BIOS right after the issue started to occur, thinking it might've been an issue with the new driver and the old BIOS.

  2. 12 hours ago, ich777 said:

    What do you mean exactly, do you mean the driver itself or the plugin?

    Have you yet tried to roll back to the previous driver that was working?

     

    Have you changed something in terms of hardware or did you maybe update your BIOS or change some settings in the BIOS?

    Have you yet tried to disable C-States in the BIOS?

     

    Please don't do this manually until advised since the plugin does this on it's own.

     

    The driver itself I think. I tried to roll back to v525.116.04 (production branch) and still had the same issues. I tried disabling Global C-States today, as well as adding pcie_aspm=off to the grub boot options (some people recommended that on nvidia forums) and I am still having it fall off the bus, even while the GPU is just idle. 

     

    I also changed nothing in terms of hardware, and I only updated my BIOS after the issue started happening to see if that would help. I didn't change anything in terms of the BIOS until after the issue started.

     

    EDIT: I just noticed that my VBIOS for my 3060ti isn't updated to supported Resizable BAR. I am trying to update it with the tool from Nvidia but when I rmmod the kernel drivers, the nvidia kernel gets reinitialized, I'm guessing from the plugin?

  3. Having issues recently since the last driver update. For some reason my GPU falls off the bus. I've tried reseating it multiple times, cleaning the connector (no riser), updating motherboard bios, replacing CMOS battery, and uninstalling and reinstalling the nvidia-driver plugin, including clearing the plugin kernel folder and then downloading and updating the driver again. When I reboot sometimes it seems to connect, other times i start to get:

     

    Atlantis kernel: NVRM: GPU 0000:2b:00.0: RmInitAdapter failed! (0x22:0x56:760)
    Atlantis kernel: NVRM: GPU 0000:2b:00.0: rm_init_adapter failed, device minor number 0

     

    I've attached the diagnostics as well as the nvidia-bug-report.log if that's of any help. There is also no power savings options for the PCIe slots on my motherboard in the BIOS. I also have the Re-Size BAR Support, above 4G decoding, and IOMMU BIOS settings enabled, and I disabled Secure Boot to see if that would help. At the end of the syslog is the stack trace from the gpu driver issue.

    atlantis-diagnostics-20230508-2114.zip nvidia-bug-report.log.gz

×
×
  • Create New...