mikeyosm

Members
  • Posts

    646
  • Joined

  • Last visited

Posts posted by mikeyosm

  1. On 2/18/2024 at 7:34 PM, jakea333 said:

    Glad it worked out. I am curious as to the root cause as well, as other boards with the Aspeed BMC don't seem to suffer in the same way. It's beyond my ability to troubleshoot, but I know that something changed between 6.12.4 and 6.12.6 that introduced this bug for me.

     

    Maybe someone else can identify the specific fix that's needed. I'm planning to leave it blacklisted and check after each Unraid release. Hopefully it's fixed in time with kernel updates.

    Have you tried 6.12.9 to see if the Aspeed blacklist is still required?

  2. 49 minutes ago, FlamongOle said:

    No specific reason why that should happen.. but it should list device by device as before, then when that's done it will write into the database. Nothing that really changed a lot to not make this not work like before.

     

    If it's stuck for so long, take a look if the lock.db file exists in /tmp/disklocation/, if it does, delete it and check if the install continues. Might be that I should delete the temp lock file during upgrade, so I will add that now.

    Yes, that fixed it.

  3. 12 hours ago, FlamongOle said:

    Update 2024.03.07

    • Commit #296 - Added background blinking (blue) on a device during "Locate" click, if assigned.
    • Commit #290 - Changed how the background task store the SMART input to the database, collect all info before writing it instead of writing it per device as found.

    Should probably not "brick" any servers out there :P GL;HF ❤️ 

    Not bricked anything but it's stuck on the below for 10 mins at least now....

     

    "Extraction done. Package file disklocation.2024.03.07.zip extracted. Adding devices into database, please wait... this might take a while..."

  4. This video inspired me to look at virgl in more detail.

    Proxmox has had this capability for a while and the 3d acceleration seems decent enough for most people, especially those looking to emulate Android and have limited space for a GPU. Can us UNRAIDers get this feature?

  5. 6 minutes ago, theDrell said:

    Yeah Noctua claims it’s medium for even overclocking the 14900k. I turned all the fans to 100% last night and turned off some of the turbo boost for now. Temps are manageable now. 
     

     

    I ordered a different Larger Noctua cooler and Amazon agreed to return this one. I’ve built pcs for over 20 years, so although I won’t say the thermal paste is perfect, it should have been good enough to not be maxing temps. 

     

    I’ll swap to the bigger one and let you know. I’m building in a 4u Supermicro sc846 case so am limited some on cooler height. 

    Not sure if you've seen this but should help get temps down even more

    https://forums.unraid.net/topic/145432-i7-13700k-and-asus-pro-ws-w680-ace-ipmi-optimizations/

    • Like 1
  6. 29 minutes ago, sharpling said:


    Thank you,

    Think I found the problem, 

    The mobo seems to use port sharing between slimsas and pch nvme slots. I have slimsas connected to a sas backplane - disconnect it and the nvme drives are detected / working.

    I will test if breakout cable allows any sata drives at all but i think it's the whole port - so no slimsas ports in case you want use pch m.2_2 or m.2_3.

     

    Oh no, hope not. I have the mATX variant and planned on using the slimSAS in SATA mode as well as both the m.2 slots. I guess we'll see as soon as I receive the cable.

  7. 11 hours ago, FlamongOle said:

    Update 2024.02.21

    • Commit #278 - BUG: Commit #274 should be fixed, turned around some variables, but also added the checks as a background tasks instead of loading and checking every refresh. The status of the devices is checked every 5 minutes instead.

    This should improve the loading time and performance quite a lot on large systems. The power mode data is stored in memory and is updated every 5 minutes.

    Fixed my issue - thank you.

     

    image.png.d26e3ec9a3a82f071916deb967576c02.png

  8. FWIW - here's mine, really happy with the results...

     

    W680M-ACE-SE

    14700

    128GB DDR5

     

    No VMs running

    Approx 13 dockers running

     

    Sits at around 36-60W idle depending on UNRAID process / dokcer usage.

    Temps are pretty decent given I'm using the Thermalright Silver Soul 92mm cooler. 

     

    image.thumb.png.637fe0f854124845cdacdce0a73e1ee8.png  

     

  9. 11 hours ago, FlamongOle said:

    Update 2024.02.19

    • Commit #274 - ISSUE: As some or all NVMe devices does not output standby mode on SMART info, I have set the drive to be overridden as "ACTIVE" regardless as long as it is a  NVMe device and is detected by SMART and the system at all.

    @mikeyosm you can try to update the plugin now and see if it helps.

     

    Updated, did a force scan all, refreshed but unfortunately no change 😞

     

    image.png.40146d261836fffc1dae8e38f30de5f6.png

  10. 9 minutes ago, FlamongOle said:

    I think I know what it is. The nvme device does not give any status of if it's in standby or not, and as this is an unassigned device it won't receive the typical status from Unraid either.

     

    I must find another way of checking the nvme devices. There's probably no reason to why it should report as active, idle or standby as an SSD. I have mine in ZFS and that gives different infos, so haven't seen or tested unassigned nvme.

     

    I might come up with a fix later tonight, but I'll see what I manage and bother :P Regardless, nothing to worry about as the you get temps and SMART OK (which is enough to know it's present really).

    I thought i might have something to do with unassigned devices/nvme, thanks for confirming. It would be awsome if you could fix it, the plugin is very useful and for me would make it 'complete'. 

    • Thanks 1
  11. 3 minutes ago, FlamongOle said:

    As long as it has it's own serial number, you should be good to go regardless of what SMART-info it find from the devices.

     

    The "Device not present" is just looking up on what Unraid has put it's state as. Do you see anything odd under "Main" tab looking at the drive there? Is the drive icon grey, or does it show something else?

    Looks OK on the main tab

    image.thumb.png.e2f394b182dc96e0fe0370321da83269.png

  12. 7 minutes ago, FlamongOle said:

    Why I am asking is, did you try "Force scan all"?

    Yes, a few times, same result

     

    The only difference with my nvme device is no LUN or FF is listed compared with my other drives which are all OK.

     

    image.png.84fcface2f16f27086e2bd893116f436.png

  13. 10 minutes ago, FlamongOle said:

    I dunno... hard to say. What did you try to do? Have you read the forum thread for possible solutions or which logs/output from commands you might give me that might be useful?

    I haven't tried anything special, just wanted to add my single m.2 to the dash. I am curious how it can show device not present yet show me the SMART status and temperature. Let me know what logs you need.

    image.png.ca21cf691e6a7c537c59661c8baad6d1.png

     

    image.png.32228b7e2b995f19aec67feb1360b12b.png

  14. 42 minutes ago, jakea333 said:

     

    I've had issues with the W680M board and iGPU passthrough that appear to be related to the BMC that I wasn't able to resolve with the BIOS changes mentioned in this thread. These weren't present on Unraid 6.12.4, but began when I attempted to update to 6.12.6. Thanks to JorgeB in the 6.12.6 announcement thread, blacklisting the ast driver allows the iGPU to work again:

     

    echo "blacklist ast" > /boot/config/modprobe.d/ast.conf

     

    You'll lose the BMC during Unraid startup, but I don't generally use it at that point anyway.

    Yes, this resolved the issue and now the iGPU is shown in gpustats properly. I wonder why the aspeed is causing issues?

  15. 6 hours ago, mikeyosm said:

    ~# dmesg |grep i915
    [    8.365381] i915 0000:00:02.0: [drm] VT-d active for gfx access
    [    8.376517]  i915_driver_probe+0x83f/0xc11 [i915]
    [    8.383725]  i915_init+0x1f/0x7f [i915]
    [    8.391432] Modules linked in: x86_pkg_temp_thermal intel_powerclamp coretemp i915(+) kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel sha512_ssse3 sha256_ssse3 ast sha1_ssse3 iosf_mbi drm_vram_helper drm_buddy aesni_intel i2c_algo_bit drm_ttm_helper drm_display_helper crypto_simd ttm cryptd drm_kms_helper rapl intel_cstate mei_hdcp mei_pxp drm i2c_i801 intel_gtt tpm_crb ipmi_ssif nvme i2c_smbus wmi_bmof tpm_tis ahci mei_me intel_uncore agpgart cdc_ether syscopyarea sr_mod input_leds sysfillrect usbnet video sysimgblt acpi_ipmi i2c_core igc joydev led_class cdrom mei nvme_core mii libahci corsair_psu vmd fb_sys_fops thermal fan tpm_tis_core wmi ipmi_si tpm backlight intel_pmc_core acpi_pad acpi_tad button unix

     

     

    ~# intel_gpu_top
    No device filter specified and no discrete/integrated i915 devices found

     

     

    SYSLOG---

     

    image.thumb.png.02bfcbd60a7c2b597535b6e846868950.png

    Adding echo "blacklist ast" > /boot/config/modprobe.d/ast.conf has helped.

    Not sure why I am having to blacklist the ASPEED driver though. Thanks to @jakea333 and @JorgeB for pointing me in the right direction.

  16. On 1/27/2024 at 8:36 PM, demps said:

    Sorry if this was answered in this thread already but what is the trick to getting iGPU transcode working with this board? I'm assuming I'm need to change something in BIOS. 

     

    I can plug a monitor in and get output but I can't get the iGPU to show up. /dev/dri only shows card0 in it. 

    Same for me btw, I'm using a 14700 CPU. Did you fix it?