Jump to content

hsingh314

Members
  • Posts

    34
  • Joined

  • Last visited

Posts posted by hsingh314

  1. So started it up. Was missing all the drives but the ones connected to the MB directly. Saw JorgeB's message and decided to reseat the HBA - and it now seems like things are back to 'normal' but I still have the millions of errors. What should I do about this? Do I do a parity-error check but don't do the write corrections?

     

    I have also attached a diag from before starting the array and afterwards below. 

     

     

    bishop-diagnostics-20240502-0909.zip bishop-diagnostics-20240502-0913.zip

  2. I just started getting messages this morning from my discord bot that files were missing - hundreds of them. When I woke up I tried to login and my root password wasn't working. Trying to restart the server right now hoping I'll be able to just even login.  It had this error: Warning: fread(): Length parameter must be greater than 0 in /usr/local/emhttp/plugins/dynamix/include/.login.php on line 23

  3. - Removed GPU Top

    - Blacklisted the i915 driver (just did the echo command thing in the link you sent). 

    - Installed 6.12.10 (only one it would let me). 

    - No webgui

    - tried /etc/rc.d/rc.nginx-fpm restart & /etc/rc.d/rc.php-fpm restart

    - Login screen showed up

    - logged in

    - blank screen

    - did diagnostic command through ssh

    - no array, dockers or vm's will start up. 

    - downgraded back to 6.11.5

     

    I have attached the diagnostic thing from when it was "running" 6.12.10 - I am not sure what to do about the update :(

     

     

     

     

     

    tower-diagnostics-20240407-1448.zip

  4. Apologies, it kinda hectic. 
     

    - Ran update assistant (everything said ok)

    - installed 6.12.8

    - no web ui or array working

    - ran diagnostic but could not get the results through SSH (would crash every time)

    - downgraded to 6.11.5

    - able to get diagnostics (the one posted, from 6.12.8)

    - array spins up

    - no VMs work

    - restart system

    - All VMs gone, all plug-ins gone, all dockers gone

    - Communtiy Apps will not install because Unraid is too old

    - Remake VM and gives error: qemu-system-x86_64: terminating on signal 15 from pid 18676 (/usr/sbin/libvirtd)
    2024-03-23 23:57:39.580+0000: shutting down, reason=shutdown

    - Run update assistant ( everything ok)

    - install 6.12.8

    - same issue, no web ui no array 

    - downgrade back to 6.11.5

    - ui and array back. VMs gone

    - remake VM and gives error qemu-system-x86_64: terminating on signal 15 from pid 18676 (/usr/sbin/libvirtd)
    2024-03-23 23:57:39.580+0000: shutting down, reason=shutdown

     

    - that’s all I’ve done so far (besides questioning my life decisions). 

  5. I tried to downgrade to 6.11.5 which broke everything. Dockers, passthrough, VM's disappeared. So I tried to update back to 6.12.8 .... I am extremely regretting upgrading from 6.12.6 to 6.12.8 and it will not let me downgrade to 6.12.6. I still cannot get ssh to download the diagnostics file. Nothing is working and now I am worried that all my data might have been lost just by pushing the button to upgrade. 

  6. Hi, 

     

    I updated to 6.12.8 (as from I read it was ok to update, but maybe I didn't read enough). I cannot access the webui. I cannot access the array and have no way to get the diagnostic zip file. I did do the suggested solution here:

    But to no avail. 

     

     I am not sure what to do. Was working fine for 90+ days without a reboot straight before this update. Any suggestions?
     

  7. Hi, 

     

    I keep getting my VM crashing and it goes into a 'pause' mode. I checked the logs and keep getting the same error. I think it has something to do with my GPU as the ID of the error corresponds to the GPU I am passing through (Tesla P4). Has anyone else encounter this before? Any suggestions on what to do next?

    2023-08-05T22:35:43.826885Z qemu-system-x86_64: vfio_err_notifier_handler(0000:09:00.0) Unrecoverable error detected. Please collect any data possible and then kill the guest

    Thanks!

  8. 1 hour ago, ich777 said:

    It's the same for i-Core series CPUs. There are some i-Core series CPUs out there which are also not supported but I can't put a list together because it's all over the place, you can't find a list which are really supported or not and this sentence is actually from the documentation for GVT-g from Intel... :D

     

    I can only tell you that all i-Core series chips are supported which don't have a letter at the end, except for 'K' these are also supported.

    Thats fair. 

  9. 16 hours ago, alturismo said:

    image.png.f938268458aea6635eae2f908b2fa6e1.png

     

    the P630 iGPU is not supported afaik ... so this (sadly) wont work

    Welp! that sucks. Maybe it's worth indicating that in the documentation as it says that all xeon E3 v6 cpus would work, but I believe all the v6's has this gpu. 

    Well... onwards and upwards... I'll try to figure something else out. 

     

    Thanks!

  10. I tried using this to passthrough my Intel iGPU to my VM while still being able to use it for a container. However, when I opened it, it said my system wasn't compatible even though my CPU is a E3-1245v6. The dialogue said I should download my diagnostics (attached) and post here..... so here I am.... I tried using this to passthrough my Intel iGPU to my VM while still being able to use it for a container. However, when I opened it, it said my system wasn't compatible even though my CPU is a E3-1245v6. The dialogue said I should download my diagnostics (attached) and post here..... so here I am.... 

    bishop-diagnostics-20230804-1824.zip

  11. 40 minutes ago, SimonF said:

    You need to have a file ignore guc bit but we need to find code for your card.

     

     

     

    root@computenode:~# cat /boot/config/modprobe.d/i915.conf 
    options i915 force_probe=56a0
    options i915 enable_guc=7
    root@computenode:~# 

     

    run this command and in the messages it should safe the value for force probe.(Mine doesnt not show as already force probed or bound to vfio)

     

    create the file as about with the correct value and reboot. or if you have intel_gpu_top installed it should do the modprobe for you without the file.

     

    root@computenode:/tmp# dmesg | grep i915
    [   50.400020] i915 0000:00:02.0: [drm] Incompatible option enable_guc=7 - undocumented flag
    [   50.400647] i915 0000:00:02.0: [drm] VT-d active for gfx access
    [   50.400677] i915 0000:00:02.0: [drm] Using Transparent Hugepages
    [   50.401279] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
    [   50.401727] mei_hdcp 0000:00:16.0-b638ab7e-94e2-4ea2-a552-d1c54b627f04: bound 0000:00:02.0 (ops i915_hdcp_component_ops [i915])
    [   50.403380] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adls_dmc_ver2_01.bin (v2.1)
    [   50.516061] i915 0000:00:02.0: [drm] GuC firmware i915/tgl_guc_70.bin version 70.5.1
    [   50.516066] i915 0000:00:02.0: [drm] HuC firmware i915/tgl_huc.bin version 7.9.3
    [   50.518499] i915 0000:00:02.0: [drm] HuC authenticated
    [   50.518697] i915 0000:00:02.0: [drm] GuC submission enabled
    [   50.518698] i915 0000:00:02.0: [drm] GuC SLPC enabled
    [   50.518989] i915 0000:00:02.0: [drm] GuC RC: enabled
    [   50.519673] mei_pxp 0000:00:16.0-fbf6fcf1-96cf-4e2e-a6a6-1bab8cbe36b1: bound 0000:00:02.0 (ops i915_pxp_tee_component_ops [i915])
    [   50.519738] i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialized
    [   51.024746] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 0
    [   51.025749] i915 0000:00:02.0: [drm] Cannot find any crtc or sizes
    [   51.025887] i915 0000:00:02.0: [drm] Cannot find any crtc or sizes

    Ugh - I am not sure what I am doing wrong. Also I do have intel gpu top installed but the a380 doesn't show up there either. 

    SCR-20230617-m0c.png

×
×
  • Create New...