Jump to content

amstel

Members
  • Posts

    74
  • Joined

Posts posted by amstel

  1. Hi,

     

    Long (you can skip to the Short section down):

    for around 2 years that I'm using unRAID server on my house.

     

    now it is the time that I need to use 100% of my PC's power for some hardcore software (such MatLab),

    and I got poor results using the windows 10 VM machine over the unRaid system.

     

    basiclly,

    now I'm booting directly into win10 that is sitting on SSD,

    and I am trying to run the unRaid (from its USB) over virtualbox machine.

     

    well,

    it is working, booting up.

     

    but,

    I cannot see the NW at all.

    the unraid gets some internal IP address over its br0 network config.

    cannot go to the webgui, not from the VM itself nor outside of it.

     

    Short:

    so,

    when running the unraid server on top of windows 10,

    using virtualbox,

    how to properly configure the NETWORK parameters

    so it would get an IP from my main router?

     

     

    Thanks.

  2. 8 hours ago, SSD said:

    Sort of points back to an issue with Matlab. Similar to running Prime95 I guess.

     

    Not sure if VNC server is part of VM by default. You can try to connect via VNC to the running VM. If that does not work, you can run a VNC server in the VM and then connect from the laptop or something. Splashtop Desktop and NoMachine are similar software. I had passthrough issues with my first VM install and used SplashTop to remote in while passthrough was active and was able to fix the passthrough problem. Without it I was blind and would have never have been able to get the driver installation issue solved.

     

    Not surprised, but worth a chance. Thought maybe unRAID was being starved. Removing one CPU is another thing to try. 

     

    You could try on bare metal and see if it continues.

     

    Did you try the logging? Might tell you more about the consistency of the hang on successive runs.

     

     

    will try that,

    but each of this tests takes alot of time..

     

    I dunno if it is related,

    but also trying now "CPU Mode" "Emulated" instead of "Host Passthrough".

     

     

    maybe I should also try changing between SeaBIOS and OVMF..

     

     

    instead of using the OS as is I became a tester for the OS.

  3. 7 hours ago, nuhll said:

     

    Thats interesting. So this might be the problem. If u dont need GPU, then youre fine, i guess.

    well, I do need the GPU, I also mine with it, never got stuck.

     

    8 hours ago, SSD said:

    It sounds as though memory may be filling up in the Windows VM and processes are starting to get cancelled. The fact that the CPU is still pegged means that the VM is still running. Will the process complete eventually and CPU utilization come back down to baseline? Or is it your assessment that the process is hung at this high CPU is indicating some sort of an infinite loop?

     

    Obviously this is not something other users are observing. But running at that level of utilization is not common, except maybe crypto mining.

     

    The fact you did not see the problem with VNC may just be coincidence, but it may be very relevant.

     

    Curious if you are using ACS override? I know it can cause some issues.

     

    I would try the following in no particular order:

    - Upgrade the GPU drivers. If the only symptom of failure relates to use of the video card, it is worth a try. You could also update your motherboard BIOS.

     

    - Try running both VNC (or something like SplashTop or NoMachine) and graphics card at the same time with the passthru in place. Would be interesting to see if the other video interface continues to operate after passthrough shuts down. You mention teamviewer is showing offline, so not optimistic here. If the network goes, none of these types of tools will work.

     

    - Set up a batch file that outputs cpu and memory utilization to a text file on your cache and on your c: drive. I'm sure you can find command line programs to output this type of data that can be piped to two different files. Also output the date/time. Have it sleep for say 5 minutes between logs. Would be interesting to see if the logging continues even after the display shuts down, and if the c: drive logging continues after the network logging. (After a reboot you should see c: drive logging). It is possible that resources (like memory) are being consumed and processes / services are shutting down. So you may see memory utilization increasing. And if the local goes longer than the network, this would imply some sort of gradual loss of function vs a single event that shuts everything down. Multiples runs that are pretty consistent in terms of duration before the logging stops would point to a resource / software issue, whereas a large amount of variance would point to something more random like hardware or heat.

     

    - download Prime95. It is a program that can stress the CPU to similar levels as your application. It is frequently used to verify stable overclock. See if running Prime95 has similar results as the program you are running. Prime95 does not leak memory and would exclude using your proprietary app which no one could try and reproduce. Be careful, as Prime95, depending on the settings, can produce a lot of heat. There is a blended test that may be suitable. But I would run it and watch your CPU temps for at least 15-30 mins to make sure they are not getting out of hand before letting run unattended. Tests that say no memory mean the CPU is able to run harder, because memory accesses give the CPU a little breathing room. The harder the CPU is pushed the hotter it runs.

     

    - Try playing with the CPU allocation. Remove the emulator pin. Remove another core. See if reducing the extent of CPU engagement has an effect (e.g., delays or eliminates the hang)

     

    Good luck. These types of issues are complex to debug. Hope this gives some ideas of how to narrow it down.

    thanks for the detailed reply.

    well I did try with and without the ACS override -->> same results.

    crypto mining is working well for me, even when doing it 24/7 (CPU + GPU).

    GPU drivers are being updated constantly.

     

    how can I run VNC + passthru at the same time?

     

    I have also checked and this software consume almost 100% of the CPU at run time,

    but consumes really small amount of RAM while running, 210MB.

    I also ran the software on my laptop and on VNC and it works great.

     

    I also tried removing the emulator pin -->> same results.

     

     

    well I guess that if this would be a lot of trouble for me I would have to boot into the winOS instead to unRaid and see how it works there.

     

     

    Thanks for trying till now.

     

  4. 7 hours ago, 1812 said:

     

    no, just trying to isolate the problem area. stay on 6.4

     

    is this software you are running in another os, like windows? if so, what guide did you follow to set it up? If you didn't follow a guide, then there is a decent chance that is your problem.

     

    well, the software configuration is just fine.

     

    I ran the software all night running on VNC instead of passthru,

    and it is working fine.. didn't get stuck.

     

     

  5. Hi,

     

    I'm running win10 on VM, using cpu isolation for cpus 1,2,3,5,6,7 out of 8 .

     

    my xml settings for the cpus:

      <cputune>
        <vcpupin vcpu='0' cpuset='1'/>
        <vcpupin vcpu='1' cpuset='2'/>
        <vcpupin vcpu='2' cpuset='3'/>
        <vcpupin vcpu='3' cpuset='5'/>
        <vcpupin vcpu='4' cpuset='6'/>
        <vcpupin vcpu='5' cpuset='7'/>
        <emulatorpin cpuset='0,4'/>
      </cputune>

    I have also noticed that after moving to unraid 6.4,

    the automatic settings have changed the order of the cpus, so I also tried this:

      <cputune>
        <vcpupin vcpu='0' cpuset='1'/>
        <vcpupin vcpu='1' cpuset='5'/>
        <vcpupin vcpu='2' cpuset='2'/>
        <vcpupin vcpu='3' cpuset='6'/>
        <vcpupin vcpu='4' cpuset='3'/>
        <vcpupin vcpu='5' cpuset='7'/>
        <emulatorpin cpuset='0,4'/>
      </cputune>

     

     

    I'm using software like Matlab that does calculations.

     

    the software uses 100% off all 6 cores.

    I run that software for 15 hours to finish come calculations.

    I leave it to run over the night,

    but in the morning the VM doesn't wakes up when I move the mouse nor hit the keyboard.

    also teamviewer shows the VM is offline.

     

    unraid's dashboard shows that all 6's cpus are using 98%-100% also when VM is stuck.

     

    VM log and unraid's log don't show anything irregular..

     

    well,

    any ideas what the problem is?

     

    my HW  for the VM is:

    6 isolated CPUS.

    12GB RAM.

    unraid's cache SSD also contains the VM's C (main) drive.

    passthru of GTX 1060 6GB with 3 connected monitors.

    windows auto turn off monitor is OFF.

    screen saver turns on after 5 minutes (problem also occurs without screensaver on).

     

     

    where do I start investigating the issue?

     

    Thanks.

  6. 10 hours ago, 1812 said:

     

    6 threads only used by unraid. Rest for vm's. all 3 vm's use an emulator pin set to use one of the 6 unraid threads. 2 use vnc, 1 uses gpu. If you leave vnc connected, it will never idle at next to nothing because the vnc needs CPU power to process. I set each of the vm's to allow thier "monitor" to sleep and disable screen saver (otherwise it is using CPU to run a screensaver on a closed vnc connection- a waste.

    well,

    mmy VM configured like this:

      <cputune>
        <vcpupin vcpu='0' cpuset='1'/>
        <vcpupin vcpu='1' cpuset='2'/>
        <vcpupin vcpu='2' cpuset='3'/>
        <vcpupin vcpu='3' cpuset='5'/>
        <vcpupin vcpu='4' cpuset='6'/>
        <vcpupin vcpu='5' cpuset='7'/>
        <emulatorpin cpuset='0,4'/>
      </cputune>

    screen saver is not configured,

    I only set the turn off screen after 5 minutes of inactivity.

     

    btw,

    I only use GPU passing and not VNC.

     

     

  7. Hey,

    started the routine but got confused O.o

     

    I am using:

    USB 2.0 on board(2 ports),

    USB 3.0 on board(4 ports),

    USB 3.0 Header (2 ports)

     

    my output:

    ~#lspci | grep USB
    00:14.0 USB controller: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller (rev 31)
    
    ~#lsusb
    Bus 002 Device 002: ID 0951:1666 Kingston Technology DataTraveler G4
    Bus 002 Device 003: ID 0781:5590 SanDisk Corp.
    Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
    Bus 001 Device 005: ID 04b3:3025 IBM Corp. NetVista Full Width Keyboard
    Bus 001 Device 004: ID 03f0:2b17 Hewlett-Packard LaserJet 1020
    Bus 001 Device 003: ID 045e:0083 Microsoft Corp. Basic Optical Mouse
    Bus 001 Device 002: ID 046d:c52b Logitech, Inc. Unifying Receiver
    Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
    
    IOMMU group 3
    	[8086:a12f] 00:14.0 USB controller: Intel Corporation Sunrise Point-H USB 3.0 xHCI Controller (rev 31)

    well,

    looks like the 'USB 2.0 onboard' and the 'USB 3.0 Header',

    sharing the same bus (BUS 001).


    I only want to passthru all the devices from Bus 001,
    how do I do that?

  8. On 3/23/2017 at 5:55 PM, darianf said:

    hi, try the recommendations from the cpu pinning sticky thread...

    also try the tweaks plugin and reduce the cache values from the default ones (you could make them half if you have more than 16GB RAM on the host)

    -d

     

     

     

    Hey,

    thanks for the reply and sorry for the delayed response.

     

    I used the tweaks plugin and set the values as was suggested.

     

    but I'm still getting the same behavior with that wireless mouse.

     

    what else should I do?

     

    the battery is full,

    the usb dongle is 1meter from the mouse.

     

  9. Hello,

     

    I'm using win10 VM that is passed thru nvidia GPU.

    I also passed the logitech wireless usb dongle to that machina.

    mouse is working mostly fine,

    but sometimes it is very laggy.

     

    when I move the mouse, the mouse pointer gets stuck in one place and after 1-2 seconds jumps straight to the next position.

     

    what should/could I do in order to fix this?

     

    Thanks.

     

  10. well,

    after last shutdown of the VM,

    I started the VM again and it got stuck again on the startup screen.

    after few minutes it shutdown itself.

     

    I got this from the log:

    " LC_ALL=C PATH=/bin:/sbin:/usr/bin:/usr/sbin HOME=/ QEMU_AUDIO_DRV=none /usr/local/sbin/qemu -name guest=win10,debug-threads=on -S -object secret,id=masterKey0,format=raw,file=/var/lib/libvirt/qemu/domain-7-win10/master-key.aes -machine pc-i440fx-2.7,accel=kvm,usb=off,dump-guest-core=off,mem-merge=off -cpu host -m 10240 -realtime mlock=off -smp 4,sockets=1,cores=2,threads=2 -uuid 21fb2763-0e7e-e9f0-8a35-3265378dc719 -display none -no-user-config -nodefaults -chardev socket,id=charmonitor,path=/var/lib/libvirt/qemu/domain-7-win10/monitor.sock,server,nowait -mon chardev=charmonitor,id=monitor,mode=control -rtc base=localtime,driftfix=slew -global kvm-pit.lost_tick_policy=discard -no-hpet -no-shutdown -boot strict=on -device nec-usb-xhci,id=usb,bus=pci.0,addr=0x7 -device ahci,id=sata0,bus=pci.0,addr=0x3 -device virtio-serial-pci,id=virtio-serial0,bus=pci.0,addr=0x4 -drive file=/mnt/user/vms/win10/vdisk1.img,format=raw,if=none,id=drive-sata0-0-2,cache=writeback -device ide-hd,busdr=2,id=hostdev2,bus=usb.0,port=1 -device usb-host,hostbus=1,hostaddr=4,id=hostdev3,bus=usb.0,port=2 -device virtio-balloon-pci,id=balloon0,bus=pci.0,addr=0x8 -msg timestamp=on
    Domain id=7 is tainted: high-privileges
    Domain id=7 is tainted: host-cpu
    char device redirected to /dev/pts/0 (label charserial0)
    2017-03-17T20:49:16.605981Z qemu-system-x86_64: terminating on signal 15 from pid 8719
    2017-03-17 20:49:18.207+0000: shutting down, reason=destroyed "

     

    btw,

    after the VM shutdown itself I started it again, and it loaded fine..

     

    Thanks.

  11. 23 hours ago, 1812 said:

    I consistently feed 32GB from several sticks to a vm, so not a bug that I know of.

     

     

    try to start the vm, wait for it to stall, then export diagnostics and post them up here.

     

    well,

    after few reboots and shutdowns of the unraid server it started to load for some reason...

  12. Hello,

     

    I've added one more DDR4 8GB stick to my unRaid server, so now I've got 2 sticks of 8GB each.

     

    I have tried to increase the size of the RAM for my win10 VM, but it getting stuck on the win10 logo.

     

    until today I used 6GB of RAM with the one stick (which works well even with 2 installed sticks).

    but the problem occurs when I assign more than this, for example 8GB/10GB/12GB.

     

    is this an unraid bug? can't it assign ram from 2 different RAM sticks to a VM?

    or, is it a way of passing thru an entire RAM stick to the VM?

     

    btw,

    I'm passing thru GTX1060 6GB GPU.

     

    Thanks.

  13. 20 minutes ago, 1812 said:

     

    your core assignments look soooo wonky, which is fine because windows 10 doesn't care about where the cores come from (as shown through benchmarking tests) or even using threaded pairs.

     

    If you're using all the cores, there is no need to really specify an emulator pin. There might be a super small gain? But if you want to, make it 2 or 3

     

    mmm,

    well, I tested 4 different combinations with GeekBench cpu benchmark

    and those are the results (singlecore, multicore):

    4 cpu's, no isolation: 3805, 6951

    2 isolated cpus (1,3): 3323, 4041

    2 isolated cpus (1,3) + 2 shared cpus: 3632, 6588

    2 isolated cpus (1,3) + 2 shared cpus + emulatorpin=2: 3554, 6781

     

    it seems that the normal configurations gives the best benchmark result.

    what am I missing here?

     

    Thanks.

  14. 1 hour ago, 1812 said:

     

    Yes. Just because a cpu is isolated/not isolated from unRaid doesn't keep it from being assigned to a vm.

     

    If you do that, my suggestion would be to ensure that vcpu 0 of the vm is an isolated core, as windows and most other operating systems will use that as a primary resource and favor it, especially during initial booting of the vm. If you did not change it, then the vm and unraid would try to use that as primary, resealing in decreased performance. Also note that using the shared cores (non isolated) may contribute to latency, diminishing he point of isolating cores.

    well,

    I have changed cpu (1,3) to be isolated.

     

    now,

    I'd like to assign all 4 of them to the VM.

     

      <cputune>
        <vcpupin vcpu='0' cpuset='1'/>
        <vcpupin vcpu='1' cpuset='3'/>
        <vcpupin vcpu='2' cpuset='0'/>
        <vcpupin vcpu='3' cpuset='2'/>
        <emulatorpin cpuset='2'/>
      </cputune>

     

    do I still need to use the emulatorpin feature?

    if yes, which CPU should I assign there?

     

    Thanks.

  15. 20 minutes ago, 1812 said:

     

    Yes. Just because a cpu is isolated/not isolated from unRaid doesn't keep it from being assigned to a vm.

     

    If you do that, my suggestion would be to ensure that vcpu 0 of the vm is an isolated core, as windows and most other operating systems will use that as a primary resource and favor it, especially during initial booting of the vm. If you did not change it, then the vm and unraid would try to use that as primary, resealing in decreased performance. Also note that using the shared cores (non isolated) may contribute to latency, diminishing he point of isolating cores.

     

    yes I know that,

    I just don't want the VM to be too slow, cause using 'only' 2 hyperthreaded cores (1 cpu).

     

    I will give those 2 different configurations a try and will test each to see which will act better.

     

    Thanks for the help!

×
×
  • Create New...